CN104579790B - A kind of method determining link prediction reduction limit quantity - Google Patents
A kind of method determining link prediction reduction limit quantity Download PDFInfo
- Publication number
- CN104579790B CN104579790B CN201510037313.9A CN201510037313A CN104579790B CN 104579790 B CN104579790 B CN 104579790B CN 201510037313 A CN201510037313 A CN 201510037313A CN 104579790 B CN104579790 B CN 104579790B
- Authority
- CN
- China
- Prior art keywords
- limit
- node
- index
- add
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
The invention belongs to network reconfiguration and link prediction field, be specifically related to a kind of method determining link prediction reduction limit number, the present invention is based on the interlinking of link prediction and limit cluster, utilize the segmentation density index in the cluster of limit to determine the reduction limit quantity in link prediction process.Use link Predicting Technique to calculate the right connection probability of all nodes not connecting limit, and sort according to connection probability.Sound out in a network successively according to clooating sequence and add each limit, all to re-start limit cluster analysis to network, in the cluster process of limit, computed segmentation density index, by adding the change of network maximum fractionation density before and after a certain bar limit, and then determine whether add this limit.The invention solves after link prediction, determine the problem of reduction limit quantity; The principle of the invention is simple, clear process, be easy to realize, and can provide the support in decision-making, improve the efficiency of decision-making for policymaker.
Description
Technical field
The invention belongs to network reconfiguration and link prediction field, be specifically related to a kind of method determining link prediction reduction limit number.
Background technology
Link prediction is to be solved is to the disappearance reduction on limit and the problem of prediction in complex network, existing link prediction method, for the consideration of computational speed and computation complexity, uses the link prediction based on similitude in practical application.Existing link prediction technology cannot determine to reduce the quantity on limit.By existing method, policymaker can only the right connection probability of the company of calculating limit (not having limit to be directly connected in observation grid) node and connect the sequence of probability, and do not know to reduce the quantity on limit, but this is the problem that must solve in network reconfiguration process.
Introduce several existing link prediction index based on similitude below.
Node v in define grid G set of node
x, the set of its neighbor node is designated as Γ (x), and the number of neighbor node is designated as k
x.Link prediction method based on similitude is thought, two node v
xand v
ysimilitude s
xylarger, between these two nodes, the possibility on the company of generation limit is larger.
(1) common neighbours' index
Common neighbours' similitude can be described as structural equivalence again, if namely two nodes have a lot of common neighbor node, so these two nodes are similar.Visible, structural equivalence pay close attention to be two some whether be in same environment.If it is more that two points share common neighbours, then their the more company of tending to limits.
Common neighbours' index mathematical definition is: two node v
xand v
ysimilitude be their common neighbours' number, that is:
s
xy=|Γ(x)∩Γ(y)|
If consider the impact of the degree of two end nodes on similarity between two nodes from different perspectives, produce again following 6 kinds of similarity indices.
1) Salton index
[1]
Salton index is also known as cosine similarity, and it reflects the ratio of common neighbours' number and two-end-point geometric mean, is defined as
2) Jaccard index
[2]
This index was just suggested before more than 100 years, its reflection be the ratio of two node common neighbours numbers and whole neighbours' quantity, be defined as
3) Sorenson index
[3]
This index is usually used in ecological data research, and it reflects that the ratio of common neighbours' number and two-end-point algebraic mean number is defined as
4) the favourable index of magnanimous node (HPI)
[4]
This index is thought, spend larger node, the centrality shown in a network is more obvious, and easy and other nodes produce and connect limit, and it is defined as
Because the less node of denominator degree of being determines, under this definition, between known magnanimous node and other nodes, more easily there is high similarity.
5) the unfavorable index of magnanimous node (HDI)
This index is contrary with HPI index, and it thinks that the large node of network moderate is unfavorable for the connection with other nodes.Contrary with HPI index, in similarity indices formula, denominator gets the maximum of two node degrees, is defined as
6) LHN-I index
[5]
This index is proposed by Leicht, Holme and Newman, is defined as
Wherein, denominator k
xk
ybe proportional to node v
xand v
ythe desired value of common neighbours' number, i.e. E (| Γ (x) ∩ Γ (y) |).
(2) AA index
[6]
Adamic-Adar index (AA index) considers the information of the degree of the common neighbours of two nodes.The contribution of common neighbours to similitude that its thought degree of being is little is larger.This is readily appreciated that in social networks: such as in microblogging, two people not theed least concerned can pay close attention to a star (having more magnanimous node), but two people of focus attentions equally on domestic consumer (having the node of less degree) are probably friends.AA index is that each common neighbor node gives a weighted value, is defined as
(3) Resourse Distribute index index
[7]
To consider in network G non-conterminous two node v arbitrarily
xand v
ysince network is communicated with, and can think v
xsome information can be transmitted to v
y, and their common neighbours just can regard the medium that information is transmitted as.Suppose the resource of each common neighbours Dou Youyige unit, and mean allocation is passed to its neighbours, then from v
xv can be delivered to
ynumber of resources just can be defined as node v
xand v
ysimilarity, that is:
[1]SaltonG,McGillMJ.Introductiontomoderninformationretrieval[J].1983.
[2]JaccardP.EtudecomparativedeladistributionfloraledansuneportiondesAlpesetduJura[M].Impr.Corbaz,1901.
[3]SorensenT.AmethodofestablishinggroupsofequalamplitudeinplantsociologybasedonsimilarityofspeciescontentanditsapplicationtoanalysesofthevegetatianonDanishcommons[J].BiologiskeSkrifter,1948,5(4):1-34
[4]RavaszE,SomeraAL,MongruDA,etal.Hierarchicalorganizationofmodularityinmetabolicnetworks[J].science,2002,297(5586):1551-1555.
[5]LeichtEA,HolmeP,NewmanMEJ.Vertexsimilarityinnetworks[J].PhysicalReviewE,2006,73(2):026120.
[6]AdamicLA,AdarE.Friendsandneighborsontheweb[J].Socialnetworks,2003,25(3):211-230.
[7]ZhouT,LüL,ZhangYC.Predictingmissinglinksvialocalinformation[J].TheEuropeanPhysicalJournalB,2009,71(4):623-630.
The present invention is intended to solve the problem determining reduction limit number in link prediction (link prediction is to be solved is to the disappearance reduction on limit and the problem of prediction in complex network) process.Method due to existing link prediction only gives the right connection probability of non-connected node in network, and one does not determine the method on reduction how many limits, and for policymaker, a kind of method of reduction limit quantity that can instruct has great importance.
Summary of the invention
Reduce limit count issue for solving in above-mentioned link reduction process, the invention provides and propose a kind of method can determining to reduce limit quantity, concrete technical scheme is as follows:
Determine a method for link prediction reduction limit quantity, comprise the following steps:
(1) initialization sets reduction limit quantity n as 0, selects link prediction similarity indices, does not connect the connection probability that the node on limit is right in computing network, and according to connection probability from big to small by node to sequence composition sequence collection;
(2) judge whether sequence sets is empty set, if empty set, goes to step (5), otherwise, from sequence sets, select the node pair of current connection maximum probability, judge whether to there is multiple node to connection maximum probability simultaneously; If exist, for connecting the equal node of probability to processing, in current network respectively heuristically these nodes between add a limit, calculate the maximum fractionation density after adding, the rear maximum fractionation density calculated is added from big to small according to souning out respectively, these being connected the equal node of probability to carrying out partial ordering's composition local sequence sets, entering step (3); If do not exist, directly enter step (4);
(3) selection node pair is successively concentrated from local sequence, calculate and compare this node between add the maximum fractionation density of a forward and backward network in limit, if add the maximum fractionation density behind this limit be more than or equal to interpolation before maximum fractionation density, then add this limit in a network, the value of n adds 1, if node corresponding to this limit is to being last node pair that local sequence is concentrated, from sequence sets, then delete all nodes pair that local sequence is concentrated, renewal sequence collection, goes to step (2); If add the maximum fractionation density behind this limit be less than interpolation before maximum fractionation density, then do not add this limit, go to step (5);
(4) calculate and compare this node between add the maximum fractionation density of network before and after a limit, if add the maximum fractionation density behind this limit be more than or equal to interpolation before maximum fractionation density, then add this limit in a network, the value of n adds 1, node corresponding for this limit is deleted from sequence sets, renewal sequence collection; Go to step (2); If add the maximum fractionation density behind this limit be less than interpolation before maximum fractionation density, then do not add this limit, go to step (5);
(5) end loop, Output rusults n value.
Adopt the technique effect that the present invention obtains: the invention solves after link prediction, determine the problem of reduction limit quantity; The principle of the invention is simple, clear process, be easy to realize, and can provide the support in decision-making, improve the efficiency of decision-making for policymaker;
Accompanying drawing explanation
Fig. 1 is that the present invention determines reduction limit quantitative approach flow chart;
Fig. 2 is limit of the present invention similitude exemplary graph;
Tu3Shi karate club network topological diagram.
Embodiment
Below in conjunction with the drawings and specific embodiments, the present invention will be further described.
As shown in Figure 1, the present invention is based on the interlinking of link prediction and limit cluster, utilize the segmentation density index in the cluster of limit to determine the reduction limit quantity in link prediction process.Link Predicting Technique to be used before the present invention to calculate the right connection probability of all nodes not connecting limit using, and sort according to connection probability.Sound out in a network successively according to clooating sequence and add each limit, all to re-start limit cluster analysis to network, in the cluster process of limit, computed segmentation density index, by adding the change of network maximum fractionation density before and after a certain bar limit, and then determine whether add this limit.
Introduce the computational methods of network maximum fractionation density below.
First according to the index that link prediction uses, limit similarity indices formula is determined.As shown in Figure 2, limit similitude is for adjacent edge (having two limits of public vertex).In the present embodiment, link prediction uses Jaccard index, and Similarity measures formula is:
so in the cluster process of limit, limit similarity definition is
wherein, e
ik, e
jkfor there being two limits of common node, Γ (i), Γ (j) are node v
i, v
jneighbor node set.
After determining limit similarity indices formula, the limit similitude of all adjacent edges in computing network.Before the cluster of limit, make every bar limit alone as a class.Maximum for limit similitude two limits (not also being classified as a class) are classified as a class, until all limits are all classified as a class in network by each cluster.
After each cluster, first calculate the Connection Density D of each class inside
c:
Wherein, C represents the class that cluster obtains, m
cthe number of the internal edges of representation class, n
cthe number (concrete list of references AHNY-Y, BAGROWJP, LEHMANNS.Linkcommunitiesrevealmultiscalecomplexityinnetw orks.Nature, 2010,466:761-764.) of the point that representation class comprises.After each cluster, the segmentation density D of whole network is exactly the weighted average of the inside Connection Density of all subclasses, is calculated by following formula:
Wherein, M represents limit number total in network.Network every bar limit before the cluster of limit, alone as a class, is all classified as in a class process to final all limits, and each cluster all calculates once splits density, and the maximum of all segmentation density is exactly the maximum fractionation density of network.
For karate club network, network topology is illustrated in fig. 3 shown below, and number in the figure 1-33 represents node.Perform step 1, initialization n=0, use Jaccard index to carry out link prediction, calculate each node to connection probability, and arrange from big to small, composition sequence collection is as follows:
Node pair | Connect probability |
{7,12} | 0.571429 |
{4,5} | 0.5 |
{6,9} | 0.5 |
{13,15} | 0.5 |
{24,25} | 0.5 |
{24,26} | 0.5 |
{25,26} | 0.5 |
{24,27} | 0.5 |
{25,27} | 0.5 |
{26,27} | 0.5 |
{24,28} | 0.5 |
{25,28} | 0.5 |
{26,28} | 0.5 |
{27,28} | 0.5 |
{12,14} | 0.428571 |
{13,14} | 0.4 |
… | … |
{3,23} | 0.041667 |
Perform step 2, sequence sets is not empty set, selects the node of connection maximum probability to { 7,12} does not exist with { 7,12} node, to connecting other equal nodes pair of probability, directly enters step 4.
Perform step 4, network adds limit, and { the maximum fractionation density after 7,12} is greater than interpolation limit { the maximum fractionation density before 7,12}, therefore { value of 7,12}, n adds 1 (now n=1) to add limit, by node to 7,12} deletes from sequence sets, and now sequence sets is as follows:
Node pair | Connect probability |
{4,5} | 0.5 |
{6,9} | 0.5 |
{13,15} | 0.5 |
{24,25} | 0.5 |
{24,26} | 0.5 |
{25,26} | 0.5 |
{24,27} | 0.5 |
{25,27} | 0.5 |
{26,27} | 0.5 |
{24,28} | 0.5 |
{25,28} | 0.5 |
{26,28} | 0.5 |
{27,28} | 0.5 |
{12,14} | 0.428571 |
{13,14} | 0.4 |
… | … |
{3,23} | 0.041667 |
Enter step 2.
Perform step 2, sequence sets is not empty set, selects the node of connection maximum probability to { 4,, there is node to { 6,9}, { 13 in 5}, 15}, { 24,25}, { 24,26}, { 25,26}, { 24,27}, { 25,27}, { 26,27}, { 24,28}, { 25,28}, { 26,28}, { 27,28} and node are to { 4,5} has equal connection probability.Calculate the maximum fractionation density distinguished in a network after these limits of exploratory interpolation, arrange from big to small, composition local sequence sets, as follows:
Enter step 3.
Perform step 3, in order the exploratory limit of adding local sequence and concentrating.{ before the maximum fractionation density after 4,5} is greater than interpolation, so add limit, { 4,5}, n add 1 (now n=2), by node to { 4,5} deletes from the sequence of local to add limit.Now local sequence sets is:
{ before the maximum fractionation density after 6,9} is greater than interpolation, so add limit, { 6,9}, n add 1 (now n=3), by node to { 6,9} deletes from the sequence of local to add limit.Now local sequence sets is:
{ before the maximum fractionation density after 13,15} is greater than interpolation, so add limit, { 13,15}, n add 1 (now n=4), by node to { 13,15} deletes from the sequence of local to add limit.Now local sequence sets is
{ before the maximum fractionation density after 24,25} is less than interpolation, so do not add limit, { 24,25} goes to step 5 to add limit.
Perform step 5, delete all sequences collection, exit circulation, terminate.N=4, the limit of adding is { 7,12}, { 4,5}, { 6,9}, { 13,15}.
Limit similarity indices is relevant with the similarity indices used by the link prediction stage.In real world applications, different similarity indices is used to carry out link prediction, different limit similarity indices will be used to determine the quantity on reduction limit, the limit Similarity measures formula in link prediction use Jaccard index situation is given in this specification, but be not limited thereto, use other similarity indices situations when link prediction under, corresponding limit Similarity measures formula is as follows:
E in table
xk, e
ykfor there being two limits of common node, Γ (x), Γ (y) are node v
x, v
yneighbor node set.
Content described in this specification embodiment is only enumerating the way of realization that the present invention conceives; should not being considered as of protection scope of the present invention is only limitted to the concrete form that embodiment is stated, protection scope of the present invention also and conceive the equivalent technologies means that can expect according to the present invention in those skilled in the art.
Claims (2)
1. determine a method for link prediction reduction limit quantity, it is characterized in that, comprise the following steps:
(1) initialization sets reduction limit quantity n as 0, selects link prediction similarity indices, does not connect the connection probability that the node on limit is right in computing network, and according to connection probability from big to small by node to sequence composition sequence collection;
(2) judge whether sequence sets is empty set, if empty set, goes to step (5), otherwise, from sequence sets, select the node pair of current connection maximum probability, judge whether to there is multiple node to connection maximum probability simultaneously; If exist, for connecting the equal node of probability to processing, in current network respectively heuristically these nodes between add a limit, calculate the maximum fractionation density after adding, the rear maximum fractionation density calculated is added from big to small according to souning out respectively, these being connected the equal node of probability to carrying out partial ordering's composition local sequence sets, entering step (3); If do not exist, directly enter step (4);
(3) selection node pair is successively concentrated from local sequence, calculate and compare this node between add the maximum fractionation density of a forward and backward network in limit, if add the maximum fractionation density behind this limit be more than or equal to interpolation before maximum fractionation density, then add this limit in a network, the value of n adds 1, if node corresponding to this limit is to being last node pair that local sequence is concentrated, from sequence sets, then delete all nodes pair that local sequence is concentrated, renewal sequence collection, goes to step (2); If add the maximum fractionation density behind this limit be less than interpolation before maximum fractionation density, then do not add this limit, go to step (5);
(4) calculate and compare this node between add the maximum fractionation density of network before and after a limit, if add the maximum fractionation density behind this limit be more than or equal to interpolation before maximum fractionation density, then add this limit in a network, the value of n adds 1, node corresponding for this limit is deleted from sequence sets, renewal sequence collection; Go to step (2); If add the maximum fractionation density behind this limit be less than interpolation before maximum fractionation density, then do not add this limit, go to step (5);
(5) end loop, Output rusults n value.
2. a kind of method determining link prediction reduction limit quantity as claimed in claim 1, it is characterized in that, described link prediction similarity indices is any one index in Salton index, Jaccard index, Sorenson index, HPI, HDI, LHN-I index and AA index, wherein HPI represents the favourable index of magnanimous node, HDI represents the unfavorable index of magnanimous node, the index abbreviated name that LHN-I index expression Leicht, Holme and Newman propose, AA index expression Adamic-Adar index.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510037313.9A CN104579790B (en) | 2015-01-26 | 2015-01-26 | A kind of method determining link prediction reduction limit quantity |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510037313.9A CN104579790B (en) | 2015-01-26 | 2015-01-26 | A kind of method determining link prediction reduction limit quantity |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104579790A CN104579790A (en) | 2015-04-29 |
CN104579790B true CN104579790B (en) | 2016-01-20 |
Family
ID=53095055
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510037313.9A Active CN104579790B (en) | 2015-01-26 | 2015-01-26 | A kind of method determining link prediction reduction limit quantity |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104579790B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104866558B (en) * | 2015-05-18 | 2018-08-10 | 中国科学院计算技术研究所 | A kind of social networks account mapping model training method and mapping method and system |
CN105183796A (en) * | 2015-08-24 | 2015-12-23 | 同济大学 | Distributed link prediction method based on clustering |
CN105228185B (en) * | 2015-09-30 | 2017-03-22 | 中国人民解放军国防科学技术大学 | Method used for identifying identity of fuzzy redundant node in communication network |
CN107194069A (en) * | 2017-05-23 | 2017-09-22 | 浙江工业大学 | A kind of link prediction method favourable based on Bayesian Estimation and magnanimous node |
CN110942178B (en) * | 2019-11-01 | 2022-08-23 | 南京邮电大学 | Charging pile recommendation method based on link prediction method of resource allocation index |
CN112434437B (en) * | 2020-12-02 | 2023-08-25 | 大连大学 | Method for constructing equipment support super-network dynamic evolution model by considering node recombination |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101960329A (en) * | 2006-12-14 | 2011-01-26 | 波音公司 | Method and device for trilateration using los link prediction and pre-measurement los path filtering |
CN103905246A (en) * | 2014-03-06 | 2014-07-02 | 西安电子科技大学 | Link prediction method based on grouping genetic algorithm |
-
2015
- 2015-01-26 CN CN201510037313.9A patent/CN104579790B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101960329A (en) * | 2006-12-14 | 2011-01-26 | 波音公司 | Method and device for trilateration using los link prediction and pre-measurement los path filtering |
CN103905246A (en) * | 2014-03-06 | 2014-07-02 | 西安电子科技大学 | Link prediction method based on grouping genetic algorithm |
Also Published As
Publication number | Publication date |
---|---|
CN104579790A (en) | 2015-04-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104579790B (en) | A kind of method determining link prediction reduction limit quantity | |
Inglesi-Lotz et al. | The role of renewable versus non-renewable energy to the level of CO2 emissions a panel analysis of sub-Saharan Africa’s Βig 10 electricity generators | |
CN103324980B (en) | A kind of method for forecasting | |
Delvenne et al. | Centrality measures and thermodynamic formalism for complex networks | |
CN103064917B (en) | The high-impact customer group of a kind of specific tendency towards microblogging finds method | |
Valverde-Rebaza et al. | Link prediction in complex networks based on cluster information | |
CN104933624A (en) | Community discovery method of complex network and important node discovery method of community | |
CN106991617B (en) | Microblog social relationship extraction algorithm based on information propagation | |
CN105095419A (en) | Method for maximizing influence of information to specific type of weibo users | |
CN103179198B (en) | Based on the topic influence individual method for digging of many relational networks | |
CN105550275A (en) | Microblog forwarding quantity prediction method | |
CN103838804A (en) | Social network user interest association rule mining method based on community division | |
CN107317704A (en) | A kind of complex network important node sort method based on tight ness rating and structural hole | |
CN105976048A (en) | Power transmission network extension planning method based on improved artificial bee colony algorithm | |
CN106411904B (en) | A kind of network risks control method based on microstate prediction | |
CN104077412A (en) | Micro-blog user interest prediction method based on multiple Markov chains | |
CN110995597A (en) | Method and system for selecting safe link of power communication network | |
Yeh | A sequential decomposition method for estimating flow in a multi-commodity, multistate network | |
CN105045967A (en) | Group degree based sorting method and model evolution method for important nodes on complex network | |
CN106327026A (en) | Wind power prediction optimization method | |
CN101859315A (en) | Heuristic solving method for maximizing influence of social network | |
CN107463551A (en) | The analysis method and device of social networks interpersonal relationships | |
CN104715034A (en) | Weighed graph overlapping community discovery method based on central persons | |
CN106451431A (en) | Transmission grid loss allocation method in hybrid transaction mode | |
Bagheri et al. | FSIM: A fast and scalable influence maximization algorithm based on community detection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |