CN106817251B - Link prediction method and device based on node similarity - Google Patents

Link prediction method and device based on node similarity Download PDF

Info

Publication number
CN106817251B
CN106817251B CN201611207950.7A CN201611207950A CN106817251B CN 106817251 B CN106817251 B CN 106817251B CN 201611207950 A CN201611207950 A CN 201611207950A CN 106817251 B CN106817251 B CN 106817251B
Authority
CN
China
Prior art keywords
nodes
node
common neighbor
similarity
cluster coefficient
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201611207950.7A
Other languages
Chinese (zh)
Other versions
CN106817251A (en
Inventor
刘大伟
柯枫
刘玮
隋雪青
程学旗
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yantai Branch Institute Of Computing Technology Chinese Academy Of Science
Original Assignee
Yantai Branch Institute Of Computing Technology Chinese Academy Of Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yantai Branch Institute Of Computing Technology Chinese Academy Of Science filed Critical Yantai Branch Institute Of Computing Technology Chinese Academy Of Science
Priority to CN201611207950.7A priority Critical patent/CN106817251B/en
Publication of CN106817251A publication Critical patent/CN106817251A/en
Application granted granted Critical
Publication of CN106817251B publication Critical patent/CN106817251B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/02Topology update or discovery

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a link prediction method based on node similarity, which comprises the following steps: carrying out node and link representation on a network to be analyzed; acquiring two nodes which are not directly connected by a link in a network; reading respective neighbor node sets of the two nodes without direct link connection; combining and taking intersection of neighbor node sets of the two nodes which are not connected by the direct link to obtain a common neighbor set; taking the common neighbor set as a subnet, and calculating the global cluster coefficient of the subnet and the cluster coefficient of any common neighbor node in the subnet; calculating the similarity of the two nodes without direct link connection according to the global cluster coefficient of the subnet and the cluster coefficient of any common neighbor node in the subnet; and performing link prediction according to the calculated node similarity. The invention defines a new index for calculating the node similarity based on the local cluster coefficient from the viewpoint of the cluster coefficient when analyzing the interrelation of the nodes in the local structure of the complex network.

Description

Link prediction method and device based on node similarity
Technical Field
The present invention relates to the field of link prediction, and in particular, to a link prediction method and apparatus based on node similarity.
Background
The mode and evolution analysis of the complex network have important research and application values, and the calculation of the node similarity in the network is the basis for solving many practical problems such as link prediction, attribute inference and the like. The method for calculating the node similarity in the complex network can support applications such as biological network analysis, social network friend recommendation, personalized content recommendation, information network attribute inference, label classification and the like.
In the research of link prediction, an undirected and unweighted network is the most basic depicting form, and a similarity-based method is the simple and effective method which is proposed and proved at the earliest. On one hand, the current network data is much richer than the past in description, and taking a social network as an example, a user represented by a node has more attribute information including personalized information such as interests, preferences, regions, behaviors and the like; the use edges can represent static relations such as friends or concerns among users, and can also represent dynamic relations such as interaction, space-time association and the like. Abundant network attribute information and topological physical meaning provide a wide research object for similarity definition; on the other hand, the scale of network data is rapidly increased, the requirement on algorithm performance in practical application is more severe, although some complex modeling methods and probability models such as likelihood analysis are superior to similarity methods in accuracy indexes, the research value is limited to understanding of network structure characteristics, and the limitation of processing node scale and overhigh time complexity determine that the applicability is not strong.
Existing similarity-based algorithms mainly include similarity indicators based on local information, path-based and random walks, where the similarity indicator based on local information common neighbors is the most common method. The similarity of two nodes in the common neighbor index is characterized by the number of the common neighbor nodes, and the index is proved to be effective in an empirical network; meanwhile, some improved methods consider operating a set of common neighbors of two nodes, such as raising a SALTON index by utilizing cosine similarity. Other optimization methods consider the respective degrees of the two nodes and provide indexes which are beneficial to the large-degree node or the small-degree node; and (3) taking the degree of the common neighbor node into consideration, providing an Adamic-Adar index: the contribution of the common neighbor node with small degree is larger than that of the common neighbor node with large degree, and the reciprocal of the logarithm of the degree of the node is decreased; the network resource allocation index is proposed by the inspiration of the network resource allocation process, is similar to the Adamic-Adar index in nature, and is decreased by the reciprocal of the degree of the node; and (4) taking the mutual relation among the common neighbor nodes into consideration, and providing a node gravity index.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: the conventional method for calculating the node similarity is high in time complexity and low in accuracy, and provides a link prediction method and device based on the node similarity.
The technical scheme for solving the technical problems is as follows: a link prediction method based on node similarity comprises the following steps:
step 1: carrying out node and link representation on a network to be analyzed;
step 2: acquiring two nodes which are not directly connected by a link in a network;
and step 3: reading respective neighbor node sets of the two nodes without direct link connection;
and 4, step 4: combining and taking intersection of neighbor node sets of the two nodes which are not connected by the direct link to obtain a common neighbor set;
and 5: taking the common neighbor set as a subnet, and calculating the global cluster coefficient of the subnet and the cluster coefficient of any common neighbor node in the subnet;
step 6: calculating the similarity of the two nodes without direct link connection according to the global cluster coefficient of the subnet and the cluster coefficient of any common neighbor node in the subnet;
and 7: and performing link prediction according to the calculated node similarity.
The invention has the beneficial effects that: the invention defines a new index of node similarity calculation based on the local cluster coefficient from the perspective of the cluster coefficient when analyzing the interrelation of the nodes in the local structure of the complex network, and provides a node similarity calculation mode from a new perspective.
On the basis of the technical scheme, the invention can be further improved as follows.
Further, the step 5 specifically includes:
step 5.1: local cluster coefficient c for any one common neighbor node z in the sub-networkzRandomly taking two common neighborsWhen the degree of z is more than 1, the local cluster coefficient c iszIs composed of
Figure BDA0001190394310000031
A is the number of links between the three nodes z, v and w, B is the number of links between z and v, between z and w and between v and w; when the degree of z is 1 or less, czIs 0;
step 5.2: global cluster coefficient C for common neighbor set according to formula
Figure BDA0001190394310000041
And calculating, wherein t is the number of all nodes in the common neighbor set subnet connected into a triangle, and s is the number of all nodes in the common neighbor set subnet connected into a V shape.
Further, the step 6 specifically includes:
representing the two nodes without direct link connection by x and y, and the similarity is according to a formula
Figure BDA0001190394310000042
A calculation is performed where C is the global cluster coefficient of the common neighbor set, Czis the local cluster coefficient of the common neighbor node z, and Γ (x) ∩ Γ (y) is the common neighbor set of nodes x and y.
Further, the step 7 specifically includes:
step 7.1: repeating the steps 2-6 to obtain the similarity of any two nodes which are not directly connected by the link in the complex network;
step 7.2: and sorting the node similarity from large to small.
Further, the network is an undirected and unauthorized complex network.
Corresponding to the above method, the technical solution of the present invention further includes a link prediction apparatus based on node similarity, including: the device comprises a representation module, a calculation module, a traversal module and a prediction module;
the representation module is used for representing nodes and links of the complex network to be analyzed;
the computing module is used for acquiring two nodes which are not connected by a direct link in the complex network to obtain respective neighbor node sets of the two nodes, acquiring an intersection of the neighbor node sets of the two nodes to obtain a common neighbor set, taking the common neighbor set as a subnet, computing a subnet global cluster coefficient and any one common neighbor node cluster coefficient of the subnet, and computing node similarity according to the global cluster coefficient and the any one common neighbor node cluster coefficient;
the traversal module is used for traversing all nodes of the complex network according to the calculation module to obtain the similarity of any two nodes which are not directly connected by the link in the complex network;
and the prediction module is used for predicting the link according to the similarity obtained by the traversal module and the calculation module.
The invention has the beneficial effects that: the invention defines a new index of node similarity calculation based on the local cluster coefficient from the perspective of the cluster coefficient when analyzing the interrelation of the nodes in the local structure of the complex network, and provides a node similarity calculation mode from a new perspective.
Further, the calculating the node similarity includes:
representing the two nodes without direct link connection by x and y, and the similarity is according to a formula
Figure BDA0001190394310000051
A calculation is performed where C is the global cluster coefficient of the common neighbor set, Czis a local cluster coefficient of a common neighbor node z, and Γ (x) ∩ Γ (y) is a common neighbor set of nodes x and y;
local cluster coefficient c for any one common neighbor node z in the sub-networkzRandomly taking two common neighbor nodes v, w different from z in two common neighbor sets, and when the degree of z is more than zLocal cluster coefficient of 1 hour czIs composed of
Figure BDA0001190394310000052
A is the number of links between the three nodes z, v and w, B is the number of links between z and v, between z and w and between v and w; when the degree of z is 1 or less, czIs 0;
for the global cluster coefficient C of the common neighbor set, according to a formula
Figure BDA0001190394310000053
And calculating, wherein t is the number of all nodes in the common neighbor set subnet connected into a triangle, and s is the number of all nodes in the common neighbor set subnet connected into a V shape.
Further, the prediction module sorts the node similarity from large to small according to size.
Further, the network is an undirected and unauthorized complex network.
Drawings
Fig. 1 is a flowchart illustrating a method and an apparatus for link prediction based on similarity according to the present invention.
Detailed Description
The principles and features of this invention are described below in conjunction with the following drawings, which are set forth by way of illustration only and are not intended to limit the scope of the invention.
As shown in fig. 1, a link prediction method based on node similarity according to an embodiment of the present invention includes the following steps:
step 1: carrying out node and link representation on a network to be analyzed;
step 2: acquiring two nodes which are not directly connected by a link in a network;
and step 3: reading respective neighbor node sets of the two nodes without direct link connection;
and 4, step 4: combining and taking intersection of neighbor node sets of the two nodes which are not connected by the direct link to obtain a common neighbor set;
and 5: taking the common neighbor set as a subnet, and calculating the global cluster coefficient of the subnet and the cluster coefficient of any common neighbor node in the subnet;
step 6: calculating the similarity of the two nodes without direct link connection according to the global cluster coefficient of the subnet and the cluster coefficient of any common neighbor node in the subnet;
and 7: and performing link prediction according to the calculated node similarity.
Wherein, the step 5 specifically comprises:
step 5.1: local cluster coefficient c for any one common neighbor node z in the sub-networkzRandomly taking two common neighbor nodes v, w different from z in two common neighbor sets, and when the degree of z is more than 1, obtaining a local cluster coefficient czIs composed of
Figure BDA0001190394310000071
A is the number of links between the three nodes z, v and w, B is the number of links between z and v, between z and w and between v and w; when the degree of z is 1 or less, czIs 0.
Step 5.2: global cluster coefficient C for common neighbor set according to formula
Figure BDA0001190394310000072
And calculating, wherein t is the number of all nodes in the common neighbor set subnet connected into a triangle, and s is the number of all nodes in the common neighbor set subnet connected into a V shape.
Wherein, the step 6 specifically comprises:
representing the two nodes without direct link connection by x and y, and the similarity is according to a formula
Figure BDA0001190394310000073
A calculation is performed where C is the global cluster coefficient of the common neighbor set, Czis the local cluster coefficient of the common neighbor node z, and Γ (x) ∩ Γ (y) is the common neighbor set of nodes x and y.
Wherein, the step 7 specifically comprises:
step 7.1: repeating the steps 2-6 to obtain the similarity of any two nodes which are not directly connected by the link in the complex network;
step 7.2: and sorting the node similarity from large to small.
Wherein the network is an undirected and unauthorized complex network.
The embodiment of the invention also provides a link prediction device based on the node similarity, which comprises the following steps: the device comprises a representation module, a calculation module, a traversal module and a prediction module;
the representation module is used for representing nodes and links of the complex network to be analyzed;
the computing module is used for acquiring two nodes which are not connected by a direct link in the complex network to obtain respective neighbor node sets of the two nodes, acquiring an intersection of the neighbor node sets of the two nodes to obtain a common neighbor set, taking the common neighbor set as a subnet, computing a subnet global cluster coefficient and any one common neighbor node cluster coefficient of the subnet, and computing node similarity according to the global cluster coefficient and the any one common neighbor node cluster coefficient;
the traversal module is used for traversing all nodes of the complex network according to the calculation module to obtain the similarity of any two nodes which are not directly connected by the link in the complex network;
and the prediction module is used for predicting the link according to the similarity obtained by the traversal module and the calculation module.
Wherein the calculating the node similarity comprises:
representing the two nodes without direct link connection by x and y, and the similarity is according to a formula
Figure BDA0001190394310000081
A calculation is performed where C is the global cluster coefficient of the common neighbor set, Czis a local cluster coefficient of a common neighbor node z, and Γ (x) ∩ Γ (y) is a common neighbor set of nodes x and y;
for any one of the subnetworks in commonLocal cluster coefficient c of same neighbor node zzRandomly taking two common neighbor nodes v, w different from z in two common neighbor sets, and when the degree of z is more than 1, obtaining a local cluster coefficient czIs composed of
Figure BDA0001190394310000091
A is the number of links between the three nodes z, v and w, B is the number of links between z and v, between z and w and between v and w; when the degree of z is 1 or less, czIs 0;
for the global cluster coefficient C of the common neighbor set, according to a formula
Figure BDA0001190394310000092
And calculating, wherein t is the number of all nodes in the common neighbor set subnet connected into a triangle, and s is the number of all nodes in the common neighbor set subnet connected into a V shape.
And the prediction module sorts the node similarity from large to small according to the size.
Wherein the network is an undirected and unauthorized complex network.
TABLE 1 similarity index similar to the present invention
Figure BDA0001190394310000093
As shown in Table 1, x and y are two nodes with similarity to be calculated, Γ (x) is a neighbor node set of the node x, Γ (y) is a neighbor node set of the node y, and the local structure is a common neighbor set of the two nodes, namely | Γ (x) ∩ Γ (y) |, kxDegree, k, of node xyDegree of node y, kzDegree of common neighbor node, ezIs the number of links in the common neighbor.
Except for the node gravity index, the information used in table 1 is only the number of the common neighbor nodes and their respective degrees. The common neighbor index and the Salton index are considered equivalent for each pair of nodes for all nodes in the common neighbor set. The common neighbor node is also the networkThe general individuals in (1) also have unique characteristics, which are reflected in the network differences. Based on the method, the Adamic-Adar index and the network resource allocation index model the difference as the node degree, namely the link number of the node in the whole network range, according to observation and demonstration, the influence of the degree of the common neighbor node on the node similarity is in an inverse relation, and the corresponding index is designed to be in an inverse or exponential inverse mode. The node gravity index further takes the degree of the common neighbor node into consideration and adds a positive correlation factor ezthat is, the number of links between the node z and other common neighbors and with the nodes x and y is equivalent to the degree in a local set Γ (x) ∩ Γ (y) ∪ { x, y }, and the degree of similarity is essentially affected more by nodes with large local degrees.
And (D) representing the complex network to be analyzed by using an undirected weightless graph, wherein the G is (N, E), the N is a set of nodes, and the E is a set of links. Whether two nodes generate links or not is in direct proportion to the similarity degree of the two nodes, namely the greater the similarity of the two nodes is, the greater the possibility of generating the links is, the similarity value (forming a similarity matrix S) between every two nodes in the network is calculated in a given network data set G according to the similarity index, and the probability sequence of generating the links between the two nodes can be given according to the sequence of the similarity value. In the embodiment, when the similarity between the nodes is measured, other node attributes caused by network particularity, such as personalized interests and behavior attributes of the nodes representing the user in the social network, are not considered from the analysis of the topological structure of the network.
Representing the two nodes without direct link connection by x and y, and the similarity is according to a formula
Figure BDA0001190394310000111
A calculation is performed where C is the global cluster coefficient of the common neighbor set, Czis the local cluster coefficient of a common neighbor node z, and is the common neighbor set of nodes x and yzRandomly taking two common neighbor nodes v, w different from z in two common neighbor sets, and when the degree of z is more than 1Local cluster coefficient czIs composed of
Figure BDA0001190394310000112
A is the number of links between the three nodes z, v and w, B is the number of links between z and v, between z and w and between v and w; when the degree of z is 1 or less, czIs 0.
Figure BDA0001190394310000113
Wherein t is the number of triangles formed by connecting all nodes in the common neighbor set subnet, s is the number of V-shaped triangles formed by connecting all nodes in the common neighbor set subnet, and the similarity between the nodes x and y can be calculated according to the three formulas. The advantages of the new index from the aspect of cluster coefficient and other indexes in the aspect of node similarity calculation are compared through actual calculation.
6 data sets such as a biological network, a social network and an information network are selected and compared with 5 other similarity calculation methods, and the selected data sets comprise: the American aviation network (UA) is composed of 332 airports and 2126 air routes of the American air transportation system; a U.S. political blog network (PB) consisting of hyperlinks between 1222 and 19021 blogs; the green wave microblog network (WB) is composed of 8735 attention relations among 500 green wave microblog VIP users and is acquired by utilizing API provided by the green wave microblog; a nematode neural network (CE) consisting of 297 neurons and 2345 nerves of the nematode; a Wikipedia network (WK) consisting of 7066 users and 103663 voting information of Wikipedia; the electronic mail network (EM) is composed of 1133 electronic mailbox accounts of certain university of Spain and 5451 mail exchanges among the electronic mailbox accounts. Meanwhile, common local similarity indexes are selected for comparison, a common neighbor index (CN) and an improvement method thereof are provided, and a Salton index (SA) with cosine similarity is utilized; Adamic-Adar index (AA) biased towards a less common neighbor node; an indicator (RA) based on network resource allocation; and a newly proposed node gravity Index (IA). It is noted that the framework of the link prediction steps is the same, and only the node similarity calculation methods are different.
TABLE 2 predicted AUC evaluation results of different similarity calculation methods
AUC CN SA AA RA IA The invention
UA 0.9509 0.9304 0.9619 0.9665 0.9662 0.9723
PB 0.9206 0.8728 0.9228 0.9238 0.9251 0.9528
CE 0.8581 0.7993 0.8768 0.8787 0.8794 0.8909
WB 0.8122 0.8005 0.8111 0.8089 0.8100 0.8766
WK 0.8637 0.7935 0.8641 0.8585 0.8660 0.8788
EM 0.8498 0.8510 0.8548 0.8539 0.8556 0.8598
The AUC evaluation method for the link prediction effect is as follows: AUC ═ N0+0.5Ne) and/N. Dividing the network into a training set and a testing set (the proportion is 9:1), and randomly dividing the network into the testing set and a nonexistent edge set each time after calculating the similarity score between every two nodes in the network through the training setRespectively selecting one edge for comparison, and if the similarity degree value of the edge in the test set is greater than that of the edge without the edge, adding 1 (N in total)0Second); if the two similarity scores are equal, add 0.5 points (N in total)eNext) of the first and second processes,
the above are independently compared n times. If all scores were randomly generated, then the resulting AUC was 0.5, and the degree to which the AUC value for each algorithm was greater than 0.5 measures how accurate the algorithm was compared to random selection. According to the results in table 2, the node similarity method provided by the invention has a better effect than a comparison algorithm in the link prediction problem, and the link prediction is performed according to the node similarity calculated by the invention, so that the greater the node similarity is, the greater the possibility that two nodes generate link connection is.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (8)

1. A link prediction method based on node similarity is characterized by comprising the following steps:
step 1: carrying out node and link representation on a network to be analyzed;
step 2: acquiring two nodes which are not directly connected by a link in a network;
and step 3: reading respective neighbor node sets of the two nodes without direct link connection;
and 4, step 4: combining and taking intersection of neighbor node sets of the two nodes which are not connected by the direct link to obtain a common neighbor set;
and 5: taking the common neighbor set as a subnet, and calculating the global cluster coefficient of the subnet and the cluster coefficient of any common neighbor node in the subnet;
step 6: calculating the node similarity of the two nodes without direct link connection according to the global cluster coefficient of the subnet and the cluster coefficient of any one common neighbor node in the subnet, specifically, representing the two nodes without direct link connection by x and yConnected nodes, the similarity of which is according to the formula
Figure FDA0002410506580000011
A calculation is performed where C is the global cluster coefficient of the common neighbor set, Czis a local cluster coefficient of any one common neighbor node z, and is a common neighbor set of nodes x and y;
and 7: and performing link prediction according to the calculated node similarity.
2. The method for link prediction based on node similarity as claimed in claim 1, wherein the step 5 comprises:
step 5.1: local cluster coefficient c for any one common neighbor node z in the sub-networkzRandomly taking two common neighbor nodes v, w different from z in the common neighbor set, and when the degree of z is more than 1, obtaining a local cluster coefficient czIs composed of
Figure FDA0002410506580000012
Wherein A is the number of links between the three nodes z, v and w, B is the number of links between z and v, between z and w and between v and w, and when the degree of z is less than or equal to 1, c iszIs 0;
step 5.2: global cluster coefficient C for common neighbor set according to formula
Figure FDA0002410506580000021
And calculating, wherein t is the number of all nodes in the common neighbor set subnet connected into a triangle, and s is the number of all nodes in the common neighbor set subnet connected into a V shape.
3. The method for link prediction based on node similarity according to any of claims 1-2, wherein the step 7 comprises:
step 7.1: repeating the steps 2-6 to obtain the similarity of any two nodes which are not directly connected by the link in the complex network;
step 7.2: and sorting the node similarity from large to small.
4. The method according to claim 1, wherein the network is a non-directional and non-weighted complex network.
5. A link prediction apparatus based on node similarity, comprising: the device comprises a representation module, a calculation module, a traversal module and a prediction module;
the representation module is used for representing nodes and links of a network to be analyzed;
the computing module is used for acquiring two nodes which are not connected by a direct link in the network to obtain respective neighbor node sets of the two nodes, acquiring an intersection of the neighbor node sets of the two nodes to obtain a common neighbor set, using the common neighbor set as a subnet, computing a subnet global cluster coefficient and any one common neighbor node cluster coefficient of the subnet, and computing node similarity according to the global cluster coefficient and the any one common neighbor node cluster coefficient, specifically, representing the two nodes which are not connected by the direct link by x and y, wherein the similarity is computed according to a formula
Figure FDA0002410506580000022
A calculation is performed where C is the global cluster coefficient of the common neighbor set, Czis a local cluster coefficient of any one common neighbor node z, and is a common neighbor set of nodes x and y;
the traversal module is used for traversing all nodes of the network according to the calculation module to obtain the similarity of any two nodes which are not directly connected by the link in the network;
and the prediction module is used for predicting the link according to the similarity obtained by the traversal module and the calculation module.
6. The link predictor based on node similarity as claimed in claim 5Measuring device, characterized in that the local cluster coefficient c for any one common neighbor node z in said sub-networkzRandomly taking two common neighbor nodes v, w different from z in the common neighbor set, and when the degree of z is more than 1, obtaining a local cluster coefficient czIs composed of
Figure FDA0002410506580000031
A is the number of links between the three nodes z, v and w, B is the number of links between z and v, between z and w and between v and w; when the degree of z is 1 or less, czIs 0;
for the global cluster coefficient C of the common neighbor set, according to a formula
Figure FDA0002410506580000032
And calculating, wherein t is the number of all nodes in the common neighbor set subnet connected into a triangle, and s is the number of all nodes in the common neighbor set subnet connected into a V shape.
7. The apparatus of claim 5, wherein the prediction module ranks the node similarities according to their sizes.
8. The apparatus according to claim 5, wherein the network is a non-directional and non-weighted complex network.
CN201611207950.7A 2016-12-23 2016-12-23 Link prediction method and device based on node similarity Active CN106817251B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611207950.7A CN106817251B (en) 2016-12-23 2016-12-23 Link prediction method and device based on node similarity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611207950.7A CN106817251B (en) 2016-12-23 2016-12-23 Link prediction method and device based on node similarity

Publications (2)

Publication Number Publication Date
CN106817251A CN106817251A (en) 2017-06-09
CN106817251B true CN106817251B (en) 2020-05-19

Family

ID=59109984

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611207950.7A Active CN106817251B (en) 2016-12-23 2016-12-23 Link prediction method and device based on node similarity

Country Status (1)

Country Link
CN (1) CN106817251B (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019014894A1 (en) * 2017-07-20 2019-01-24 深圳大学 Link prediction method and device
CN107623586B (en) * 2017-07-20 2021-06-04 深圳大学 Network link prediction method and device
CN107920133A (en) * 2017-12-12 2018-04-17 浙江工业大学 A kind of recommendation method based on ternary closure information in localized network
CN108063810A (en) * 2017-12-12 2018-05-22 浙江工业大学 A kind of recommendation method based on the filtering of network part structural information
CN109194703B (en) * 2018-06-29 2021-06-18 平安科技(深圳)有限公司 Processing method of communication load between cloud platform hosts, electronic device and medium
CN109726297B (en) * 2018-12-28 2022-12-23 沈阳航空航天大学 Bipartite network node prediction algorithm based on mutual exclusion strategy
CN110290165B (en) * 2019-04-04 2022-01-28 平安科技(深圳)有限公司 Method for regulating and controlling communication load between network hosts, electronic device and readable storage medium
CN110113228B (en) * 2019-04-25 2020-12-18 新华三信息安全技术有限公司 Network connection detection method and device
CN111159768B (en) * 2019-12-31 2022-04-08 广西师范大学 Evaluation method for link privacy protection effect of social network
CN111461440B (en) * 2020-04-02 2022-05-31 河北工程大学 Link prediction method, system and terminal equipment
CN111522891A (en) * 2020-07-03 2020-08-11 支付宝(杭州)信息技术有限公司 Method and device for determining relationship closeness between two entities and electronic equipment
CN112765491B (en) * 2021-04-07 2021-06-22 中国人民解放军国防科技大学 Link prediction method and device considering node local area link compactness

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104376116A (en) * 2014-12-01 2015-02-25 国家电网公司 Search method and device for figure information
CN104765825A (en) * 2015-04-10 2015-07-08 清华大学 Method and device for predicting social network links based on cooperative fusion theory

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9082082B2 (en) * 2011-12-06 2015-07-14 The Trustees Of Columbia University In The City Of New York Network information methods devices and systems

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104376116A (en) * 2014-12-01 2015-02-25 国家电网公司 Search method and device for figure information
CN104765825A (en) * 2015-04-10 2015-07-08 清华大学 Method and device for predicting social network links based on cooperative fusion theory

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Behavior Analysis of Internet Traffic via Bipartite Graphs and One-Mode Projections;Kuai Xu等;《IEEE/ACM TRANSACTIONS ON NETWORKING》;20140630;全文 *
Link prediction with node clustering coefficient;Zhihao Wu等;《Physica A》;20160229;全文 *

Also Published As

Publication number Publication date
CN106817251A (en) 2017-06-09

Similar Documents

Publication Publication Date Title
CN106817251B (en) Link prediction method and device based on node similarity
US11659050B2 (en) Discovering signature of electronic social networks
Gregory Fuzzy overlapping communities in networks
CN105608179B (en) The method and apparatus for determining the relevance of user identifier
Gustafsson et al. Comparison and validation of community structures in complex networks
CN104199818B (en) Method is recommended in a kind of socialization based on classification
CN103795613A (en) Method for predicting friend relationships in online social network
Li et al. Effective link prediction based on community relationship strength
CN111274485A (en) Personalized recommendation method based on community discovery
Xu et al. Effective community division based on improved spectral clustering
CN112148994B (en) Information push effect evaluation method and device, electronic equipment and storage medium
Huang et al. Information fusion oriented heterogeneous social network for friend recommendation via community detection
CN107767155A (en) A kind of method and system for assessing user's representation data
Kotzias et al. Addressing the Sparsity of Location Information on Twitter.
CN107909498B (en) Recommendation method based on area below maximized receiver operation characteristic curve
Wang et al. An Edge-based Clustering Algorithm to Detect Social Circles in Ego Networks.
Liu et al. From strangers to neighbors: Link prediction in microblogs using social distance game
Nie et al. Identifying users based on behavioral-modeling across social media sites
CN105786979B (en) Hidden link-based behavior analysis method and system for user to participate in hot topic
CN109934407A (en) A kind of volunteers working intention prediction technique based on Logistic generalized linear regression model
Sharma et al. Comparative analysis of different algorithms in link prediction on social networks
Amin et al. Advanced community identification model for social networks
Liakos et al. On the impact of social cost in opinion dynamics
Yu et al. A new method for link prediction using various features in social networks
Zhang et al. Community detection based on structural balance in signed networks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant