CN114978983A

CN114978983A - Influence node identification method and system based on second-order H index and voting mechanism

Info

Publication number: CN114978983A
Application number: CN202210684358.5A
Authority: CN
Inventors: 马志新; 王赟栋; 徐玉生; 刘莉
Original assignee: Gansu Daily Newspaper Group Co ltd; Lanzhou University
Current assignee: Gansu Daily Newspaper Group Co ltd; Lanzhou University
Priority date: 2022-06-17
Filing date: 2022-06-17
Publication date: 2022-08-30
Anticipated expiration: 2042-06-17
Also published as: CN114978983B

Abstract

The application discloses an influence node identification method and system based on a second-order H index and a voting mechanism, wherein data to be tested are acquired, data preprocessing is carried out, and a data preprocessing result is acquired; initializing based on the preprocessing result, acquiring an initialization result, and acquiring a voting result through voting; screening based on the voting result to obtain a screening result; and updating based on the screening result, acquiring an updating result, selecting a specified number of seed nodes, and finishing node identification. According to the method and the device, the initial voting capacity of all the nodes is set to be the SHIKS value of the node, the second-order neighbors are taken into consideration in the voting stage, the voting capacity of all the nodes in the second-order neighborhood is updated in the updating stage according to the shortest path length between the nodes and the seed nodes, the screened seed nodes are far away from each other, the propagation effect is better, and the method and the device have higher accuracy compared with a traditional identification mode.

Description

Influence node identification method and system based on second-order H index and voting mechanism

Technical Field

The application belongs to the field of influence node identification algorithms based on a second-order H index, and particularly relates to an influence node identification method and system based on a second-order H index and a voting mechanism.

Background

The H-index (H-index) is a hybrid quantitative index originally proposed by george-herch and used to measure the number of papers and the influence. The H index is described as: there are H papers cited under the name of an author at least H times, so this index can represent the academic ability of the author's paper to some extent.

Hirsch et al then introduced the H-index as a centrality indicator into the impact maximization problem to identify the nodes with impact.

The H index is used for measuring the influence of the node by the number of neighboring nodes directly connected to the node, but if the index is used alone, a group of nodes with influence cannot be mined more accurately, because a plurality of nodes with the same H index may exist, and the difference between the influences of the nodes cannot be further quantified, so that the influence of the node is measured by taking the self-degree of the node into consideration while introducing the second-order H index and combining the self-degree with the entropy of the node information.

Although the traditional VoteRank algorithm has a remarkable effect compared with some classical algorithms when mining nodes with influences, certain limitations also exist. The algorithm considers that the initial voting capacity of each node is the same and is set to be 1, but in many voting scenes in real life, the importance of each participant is different, namely the initial voting capacity of the participants should be different. In addition, in the voting phase, the voting score of each node comes from only the first-order neighbor nodes, but besides the first-order neighbors, the second-order neighbors or more distant nodes play an important role in the process of mining the seed nodes. Furthermore, the VoteRank needs to reduce the voting capacity in the update stage, and the reduction amount should not be set to a fixed value, but determined according to the distance between the seed node and the node. If the node is farther away from the seed node, the degree of weakening the voting ability of the node is lower, and if the node is closer to the seed node, the degree of weakening the voting ability of the node is higher. Based on the analysis, the application combines the SHIKS algorithm with the Voterank algorithm, and provides a new impact node identification algorithm SHIKS-Voterank.

The SHIKS-VoteRank algorithm is an influence node identification algorithm which combines SHIKS and VoteRank and has better accuracy and execution efficiency, the algorithm calculates the influence of each node by using the SHIKS algorithm in the initialization stage, and the value is used as the initial voting capacity, namely va, of each node in the voting mechanism _v ＝H-index ₂ (v) In that respect In addition, in the voting process of the VoteRank, the voting score of a node only comes from the node directly connected with the node, namely the algorithm does not consider the influence of neighbor nodes two or three hops away from the node on the node. Therefore, in the SHIKS-VoteRank algorithm, the calculation of the voting score needs to take into account the first-order and second-order neighbor nodes of the node. In the updating stage, all nodes participating in the voting need to be weakened in a variable manner, and the weakening strength depends on the distance between the node and the seed node. The closer to the seed node, the greater the weakened strength, and vice versa. Therefore, on the basis of the original attenuation factor, the shortest path length between nodes is introduced, and a new attenuation factor is defined. Since the voting stage takes the second-order neighbor nodes into account, in the updating stage, the voting capability of the second-order neighbor nodes also needs to be updated at the same time.

Disclosure of Invention

The application provides an influence node identification method and system based on a second-order H index and a voting mechanism.

In order to achieve the above purpose, the present application provides the following solutions:

the influence node identification method based on the second-order H index and the voting mechanism comprises the following steps:

acquiring data to be tested;

based on the data to be tested, data preprocessing is carried out to obtain a data preprocessing result, and the preprocessing result comprises: the importance of the node and the SHIKS value, and initialization is carried out;

voting based on the initialized preprocessing result to obtain a voting result;

screening based on the voting result to obtain a screening result;

and updating based on the screening result, selecting a specified number of seed nodes, and finishing node identification.

Preferably, the data preprocessing method includes:

calculating a second-order H index of each node of the data to be tested based on the obtained data to be tested, and obtaining the second-order H index of each node of the data to be tested;

and calculating the importance and the SHIKS value of each node based on the second-order H index of each node of the data to be tested.

Preferably, the initialization method includes:

and initializing the voting score and the voting capacity, wherein the voting score is 0, and the voting capacity is the SHIKS value of the node.

Preferably, the voting method includes:

the first-order neighbor and the second-order neighbor both participate in voting, i.e. the voting score is

Wherein gamma is _u Is a first-order and second-order neighbor node set of a node u, a corner mark v represents a node, va represents voting capability and va _v Indicating the voting ability of the node.

Preferably, the screening method comprises: and after the voting is finished, adding the node with the highest voting score into the seed node set, wherein the voting capacity and the voting score of the node are set to be 0, so that the node does not participate in the subsequent voting any more and can not be elected for the second time.

Preferably, the updating method includes: the voting capacity of the first-order and second-order neighbor nodes of the seed node needs to be attenuated, the attenuation is determined by the length of the shortest path between the node and the seed node, namely an attenuation factor

Where < k > represents the average of the sum of degrees of the respective nodes, and d represents the shortest path length.

In order to better realize the technical content, the application also provides an influence node identification system based on a second-order H index and a voting mechanism,

the method comprises the following steps: the system comprises a data acquisition module, a data preprocessing module, a voting mechanism module, a voting screening module and a node identification module;

the data acquisition module is used for acquiring data to be tested;

the data preprocessing module is used for preprocessing data based on the data to be tested and obtaining a data preprocessing result, wherein the preprocessing result comprises: the importance of the node and the SHIKS value, and initialization is carried out;

the voting mechanism module is used for voting based on the initialized preprocessing result to acquire a voting result;

the voting screening module is used for screening based on the voting result to obtain a screening result;

and the node identification module is used for updating based on the screening result, selecting a designated number of seed nodes and finishing node identification. Preferably, the initialization method in the data initialization module includes:

Preferably, the screening method in the voting screening module comprises: and after the voting is finished, adding the node with the highest voting score into the seed node set, setting the voting capacity of the node to be 0, and not participating in the subsequent voting any more.

Where < k > represents the average of the sum of degrees of the respective nodes and d represents the shortest path length.

The beneficial effect of this application does: the application discloses an influence node identification method and system based on a second-order H index and a voting mechanism, the initial voting capacity of all nodes is set to be the SHIKS value of the node, second-order neighbors are taken into consideration in a voting stage, the voting capacity of all nodes in a second-order neighborhood is updated according to the shortest path length between the node and the seed node in an updating stage, the screened seed nodes are far away from each other, the propagation effect is better, and the method and system are more accurate compared with a traditional identification mode.

Drawings

In order to more clearly illustrate the technical solution of the present application, the drawings needed to be used in the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present application, and it is obvious for a person skilled in the art to obtain other drawings without any inventive exercise.

FIG. 1 is a schematic flow chart of a method according to an embodiment of the present application;

FIG. 2 is a diagram illustrating a specific example of a VoteRank algorithm flow according to an embodiment of the present application;

FIG. 3 is a graph showing the variation of the infection amount F (t) with time t in the example of the present application;

FIG. 4 shows the final infection scale F (t) of the examples of the present application _c ) A line graph schematic diagram which changes along with the seed node proportion rho;

FIG. 5 shows an average shortest distance L between seed nodes according to an embodiment of the present disclosure _s A line graph schematic diagram which changes along with the seed node proportion rho;

FIG. 6 is a schematic flow chart of the VoteRank algorithm according to the embodiment of the present application;

fig. 7 is a schematic diagram of a system according to an embodiment of the present application.

Detailed Description

The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present application, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.

In order to make the aforementioned objects, features and advantages of the present application more comprehensible, the present application is described in further detail with reference to the accompanying drawings and the detailed description.

As shown in fig. 1, the method for identifying an influence node based on a second-order H index and a voting mechanism specifically includes the following steps:

acquiring data to be tested;

based on the data to be tested, data preprocessing is carried out to obtain a data preprocessing result, and the result comprises: the importance of the node and the SHIKS value, and initialization is carried out;

voting based on the initialized result;

screening based on the voted result to obtain a screening result;

The data preprocessing method comprises the following steps:

The initialization method comprises the following steps:

The voting method comprises the following steps:

The screening method comprises the following steps: and after the voting is finished, adding the node with the highest voting score into the seed node set, wherein the voting capacity and the voting score of the node are set to be 0, so that the node does not participate in the subsequent voting any more and can not be elected for the second time.

The updating method comprises the following steps: the voting capacity of the first-order and second-order neighbor nodes of the seed node needs to be attenuated, the attenuation is determined by the length of the shortest path between the node and the seed node, namely an attenuation factor

Wherein < k > represents the average value of the sum of degrees of each node, d represents the shortest path length, and the designated number of seed nodes are selected through updating to complete node identification.

In this embodiment, the specific operation steps are as follows:

the method comprises the following steps: the degree of a first-order neighbor node and a second-order neighbor node of each node is calculated, and then a second-order H index of each node is obtained according to the concept of the second-order H index.

The moderate concept is as follows:

degree (Degree) is the simplest and most direct index for describing the attribute of a node in a network, and the Degree of a node is the Degree of how many first-order neighbor nodes a node has. Degree of node i degree _i Is defined as:

the second order exponent is defined as follows:

if the second-order H index value of a node is k, at least k nodes with the degree of k are arranged in the first-order and second-order neighbor nodes of the node.

Step two: the importance of each node is calculated according to equation 1 and then the SHIKS value of each node is calculated according to equation 2.

The importance calculation method of the node v is shown as formula 2:

wherein, H-index ₂ (v) Second order H-exponent, degree representing node v _v Degree of the node v,

The ratio of the degree of the node v to the sum of the degrees of all nodes in the network, and N represents the total number of nodes in the network.

The node information entropy calculation mode of the node v is shown as formula 3:

wherein the content of the first and second substances,

set of k-th order neighbor nodes, Im, representing node v _p Denotes the importance of the node p, Im _q Representing the importance of node q.

Step three: an initialization stage: initializing the voting score and the voting capacity of each node, wherein the voting score is 0, and the voting capacity is the SHIKS value of the node, namely (S) _u ,va _u )＝ (0,SHIKS(u))。

Step four: a voting stage: the first-order neighbors and the second-order neighbors (if any) of each node participate in the vote, i.e., the vote score is

Wherein gamma is _u Is a set of first and second order neighbor nodes for node u. Notably, if a node has been selected as a seed node in a previous round, the voting score for that node is set to 0, avoiding being selected twice.

Step five: a screening stage: and after each round of voting is finished, counting the voting scores of all the nodes, and selecting the node with the highest voting score to add into the seed node set. Meanwhile, the voting capacity of the node is set to 0, and the node does not participate in subsequent voting.

Step six: and (3) an updating stage: the voting capacity of the first-order and second-order neighbor nodes of the seed node needs to be attenuated, the attenuation is determined by the length of the shortest path between the node and the seed node, namely an attenuation factor

Wherein<k>Representing the average degree of the network; d represents the shortest path length between the node and the seed node.

Step seven: and repeating the fourth step to the sixth step until the seed nodes with the specified number are selected.

To more intuitively illustrate the concept of the SHIKS-VoteRank algorithm, the network shown in FIG. 2 is used to illustrate the process of selecting a group of influential nodes, here taking the selection of two nodes as an example.

Fig. 2 is an exemplary network with 7 nodes, denoted by G ═ V, E, where V denotes a set of all nodes in the network and E denotes a set of edges between nodes. By H-index ₂ (v) represents the second-order H index of the node v, and SHIKS (v) represents the SHIKS value of the node v, (S) _v ，va _v ) Represents the voting score and voting ability of the node v, f represents a decay factor,<k>represents the average degree of the network, d represents the length of the shortest path between the node v and the seed node。

The initial voting capability of each node, i.e., the SHIKS value, needs to be calculated first. Taking node a as an example, according to the definition of the second-order H index, the second-order index is defined as follows:

The second-order H index value of the node A is calculated to be 3, namely H-index ₂ (A) 3. And then calculating according to formula 2 and formula 3 to obtain the SHIKS value of the node A.

Likewise, the SHIKS value of all other nodes can be calculated by the above formula. Table 1 shows the corresponding SHIKS values for each node. After that, each node is initialized and,

TABLE 1

Setting the initial voting capability of the node to the SHIKS value of the node, and the voting score is 0, as shown in figure 2. And after the initialization is finished, entering a voting stage. The voting score of each node is equal to the sum of the voting capabilities of its first-order neighbors and second-order neighbors.

The voting score of the node A can be calculated according to equation 4, wherein gamma _A Is the first and second order neighbor set of node a.

Wherein, gamma is _u Representing first and second order neighbors of node uAnd (4) aggregation of the nodes.

Likewise, the voting scores of other nodes can be calculated by this formula, as shown in fig. 2. Then, the screening stage is entered, and the voting score of the node B is 240.8921, which is the highest among all the nodes, so that the node B is selected as the seed node. It should be noted that after node B is selected as the seed node, its voting ability and voting score both need to be set to 0, the voting ability is set to 0 to make the node not participate in the subsequent voting, and the voting score is set to 0 to make the node not be elected repeatedly. After the election is finished, the updating stage is entered, and as the node A, C, D, E, F, G votes for the node B, the voting capacity of the nodes needs to be attenuated, namely va _v ＝va _v -f. Wherein the attenuation factor

Taking node a as an example, the new voting capacity of node a is:

the voting ability of the nodes after each node update is shown in fig. 2. Then, entering the second round of voting phase, a new voting score of each node can be obtained according to equation 4, as shown in fig. 2, the voting score of the node C is 385.3528 which is the highest among all the nodes, so that the node C is selected as the seed node, and the voting score and voting capacity of the node C are set to 0. At this time, the number of the seed nodes reaches two, and the algorithm stops.

Step eight: results and analysis of the experiments

S8.1 Experimental setup

S8.1.1 data set

In order to more truly evaluate the performance and accuracy of the SHIKS-voterrank algorithm, the experiment uses 12 reference real network data sets of different scales and different characteristics, respectively: jazz, USAir97, Email, Celegansroad, Hamster, Polblogs, Power, Router, Yeast, Facebook, CEnew, US-Air 2010.

S8.1.2 comparison algorithm

Seven influence node identification algorithms are compared in the experiment, and are respectively as follows: DC. CC, BC, MCDE, H-index, VoteRank, SHIKS.

S8.1.3 evaluation index

In the experiment of this example, the amount of infection F (t) at each time and the final amount of infection F (t) were used _c ) Average shortest path length between seed nodes, L _s And evaluating the accuracy and the effectiveness of the algorithm by the indexes.

S8.1.4 Experimental Environment

In this embodiment, all experiments are performed on a 64-bit Windows11 operating system with 11th Gen Intel (R) core (tm) i7-1165G7@2.80GHz CPU and 16G memory, the algorithms involved in this embodiment are implemented by using Python language version 3.8, and the experimental results are drawn by using a Python drawing library Matplotlib.

TABLE 2

S8.1.5 parameter setting

In order to evaluate the propagation capability of the seed nodes mined by the proposed algorithm, the present embodiment performs all propagation scale comparison experiments by using the SIR model. In the SIR model, each node has three states: infection state, recovery state, susceptibility state. The node in the infection state infects the susceptible node with a certain probability beta, and the node returns to the normal state with a certain probability gamma. In the experiment of the present embodiment, the recovery rate γ was set to 0.01. The setting of the infection rate is very important, if the infection rate is too low, the transmission effect may be poor, even the transmission cannot be carried out, but if the infection rate is too high, the situation of infection outbreak may occur in the whole network,the impact on a single node is difficult to distinguish. Therefore, in this example, the infection rate β is slightly larger than the transmission rate λ _c ,

Wherein<k>Which represents the average degree of the network,<k ² >representing the average of the sum of squares of the node degrees in the network. Infection rate beta and transmission rate lambda used by 12 real networks _c . As shown in table 2. Since there is some error in the result obtained by each simulation, the experiment of this embodiment sets the number of simulations to 2000, and takes the average value of 2000 simulations as the final result.

S8.2 analysis of Experimental results

FIG. 3 is a graph of the amount of infection (the proportion of infected nodes to recovery nodes to all nodes in the network) over time at each time, with the X-axis representing the time of infection t and the Y-axis representing the amount of infection F (t) _c ). The initial node number in this experiment was set to 20% of the total node number. As can be seen from the figure, in the celegsnereal, CEnew, USAir2010, Facebook, Polblogs and Router network, the number of infected nodes per time of the SHIKS-VoteRank is obviously greater than that of other comparison algorithms such as SHIKS, VoteRank and H-index, namely the infected capability of the seed node is relatively strong, the infected range is also larger, and from the steep degree of the curve, the curve of the SHIKS-VoteRank is steeper than that of the other algorithms, which shows that the seed node excavated by the SHIKS-VoteRank has a faster propagation speed. In USAir97 and Power networks, when t is<At 200, the performance of SHIKS-VoteRank is only slightly lower than that of VoteRank, when t is>At 200, the performance of the SHIKS-VoteRank is basically equal to that of the VoteRank and the SHIKS. On the whole, in most real networks, the SHIKS-VoteRank algorithm is excellent in performance, the excavated seed nodes have better propagation capacity, and the accuracy and the effectiveness of the algorithm are verified.

FIG. 4 is the final infection scale F (t) _c ) Graph with initial seed node ratio ρ. The X-axis represents the seed node ratios, 0.04, 0.08, 0.12, 0.16, and 0.20, respectively. The Y-axis represents the final infection scale. In the network Email, Celegansneal, USAir2010, USAir,In Facebook, Hamster, Polblogs and Router networks, the performance of the SHIKS-VotetRank algorithm is superior to that of other algorithms under the condition of seed nodes in any proportion. In a Jazz network, the SHIKS-VotetRank algorithm is not stable, when rho is 0.04, the SHIKS-VotetRank performance is only slightly higher than VotetRank, the SHIKS-VotetRank performance is obviously improved along with the gradual increase of the proportion of the seed nodes, when rho is 0.12, the SHIKS-VotetRank exceeds H-index, and then the SHIKS-VotetRank algorithm and the SHIKS algorithm show a tendency of alternating leading. In USAir97, Power network, when rho>At 0.1, the number of the infected nodes of the SHIKS-VoteRank exceeds the SHIKS and VoteRank algorithms, and then the infected nodes are always in the leading position. And in Yeast network, when p>At 0.14, the SHIKS-VoteRank performance was higher than that of SHIKS.

In general, although the seed node proportion is different, the SHIKS-VoteRank algorithm is superior to other comparison algorithms in the final infection scale in most networks.

FIG. 5 is a line graph of the average shortest distance between seed nodes as a function of the seed node scale. Wherein the X axis is the seed node proportion rho, the values are respectively 0.005, 0.010, 0.015, 0.020, 0.025 and 0.030, and the Y axis is the average shortest distance L between the seed nodes _s . As can be seen from the figure, in addition to the Jazz and Eamil networks, the seed selected by the SHIKS-voterrank algorithm can be distributed throughout the network compared to other algorithms, i.e., the selected seed nodes are more distributed, especially in the Celegansneural, USAir97, USAir2010, Power, Router networks. In the network CEnew, USAir97, ρ>At 0.08, the performance of the SHIKS-VoteRank algorithm is obviously improved and exceeds that of the SHIKS algorithm. In Facebook, Hamster, Yeast networks, when ρ<0.12, the effect is not good, but when p is>At 0.12, the effect is remarkably improved. In a whole view, when the seed node proportion is low, the effect is general, but with the proportion increasing, the advantages of the algorithm are reflected.

The embodiment firstly describes the idea and specific steps of the traditional VoteRank algorithm in detail through an example network, analyzes the limitation of the VoteRank, and then provides an influence node identification algorithm SHIKS-VoteRank with higher accuracy aiming at the defects of the VoteRank. And secondly, describing the idea and specific steps of the SHIKS-VoteRank algorithm in detail, wherein the algorithm is used for setting the initial voting capacity of all nodes as the SHIKS value of the node and taking second-order neighbors into consideration in the voting stage. In addition, in the updating stage, the voting capacity of all nodes in the second-order neighborhood is updated according to the shortest path length between the nodes and the seed nodes. And finally, the algorithm is applied to 12 real networks, and a comparison experiment is carried out with the SHIKS and other classical influence node identification algorithms, and the result shows that the seed nodes screened out by the SHIKS-VoteRank are far away from each other, so that the algorithm has a better propagation effect, and the accuracy and the effectiveness of the algorithm are verified.

Example two

As shown in fig. 7, the present application further provides an influence node identification system based on a second-order H-index and voting mechanism,

the data acquisition module is used for acquiring data to be tested;

the data preprocessing module is used for preprocessing data based on the data to be tested and acquiring a data preprocessing result, and the result comprises: the importance of the node and the SHIKS value, and initialization is carried out;

the voting mechanism module is used for voting based on the initialized result;

and the node identification module is used for updating based on the screening result, selecting a designated number of seed nodes and finishing node identification.

Specifically, the initialization method in the data initialization module includes:

The voting mechanism module carries out voting based on the initialization result to acquire a voting result;

the screening method in the voting screening module comprises the following steps: and after the voting is finished, adding the node with the highest voting score into the seed node set, wherein the voting capacity and the voting score of the node are set to be 0, so that the node does not participate in the subsequent voting any more and can not be elected for the second time.

And the node identification module is used for updating based on the screening result, acquiring the updating result, selecting a specified number of seed nodes and finishing node identification.

The updating method in the node identification module comprises the following steps: the voting capacity of the first-order and second-order neighbor nodes of the seed node needs to be attenuated, the attenuation is determined by the length of the shortest path between the node and the seed node, namely an attenuation factor

VoteRank is an algorithm proposed by Zhang et al based on a voting mechanism to identify a group of influential nodes in a complex network. VoteRank assigns a tuple (S) to each node _u ，υa _u ) Recording the voting score and the voting capacity of the node after each round of voting, S _u Voting score, va, representing a node _u Indicating the voting ability of the node. At an initial time, the voting score and voting ability of each node are set to 0 and 1, and then the voting phase is entered. In the voting stage, the voting score of each node is obtained by summing up the voting capacities of the neighboring nodes of the node, and the calculation formula is as follows:

wherein, gamma is _u A set of first and second order neighbor nodes representing node u. In the screening stage, the node with the highest voting score is added into the seedAnd setting the voting score and the voting capacity of the node to be 0 in the node set, wherein the voting score and the voting capacity are set to be 0 in order to avoid that the node is selected again in the subsequent rounds, and the node is not participated in the subsequent voting any more. Finally, in the updating stage, the voting capacity of the neighbor nodes of the seed node needs to be weakened, and each round is weakened

Until the voting ability reaches 0, wherein<k>Is the average degree of the network. The process of voting, election, and updating is then repeated until a specified number of seed nodes are elected. In order to more intuitively show the concept of the VoteRank algorithm, the steps of VoteRank are described in detail by using the network shown in FIG. 7.

Fig. 6 shows the voting scores and voting ability of the nodes after the first round of voting. Taking node C as an example, the voting score is:

by analogy, the voting scores of the rest nodes can be obtained. As can be seen in the figure, node C has the highest vote score, and therefore node C is added to the set of seed nodes. Since node C is elected, it will not participate in subsequent votes and therefore its voting score and voting ability are set to 0, and moreover, the voting ability of node C's neighbor node A, B, D, E, F, G will be reduced

The voting ability of each node after the update is shown in fig. 7. Then, a second round of voting is started, taking node a as an example, since the voting ability of node C has been set to 0, the voting score of node a comes only from node B, and in the last round, the voting ability of node B is weakened to 0.5714, so the voting score of node a is 0.5714. By analogy, a second round of voting scores for each of the remaining nodes can be derived. As can be seen in the figure, the voting score of node H is first at 4.1428, sinceThis adds node H to the set of seed nodes in the second round. The voting ability of the neighbor nodes to node H then continues to diminish. The above process is repeated until a sufficient number of seed nodes are selected.

The above-described embodiments are merely illustrative of the preferred embodiments of the present application, and do not limit the scope of the present application, and various modifications and improvements made to the technical solutions of the present application by those skilled in the art without departing from the spirit of the present application should fall within the protection scope defined by the claims of the present application.

Claims

1. The influence node identification method based on the second-order H index and the voting mechanism is characterized in that,

acquiring data to be tested;

voting based on the initialized preprocessing result to obtain a voting result;

screening based on the voting result to obtain a screening result;

2. The method for identifying an influence node based on a second-order H-index and voting mechanism according to claim 1,

the data preprocessing method comprises the following steps:

3. The method of claim 2, wherein the influence node identification method based on the second-order H index and voting mechanism,

the initialization method comprises the following steps:

4. The method for identifying an influence node based on a second-order H index and voting mechanism according to claim 1,

the voting method comprises the following steps:

the first-order neighbor and the second-order neighbor participate in the voting, i.e. the voting score is

Wherein gamma is _u Is a first-order and second-order neighbor node set of a node u, a corner mark v represents a node, va represents voting capability and va _v Representing the voting ability of node v.

5. The method of claim 4, wherein the influence node identification method based on the second-order H index and voting mechanism,

6. The method of claim 5, wherein the influence node identification method based on the second-order H index and voting mechanism,

7. The influence node identification system based on the second-order H index and the voting mechanism is characterized in that,

the data acquisition module is used for acquiring data to be tested;

8. The second order H-exponent and voting mechanism-based influence node identification system of claim 7,

the initialization method in the data initialization module comprises the following steps:

9. The second order H-exponent and voting mechanism-based influence node identification system of claim 7,

10. The second order H-exponent and voting mechanism-based influence node identification system of claim 9,