CN110138619A

CN110138619A - A kind of start node choosing method that realizing maximizing influence and system

Info

Publication number: CN110138619A
Application number: CN201910448351.1A
Authority: CN
Inventors: 周旭; 刘勇刚; 姜文君; 肖国庆; 罗文晟; 李肯立; 李克勤
Original assignee: Hunan University
Current assignee: Hunan University
Priority date: 2019-05-28
Filing date: 2019-05-28
Publication date: 2019-08-16
Anticipated expiration: 2039-05-28
Also published as: CN110138619B

Abstract

The invention discloses a kind of start node choosing methods for realizing maximizing influence, its scene for influences a variety of in social networks while propagation, the consciousness of comforming is introduced into communication process, it proposes inversely for the propagation model for the consciousness of comforming using the method for sampling, start node choosing method and start node estimation method, it is reverse using sampling to network progress is influenced first, then according to inversely using the sample of sampling, iteratively calculate start node, until meeting required precision using income using estimation method judgement, otherwise sampling scale is doubled, repeat above step.The propagation model for consciousness of comforming is more scientific and truly modelled propagation process, start node choosing method can accurately and efficiently choose start node, and can adapt to large scale network structure, improves the timeliness of start node choosing method.

Description

A kind of start node choosing method that realizing maximizing influence and system

Technical field

The invention belongs to computer information technology fields, realize the initial of maximizing influence more particularly, to a kind of Node selection method and system.

Background technique

The life that the development of internet is not only provided convenience for the mankind, more changes human lives and working method.With The network applications rise such as Facebook, microblogging and mobile network's terminal it is universal, online social networks will be dispersed in differently Domain possesses heterogeneous beliefs, is under the jurisdiction of country variant and the people of tissue link together, and forms a huge information transportation net Network.Huge economy and society value is contained in the information propagation of large-scale social networks (such as can be used for advertisement marketing and political affairs Plan is promoted), therefore start node how is chosen to realize maximizing influence (Influence maximization, abbreviation IM), have become social networks technical field one important to study a question.

Existing start node selection method is mainly based upon independent cascade (Independent cascade, abbreviation IC) Model or linear threshold (Linear threshold, abbreviation LT) model use the method, heuristic based on analogue simulation Method or the method for reverse influence sampling determine the influence of start node.However, existing start node selection method institute The model used still has the technical issues of can not ignore: since it does not account for conformity behavior, so that the model established is not It is enough true, thus the influence value obtained is inaccurate, the start node for eventually leading to selection is not ideal enough.

Summary of the invention

Aiming at the above defects or improvement requirements of the prior art, the first of maximizing influence is realized the present invention provides a kind of Beginning node selection method and system, it is intended that solving model used by existing start node selection method due to not having Consider conformity behavior, so that the model established is not true enough, leads to the start node that the influence value obtained is inaccurate, selects not Enough ideal technical problems；In addition, the authenticity and science of the propagation model enhancing propagation model proposed in the present invention, also mention The scope of application of propagation model is risen；Finally, the start node selection method proposed in the present invention improves start node selection The timeliness of method.

To achieve the above object, according to one aspect of the present invention, it provides and a kind of realizes the initial of maximizing influence Node selection method, comprising the following steps:

(1) influence diagram is constructed according to established propagation model, and start node collection S is generated according to the influence diagram of building_C, For propagating other influences for having competitive relation；

(2) according to preset precision parameter ε and δ, threshold value T is obtained using SSA algorithm₀, to the influence diagram obtained in (1) into Row T₀Secondary T reverse using sampling, and that all samplings are obtained₀A sample is put into sample setIn；

(3) according to the sample set returned in step (2)Iteratively k is selected to use edge income from influence diagram Maximum node obtains the Biased estimator of the use of start node collection S as start node collectionWherein 0 < k < n；

(4) counter c10=0 is set, executes step (5) reverse number using sampling, and setting steps for indicating (3) the total of the start node collection S obtained in uses SUM₂=0；

(5) inversely using sampling, calculate the start node collection S obtained in step (3) in this sampling to influence diagram Middle acquisition uses A_c10, and update the total using SUM of the start node collection S obtained in step (3)₂=SUM₂+A_c10；

(6) judge whether counter c10 is less than threshold value T₂And the total of start node collection S uses SUM₂Whether threshold value T is less than₃, C10=c10+1, and return step (5) are if it is set, otherwise exportedAnd enter step (7)；

(7) Biased estimator of the start node collection S obtained in judgment step (3)It is first with being obtained in step (6) The unbiased esti-mator of beginning node collection SWhether meetIt is saved if it is satisfied, then directly output is initial Otherwise point set S updates sampling number T used in step (2) as a result, process terminates₀=2*T₀, and return step (2).

Preferably, influence diagram is represented as G=(V, E, p), and wherein V indicates the set of node, and E indicates the collection of directed edge Close, p indicates the function for the activation probability that all sides are preassigned, each edge have a pre-assigned activation Probability p (u, V) [0,1] ∈ indicates the probability of node u activation node v.

Preferably, the communication process of propagation model is as follows:

Firstly, different influences activates respective start node, at 0 moment to start communication process.Different influences can Select identical node as start node；

Then, in t moment, become state of activation at this moment in the node that the t-1 moment is activated, and attempt to activate The neighbor node not being activated, if activation trial and success, the node being activated will receive being had an impact for activation node, if Failed, then not to be activated node holding unactivated state is attempted in activation, and after this, the node in state of activation will According to comforming using probability defined in function, using one of its received influence, and adoption status is ultimately become；

That comforms is as follows using function h (u, I) definition probability:

Wherein, h (u, I_i) indicate node u using influence I_iProbability,It is the set of had an impact composition, NA (u, I_i) Expression, which has activated node u and has propagated to it, influences I_iNeighbor node.

Finally, communication process termination when not new node is activated.

Sample in step (2) is the data that single is inversely obtained using sampling, and including node set R, node set R In the set { d (u) | u ∈ R } of distance between each node w and other nodes, in node set R each node activation node Set { NA (u) | u ∈ R }, node w and the start node collection S of collection_CDistance d_CWith node w by start node collection S_CInterior joint institute The total degree Time of the influence activation of propagation_C。

Preferably, single includes inversely following sub-step using the process of sampling in step (2):

(2-1) is randomly chosen a node w from influence diagram, and node w and itself distance d (w is arranged_c1)=0, by w It is put into node set R, and start node set S is set_CWith node w distance d_CVariable curDist=0 is arranged in=∞, CurDist indicate currently processed node at a distance from node w, abbreviation current distance；

(2-2) judges whether node w is start node collection S_CIn element, if it is, record node w by competition shadow Ring the total degree Time of activation_CWith start node set S_CWith node w distance d_c=0, it is transferred to step (2-17), is otherwise entered Step (2-3)；

Counter c1=1, current distance curDist=curDist+1 is arranged in (2-3), judges whether c1 value is greater than node w Enter neighbor node sum, if it is, being transferred to step (2-17), be otherwise transferred to step (2-4)；

(2-4) enters neighbor node w according to c1 of node w_c1Activate the probability predicate node w of node w_c1Activate node w Whether succeed, if it is, record node w_c1With node w distance d (w_c1)=curDist, by node w_c1It is put into its activation section Point set NA (w_c1), by node w_c1It is put into node set V, and is transferred to step (2-5), be otherwise transferred to step (2-6)；

(2-5) judges node w_c1It whether is start node collection S_CIn element, if it is by start node set S_C With node w distance d_CIt is set as current distance curDist, and is transferred to step (2-6), is otherwise transferred to step (2-6)；

(2-6) judges that whether c1 value be equal to node w enters neighbor node sum, no if it is, be transferred to step (2-8) Then it is transferred to step (2-7)；

Counter c1=c1+1, and return step (2-4) is arranged in (2-7)；

(2-8) judges node set V and start node collection S_CWhether have whether intersection or node set V are empty set, such as Fruit is then to calculate and record node w by node set V and S_CThe total degree for the competitive influence activation that intersection interior joint is propagated Time_C, and it is transferred to step (2-17), otherwise all nodes in node set V are put into node set R, and be transferred to step (2-9)；

Counter c2=1 and current distance curDist=curDist+1 is arranged in (2-9)；

(2-10) selects the c2 node w from node set V_c2, and counter c3=1 is set, judge counter c3's Whether value is greater than node w_c2Enter neighbor node sum, if it is, being transferred to step (2-14), be otherwise transferred to step (2-11)；

(2-11) judges node w_c3Whether not in node set R, and according to node w_c2C3 enter neighbor node w_c3 Activate node w_c2Probability predicate node w_c3Activate node w_c2Whether succeed, if it is, setting node w_c3With node w away from From d (w_c3) it is equal to curDist, by node w_c2Activation node collection NA (w_c2) in element be put into node w_c3Activate node collection NA (w_c3) in, by node w_c3It is put into node set V_nextIn, and it is transferred to step (2-12), otherwise it is transferred to step (2-13)；

(2-12) judges node w_c3It whether is start node collection S_CIn element, if it is be arranged start node set S_C With node w distance d_CEqual to curDist, and it is transferred to step (2-13), is otherwise directly transferred to step (2-13)；

(2-13) judges whether the value of counter c3 is greater than node w_c2Enter neighbor node sum, if it is, being transferred to step Suddenly counter c3=c3+1, and return step (2-11) is otherwise arranged in (2-14)；

(2-14) judges whether counter c2 value is equal to the node total number in node set V, if yes then enter step (2- 15) counter c2=c2+1, and return step (2-10), are otherwise set；

(2-15) judges node set V_nextWith start node collection S_CWhether intersection or node set V are had_nextWhether be Empty set, if it is, obtaining node w by node set V_nextWith S_CTotal time of the competitive influence activation that intersection interior joint is propagated Number Time_C, and it is transferred to step (2-17), otherwise it is transferred to step (2-16)；

(2-16) uses node set V_nextIn all nodes replacement node set V in all nodes, and by node collection All nodes closed in V are put into node set R, and return step (2-8)；

(2-17) obtain node set R, in node set R the distance between each node and node w set d (u) | u ∈ R }, in node set R the activation node collection of each node set { NA (u) | u ∈ R }, node w and start node collection S_C's Distance d_CAnd node w is by start node collection S_CThe total degree Time for the influence activation that interior joint is propagated_C。

Preferably, the k mistake using the node of edge Income Maximum as start node collection is iteratively selected in step (3) Journey includes following sub-step:

(3-1) settingCounter c4=1 is set, and each node is in sample set in setting influence diagramOn The initial value always used is 0, and start node collection S is in sample setOn use summation SUM₁=0；

Counter c5=1 is arranged in (3-2)；

(3-3) is from sample setThe c5 sample R of middle taking-up_c5, counter c6=1 is set；

(3-4) is from sample R_c5The c6 node w of middle taking-up_c6, judge in R_c5Interior joint w_c6With node w distance d (w_c6) Whether d is less than_C, if it is, setting node w_c6In R_c5On be adopted as 1, node w_c6It is total using plus 1, be then transferred to step (3-5)；Otherwise, node w is set_c6In R_c5On be adopted asUpdate w_c6In sample setOn it is total It is adopted as node w_c6In sample setOn it is total using addingThen it is transferred to step (3-5)；Wherein NA(w_c6) and Time_CIt is to respectively indicate sample R_c5In activation node collection and start node collection S_CThe influence that interior joint is propagated swashs Sample R living_c5In node w total degree.

(3-5) judges whether counter c6 is less than | R_c5|, if it is, setting c6=c6+1, then return step (3- 4), otherwise it is transferred to step (3-6)；

(3-6) judges whether counter c5 is less thanIf it is, setting c5=c5+1, and return step (3-3), it is no Then it is transferred to step (3-7)；

(3-7) exists to nodes all in influence diagram according to each node using big root heapOn the size that always uses carry out Descending arrangement；

(3-8) judges counter C₄Whether preset threshold k is less than, if it is, taking outOn always use maximum section Point w_c4, by node w_c4Start node collection S is added, SUM is set₁=SUM₁+A(w_c4), wherein A (w_c4) it is node w_c4?On It is total to use, and the deletion of node w from big root heap_c4, and step (3-9) is gone to, otherwise obtains start node set S and used with it Biased estimatorThen it is transferred to step (3-22).

(3-9) is from sample setMiddle take out all includes node w_c4Sample, formed sample setAnd meter is set Number device c7=1；

(3-10) takes outIn the c7 sample R_c7, and judgement sample R_c7Whether it is marked as determining use, if It is to go to step (3-21), otherwise goes to step (3-11)；

(3-11) judges w_c4With sample R_c7In node w distance d (w_c4) whether it is less than R_c7In start node set S_C With node w distance d_C, if it is marker samples R_c7It is used to determine, and goes to step (3-12), otherwise go to step (3- 15)；

Counter c8=1 is arranged in (3-12)；

(3-13) takes out sample R_c7In the c8 node, update w_c8In sample setOn be always adopted as node w_c8 In sample setOn it is total using subtracting node w_c8In R_c7On use, and more new node w_c8Sequence in big root heap.

(3-14) judges whether counter c8 is less than | R_c7|, if it is, setting c8=c8+1, and go to step (3- 13) step (3-21), otherwise, is gone to；

Counter c9=1 is arranged in (3-15)；

(3-16) takes out sample R_c7In the c9 node w_c9, judge w_c9With sample R_c7In node w distance d (w_c9) Whether R is less than_c7In start node set S_CWith node w distance d_C, step (3-17) is if it is gone to, step is otherwise gone to Suddenly (3-18)；

(3-17) more new node w_c9In R_c7On be adopted as node w_c9In R_c7On use subtract node w_c4In R_c7On Using more new node w_c9In sample setOn be always adopted as node w_c9In sample setOn it is total using subtracting node w_c4In R_c7On use；

(3-18) more new node w_c9In sample setOn be always adopted as node w_c9In sample setOn total use Subtract node w_c9In R_c7On use, more new node w_c9In R_c7On be adopted asWhereinFor swashing for set S interior joint The size of the union of movable joint point set, i.e., start node collection S is in R_c7On activation total degree, and more new node w again_c9In sample SetOn be always adopted as node w_c9In sample setOn it is total using add node w_c9In R_c7Upper updated use；

Sequence of (3-19) the more new node wc9 in big root heap；

(3-20) judges whether counter c9 is less than | R_c7|, if it is, setting c9=c9+1, and go to step (3- 16) step (3-21), otherwise, is gone to；

(3-21) judges whether counter c7 is less thanIf it is, setting c7=c7+1, and go to step (3- 10) step (3-8), otherwise, is gone to；

(3-22) judges SUM₁Whether threshold value T is greater than₁, if it is, entering step (4), otherwise updating makes in step (2) Sampling number T₀=2*T₀, and return to (2).According to SSA algorithm, threshold value T₁Calculation formula beWhereinε₂=ε₃=(1-1/e)^-1ε/2。

Preferably, start node collection S's using f (S) is obtained using following formula:

Wherein, S is start node collection, and n is nodes sum, A_iIt is the use income of i-th sampling,It is real The number of border sample,It isIn all samples average use.

Preferably, influence diagram is carried out inversely using sampling in step (5), and calculates the initial section obtained in step (3) What point set S was obtained in this sampling uses A_c10Process include following sub-step:

(5-1) is randomly chosen a node z from influence diagram, and node z is put into node set R, start node collection S_CWith node z distance d_C=∞, start node collection S and node z distance d_S=∞, current distance curDist=0, it is initial to save Point set S uses A what this was sampled_c10=0；

(5-2) judges whether node z is start node collection S_CIn element, if it is d_C=curDist is transferred to step Suddenly (5-3) otherwise enters step (5-3)；

(5-3) judges whether node z is element in start node collection S, if it is d_S=curDist is transferred to step Suddenly (5-4) otherwise enters step (5-4)；

(5-4) judges d_CWhether it is equal to curDist, is if it is transferred to step (5-5), otherwise enters step (5-6)；

(5-5) judges d_SWhether curDist, the use that if it is start node collection S is sampled at this are equal toWherein Time_CIt is the total degree that node z is activated by competitive influence, and is transferred to step (5-30), otherwise just Beginning node collection S uses A what this was sampled_c10=0, enter step (5-30)；

(5-6) judges whether to be equal to curDist, and if it is start node collection S uses A what this was sampled_c10=1, turn Enter step (5-30), otherwise enters step (5-7)；

Counter c11=1, current distance curDist=curDist+1 is arranged in (5-7)；

(5-8) judges that whether c11 value be greater than node z enters neighbor node sum, if it is, be transferred to step (5-13), Otherwise it is transferred to step (5-9)；

(5-9) enters neighbor node z according to c11 of node z_c11Activate the probability predicate node z of node z_c11Activation section Whether point z succeeds, if it is, by node z_c11It is put into it and activates node collection NA (z_c11), by node z_c11It is put into node set V In, and it is transferred to step (5-10), otherwise it is transferred to step (5-11)；

(5-10) judges node z_c11It whether is start node collection S_CIn element, if it is d_C=curDist will be saved Point z_c11Node collection Set is added_CIn, it is transferred to step (5-11), otherwise enters step (5-11)；

(5-11) judges node z_c11It whether is element in start node collection S, if it is d_S=curDist, it will Node z_c11Node collection Set is added_SIn, it is transferred to step (5-12), otherwise enters step (5-12)；

(5-12) counter c11=c11+1, enters step (5-8)；

(5-13) judges d_CWhether it is equal to curDist, is if it is transferred to step (5-14), otherwise enters step (5- 15)；

(5-14) judges d_SWhether curDist, the use that if it is start node collection S is sampled at this are equal toWhereinIt is the total degree that node z is activated by competitive influence, Infset_CIt (u) is node collection Set_CThe set for the competitive influence that interior joint is propagated, and is transferred to step (5-30), otherwise start node Collection S uses A what this was sampled_c10=0, enter step (5-30)；

(5-15) judges d_SWhether curDist is equal to, if it is start node collection S uses A what this was sampled_c10= 1, it is transferred to step (5-30), otherwise enters step (5-16)；

(5-16) if node set V is not empty set, and d_SNot equal to curDist, and d_CIt, then will section not equal to curDist All nodes in point set V are put into node set R, are transferred to step (5-17), are otherwise transferred to step (5-27)；

(5-17) be arranged counter c12=1, currently with node z distance curDist=curDist+1；

(5-18) judges the node total number whether c12 value is greater than in node set V, if yes then enter step (5-26), Otherwise return step (5-19)；

(5-19) selects the c12 node z from node set V_c12, and counter c13=1 is set；

(5-20) judges whether c13 value is greater than node z_c12Enter neighbor node sum, if it is, being transferred to step (5- 25), otherwise it is transferred to step (5-21)；

(5-21) judges node z_c13Whether not in node set R, and according to node z_c12C13 enter neighbor node z_c13Activate node z_c12Probability predicate node z_c13Activate node z_c12Whether succeed, if it is, by node z_c12Activation section Point set NA (w_c2) in element be put into node z_c13Activate node collection NA (w_c3) in, by node z_c13It is put into node set V_nextIn, And it is transferred to step (5-22), otherwise it is transferred to step (5-24)；

(5-22) judges node z_c13It whether is start node collection S_CIn element, if it is d_C=curDist will be saved Point z_c13Node collection Set is added_CIn, it is transferred to step (5-23), otherwise enters step (5-23)；

(5-23) judges node z_c13It whether is element in start node collection S, if it is d_S=curDist will be saved Point z_c13Node collection Set is added_SIn, it is transferred to step (5-24), otherwise enters step (5-24)；

(5-24) counter c13=c13+1, and return step (5-20)；

(5-25) counter c12=c12+1, and return step (5-18)；

(5-26) uses node set V_nextIn all nodes replacement node set V in all nodes, and return step (5-16)；

(5-27) judges d_CWhether it is equal to curDist, is if it is transferred to step (5-28), otherwise enters step (5- 29)；

(5-28) judges d_SWhether curDist, the use that if it is start node collection S is sampled at this are equal toWhereinFor set S interior joint activation node collection and The size of collection, and it is transferred to step (5-30), otherwise start node collection S uses A what this was sampled_c10=0, enter step (5- 30)；

(5-29) judges whether to be equal to curDist, and if it is start node collection S uses A what this was sampled_c10=1, It is transferred to step (5-30), otherwise enters step (5-30)；

(5-30) returns to the use A of this sampling_c10。

It is another aspect of this invention to provide that providing a kind of start node selecting system for realizing maximizing influence, wrap It includes:

First module for constructing influence diagram according to established propagation model, and generates just according to the influence diagram of building Beginning node collection S_C, for propagating other influences for having competitive relation；

Second module, for obtaining threshold value T using SSA algorithm according to preset precision parameter ε and δ₀, to the first module The influence diagram of acquisition carries out T₀Secondary T reverse using sampling, and that all samplings are obtained₀A sample is put into sample setIn；

Third module, the sample set for being returned according to the second moduleK use is iteratively selected from influence diagram The node of edge Income Maximum obtains the Biased estimator of the use of start node collection S as start node collectionIts In 0 < k < n；

Counter c10=0 is arranged in 4th module, for indicating to execute step (5) inversely using the number of sampling, and sets Set the total using SUM of the start node collection S of third module acquisition₂=0；

5th module, for inversely calculate the start node collection S that third module obtains using sampling and exist to influence diagram What is obtained in this sampling uses A_c10, and the total of start node collection S for updating the acquisition of third module uses SUM₂=SUM₂+A_c10；

6th module, for judging whether counter c10 is less than threshold value T₂And the total of start node collection S uses SUM₂Whether Less than threshold value T₃, c10=c10+1 is if it is set, and returns to the 5th module, otherwise exports the use of start node collection S Biased estimatorSubsequently into the 7th module；

7th module, the Biased estimator of the start node collection S for judging the acquisition of third moduleWith the 6th module The unbiased esti-mator of the start node collection S of acquisitionWhether meetIf it is satisfied, then directly defeated Out otherwise start node collection S updates the sampling number T that the second module uses as a result, process terminates₀=2*T₀, and return to the Two modules.

In general, through the invention it is contemplated above technical scheme is compared with the prior art, can obtain down and show Beneficial effect:

(1) propagation model of consciousness proposed by the present invention of comforming, it is contemplated that a variety of competitive relations for influencing to propagate, and for the first time It is used as decision-making foundation during conformity behavior is introduced using influencing, to enhance the reasonability and science of propagation model Property, the influence value of acquisition is accurate, therefore the start node of final choice is ideal enough；

(2) present invention for consciousness of comforming propagation model design it is reverse using the method for sampling, solve conventional inverse The problem of influencing sampling are not suitable for the method for sampling, so that inversely wider using the scope of application of the method for sampling more；

(3) incrementally more new node uses income for start node selection method of the invention, to reduce a large amount of It computes repeatedly, improves computational efficiency.

Detailed description of the invention

Fig. 1 is the flow chart for the start node choosing method that the present invention realizes maximizing influence；

Fig. 2 is the state transition diagram of the propagation model interior joint for the consciousness of comforming；

Fig. 3 shows an example of the propagation model of consciousness of the invention of comforming.

Specific embodiment

In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and It is not used in the restriction present invention.As long as in addition, technical characteristic involved in the various embodiments of the present invention described below Not constituting a conflict with each other can be combined with each other.

For the above-mentioned deficiency mentioned in background technique, present invention firstly provides a kind of propagation model of consciousness of comforming, For the propagation design of multiple competitive influences, the conformity behavior of people is introduced into communication process, for select finally to use It influences；Then, based on the propagation model for the consciousness of comforming, we have proposed the use maximization problems comformed；Finally, for comforming The propagation model design of consciousness inversely uses the method for sampling and start node selection method, and calculates and have in conjunction with SSA frame The start node set of quality assurance.

The technical term occurred in the present invention is explained in detail and is illustrated first below:

Comform: also referred to as conformity behavior refers to the personal influence by extraneous crowd behaviour, and in the consciousness of oneself, judge, recognize The behavior for according with public opinion or majority is shown in knowledge.

Using: the section for referring to the expectation of the quantity in social networks using certain node influenced or being influenced using certain The quantity of point.

That comforms uses function: for the scenes with emulative influence while propagation multiple in social networks, passing Broadcast probability function defined in modelThat comforms is met using probability function: 0≤h (u, I_i) ≤ 1 andTo provide probability ginseng using one such influence for the node selection in social networks Number, and node receive certain influence number it is more, using this kind influence probability it is bigger, embody node selection obey The behavior comformed.

Enter neighbor node: the information between the node in social networks, which is propagated, has directionality, therefore, the section in influence diagram Side between point also has direction.For arbitrary node v, if there is a directed edge (u, v), indicate that node u is directed toward node V, then, node u is that node v enters neighbor node.

Neighbor node out: with it is upper similarly, for arbitrary node u, if there is a directed edge (u, v), indicate that node u refers to To node v, then, node v is the neighbor node out of node u.

The propagation model for consciousness of comforming: which specify propagation rule of a variety of influences in influence diagram.Each section among these There are two types of behaviors for point: activating and uses, controls mainly activation probability function p (e) of both action and the use comformed is general Rate function h (u, I).

Activate probability function p (e) in the present invention there are three types of set-up mode, unified probability is arranged, and in-degree inverse weight is set Set selected with three values one setting.Unify probability set-up mode, identical activation probability is arranged in each edge in network, is commonly 0.1.The setting of in-degree inverse weight, by while activation probability be set as this while terminal in-degree inverse, i.e. p (u, v)=1/ | N_in(v) |, wherein side e=(u, v), v are the terminal on side, N_in(v) all of the side of v are directed to and enter neighbours.Three values select one Setting is that the activation probability for being set as side is randomly chosen from { 0.1,0.01,0.001 } three values.

That comforms is defined using the following h of probability function (u, I):

Wherein, h (u, I_i) indicate that node u uses I_iProbability, Freq (I_i) indicate that u receives influence I_iNumber.In reality Border uses following formula in calculating:

NA(u,I_i) indicate to have activated u and have propagated I to it_iEnter neighbours.

That comforms is met using probability function: 0≤h (u, I_i)≤1 and

As shown in Figure 1, the present invention realize maximizing influence start node choosing method the following steps are included:

(1) influence diagram (Influence graph) is constructed according to established propagation model, and according to the influence diagram of building Generate start node collection S_C, for propagating other influences (i.e. competitive influence) for having competitive relation；

Social networks is modeled as oriented influence diagram G=(V, E, p), and wherein V indicates the set of node, and E indicates oriented The set on side, p indicate the function for the activation probability that all sides are preassigned, and each edge has a pre-assigned activation general Rate p (u, v) ∈ [0,1] indicates the probability of node u activation node v.

For start node collection S_CFor, it is assumed that there is q >=a kind influence in social networks, every kind influences all to correspond at the beginning of one The start node collection of beginning node collection, different influences can be overlapped, i.e. the start node that a node can be used as multiple influences. Start node collection S_CCan be arranged by obtaining true start node from the network of reality, can also taking human as pass through inspiration The method setting (being such as arranged according to the out-degree of node) of formula.For example, being every kind of competitive influence as the number q=2 of competitive influence 50 start nodes are set, and it is the common start node of two kinds of competitive influences that maximum 25 nodes of out-degree, which may be selected, then again It is followed successively by two influences and distributes the maximum node of out-degree in remaining node, each node distributes a kind of influence.

Fig. 2 illustrates three kinds of state changes of the communication process interior joint of propagation model, and node is initially unactivated state, Become state of activation if by neighbor node successful activation, if un-activation will do not kept still by neighbor node successful activation State.When the node being activated receives the influence that one or more nodes are propagated, it will select and apply one of shadow It rings, state can become having used, and remains to communication process and terminate.

The communication process of propagation model is further illustrated below in conjunction with Fig. 3.In Fig. 3, circle indicates node, line with the arrow Indicate oriented side, the influence that digital representation this node in circle receives.Fan-shaped expression in circle is using corresponding shadow Loud probability, the line segment with the arrow of overstriking indicate the side for being try to activation, and dotted line indicates activation failure.

1) moment 0, different influences activates respective start node, to start communication process.Different influences may be selected Identical node is as start node.

In Fig. 3, t=0 moment, tri- nodes of a, b and c are activated as start node, and a, c are affected 1 and 4 respectively and swash Living, it is influence 2,3 and 4 respectively that node b is activated by three kinds of influences.

2) in moment t, the node being activated in t-1 becomes state of activation at this moment, and attempt activation not by The neighbor node of activation, if activation trial and success, the node being activated will receive activation the having an impact of node (same not The node being activated will likely be activated by the node of multiple states of activation, so that a variety of influences are received, it is same to influence May repeatedly be received by the same node), if failed, not to be activated node holding unactivated state is attempted in activation. After this, node in state of activation is by according to the probability defined using function comformed, using its received influence One of, and ultimately become adoption status.

That comforms uses function h (u, I_i) define probability it is as follows:

Wherein, h (u, I_i) indicate node u using influence I_iProbability,It is the set of had an impact composition, Freq (I_i) Indicating that node u is received influences I_iNumber.Following formula is used in actually calculating:

NA(u,I_i) indicate to have activated node u and have propagated to it to influence I_iNeighbor node.

That comforms is met using probability function: 0≤h (u, I_i)≤1 andIn Fig. 3, the t=1 moment, A, tri- nodes of b and c become state of activation, and attempt activation tri- unactivated nodes of d, e and f respectively.Node c activates e Failure, node e are activated by two nodes of a and b, have received four kinds of influences of two nodes.Finally, tri- nodes of d, e and f will One kind is selected in the influence received from it respectively.Its interior joint e receives four kinds of influences, and every kind of influence receives only one It is secondary, so the probability used is all 1/4.In Fig. 3, at the t=2 moment, because node e is in, adoption status, node d cannot swash E living.Node g is activated by three nodes simultaneously, receives four kinds of influences, but is influenced 1 and 4 and had received twice, remaining receives one It is secondary.So it is higher using the probability for influencing 1 and 4, it is 2/6, remaining two kinds influences use probability for 1/6.

3) communication process termination when not new node is activated.

In Fig. 3, not new node can be activated after the t=2 moment, and communication process terminates.

(2) according to preset precision parameter ε and δ, threshold value T is obtained using SSA algorithm₀, to the influence diagram obtained in (1) into Row T₀Secondary T reverse using sampling, and that all samplings are obtained₀A sample is put into sample setIn, wherein 0 < ε <, 1,0 < δ < 1；

Specifically, threshold value T₀Calculation formula beWherein, function

N indicates the node total number in influence diagram,

In this step, sample refers to the data that single is inversely obtained using sampling, specifically includes: node set R, The set { d (u) | u ∈ R } of distance between each node w and other nodes, each node in node set R in node set R Activate set { NA (u) | u ∈ R }, node w and the start node collection S of node collection_CDistance d_CWith node w by start node collection S_C The total degree Time for the influence activation that interior joint is propagated_C。

Single includes inversely following sub-step using the process of sampling in this step:

Counter c1=c1+1, and return step (2-4) is arranged in (2-7)；

Counter c2=1 and current distance curDist=curDist+1 is arranged in (2-9)；

Because influence diagram is probability graph, activating and using has randomness, so, using the quantity of certain node influenced Also there is randomness.Therefore, multiple influences are in probability directed networks in communication process, definition use function f (S) for use by Expectation of the node collection S as the quantity of certain node influenced of start node collection.Because calculate start node collection S uses f It (S) is NP-Hard problem, so, f (S) is used using the sample estimation sampled in step (2), the calculation formula of use is as follows:

Using the income size of the measurement start node for being desirable to more rationally science of number of nodes.

In this step, iteratively select the k node using edge Income Maximum as start node from influence diagram Detailed process is as follows for collection:

Counter c5=1 is arranged in (3-2)；

Counter c8=1 is arranged in (3-12)；

Counter c9=1 is arranged in (3-15)；

(3-17) more new node w_c9In R_c7On be adopted as node w_c9In R_c7On use subtract node w_c4In R_c7On Using more new node w_c9In sample setOn be always adopted as node w_c9In sample setOn it is total using subtracting node w_c4In R_c7On use.

(3-18) more new node w_c9In sample setOn be always adopted as node w_c9In sample setOn total use Subtract node w_c9In R_c7On use, more new node w_c9In R_c7On be adopted asWhereinFor swashing for set S interior joint The size of the union of movable joint point set, i.e., start node collection S is in R_c7On activation total degree, and more new node w again_c9In sample SetOn be always adopted as node w_c9In sample setOn it is total using add node w_c9In R_c7Upper updated use.

Sequence of (3-19) the more new node wc9 in big root heap.

(3-22) judges SUM₁Whether threshold value T is greater than₁, if it is, entering step (4), otherwise updating makes in step (2) Sampling number T₀=2*T₀, and return to (2).According to SSA algorithm, threshold value T₁Calculation formula beWhereinε₂=ε₃=(1-1/e)^-1ε/2, remaining change It measures identical with step (2).

Since there is nonnegativity, monotone nondecreasing and submodularity using function f (S).So being calculated using greedy strategy The sampling of S outWhereinIt is in sample setUpper optimal start node collection S+'s adopts Sample.

In the above process, respectively sample setIn sample R in all node u for sample R calculate use, so Add up use of each node in different sample R afterwards, and each node is in sample set in acquisition influence diagramOn always adopt With (3-1 to 3-6).Then according to node sample setOn total use, big root heap is established, according to node in sample set On always use size descending arrangement, heap top be the maximum node of total adopted value.Heap top node is selected, i.e., using maximum section Point, as new start node.Due to the overlapping of the range used between node, it is in same with the new start node chosen The use of node in sample R will be reduced, and need to be updated the income of these nodes and adjust in big heap of these nodes Position.It cyclically selects heap top node and updates the income of other nodes, until number of nodes is k (3-7 to 3-21).

(5) inversely using sampling, calculate the start node collection S obtained in step (3) in this sampling to influence diagram Middle acquisition uses A_c10, and update the total using SUM of the start node collection S obtained in step (3)₂=SUM₂+A_c10。

Influence diagram is carried out inversely using sampling in this step, and calculates the start node collection S obtained in step (3) at this What is obtained in secondary sampling uses A_c10Process include following sub-step:

Counter c11=1, current distance curDist=curDist+1 is arranged in (5-7)；

(5-12) counter c11=c11+1, enters step (5-8)；

(5-19) selects the c12 node z from node set V_c12, and counter c13=1 is set；

(5-23) judges node z_c13It whether is element in start node collection S, if it is d_S=curDist, it will Node z_c13Node collection Set is added_SIn, it is transferred to step (5-24), otherwise enters step (5-24)；

(5-24) counter c13=c13+1, and return step (5-20)；

(5-25) counter c12=c12+1, and return step (5-18)；

(5-30) returns to the use A of this sampling_c10。

According to SSA algorithm, threshold value T₂Calculation formula isThreshold value T₃Calculation formula isVariable is identical with described in step (2) and (3-22) in formula.

(7) Biased estimator of the start node collection S obtained in judgment step (3)It is first with being obtained in step (6) The unbiased esti-mator of beginning node collection SWhether meetIt is saved if it is satisfied, then directly output is initial As a result, process terminates, process terminates point set S, otherwise updates sampling number T used in step (2)₀=2*T₀, and return Step (2).

To sum up, the propagation model of consciousness proposed by the present invention of comforming, it is contemplated that a variety of competitive relations for influencing to propagate, And for the first time by conformity behavior introduce using influence during be used as decision-making foundation, thus enhance propagation model reasonability and It is scientific.The method of sampling is used for the reverse of propagation model design for the consciousness of comforming, solves the reverse method of sampling of propagation It is not suitable for the problem of influencing sampling more, has widened the scope of application of such methods.Start node selection method incrementally updates Node uses income, largely computes repeatedly to reduce, improves computational efficiency.

As it will be easily appreciated by one skilled in the art that the foregoing is merely illustrative of the preferred embodiments of the present invention, not to The limitation present invention, any modifications, equivalent substitutions and improvements made within the spirit and principles of the present invention should all include Within protection scope of the present invention.

Claims

1. a kind of start node choosing method for realizing maximizing influence, which comprises the following steps:

(1) influence diagram is constructed according to established propagation model, and start node collection S is generated according to the influence diagram of building_C, it is used for Propagate other influences for having competitive relation；

(2) according to preset precision parameter ε and δ, threshold value T is obtained using SSA algorithm₀, T is carried out to the influence diagram obtained in (1)₀ Secondary T reverse using sampling, and that all samplings are obtained₀A sample is put into sample setIn；

(3) according to the sample set returned in step (2)Iteratively k is selected to use edge Income Maximum from influence diagram Node as start node collection S, and obtain the Biased estimator of the use of start node collection SWherein 0 < k < n；

(4) counter c10=0 is set, and the total of the start node collection S obtained in setting steps (3) uses SUM₂=0；

(5) influence diagram inversely calculated the start node collection S obtained in step (3) using sampling and obtained in this sampling What is obtained uses A_c10, and update the total using SUM of the start node collection S obtained in step (3)₂=SUM₂+A_c10；

(6) judge whether counter c10 is less than threshold value T₂And the total of start node collection S uses SUM₂Whether threshold value T is less than₃If It is that c10=c10+1 is set, and return step (5) otherwise exportsAnd (7) are entered step, wherein n table Show the node total number in influence diagram；

(7) Biased estimator of the start node collection S obtained in judgment step (3)With the initial section obtained in step (6) The unbiased esti-mator of point set SWhether meetIf it is satisfied, then directly exporting start node collection Otherwise S updates sampling number T used in step (2) as a result, process terminates₀=2*T₀, and return step (2), wherein ε₁Indicate weight.

2. start node choosing method according to claim 1, which is characterized in that influence diagram be represented as G=(V, E, P), wherein the set of V expression node, E indicate the set of directed edge, and p indicates the letter for the activation probability that all sides are preassigned Number, each edge have pre-assigned activation Probability p (u, a v) ∈ [0,1], indicate the probability of node u activation node v.

3. start node choosing method according to claim 1, which is characterized in that the communication process of propagation model is as follows:

Firstly, different influences activates respective start node, at 0 moment to start communication process.Different influences may be selected Identical node is as start node；

Then, in t moment, become state of activation at this moment in the node that the t-1 moment is activated, and attempt activation not by The neighbor node of activation, if activation trial and success, the node being activated will receive being had an impact for activation node, if activation Attempt failed, then the node not being activated keeps unactivated state, and after this, the node in state of activation will be according to That comforms uses probability defined in function, using one of its received influence, and ultimately becomes adoption status.

That comforms uses function h (u, I_i) define probability it is as follows:

Wherein, h (u, I_i) indicate node u using influence I_iProbability,It is the set of had an impact composition, NA (u, I_i) indicate Having activated node u and having propagated to it influences I_iNeighbor node；

Finally, communication process termination when not new node is activated.

4. start node choosing method according to claim 1, which is characterized in that

Sample in step (2) is the data that single is inversely obtained using sampling, and including every in node set R, node set R The set { d (u) | u ∈ R } of distance between a node w and other nodes, the activation node collection of each node in node set R Set { NA (u) | u ∈ R }, node w and start node collection S_CDistance d_CWith node w by start node collection S_CInterior joint is propagated Influence activation total degree Time_C。

5. start node choosing method as claimed in any of claims 1 to 4, which is characterized in that single in step (2) The secondary reverse process using sampling includes following sub-step:

(2-1) is randomly chosen a node w from influence diagram, and node w and itself distance d (w is arranged_c1)=0, w is put into In node set R, and start node set S is set_CWith node w distance d_CVariable curDist=0, curDist is arranged in=∞ Indicate currently processed node at a distance from node w, abbreviation current distance；

(2-2) judges whether node w is start node collection S_CIn element, if it is, record node w swashed by competitive influence Total degree Time living_CWith start node set S_CWith node w distance d_c=0, it is transferred to step (2-17), is otherwise entered step (2-3)；

Counter c1=1, current distance curDist=curDist+1 is arranged in (2-3), judges whether c1 value is greater than entering for node w Neighbor node sum, if it is, being transferred to step (2-17), is otherwise transferred to step (2-4)；

(2-4) enters neighbor node w according to c1 of node w_c1Activate the probability predicate node w of node w_c1Whether activate node w Success, if it is, record node w_c1With node w distance d (w_c1)=curDist, by node w_c1It is put into it and activates node collection NA(w_c1), by node w_c1It is put into node set V, and is transferred to step (2-5), be otherwise transferred to step (2-6)；

(2-5) judges node w_c1It whether is start node collection S_CIn element, if it is by start node set S_CWith node The distance d of w_CIt is set as current distance curDist, and is transferred to step (2-6), is otherwise transferred to step (2-6)；

(2-6) judge c1 value whether be equal to node w enter neighbor node sum, if it is, being transferred to step (2-8), otherwise turn Enter step (2-7)；

Counter c1=c1+1, and return step (2-4) is arranged in (2-7)；

(2-8) judges node set V and start node collection S_CWhether have whether intersection or node set V are empty set, if so, It then calculates and records node w by node set V and S_CThe total degree Time for the competitive influence activation that intersection interior joint is propagated_C, and It is transferred to step (2-17), otherwise all nodes in node set V are put into node set R, and is transferred to step (2-9)；

Counter c2=1 and current distance curDist=curDist+1 is arranged in (2-9)；

(2-10) selects the c2 node w from node set V_c2, and counter c3=1 is set, judge that the value of counter c3 is It is no to be greater than node w_c2Enter neighbor node sum, if it is, being transferred to step (2-14), be otherwise transferred to step (2-11)；

(2-11) judges node w_c3Whether not in node set R, and according to node w_c2C3 enter neighbor node w_c3Activation Node w_c2Probability predicate node w_c3Activate node w_c2Whether succeed, if it is, setting node w_c3With node w distance d (w_c3) it is equal to curDist, by node w_c2Activation node collection NA (w_c2) in element be put into node w_c3Activate node collection NA (w_c3) in, by node w_c3It is put into node set V_nextIn, and it is transferred to step (2-12), otherwise it is transferred to step (2-13)；

(2-12) judges node w_c3It whether is start node collection S_CIn element, if it is be arranged start node set S_CWith section The distance d of point w_CEqual to curDist, and it is transferred to step (2-13), is otherwise directly transferred to step (2-13)；

(2-13) judges whether the value of counter c3 is greater than node w_c2Enter neighbor node sum, if it is, being transferred to step (2- 14) counter c3=c3+1, and return step (2-11), are otherwise set；

(2-14) judges whether counter c2 value is equal to the node total number in node set V, if yes then enter step (2-15), Counter c2=c2+1, and return step (2-10) are otherwise set；

(2-15) judges node set V_nextWith start node collection S_CWhether intersection or node set V are had_nextIt whether is empty set, If it is, obtaining node w by node set V_nextWith S_CThe total degree for the competitive influence activation that intersection interior joint is propagated Time_C, and it is transferred to step (2-17), otherwise it is transferred to step (2-16)；

(2-16) uses node set V_nextIn all nodes replacement node set V in all nodes, and will be in node set V All nodes be put into node set R, and return step (2-8)；

(2-17) obtain node set R, in node set R the distance between each node and node w set d (u) | u ∈ R }, in node set R the activation node collection of each node set { NA (u) | u ∈ R }, node w and start node collection S_CAway from From d_CAnd node w is by start node collection S_CThe total degree Time for the influence activation that interior joint is propagated_C。

6. start node choosing method as claimed in any of claims 1 to 5, which is characterized in that in step (3) repeatedly It includes following sub-step that generation ground, which selects the k node using edge Income Maximum as the process of start node collection:

(3-1) settingCounter c4=1 is set, and each node is in sample set in setting influence diagramOn always adopt Initial value is 0, and start node collection S is in sample setOn use summation SUM₁=0；

Counter c5=1 is arranged in (3-2)；

(3-4) is from sample R_c5The c6 node w of middle taking-up_c6, judge in R_c5Interior joint w_c6With node w distance d (w_c6) whether small In d_C, if it is, setting node w_c6In R_c5On be adopted as 1, node w_c6It is total using plus 1, be then transferred to step (3-5)； Otherwise, node w is set_c6In R_c5On be adopted asUpdate w_c6In sample setOn be always adopted as Node w_c6In sample setOn it is total using addingThen it is transferred to step (3-5)；Wherein NA (w_c6) And Time_CIt is to respectively indicate sample R_c5In activation node collection and start node collection S_CThe influence that interior joint is propagated activates sample R_c5In node w total degree.

(3-5) judges whether counter c6 is less than | R_c5|, if it is, setting c6=c6+1, then return step (3-4), no Then it is transferred to step (3-6)；

(3-6) judges whether counter c5 is less thanIf it is, setting c5=c5+1, and return step (3-3), otherwise turn Enter step (3-7)；

(3-8) judges counter C₄Whether preset threshold k is less than, if it is, taking outOn always use maximum node w_c4, by node w_c4Start node collection S is added, SUM is set₁=SUM₁+A(w_c4), wherein A (w_c4) it is node w_c4?On it is total Using, and the deletion of node w from big root heap_c4, and step (3-9) is gone to, otherwise obtain start node set S has with what it was used Estimation partiallyThen it is transferred to step (3-22).

(3-9) is from sample setMiddle take out all includes node w_c4Sample, formed sample setAnd counter is set C7=1；

(3-10) takes outIn the c7 sample R_c7, and judgement sample R_c7Whether it is marked as determining use, if it is turns To step (3-21), step (3-11) is otherwise gone to；

(3-11) judges w_c4With sample R_c7In node w distance d (w_c4) whether it is less than R_c7In start node set S_CWith section The distance d of point w_C, if it is marker samples R_c7It is used to determine, and goes to step (3-12), otherwise go to step (3-15)；

Counter c8=1 is arranged in (3-12)；

(3-13) takes out sample R_c7In the c8 node, update w_c8In sample setOn be always adopted as node w_c8In sample This setOn it is total using subtracting node w_c8In R_c7On use, and more new node w_c8Sequence in big root heap；

(3-14) judges whether counter c8 is less than | R_c7|, if it is, setting c8=c8+1, and step (3-13) is gone to, it is no Then, step (3-21) is gone to；

Counter c9=1 is arranged in (3-15)；

(3-16) takes out sample R_c7In the c9 node w_c9, judge w_c9With sample R_c7In node w distance d (w_c9) whether Less than R_c7In start node set S_CWith node w distance d_C, step (3-17) is if it is gone to, step is otherwise gone to (3-18)；

(3-17) more new node w_c9In R_c7On be adopted as node w_c9In R_c7On use subtract node w_c4In R_c7On use, More new node w_c9In sample setOn be always adopted as node w_c9In sample setOn it is total using subtracting node w_c4In R_c7 On use；

(3-18) more new node w_c9In sample setOn be always adopted as node w_c9In sample setOn it is total using subtracting Node w_c9In R_c7On use, more new node w_c9In R_c7On be adopted asWhereinFor swashing for set S interior joint The size of the union of movable joint point set, i.e., start node collection S is in R_c7On activation total degree, and more new node w again_c9In sample SetOn be always adopted as node w_c9In sample setOn it is total using add node w_c9In R_c7Upper updated use；

Sequence of (3-19) the more new node wc9 in big root heap；

(3-20) judges whether counter c9 is less than | R_c7|, if it is, setting c9=c9+1, and step (3-16) is gone to, it is no Then, step (3-21) is gone to；

(3-21) judges whether counter c7 is less thanIf it is, setting c7=c7+1, and step (3-10) is gone to, it is no Then, step (3-8) is gone to；

(3-22) judges SUM₁Whether threshold value T is greater than₁, if it is, entering step (4), otherwise update used in step (2) Sampling number T₀=2*T₀, and return to (2).

7. start node choosing method as claimed in any of claims 1 to 6, which is characterized in that start node collection S Using f (S) be using following formula obtain:

Wherein, S is start node collection, and n is nodes sum, A_iIt is the use income of i-th sampling,It is practical sample This number,It isIn all samples average use.

8. start node choosing method as claimed in any of claims 1 to 7, which is characterized in that right in step (5) Influence diagram carries out inversely using sampling, and calculates the use that the start node collection S obtained in step (3) is obtained in this sampling A_c10Process include following sub-step:

(5-1) is randomly chosen a node z from influence diagram, and node z is put into node set R, start node collection S_CWith section The distance d of point z_C=∞, start node collection S and node z distance d_S=∞, current distance curDist=0, start node collection S A is used in this sampling_c10=0；

(5-2) judges whether node z is start node collection S_CIn element, if it is d_C=curDist is transferred to step (5- 3) (5-3), is otherwise entered step；

(5-3) judges whether node z is element in start node collection S, if it is d_S=curDist is transferred to step (5- 4) (5-4), is otherwise entered step；

(5-6) judges whether to be equal to curDist, and if it is start node collection S uses A what this was sampled_c10=1, it is transferred to step Suddenly (5-30) otherwise enters step (5-7)；

Counter c11=1, current distance curDist=curDist+1 is arranged in (5-7)；

(5-8) judge c11 value whether be greater than node z enter neighbor node sum, if it is, being transferred to step (5-13), otherwise It is transferred to step (5-9)；

(5-9) enters neighbor node z according to c11 of node z_c11Activate the probability predicate node z of node z_c11Activating node z is No success, if it is, by node z_c11It is put into it and activates node collection NA (z_c11), by node z_c11It is put into node set V, and It is transferred to step (5-10), is otherwise transferred to step (5-11)；

(5-10) judges node z_c11It whether is start node collection S_CIn element, if it is d_C=curDist, by node z_c11Node collection Set is added_CIn, it is transferred to step (5-11), otherwise enters step (5-11)；

(5-11) judges node z_c11It whether is element in start node collection S, if it is d_S=curDist, by node z_c11Node collection Set is added_SIn, it is transferred to step (5-12), otherwise enters step (5-12)；

(5-12) counter c11=c11+1, enters step (5-8)；

(5-13) judges d_CWhether it is equal to curDist, is if it is transferred to step (5-14), otherwise enters step (5-15)；

(5-15) judges d_SWhether curDist is equal to, if it is start node collection S uses A what this was sampled_c10=1, it is transferred to Step (5-30), otherwise enters step (5-16)；

(5-16) if node set V is not empty set, and d_SNot equal to curDist, and d_CNot equal to curDist, then by node collection All nodes closed in V are put into node set R, are transferred to step (5-17), are otherwise transferred to step (5-27)；

(5-19) selects the c12 node z from node set V_c12, and counter c13=1 is set；

(5-20) judges whether c13 value is greater than node z_c12Enter neighbor node sum, it is no if it is, be transferred to step (5-25) Then it is transferred to step (5-21)；

(5-21) judges node z_c13Whether not in node set R, and according to node z_c12C13 enter neighbor node z_c13 Activate node z_c12Probability predicate node z_c13Activate node z_c12Whether succeed, if it is, by node z_c12Activation node Collect NA (w_c2) in element be put into node z_c13Activate node collection NA (w_c3) in, by node z_c13It is put into node set V_nextIn, and It is transferred to step (5-22), is otherwise transferred to step (5-24)；

(5-22) judges node z_c13It whether is start node collection S_CIn element, if it is d_C=curDist, by node z_c13Node collection Set is added_CIn, it is transferred to step (5-23), otherwise enters step (5-23)；

(5-23) judges node z_c13It whether is element in start node collection S, if it is d_S=curDist, by node z_c13Node collection Set is added_SIn, it is transferred to step (5-24), otherwise enters step (5-24)；

(5-24) counter c13=c13+1, and return step (5-20)；

(5-25) counter c12=c12+1, and return step (5-18)；

(5-26) uses node set V_nextIn all nodes replacement node set V in all nodes, and return step (5- 16)；

(5-27) judges d_CWhether it is equal to curDist, is if it is transferred to step (5-28), otherwise enters step (5-29)；

(5-30) returns to the use A of this sampling_c10。

9. a kind of start node selecting system for realizing maximizing influence characterized by comprising

First module for constructing influence diagram according to established propagation model, and generates initial section according to the influence diagram of building Point set S_C, for propagating other influences for having competitive relation；

Second module, for obtaining threshold value T using SSA algorithm according to preset precision parameter ε and δ₀, the first module is obtained Influence diagram carries out T₀Secondary T reverse using sampling, and that all samplings are obtained₀A sample is put into sample setIn；

Third module, the sample set for being returned according to the second moduleIteratively k are selected to use edge from influence diagram The node of Income Maximum obtains the Biased estimator of the use of start node collection S as start node collectionWherein 0 < k<n；

4th module, for counter c10=0 to be arranged, and the total of start node collection S that the acquisition of third module is arranged uses SUM₂ =0；

5th module, for, inversely using sampling, calculating the start node collection S of third module acquisition at this to influence diagram progress What is obtained in sampling uses A_c10, and the total of start node collection S for updating the acquisition of third module uses SUM₂=SUM₂+A_c10；

6th module, for judging whether counter c10 is less than threshold value T₂And the total of start node collection S uses SUM₂Whether it is less than Threshold value T₃, c10=c10+1 is if it is set, and returns to the 5th module, otherwise export the use of start node collection S has partially EstimationSubsequently into the 7th module, wherein n indicates the node total number in influence diagram；

7th module, the Biased estimator of the start node collection S for judging the acquisition of third moduleIt is obtained with the 6th module Start node collection S unbiased esti-matorWhether meetIf it is satisfied, then directly output is first Otherwise beginning node collection S updates the sampling number T that the second module uses as a result, process terminates₀=2*T₀, and return to the second mould Block, wherein ε₁Indicate precision parameter.