CN106682770B

CN106682770B - Dynamic microblog forwarding behavior prediction system and method based on friend circle

Info

Publication number: CN106682770B
Application number: CN201611151738.3A
Authority: CN
Inventors: 柳靓云; 肖云鹏; 杜江; 刘宴兵; 张克毅; 李茜曦; 李晓娟; 宋晨光
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2016-12-14
Filing date: 2016-12-14
Publication date: 2020-08-04
Anticipated expiration: 2036-12-14
Also published as: CN106682770A

Abstract

The invention discloses a dynamic microblog forwarding behavior prediction system and method based on a friend circle, and belongs to the field of social network information analysis.

Description

Dynamic microblog forwarding behavior prediction system and method based on friend circle

Technical Field

The invention relates to the field of social network information analysis, in particular to a dynamic microblog forwarding behavior prediction model constructed according to social network user behavior analysis.

Background

With the popularization of the WEB2.0 concept and the increasing maturity of related technologies, social websites such as Twitter, Facebook, Sino microblog and the like have great influence on the life of people. People update the state or send broadcasts in the social network site so as to show the life state of the people, make up thoughts or share information with friends. The social network site provides a very convenient platform for users to communicate with each other, make opinions and views. Modeling and predicting the user behavior of the social network site have important social significance and application value in multiple fields such as safety, business and the like, and are gradually paid attention by researchers in recent years.

The method includes text-based analysis, user Influence-based analysis, network structure-based analysis and the like, wherein the text-based analysis mainly utilizes a probability Topic model to analyze texts, predicts user forwarding behaviors of users according to similarity of text topics and user interests, and the like, and constructs a social Topic forwarding model based on the probability Topic model of Who participates in a black network Community, thereby Finding out a social Influence prediction model of user Influence, which is based on social Influence prediction of users, and user Influence prediction of social Influence of users, which is based on a theoretical user Influence prediction model of user Influence and a theoretical user Influence prediction model of user Influence, which is based on a theoretical user Influence prediction model of user Influence and a theoretical Influence prediction model of user Influence of users Who will participate in a black network Community, and a social Influence prediction model of user Influence of user Who will be found in social network by a social network prediction model of user Influence prediction, which is based on a social Influence prediction model of user Influence of users Who will be found in a social network by a social network.

The information forwarding behavior of the user is a result of multi-factor combined action, but the prior art does not consider the complexity of the user behavior, and only focuses on predicting the user forwarding behavior on the one hand, the prediction result is not accurate, and the importance of each feature influencing the user behavior cannot be evaluated.

Disclosure of Invention

The invention aims at the problems existing in the prior art: aiming at the problems of network dynamic characteristics, user behavior characterization, user characteristic importance evaluation and the like in information transmission, the dynamic microblog forwarding behavior prediction system and method based on the friend circle are provided, wherein the dynamic microblog forwarding behavior prediction system and method can effectively estimate whether the message can be forwarded or not and can find microblogs which possibly cause large-scale outbreaks as soon as possible. The technical scheme of the invention is as follows:

a dynamic microblog forwarding behavior prediction system based on a friend circle comprises a user behavior data source obtaining module, an attribute extracting module, a model building module and a prediction analysis module, wherein the user behavior data source obtaining module is used for obtaining user relationship and user behavior data in a social network and taking fans of text users as alternative users; the microblog forwarding behavior prediction model building module is used for building a microblog forwarding behavior prediction model for the alternative user, the forwarding behavior is mainly determined by the interest difference tau between the alternative user and friends of the alternative user, the activity s of the alternative user in the article release period and the network influence r parameter of the friends of the alternative user, and the model parameters are fitted; and the prediction analysis module is used for predicting whether the candidate user can forward the microblog according to the parameters obtained after fitting and the user text sending condition at any time t.

Further, the attribute extraction module extracts a user interest vector according to the difference of interests among users, including: obtaining an attention list of each user by using attention behavior attributes of the users, and defining an interest vector of the user v as

Wherein e is_v,uI.e., represents that user v is interested in users in the list, u 1,2_v|，|E_vAnd | represents the total number of users in the user v interest list.

Further, the attribute extraction module extracts a user state vector for the activity of the candidate user, including: acquiring the user microblog issuing activity and the microblog forwarding activity of each user within a period of time by using the interactive behavior attribute and the time attribute of the user, and defining the activity state vector of the user v as

Wherein,

representing the microblog release activity of the user v on the time slice t,

representing the forwarded microblog activity of the user v on the time slice t,

and

respectively representing the number of microblogs issued by the user v in the time slice t, the number of forwarded microblogs and the average number of microblogs issued by the user v per day.

Further, the attribute extraction module extracts a user feature vector for the influence of the user sending a text, including: obtaining the out-degree, in-degree and local aggregation coefficient of each user node by using the network topological structure attribute, and defining the influence characteristic vector of the user v as

Wherein d is_v,1Number of fans representing user v, d_v,2Indicating the number of buddies of user v,

representing the local cluster coefficient, Ng, of user v_vIs a set of neighboring nodes, edg, for node v_ijIs the connection between its adjacent nodes.

The microblog forwarding behavior prediction model extracts user interest vectors from user behavior and user relationship information in terms of the interest difference among users, trains all users by using an L DA model to obtain interest subject distribution of the users, extracts user state vectors in each time slice from the user behavior and time information in terms of the interest difference among users, uses Gaussian distribution improvement L DA for elements in the user state vectors being continuous values, trains all users by using an improved L DA model to obtain active state distribution of the users in each time slice, extracts feature vectors of the users from network structure information in terms of influence of the user who sends out, uses Gaussian distribution improvement L DA as well as all users by using an improved L DA model to obtain network distribution of the users, and finally obtains a whole user forwarding history prediction model according to whether interest roles among the users are consistent, active network states of the users of the alternative users on each time slice, and user role forwarding behavior of the user training data.

Further, the acquiring, by the microblog forwarding behavior prediction model, the distribution of the interest topics of the user further includes: on the basis of the user relationship network, the interactive behaviors among users are reused, the interest vectors I (v) of the users are weighted to obtain weighted user interest vectors

Wherein, w_v,nN. 1,2_v，N_vAnd (4) training all users by using an L DA model for the total interaction times of the users v, so that the interest topic distribution of the users can be obtained.

Further, the acquiring the active state distribution of the user on each time slice further includes: publishing liveness x for a user_v,t,1And forwarding liveness x_v,t,2The continuous variable is obtained by improving an L DA model by using Gaussian distribution, so that values of the release activity and the forwarding activity respectively obey different Gaussian distributions:

wherein x is_v,t,mRepresenting the mth attribute value, μ, of user v over time slice t_s,mAnd σ_s,mRespectively, the mean and standard deviation of the mth attribute when the user activity state is s.

Further, by a time slicing method, cutting each day from 0 o' clock at night into 4 time intervals, i.e. t is 1,2,3,4, dividing the active state of the user into three levels, i.e. very active, general active and inactive, training all users by using the improved L DA model, the active state distribution of the user on each time slice can be obtained.

Further, the user nodes are divided into three role types based on a network topology structure, namely opinion leaders, information propagators and common users, and after a Gaussian distribution improved L DA model is used, all users are trained by the model, so that the network role distribution of the users can be obtained.

A dynamic microblog forwarding behavior prediction method based on a friend circle of the system comprises the following steps:

acquiring user relationship and user behavior data in a social network, and taking fans of text users as alternative users; acquiring three user vectors from three aspects of interest difference among users, activity of alternative users and influence of a text user as input of a prediction model;

constructing a microblog forwarding behavior prediction model, and fitting model parameters;

and inputting the parameters obtained after fitting and the user text sending condition at any time t into a prediction model to predict whether the candidate user can forward the microblog.

The invention has the following advantages and beneficial effects:

the invention provides a dynamic microblog forwarding behavior prediction method based on a friend circle, which comprises the steps of firstly, utilizing a basic thought and a method which can solve 'one word is polysemic and multiple words is polysemic' by utilizing an L DA topic model aiming at the diversity of the interest, the liveness and the influence of a single user, carrying out modeling analysis on the user behavior to obtain the topic distribution related to the user behavior, secondly, utilizing Gaussian distribution to improve L DA to find the liveness and the influence of the user by considering that elements in a user state vector and a user characteristic vector are continuous values, and finally, utilizing a time dispersion and time slicing method aiming at the change of the liveness of the user along with the time to provide an improved L DA dynamic microblog forwarding behavior prediction model, dynamically monitoring the liveness of the user and improving the accuracy of the prediction model.

The invention provides a dynamic microblog forwarding behavior prediction method based on a friend circle, aiming at the problems of network dynamic characteristics, user behavior characterization, user characteristic importance evaluation and the like in information transmission, and the user forwarding behavior can be accurately predicted. According to the prediction result, whether the message can be forwarded or not and the forwarding scale of the message can be effectively estimated, the microblog which possibly causes large-scale outbreak can be found as soon as possible, and the method has great significance for microblog burstiness detection and microblog influence evaluation.

Drawings

FIG. 1 is a flowchart of a method for predicting a dynamic microblog forwarding behavior based on a friend circle according to an embodiment of the present invention;

FIG. 2 is a block diagram of a predictive model of the present invention;

FIG. 3 is a flow chart of the predictive model of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be described in detail and clearly with reference to the accompanying drawings. The described embodiments are only some of the embodiments of the present invention.

The technical solution of the present invention for solving the above technical problems is,

because information propagation in the social network is mainly promoted by interest difference, user historical behaviors and a network structure, the invention starts from three aspects of user interest, activity and influence, utilizes the basic thought and method of an L DA topic model to perform modeling analysis on the user behaviors to obtain topic distribution related to the user behaviors, then uses Gaussian distribution improvement L DA to find the activity and the influence of the user aiming at the problem of continuous variables existing in user attributes, and finally utilizes a time dispersion and time slicing method aiming at the change of the activity of the user along with time to provide an improved L DA dynamic microblog forwarding behavior prediction model, so that the activity of the user can be dynamically monitored, the forwarding behavior of the user can be accurately predicted, and key factors influencing the user forwarding can be found.

The method specifically comprises the steps of giving a social relationship network G (V, E, Y), wherein V represents all users in the network, | V | N represents the number of the users, E represents the relationship among all the users and is an N × N-dimensional matrix, Y represents a series of past behaviors of the users, and | Y | I represents the total number of user behavior data, designing a probability generation model, analyzing each user by utilizing the user relationship and the user behavior information in the social network and adding the influence of timeliness factors, obtaining the interest distribution, the activity level distribution, the network role distribution and the user forwarding behavior distribution of each user through 4 probability generation processes, and predicting the forwarding behavior of the user on the concerned microblog in a period of time according to the 4 distributions.

Fig. 1 shows a general flow chart of the present invention, which mainly includes: the system comprises a data acquisition module, an attribute extraction module, a model construction module and a prediction analysis module.

The detailed implementation of the present invention is described in detail below.

S1: a data source is acquired. The acquired data specifically comprises user behavior information of the user attention relationship network and all users in the network, and the user behavior comprises microblogs issued and forwarded by the users and time for issuing and forwarding the microblogs. Specifically, the following method (which can also be obtained by a conventional method in the prior art) can be used:

s11: raw data is acquired. And acquiring past behavior data of a user attention relationship network and all users under the network. The original data can be obtained through a social network public API or by directly downloading the existing data source, and the data can be supplemented by combining methods such as a web crawler and the like.

S12: simple data cleaning. Most of the data can be made available for analysis by simple data cleansing. Such as deleting duplicate data, cleaning up invalid nodes, etc.

S13: and time slicing is carried out on the data, and the attribute of the user on each time slice is determined. The user attributes herein specifically refer to the publishing activity and forwarding activity of the user. Because the forwarding behavior of the user is closely related to the work and rest time of the user, the user performs time slicing on a day by taking preset time (such as 6 hours) as a time period according to the work and rest characteristics of the user. And in a certain time period t, determining the active state of the user in the time period according to the user attribute so as to predict whether the user can forward the microblog of the friend.

S2: and extracting the relevant attributes. Considering that forwarding behaviors in a social network mainly include three aspects of interest difference, user historical behaviors and network structure, the invention extracts relevant attributes such as attention behavior attributes, interaction behavior attributes, time attributes and network structure attributes from three aspects of user interest, activity and influence. The attributes of which may be modified as appropriate depending on the characteristics of the data aspects.

And after extracting all the attributes of the three aspects, acquiring a corresponding user vector. The specific manner thereof is as follows.

S21: and extracting the user interest vector. Considering the attention of users interested in the users, acquiring an attention list of each user by using the attention behavior attributes of the users, and defining the interest vector of the user v as follows:

wherein e is_v,u(u＝1,2......|E_vI) represents user v is interested in the user in the list, | E_vAnd | represents the total number of users in the user v interest list. For example: the users in the user a attention list are: b, c, d, e.]。

S22: and extracting the user state vector. According to the daily work and rest characteristics of users, time slicing is carried out on a day by taking preset time (such as 6 hours) as a time period, the microblog publishing activity and the microblog forwarding activity of each user in each time slice are obtained by utilizing the interaction behavior attribute and the time attribute of the users, and the state vector of the user v is defined as follows:

wherein,

representing the microblog release activity of the user v on the time slice t,

and representing the activity of the forwarded microblog of the user v on the time slice t.

And

respectively representing the number of microblogs issued by the user v in the time slice t, the number of forwarded microblogs and the average number of microblogs issued by the user v per day. For example: the user a publishes 3 microblogs on the 1 st time slice, wherein the number of forwarded microblogs is 2, and the user a publishes 5 microblogs on average one day, so that the behavior vector of the user a is

S23: and extracting the user feature vector. Because the position of the user node in the network has a great influence on information transmission, the out-degree, in-degree and local aggregation coefficients of each user node are obtained by utilizing the network topological structure attribute, and the feature vector of the user v is defined as:

representing the local cluster coefficients of user v. Ng g_vIs a set of neighboring nodes, edg, for node v_ijIs the connection between its adjacent nodes. For example: the user a has 30 fans and 20 friends, the total number of the neighbor nodes is 40, 200 connecting edges exist between the neighbor nodes, and then the usera is a feature vector of

S3: a prediction model is established, and as shown in fig. 2, a block diagram of the prediction model of the present invention is shown. Whether the candidate user can forward the microblog of the friend is mainly determined by the interest difference tau between the candidate user and the friend, the activity s of the candidate user in the article release period and the network influence r of the friend.

The prediction of whether the candidate user can forward a certain microblog of the friend thereof by the prediction model specifically comprises the steps of extracting interest vectors I (v) of the users from user behavior and user relationship information in the aspect of interest difference, training all the users by utilizing an L DA model, and acquiring the interest community distribution of the users

Wherein,

for the aspect of user activity, state vectors L (v, t) of the users on each time slice are extracted from user behavior and time information, for the elements in the user state vectors are continuous values, Gaussian distribution improvement L DA is firstly used, then all the users are trained by utilizing an improved L DA model, and the activity state distribution of the users on each time slice is obtained

Wherein,

for the network influence aspect of the users, extracting the characteristic vector F (v) of the users from the network structure information, using Gaussian distribution improvement L DA as the user state vector, training all users by using an improved L DA model, and acquiring the network role score of the users

Wherein,

representing a network role probability distribution for user v; finally, according to the interest community distribution of the users

Distribution of user activity state in each time slice

Network role distribution for users

Training the whole prediction model by historical forwarding data Y of the user to obtain the forwarding behavior distribution of the user

Wherein,

representing the probability of the alternative user forwarding the microblog when the interest difference among the users is tau, the alternative user is in an active state s and the text sending user plays a network role r,

indicating the probability of not forwarding. The solution of the model and how to predict the forwarding behavior of the candidate users over the respective time slices will be described in detail in the following section.

FIG. 3 is a flow chart of the predictive model of the present invention.

S31: and acquiring the interest community distribution of the user.

The friend relationship only represents the possibility of interaction between users, so that the strength of information interaction between the users cannot be truly reflected, and the friend relationship tends to be static. In order to find an active interest community, interaction behaviors among users are reused on the basis of a user relationship network, and interaction weighting is performed on interest vectors I (v) of the users, wherein the interaction behaviors specifically refer to forwarding behaviors, and the obtained weighted interest vectors of the users are as follows:

wherein, w_v,n(n＝1,2......N_v) Interactive object representing the nth interaction behavior of user v, N_vThe total number of interactions for user v. For example: user a interacts with user b2 times, and interacts with user c 4.]。

Given C as the number of interest communities, training all users by adopting an L DA model, wherein the specific generation process is as follows:

for each user v:

1. sampling an edge distribution ξ -Dir (λ), λ being a parameter of Dirichlet distribution;

2. sampling a user interest community distribution

α are parameters of Dirichlet distribution;

3. for each edge e of the user_v,i：

1) Sampling an interest community

2) Sampling a side

Wherein,

represents the distribution of the community of interest of user v,

representing the edge distribution of community of interest c.

In this probabilistic generative model, modeling user behavior is actually to compute the distribution of interest communities for the user

And edge distribution of communities of interest

For the solution of Φ and ξ, using Gibbs sampling, the equations for Gibbs sampling to estimate Φ and ξ at each iteration are as follows:

wherein,

representing the probability of the user v being in the community of interest C, C being the total number of communities of interest,

n_v,crepresenting the number of times user v interacts with interested users in community of interest c, | N_vL is the total number of times of interaction between the user v and friends of the user v;

representing the probability of user E appearing in community of interest c, | E | is the total number of edges in the network,

n_c,erepresenting the number of interactions, n, of a user e in the community of interest c_cThe total number of interactions in the community of interest c.

S32: and acquiring the distribution of the active states of the user on each time slice.

The forwarding behavior of the user is closely related to the work and rest time, each user has relatively fixed internet surfing time, in the time period, the user is active, the probability of posting and replying is high, and other time rarely participates in the spreading of topics.Therefore, by the time slicing method, each day is sliced into 4 periods (t ═ 1,2,3,4) from 0 o' clock at night, and the vector data are dispersed in time. Second, the liveness x is published for the user_v,t,1And forwarding liveness x_v,t,2The continuous variable is obtained by improving an L DA model by using Gaussian distribution, so that values of the release activity and the forwarding activity respectively obey different Gaussian distributions:

The invention sets the active state of the user to three levels S to 3, namely, very active, general active and inactive, and trains all users by using the improved L DA model, and the specific generation process is as follows:

for each user v:

1. sampling the distribution of the activity state of a user over a time slice t

β are parameters of Dirichlet distribution;

2. sampling an active level

3. For each attribute of user v:

1) sampling an attribute value

Wherein,

representing the distribution of the activity state of user v over time slice t.

In the probability generation model, the user state attribute is processedThe modeling is actually to calculate the distribution of the active state of the user over the various time slices

And Gaussian distribution N (mu, sigma) to which each attribute value of the user obeys. For theta^(t)And mu and sigma are solved, an EM algorithm is adopted, and the EM iteration estimates theta^(t)And μ, σ, the process is divided into two steps:

e-step: updating

M-step: updating mu_s,mAnd σ_s,m。

Wherein,

representing the probability that the user v is in an active state of S in a time slice t, S being the number of state levels,

m is the number of user state attributes, x_v,t,mRepresenting the mth attribute value, μ, of user v over time slice t_s,mAnd σ_s,mRespectively, the mean and standard deviation of the mth attribute when the user activity state is s.

S33: and acquiring the network role distribution of the user.

The position of the node in the network and the resulting impact have a significant impact on the information dissemination. The invention divides the user nodes into three role types R (3), namely opinion leader, information propagator and common user, based on the network topology structure. Opinion leaders have a higher degree of in-coming and information propagators have a higher degree of out-going.

Similarly, due to the existence of continuous variables in the role attributes, after the L DA model is improved by using Gaussian distribution, all users are trained by using the improved L DA model, and the specific generation process is as follows:

for each user v:

1. sampling a user network role distribution

Is a parameter of Dirichlet distribution;

2. sampling a network role

3. For each role attribute of user v:

1) sampling a role attribute value

Wherein,

representing the network role distribution of user v.

In this probabilistic generative model, modeling the user role attributes is actually to compute the network role distribution of the user

And the attribute value of each role of the user is subjected to Gaussian distribution N (mu ', sigma'). for the solution of η and mu ', sigma', adopting an EM algorithm, the process of EM iterative estimation η and mu ', sigma' is divided into two steps:

e-step: updating

M-step: mu 'are updated'_r,hAnd σ'_r,h。

Wherein,

representing the probability that user v plays the network role R, which is the number of network roles,

h is the number of user state attributes, d_v,hH-th attribute value, mu 'representing user v'_r,hAnd σ'_r,hThe mean and standard deviation, respectively, of the h-th attribute when the user is playing the network role r.

S34: and acquiring the forwarding behavior distribution of the user.

According to user interest community distribution

Distribution of user activity state in each time slice

Network role distribution for users

The specific generation process is as follows:

forward behavior y for each user_i：

1. Sampling a user forwarding behavior distribution rho-Dir (gamma), wherein the gamma is a parameter of Dirichlet distribution;

2. sampling a community of interest for alternative users v

3. Sampling a community of interest for a textual user u

4. Sampling an active state for an alternative user v

5. Sampling a network role for a text user u

6. Sampling a user forwarding behavior

Wherein,

representing the distribution of the forwarding behavior of the user,

indicating the probability of not forwarding. τ is an indicator function defined as follows:

wherein z is_u，z_vRespectively representAnd the interest communities of the users u and v. τ -1 indicates consistent interest, and τ -0 indicates inconsistent interest.

In the probability generation model, modeling the forwarding behavior of the user is to calculate the forwarding behavior distribution of the user

For the

The solution of (1) adopts Gibbs sampling, and each iteration of the Gibbs sampling is estimated

The formula of (1) is as follows:

wherein n is_i,τ,s,rRepresenting the user behavior y when the interest difference is tau, the active state of the alternative user is s and the user sending the text plays the network role r_i1 (forward) or y_iNumber of 0 (no forwarding); i is the total number of user behaviors, including non-forwarding behaviors; m is the number of the user state attributes, and H is the number of the user role attributes.

S4: phi and theta obtained by fitting^(t)、η、

Calculating the forwarding probability of any microblog of the friends of the user according to the fitted prediction model

The prediction result can be obtained. And analyzing which friends' microblogs can be forwarded by the user and key factors influencing the microblogs forwarded by the user according to the predicted result.

According to the method, firstly, aiming at the diversity of interest, activeness and influence of a single user, a basic idea and a method of 'multiple words and multiple meanings' can be solved by using an L DA topic model, modeling analysis is carried out on user behaviors to obtain topic distribution related to the user behaviors, secondly, the activity of the user and the influence of the user are found by using Gaussian distribution improved L DA considering that elements in a user state vector and a user characteristic vector are continuous values, and finally, aiming at the change of the activity of the user along with time, a time dispersion and time slicing method is used for providing an improved L DA dynamic microblog forwarding behavior prediction model, the activity of the user is dynamically monitored, the forwarding behavior of the user can be accurately predicted, and key factors influencing user forwarding are analyzed.

The above examples are to be construed as merely illustrative and not limitative of the remainder of the disclosure. After reading the description of the invention, the skilled person can make various changes or modifications to the invention, and these equivalent changes and modifications also fall into the scope of the invention defined by the claims.

Claims

1. A dynamic microblog forwarding behavior prediction system based on a friend circle comprises a user behavior data source obtaining module, an attribute extraction module, a model building module and a prediction analysis module, wherein the user behavior data source obtaining module is used for obtaining user relationship and user behavior data in a social network and taking fans of text users as alternative users; the microblog forwarding behavior prediction model building module is used for building a microblog forwarding behavior prediction model for the alternative user, the forwarding behavior is mainly determined by the interest difference tau between the alternative user and friends of the alternative user, the activity s of the alternative user in the article release period and the network influence r parameter of the friends of the alternative user, and the model parameters are fitted; the prediction analysis module is used for predicting whether the candidate user can forward the microblog or not according to the parameters obtained after fitting and the microblog releasing situation of the user at any time t;

the attribute extraction module extracts user interest vectors according to the interest difference among users, and comprises the following steps: obtaining an attention list of each user by using attention behavior attributes of the users, and defining an interest vector of the user v as

Wherein e is_v,uI.e., represents that user v is interested in users in the list, u 1,2_v|，|E_vL represents the total number of users in the user v attention list;

the attribute extraction module extracts a user state vector aiming at the activity of the alternative user, and comprises the following steps: acquiring the user microblog issuing activity and the microblog forwarding activity of each user within a period of time by using the interactive behavior attribute and the time attribute of the user, and defining the activity state vector of the user v as

Wherein,

representing the microblog release activity of the user v on the time slice t,

and

respectively representing the number of microblogs issued by the user v in the time slice t, the number of forwarded microblogs and the average number of microblogs issued by the user v per day;

the attribute extraction module extracts user characteristics aiming at the influence of the user sending a textThe eigenvector includes: obtaining the out-degree, in-degree and local aggregation coefficient of each user node by using the network topological structure attribute, and defining the influence characteristic vector of the user v as

representing the local cluster coefficient, Ng, of user v_vIs a set of neighboring nodes, edg, for node v_ijIs the connection between its adjacent nodes;

the microblog forwarding behavior prediction model extracts user interest vectors from user behavior and user relationship information in terms of interest difference among users, extracts user interest vectors from the user behavior and user relationship information, trains all users by using an L DA model and obtains interest subject distribution of the users, extracts user state vectors on each time slice from the user behavior and time information in terms of interest difference among users, uses Gaussian distribution improvement L DA aiming at continuous elements in the user state vectors, trains all users by using an improved L DA model and obtains active state distribution of the users on each time slice, extracts feature vectors of the users from network structure information in terms of influence of the user who sends out the text, uses Gaussian distribution improvement L DA as well as an improved L DA model to train all users and obtain network role distribution of the users, and finally obtains multiple forwarding behavior prediction items according to whether the interest among the users is consistent, the state of the candidate users on each time slice, the network role of the active users and the forwarding data of the whole user;

the acquiring of the interest topic distribution of the user by the microblog forwarding behavior prediction model further comprises: on the basis of the user relationship network, the interactive behaviors among users are reused, the interest vectors I (v) of the users are weighted to obtain weighted user interest vectors

Wherein, w_v,nN. 1,2_v，N_vTraining all users by using an L DA model for the total interaction times of the users v, so that the interest topic distribution of the users can be obtained;

the acquiring the active state distribution of the user on each time slice further comprises: publishing liveness x for a user_v,t,1And forwarding liveness x_v,t,2The continuous variable is obtained by improving an L DA model by using Gaussian distribution, so that values of the release activity and the forwarding activity respectively obey different Gaussian distributions:

wherein x is_v,t,mRepresenting the mth attribute value, μ, of user v over time slice t_s,mAnd σ_s,mRespectively is the mean value and standard deviation of the mth attribute when the user activity state is s;

by a time slicing method, cutting every day from 0 night into 4 time intervals, namely t is 1,2,3 and 4, dividing the activity state of the user into three levels, namely very active, general active and inactive, training all users by using an improved L DA model, and obtaining the activity state distribution of the user on each time slice;

based on a network topological structure, user nodes are divided into three role types, namely opinion leaders, information propagators and common users, and similarly, after a Gaussian distribution improved L DA model is used, all users are trained by the model, and then the network role distribution of the users can be obtained.

2. A dynamic microblog forwarding behavior prediction method based on a friend circle of the system of claim 1,

the method is characterized by comprising the following steps:

and inputting the parameters obtained after fitting and the microblog issuing condition of the user at any time t into a prediction model to predict whether the alternative user can forward the microblog.