Detailed Description
The invention is further described with reference to the following examples.
Referring to fig. 1 and fig. 2, a user behavior analysis system facing a social network in this embodiment includes a modeling module 1, a forwarding behavior analysis module 2, and a user behavior analysis module 3, where the modeling module 1 is configured to establish a social network model, the forwarding behavior analysis module 2 is configured to analyze forwarding behaviors of users, the user behavior analysis module 3 is configured to analyze user behaviors according to the forwarding behaviors of the users, and the modeling module 1 establishes the social network model in the following manner: and representing the social network model as a binary group E ═ U, B, wherein U represents a user node set, B represents an edge set, and if a user U and a user v in the user node set concern each other, edges (U, v) exist between the user U and the user v, and the user U and the user v are adjacent nodes to each other.
According to the method and the device, user behavior analysis of the social network is achieved, specifically, the social network is modeled based on the mutual attention relationship, a large number of junk users in the social network can be effectively eliminated, and accuracy and reliability of follow-up analysis are improved.
Preferably, the forwarding behavior analysis module 2 includes a forwarding probability calculation unit 21, a forwarding index determination unit 22, and a forwarding behavior analysis unit 23, where the forwarding probability calculation unit 21 is configured to calculate a probability that the user publication information is forwarded, the forwarding index determination unit 22 is configured to determine a forwarding index of the user according to the probability that the user publication information is forwarded, and the forwarding behavior analysis unit 23 is configured to analyze the user forwarding behavior according to the forwarding index.
Preferably, the calculating the probability of forwarding the published information of the user specifically includes:
(1) representing all the adjacent node sets of the user u by L (u), and if the user w exists, so that w belongs to L (u) and w belongs to L (v), the user v is the close adjacent node of the user u by La(u) represents the set of all close neighbors, if user w is not present, such that w ∈ L (u) and w ∈ L (v), then user v is the loose neighbor of user u, with Lb(u) represents a set of all loose neighbor nodes;
(2) calculating the probability that the user is forwarded by the adjacent nodes:
in the formula, Pu(L (u)) represents the probability that user u is forwarded by its neighboring nodes, m (u) represents the number of messages posted by user u, ru(v) Indicating the number of messages issued by user v and forwarded by user u, tu(v) Representing the number of the users v forwarding the published messages of the users u within the set time limitL (u) represents the number of nodes adjacent to user u;
calculating the probability that the user is forwarded by the close adjacent node:
in the formula, Pu(La(u)) represents the probability that user u is forwarded by its immediately adjacent node, | La(u) | represents the number of closely adjacent nodes of user u;
calculating the probability that the user is forwarded by the loose adjacent nodes:
in the formula, Pu(Lb(u)) represents the probability that user u is forwarded by its loose neighbors, | Lb(u) | represents the number of loose neighboring nodes for user u.
The preferred embodiment considers the timeliness of the message when calculating the forwarding probability of the user message, is beneficial to improving the instantaneity of the user forwarding behavior analysis, calculates the forwarding probabilities of the close adjacent node and the loose adjacent node respectively, and is convenient for obtaining the relationship between the user forwarding behavior and the user attention relationship.
Preferably, the forwarding index determining unit 22 includes a first forwarding index determining subunit, a second forwarding index determining subunit and a forwarding index determining subunit, where the first forwarding index determining subunit is configured to determine a first forwarding index of the user, the second forwarding index determining subunit is configured to determine a second forwarding index of the user, and the forwarding index determining subunit is configured to determine the forwarding index of the user according to the first forwarding index and the second forwarding index.
The first forwarding index is obtained by the following formula:
in the formula, DYuA first forwarding index representing user u;
the second forwarding index is obtained in the following manner:
(1) for user u and its close neighbor nodes v and w, affinity is defined to reflect the degree of affinity between close neighbor nodes:
in the formula, Tu(v, w) represents the intimacy between nodes v and w, rw(v) Indicating the number of messages forwarded by user v and published by user w, rv(w) represents the number of users w forwarding user v published messages;
(2) calculating the activity of the user:
Hu=(u)×m(u)
in the formula, HuRepresenting the activeness of the user u, and a (u) representing the average daily published message number of the user u;
(3) calculating a second forwarding index:
in the formula, DEuA second forwarding index representing user u;
the forwarding index is determined using the following equation:
in the formula (ZF)uRepresenting the forwarding index of user u.
In the preferred embodiment, the forwarding index is determined by adopting the first forwarding index and the second forwarding index, so that a more scientific and reasonable forwarding index is obtained, and a favorable guarantee is provided for the subsequent forwarding behavior analysis, so that the accuracy and the scientificity of the user behavior analysis are guaranteed.
Preferably, the analyzing the user behavior forwarding according to the forwarding index specifically includes: the larger the forwarding index of the user is, the higher the probability that the user is forwarded is, for the users with the same forwarding index and the larger the second forwarding index is, the higher the probability that the user is forwarded is, and for the users with the same second forwarding index and the larger the first forwarding index is, the higher the probability that the user is forwarded is;
the analyzing the user behavior according to the user forwarding behavior specifically comprises: the higher the probability that a user is forwarded, the greater the impact of the user in the network.
In the preferred embodiment, the forwarding index is used for analyzing the user forwarding behavior, and the forwarding behavior is used for analyzing the user behavior, so that the user behavior analysis of the social network is realized.
The user behavior analysis system oriented to the social network is adopted to analyze the user behavior, 5 social networks are selected and are compiled into the network 1, the network 2, the network 3, the network 4 and the network 5, the user behavior in the networks is analyzed, and the analysis time and the analysis accuracy of the user behavior are counted, so that compared with the existing user behavior analysis system, the beneficial effects are shown in the following table:
|
reduced analysis time
|
Analytical accuracy improvement
|
Network 1
|
23%
|
21%
|
Network 2
|
25%
|
20%
|
Network 3
|
24%
|
25%
|
Network 4
|
26%
|
22%
|
Network 5
|
24%
|
23% |
Finally, it should be noted that the above embodiments are only used for illustrating the technical solutions of the present invention, and not for limiting the protection scope of the present invention, although the present invention is described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions can be made on the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention.