CN110147911B

CN110147911B - Social influence prediction model and prediction method based on content perception

Info

Publication number: CN110147911B
Application number: CN201910452492.0A
Authority: CN
Inventors: 黄晶; 杨博; 段明月; 左祥麟
Original assignee: Jilin University
Current assignee: Jilin University
Priority date: 2019-05-28
Filing date: 2019-05-28
Publication date: 2022-09-09
Anticipated expiration: 2039-05-28
Also published as: CN110147911A

Abstract

The invention discloses a social influence prediction model and a prediction method based on content perception, which relate to the field of social network analysis and comprise the following steps: and performing feature representation on network nodes by using a graph convolution network, representing social media content corresponding to each node by using a long-short term memory network (LSTM), fusing feature vector representations of the nodes and the social media content by using an RN model combined with the social media content, performing inference calculation, and comparing the obtained output with a real label so as to optimize a target function. The invention uses the graph convolution network to carry out feature representation on the network node, on one hand, the attribute information of the node can be better integrated, and on the other hand, the invention can also predict the new node in the network; meanwhile, content information of social media is considered during modeling, positive and negative samples are divided according to social media content, and the problem of deep research caused by sparse observation data is effectively solved.

Description

Social influence prediction model and prediction method based on content perception

Technical Field

The invention relates to the field of social network analysis, in particular to a social influence prediction model and a prediction method based on content perception.

Background

The social influence is embodied by social activities among users and is represented by the phenomenon that the behaviors, ideas and the like of the users are influenced by others and changed. Online social network influence analysis mainly involves three aspects of content: 1. the identification of the influence is realized, and how to distinguish and link the influence and related elements from complicated factors is identified; 2. measuring social influence, and designing and selecting a measuring method which has certain universality and can fully discover characteristics of a social network; 3. dynamic propagation of social influence.

At present, when network nodes are represented, node attribute information cannot be integrated, new nodes in a network cannot be predicted, meanwhile, the existing modeling prediction on social influence is generally based on a neural network, content information of social media is not considered, the existing observation data is generally divided into positive and negative samples by taking a user as a main body, the observation data is sparse, and effective analysis is difficult to perform.

Disclosure of Invention

The embodiment of the invention provides a social influence prediction model and a prediction method based on content perception, which are used for solving the problems in the prior art.

A social influence prediction model and a prediction method based on content perception comprise the following steps:

an input layer for inputting information, the input information comprising: user network structure information, user network attribute information and user social media content information;

a GCN layer for extracting the user network structure information and the user network attribute information inputted from the input layer;

the Embedding layer is used for storing the D-dimensional Embedding representation of the user extracted from the GCN layer;

an LSTM layer for representing the user social media content input by the input layer using an LSTM algorithm;

the RN layer is used for putting the Embedding representation output by the Embedding layer and the social media content Embedding representation output by the LSTM layer into an RN model, and deducing whether the current node is influenced by the neighbor nodes under the given social media content by utilizing the RN model;

the output layer is used for representing the result of model prediction in a two-dimensional mode when the content of the given social media is influenced by the neighbor nodes of the given social media;

further, still include: and the Grountreth layer is used for comparing the result predicted by the model with the Grountreth layer so as to optimize the objective function.

Further, in the input layer, each instance information in the input layer is an r-proxy network composed of n users.

Further, in the GCN layer, the GCN layer receives as input m user network configuration information and m n × f user network attribute information.

Further, in the GCN layer, the user network structure information is calculated from an adjacency matrix a of the network, and the GCN layer instantiates the adjacency matrix a of the network and a user network attribute information matrix H as static matrices closely related to a standardized graph laplacian:

wherein,

a is the adjacency matrix of the network graph G, I is the identity matrix,

the diagonal matrix of the network node value is obtained, and after A (G) is obtained, the matrix and a user network attribute information matrix H are used as the input of each GCN layer together for extracting the network node characteristics.

Further, in the LSTM layer, the LSTM layer receives as input m p × q user social media content information, p referring to the number of words in the social media content information, and q referring to the initial dimension of each word.

A prediction method based on the social influence prediction model based on content perception specifically comprises the following steps:

firstly, sampling to obtain a sub-network with a fixed size, and taking the sub-network as a proxy network of each r-ego network, wherein the proxy network comprises user network structure information, user network attribute information and user social media content information;

and step two, the agent network in the step one is put into a social influence prediction model based on content perception for prediction, and a corresponding prediction result is obtained.

Further, still include:

and step three, comparing the output result in the step two with a real result, calculating loss and minimizing a negative log-likelihood function.

The invention has the beneficial effects that: the invention uses the graph convolution network to carry out feature representation on the network node, on one hand, the attribute information of the node can be better integrated, and on the other hand, the invention can also predict a new node in the network; meanwhile, content information of social media is considered during modeling, positive and negative samples are divided according to social media content, and the problem of deep research caused by sparse observation data is effectively solved.

Drawings

Fig. 1 is a schematic diagram of a social influence prediction algorithm model framework based on content awareness for a social influence prediction model and a prediction method based on content awareness according to an embodiment of the present invention;

fig. 2 is a schematic diagram of an RN model framework of a social influence prediction model and a prediction method based on content awareness according to an embodiment of the present invention;

fig. 3 is a schematic diagram of 1-neighbor/2-neighbor of a social influence prediction model and a prediction method based on content awareness according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention are clearly and completely described below with reference to the drawings in the embodiments of the present invention, but it should be understood that the scope of the present invention is not limited by the specific embodiments.

Referring to fig. 1-3, the present invention provides a social influence prediction model based on content perception, which is first defined as follows:

definition 1: r-neighbors (r-neighbors) and r-ego network

Given a social network G ═ (V, E), where G is a static and undirected network, V denotes the set of users in the social network, and E denotes the set of relationships of the users. For a given central user v, its r-neighbors are defined as

Where d (u, v) refers to the distance between user u and user vThe shortest distance in terms of hop count. As shown in FIG. 1, the dark node is taken as the center node, and the 1-neighbor and 2-neighbor of the dark node are shown respectively. The r-ego network of user v is composed of

A network of nodes of

And (4) showing.

Definition 2: r-proxy network

Given a central user v, the present invention extracts its r-ego network using a Breadth-First-Search (BFS) method. Since different users have different neighbor sizes, and the social network data with different sizes are not suitable for most deep learning models, N nodes are sampled from the r-neighbor network of the user v as a representative, which is called an r-proxy network.

Definition 3: social behavior (Social Action)

Social behavior refers to social actions performed by users in a social network on social media content, such as, for example, actions of following, like, forwarding, voting, etc. Given a behavior log A, A contains a series of triples

Indicating that user u is at time

I-actions have been done on a certain message or book. Observing the state of user v based on time can characterize it as a binary variable,

wherein, t represents the time of day,

indicating that user v produced some behavior on the ith social media content before time t,

it indicates that the user did not act on the ith social media content.

Definition 4: social media content (Social content)

Social media content refers in the present invention to objects in a social network that a user generates behavior, such as a log, a tweet, a story, and the like. Given social media content C and user v, the present invention is concerned with the impact that nodes in the r-proxy network of user v have on social media content C.

Problem definition: local Social Influence (Social affinity Locality)

Given the r-ego network of node v, the behavioral state of users in the r-ego network, and the specific social media content, the local social impact models the probability of node v behavioral state under these conditions. I.e. given an r-ego network

r-ego behavioral states of users in a network

Social media content C and time interval Δ t, local social impact aims to quantify the probability that node v produces C behavior:

assume that there are N instances, each instance is a triple (v, a, t), where v refers to the user, a refers to social behavior, and t refers to a timestamp. For such a triplet (v, a, t), we know the r-ego network-where v is located

r-ego behavioral states of users in a network

And social media content —C, this problem can be expressed as a binary graph classification problem, which can be solved by minimizing the following negative log-likelihood objective (model parameters Θ):

it should be noted that in this work it is assumed that Δ t is sufficiently large.

A social influence prediction model based on content awareness comprising:

an input layer: for inputting information, the inputted information comprising: user network structure information, user network attribute information and user social media content information;

each instance in the input is an r-proxy network consisting of n users, wherein n refers to the number of nodes in the r-proxy network, and the r-proxy network comprises three aspects of information: network structure information, network attribute information (specific content-see attribute information collection section), and social media content information.

The GCN receives m n × n network structure information and m n × f network attribute information as input, where m refers to the size of batch training;

the long-short term memory network LSTM accepts as input m p × q social media content information, where p refers to the number of words in the social media content information and q refers to the initial dimension of each word.

A GCN layer: for extracting user network structure information and user network attribute information inputted from the input layer;

given a user instance, the goal of the GCN layer is to learn a feature mapping function on the instance graph. For the algorithm presented here, a 3-layer cascaded GCN model is used in this section, with two parts for each layer input: example graph structure information and example graph network attribute information. The structure information of the example graph is calculated from the adjacency matrix of the network.

Given the adjacency matrix a and the user-network-attribute-information matrix H of the example graph, the GCN instantiates this as a static matrix closely related to the normalized graph laplacian:

herein, the

A is the adjacency matrix of the network graph G, I is the identity matrix,

is a diagonal matrix of network node values. And after A (G) is obtained, the matrix and the attribute information matrix H are used as the input of each GCN layer for extracting the network node characteristics.

Embedding layer: d dimension Embedding representation for storing n users in the instance extracted from GCN layer, which is D value is 128.

LSTM layer: the system comprises a display module, a display module and a display module, wherein the display module is used for representing user social media content corresponding to each node input by an input layer by using an LSTM algorithm;

given a piece of social media content information, the content information is first searched for the word Embedding representation in the content information (300-dimensional word Embedding representation is used herein), and then the Embedding matrix of the words is used as the input of the LSTM algorithm. For social media content herein, its LSTM output dimension is 128.

RN layer: after GCN, each node has its own Embedding representation, after LSTM represents, each social media content also has its own Embedding representation, the node Embedding representation output by the Embedding layer and the social media content Embedding representation output by the LSTM layer are put into an RN model, and the RN model is used for deducing whether the current node is influenced by the neighbor node under the given social media content;

as shown in fig. 2, the first column of objects is set as the neighborhood of the central user in the proxy network, the second column of objects is the central user itself, and the third column represents social media content, and the mathematical expression of the problem is as follows:

where o refers to the user's Embedding representation, c _i Refers to a time series representation of the ith social media content,

it is referred to whether the neighbors of the central user have acted on the social media content i before time t. Since it is assumed herein that the behavior of the central user is only influenced by active neighbor users, the state (node, social media content) matrix and the historical behavior state of the user are subjected to inner product operation in the RN model, i.e. it is determined whether each user plays a role in the following reasoning process according to its state. When in use

It means that node u has generated behavior before time t for the ith social media content, otherwise it has not generated behavior.

The specific description of the RN model algorithm in conjunction with social media content is as follows:

input, wherein the Embedding of nodes in the agent network represents O, the time sequence of social media content represents C and the historical state S of users in the agent network

Output the outcome of the prediction of social influence

An output layer: outputting the result predicted by the two-dimensional representation model;

group treuth: and comparing the result of model prediction with the GrountTruth so as to optimize the objective function.

The content-aware social influence model algorithm is described in detail as follows:

input: an adjacency matrix A of a proxy network where self-users are located, an attribute information matrix H of the users, social media contents C and historical behavior states S of self-user neighbors in the proxy network

Output: results of nodes being affected

A prediction method of a social influence prediction model based on content perception specifically comprises the following steps:

firstly, sampling to obtain a sub-network with a fixed size, and using the sub-network as a proxy network of each r-ego network, wherein the proxy network comprises user network structure information, user network attribute information and user social media content information;

given a central user v, the text extracts its r-ego network using a Breadth-First-Search (BFS) method.

Since different users have different neighbor sizes, such different size network data is not suitable for most deep learning models. In order to solve this problem,

a sub-network of fixed size is sampled from the r-ego network of user v as a proxy network for the r-ego network.

There are studies that show that people are more likely to be affected by active neighbors than inactive neighbors, and therefore, more attention is paid to the active r-neighbors of user v in this text in choosing the neighbor portion.

The method comprises the following steps: all active r-neighbors of user v are taken out, and then M, which is the number of users of the proxy network-the number of active users, is randomly taken out of the remaining inactive r-neighbors. By using

Indicating ego the proxy network where the user is located,

representing the state of behaviour of the user v neighbours in the proxy network, the purpose of this equationThe objective function may be redefined as:

Evaluation indexes are as follows: to test the effectiveness of the social influence prediction method presented herein, the values of precision, recall, and F1 were used in the experimental section to evaluate the experimental results, respectively.

The precision ratio is as follows: the precision ratio is a measurement standard of the proportion of the data correctly predicted as the positive sample by the evaluation classifier to the data correctly predicted as the positive sample, and the mathematical expression of the precision ratio is as follows:

the TP is the condition that the positive sample is divided into the positive samples by the classifier and the positive samples are correctly divided by the corresponding classifier; and the FP is the condition that the classifier divides the negative samples into the negative samples and the corresponding classifier divides the negative samples correctly.

And (4) recall rate: a scale criterion for evaluating the positive samples divided by the classifier into positive samples is mathematically expressed as follows:

FN here is the case where the classifier divides positive samples into negative samples.

F1: an index for measuring the accuracy of the two classification models gives consideration to both the accuracy rate and the recall rate in the evaluation standard, and is a comprehensive evaluation index. The mathematical expression of the F1 evaluation index is as follows:

and (3) comparison algorithm: to verify the validity of the model proposed herein, the DeepInf-GCN algorithm, an algorithm published in 2018 on KDD that also uses deep neural networks for social impact prediction, was used herein as a comparison algorithm.

Experimental results and analysis: in order to check the effectiveness of the social influence prediction method proposed herein, data of 10 social media contents are found out on the DIGG dataset, respectively, so as to verify the content-aware social influence prediction algorithm proposed herein, and a deepnf-GCN algorithm is selected as a comparison algorithm. The evaluation criteria of the experiment were accuracy, recall and F1 values, respectively. In order to verify the effectiveness of the model comprehensively and accurately, the results are verified in two groups of experiments.

The first set of experiments: content data for 10 different social media were extracted from the Digg dataset and, assuming that their propagation patterns were the same, trained together and predicted separately.

Table 1 comparison of data sets experimental results for different contents in Digg (assuming the same propagation pattern) (%)

The test results of the content-aware social influence prediction method and the deep-GCN algorithm on 10 data sets are shown in table 1, and the experimental results are analyzed with respect to the performances of the two algorithms on each data set in terms of three evaluation indexes, namely precision, recall and F1 values. Given the same propagation pattern of these data, these data sets are trained together and predicted separately herein. From the aspect of the performance of 10 data sets on the precision rate, the algorithm provided by the invention only has no good performance of a comparison algorithm on two data sets, namely Content4 and Content10, and is superior to or equal to the comparison algorithm on the other eight data sets; from the aspect of recall rate, the algorithm provided by the invention is weaker than the comparison algorithm only on the Content4 data set, and the performance on other nine data sets is better than or equal to the comparison algorithm; from the value of F1, the algorithm presented herein is weaker than the comparison algorithm only on two data sets, Content4 and Content10, and performs better than or equal to the comparison algorithm on the other eight data sets.

Table 2 is a comparison of the relative gains of the two algorithms on each evaluation index.

Table 2 relative gain comparison (%)% of the two algorithms on each evaluation index for the first set of experiments

The second set of experiments: the data sets are still the data sets with 10 different contents, and the data sets are trained and predicted respectively on the assumption that the propagation modes of the data sets are different.

Table 3 comparison of the results of the experiments on the data sets of different contents of Digg (assuming different propagation patterns) (%)

The results of the second set of experiments are shown in table 3. Given the different propagation modes of these data sets, the force-of-influence parameters to be captured for different data sets will also be different. From this point of view, the above 10 data sets were also compared with their respective predicted results after training. The results of 10 data sets were analyzed: firstly, on the precision rate, the results of the algorithm proposed herein on 7 data sets are superior to the comparison algorithm; secondly, the performance of the algorithm provided by the method is superior to or equal to that of a comparison algorithm on the recall ratio except for results on two data sets, namely Content6 and Content 10; finally, for the composite index value of F1, the algorithm presented herein outperforms the comparison algorithm in all 10 data sets, and the highest relative gain value is 16.67%.

Table 4 is a comparison of the relative gains of the two algorithms on each index.

TABLE 4 relative gain comparison (%)

The effectiveness of the algorithm provided by the invention can be verified by combining the results of the two groups of experiments, and in addition, the algorithm provided by the invention also solves the problem that the prediction result cannot be explained.

In summary, the invention uses the graph convolution network to perform feature representation on the network node, so that on one hand, the attribute information of the node can be better integrated, and on the other hand, the invention can predict a new node in the network; meanwhile, content information of social media is considered during modeling, positive and negative samples are divided according to social media content, and the problem of deep research caused by sparse observation data is effectively solved.

The above disclosure is only one specific embodiment of the present invention, however, the present invention is not limited thereto, and any modifications that can be made by those skilled in the art should fall within the protection scope of the present invention.

Claims

1. A social influence prediction model based on content awareness, comprising:

a GCN layer for extracting the user network structure information and user network attribute information inputted from the input layer;

the LSTM layer is used for outputting the user social media content input by the input layer into an Embedding representation of the social media content by using an LSTM algorithm;

the RN layer is used for putting the Embedding representation output by the Embedding layer and the social media content Embedding representation output by the LSTM layer into the RN model, and deducing whether the current node is influenced by the neighbor node under the given social media content by utilizing the RN model;

the mathematical expression of the RN model is as follows:

refers to whether a neighbor of the central user has made an overaction on social media content i before time t, v refers to the central user, u refers to the neighbor of v, Σ refers to the sum, g _θ A three-layer perceptron network is shown,

representing a two-tiered perceptron network;

and the output layer is used for representing the result of model prediction in a two-dimensional form under the influence of the neighbor nodes of the social media content.

2. The content-awareness-based social influence prediction model of claim 1, further comprising: and the Grountreth layer is used for comparing the result predicted by the model with the Grountreth layer so as to optimize the objective function.

3. The content-awareness-based social influence prediction model of claim 1, wherein in the input layer, each instance of information in the input layer is an r-agent network consisting of n users;

the r-proxy network definition is that N nodes are sampled from an r-neighbor network of a user v as a representative;

the r-neighbor is defined as

Where d (u, v) refers to the shortest distance between user u and user v in terms of hop count.

4. The content-aware-based social influence prediction model of claim 1, wherein the GCN layer receives as input m user network structure information and m n x f user network attribute information.

5. The content-awareness-based social influence prediction model according to claim 1, wherein the user network structure information is calculated from an adjacency matrix a of the network in the GCN layer, and the GCN layer instantiates the adjacency matrix a of the network and the user network attribute information matrix H as static matrices closely related to a normalized graph laplacian:

wherein,

a is the adjacency matrix of the network graph G, I is the identity matrix,

is a diagonal matrix of network node values, after obtaining A (G), the matrix is summedThe user network attribute information matrix H together serves as an input to each GCN layer for extracting network node characteristics.

6. The content-aware-based social influence prediction model of claim 1, wherein in the LSTM layer, the LSTM layer receives m p x q of user social media content information as input, p referring to the number of words in the social media content information, and q referring to an initial dimension of each word.

7. A content-awareness-based social influence prediction model prediction method according to claim 3, specifically comprising the following steps:

the r-ego network is defined as being composed of

A network of nodes in (1);

8. The prediction method of claim 7, further comprising: