CN113254864A

CN113254864A - Dynamic subgraph generation method and dispute detection method based on node characteristics and reply path

Info

Publication number: CN113254864A
Application number: CN202110478862.5A
Authority: CN
Inventors: 曹娟; 钟雷; 王政嘉; 盛强; 谢添; 徐朝喜
Original assignee: Hangzhou Zhongke Ruijian Technology Co ltd; Institute Of Digital Economy Industry Institute Of Computing Technology Chinese Academy Of Sciences
Current assignee: Hangzhou Zhongke Ruijian Technology Co ltd; Institute Of Digital Economy Industry Institute Of Computing Technology Chinese Academy Of Sciences
Priority date: 2021-04-29
Filing date: 2021-04-29
Publication date: 2021-08-13
Anticipated expiration: 2041-04-29
Also published as: CN113254864B

Abstract

The invention relates to a dynamic subgraph generation method and a dispute detection method based on node characteristics and reply paths, S1, constructing a path matrix P and a path length matrix S based on a 'post-comment' graph G, wherein the path matrix P records all paths from each node in the graph G to terminal nodes, and the terminal nodes comprise post nodes in the graph G and comment nodes without replies; the path length matrix S records the length of each path in the path matrix; s2, calculating to obtain a path Laplacian matrix L based on the path matrix P and the path length matrix S; s3, calculating and obtaining the expression of the current node perception path information based on the Laplacian matrix L of the path and the content characteristics of the nodes in the graph G; and S4, based on the similarity between the current node and all nodes on the corresponding path, reserving the most important part of nodes on each path, wherein the important nodes on all paths form a subgraph corresponding to the current node, and the nodes in the subgraph are local discussions related to the current node.

Description

Dynamic subgraph generation method and dispute detection method based on node characteristics and reply path

Technical Field

The invention relates to a dynamic subgraph generation method and a dispute detection method based on node characteristics and a reply path. The method is suitable for the field of social media platform disputeness detection.

Background

Social media platforms have become an important platform for people to express opinions. People share, comment on social media and have led to intense discussions among parts of posts, indicating disputes among the participating people, which reflect public sentiment and focus. A controversial post has controversial content and the expressed idea or opinion can cause controversy in the reply.

The task of post-level dispute detection is to automatically determine whether a post is disputed. The task is helpful for evaluating the influence of bipolar differentiation and events of human viewpoints, and also provides a reference for news topic selection. The controversial detection plays an important role in mining public emotions of social media, and has become a research hotspot of people in recent years.

The existing dispute detection method comprises the steps of firstly, constructing a graph structure through a post-comment tree, wherein nodes in the graph represent posts or comments, and edges represent a reply relationship between the nodes; the graph convolutional neural network is then used to learn the representation of the nodes in the graph and to use the average information of posts and comments for dispute detection. The method cannot update the node expression by using local discussions related to the node, cannot model a dispute mode, and cannot pay attention to information related to the post.

In an actual scenario, the disputeness of a post is often reflected in Local Discussion (LD) of a post, which refers to the Discussion content related to the current node, and is embodied as a subgraph in a "post-comment tree" graph. There are some local discussions that are disputed and some that are not, we call those local discussions as local disputes (localargumentations). While some discussions in posts are related to posts and some are off-topic, we call those local disputes related to posts as Key local disputes (KLA). Posts with critical local disputes are likely to be controversial posts, so finding a critical local dispute will help in the dispute detection of posts.

FIG. 1 shows a dispute post discussing "vacation" and "accompanying vacation" of a microblog platform, (a) shows the content of the post and comments, and the standpoint of the comment content is divided into 4 aspects of "support", "objection", "neutral" and "irrelevant"; (b) the displayed is a reply relation graph of posts-comments. The partial discussion of the presence of posts has been indicated by dashed circles, such as LD 1-LD 3 in the figure. In these local discussions, there is a debate among LD2 and LD3, and it belongs to discussions related to posts, so LD2 and LD3 belong to key local debates, i.e., KLA1 and KLA2 in the figure. Based on the observation of FIG. 1, the dispute detection with critical local disputes can be performed in two steps: (1) first, the local discussion present in the post is found. (2) The local discussions in which the posts are related and most likely to be disputed are found for dispute detection.

Currently, post-based dispute detection is mainly based on web pages and social media, while web-based research mostly focuses on wikipedia, mainly by using specific features to classify: such as number of modifications, edit history, and dispute tags; more work has been focused on performing disputes detection of social media, some of which use linguistic features to detect such as topics, emotions, and other indicators, some of which are emphasized or Twitter specific; still other efforts use structural features in post-comment graphs for detection, such as propagated or local features, nodularity features, and the like.

The dispute detection is carried out by learning node expressions in post-comment graphs by using the graph convolution neural network, and two main defects exist: (1) the graph convolution neural network only focuses on first-order neighbor information of the node, and cannot directly learn high-order information, and therefore, the local discussion of the node cannot be utilized to learn node information. (2) The use of the average information of posts and comments for dispute detection does not allow modeling of dispute patterns in the data and does not allow for the attention to discussion information related to the posts.

Graph neural networks have been successful in many areas, and widely used graph neural networks include GCN, GraphSage, GAT, and GIN, among others. However, the networks can only aggregate first-order neighbor information of the nodes to update the node expression, and high-order neighbor information can be indirectly learned by using a multilayer network, but experiments show that the model performance is greatly influenced due to the over-smoothing problem.

At present, some works based on a graph neural network and capable of directly learning high-order node information exist, for example, a shortest path is generated for each node by using an attention mechanism, and information updating is performed on each node by using the path information, but the work only focuses on the relationship between node pairs and cannot focus on the whole information of a local subgraph; or, calculating the shortest path length between the nodes as the intimacy between the nodes, and selecting the TopK node with the closest distance as the subgraph corresponding to the node according to the intimacy, but the method only uses the structural information in the graph, and ignores the important node characteristics; or, all paths related to the node are listed which are less than a certain threshold value, and a sub-graph corresponding to the node is generated by enumerating part of the paths.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: aiming at the existing problems, a dynamic subgraph generation method and a dispute detection method based on node characteristics and a reply path are provided.

The technical scheme adopted by the invention is as follows: a dynamic subgraph generation method based on node features and reply paths is characterized in that:

s1, constructing a path matrix P and a path length matrix S based on the 'post-comment' graph G, wherein the path matrix P records all paths from each node in the graph G to terminal nodes, and the terminal nodes comprise post nodes in the graph G and comment nodes without replies; the path length matrix S records the length of each path in the path matrix;

s2, calculating to obtain a path Laplacian matrix L based on the path matrix P and the path length matrix S;

s3, calculating and obtaining the expression of the current node perception path information based on the Laplacian matrix L of the path and the content characteristics of the nodes in the graph G;

and S4, based on the similarity between the current node and all nodes on the corresponding path, reserving the most important part of nodes on each path, wherein the important nodes on all paths form a subgraph corresponding to the current node, and the nodes in the subgraph are local discussions related to the current node.

2. The method for dynamic subgraph generation based on node features and reply paths according to claim 1, wherein the step S1 comprises:

s11, constructing a 'post-comment' graph G ═ V, E according to the reply relation, wherein V is a set of nodes and comprises post nodes and comment nodes; e represents the reply relationship between the nodes, including the connecting edges between the posts and the comments and the connecting edges between the comments and the comments;

s12, constructing a path matrix P based on the graph G, wherein the path matrix P belongs to R^m*mRecording m paths in the graph G, and taking all paths from each node to the terminal node in the graph G;

s13, constructing a path length matrix S based on the graph G, wherein the path length matrix S belongs to R^m*mThe element value on the diagonal of the ith row of the matrix represents the length of the ith path in the path matrix P.

Step S2 includes: the difference of the path matrix P and the path length matrix S is used to define a path laplacian matrix: l ═ S-P.

Step S3 includes:

s31, calculating a normalization form of the path Laplace matrix L:

L′＝I-S^-1P

wherein I is an identity matrix of M;

s32, calculating the expression of the sensing path information of the central node i based on the matrix L', wherein the expression matrix Q of the central node belongs to R^m*dThe calculation is as follows:

Q＝L′H

wherein the matrix H ∈ R^m*dAnd recording the d-dimensional expression vector of each central node in the path matrix.

Step S4 includes:

calculating a correlation matrix between the nodes based on the matrices Q and H:

R＝QW_sH^T

wherein W_s∈R^d*dIs a learnable matrix; each row in the matrix R represents the correlation degree of the central node and all other nodes;

filtering out nodes on a path corresponding to the central node by using the path matrix P, and calculating a normalized correlation value between the central node and the nodes on the corresponding path by using a Softmax function according to a line;

R′＝Softmax(P⊙R)

wherein |, represents the product of the corresponding elements in the matrix;

for a path with the node i as the center, accumulating the correlation values on the path from the node i along the path, and cutting off the rest nodes when the accumulated correlation values are larger than a threshold value theta;

the collection of all the truncated paths becomes the subgraph corresponding to the central node i, which is recorded as SG_iAnd local discussion information corresponding to the central node i is recorded in the subgraph.

Updating the expression of the node by utilizing the node information in the subgraph based on the classical GNN model, wherein the expression of the node i in the l-th layer is

It further comprisesThe new rule is:

wherein g is an aggregation function, different aggregation functions being used in different GNN models; σ is a nonlinear activation function; b^(l)Is a bias vector.

In the GCN, the number of bits in the GCN,

wherein W^(l)Is a learnable parameter matrix.

A method of dispute detection, comprising:

A. adopting differences among node expressions to model a dispute mode in a subgraph generated by the dynamic subgraph generation method of any one of claims 1-6;

B. the sub-graphs are re-weighted using a post-directed attention mechanism to capture post-related disputes.

The step A comprises the following steps:

for node i in reply to node x, the difference between node expressions is calculated

Using the fully connected layer to learn these differences;

summing all differences in the subgraph to obtain an expression vector of the subgraph, wherein a calculation formula is as follows:

wherein

Is a matrix of parameters that can be learned,

is offsetA matrix of entries.

The individual subgraphs are reweighted using the post-directed attention mechanism, which is calculated as follows:

wherein h is_pAn expression representing a post node; SG represents all subgraph sets in the "post-comment" graph;

is the weight of attention mechanism, representing sub-graph SG_iRelevance to the post.

The result of the weighted summation is learned by using a full link layer, and finally whether the post is controversial or not is judged, and a loss function uses cross entropy:

wherein

A real tag representing the ith post;

the probability of dispute for each ith post predicted by the model; and N is the size of the batch during training.

The invention has the beneficial effects that: the invention provides a method for mining key local disputes to perform disputed detection based on dynamic subgraph generation, which mainly comprises two parts of dynamic subgraph generation and key local dispute mining.

The dynamic subgraph generation can dynamically generate subgraphs corresponding to each node based on the content of the node and the characteristics of the reply structure, namely relevant local discussion, each node can use relevant local discussion information to express and update, and the method can be integrated into different graph neural networks in a plug-in mode to improve the detection performance of the graph neural networks.

The key local dispute mining can model a dispute mode in discussion and excavate disputes related to the content of the posts for dispute detection.

The method of the invention can deal with irrelevant information in the data and can provide certain model interpretability (the local dispute which is most concerned by the model is probably the reason for dispute of the posts).

Drawings

FIG. 1 shows a dispute post discussing "vacation" and "coss" on the microblog platform.

Fig. 2 is a model structure diagram of the embodiment.

Detailed Description

The embodiment is a method for mining key local disputes to perform dispute detection based on a dynamic subgraph generation method, and comprises a dynamic subgraph generation method and a local dispute mining method based on node characteristics and reply paths.

A subgraph centered around a node (e.g., node i) should include local discussions associated with it that exist in the reply path of the center node i. In this embodiment, the dynamic subgraph generation method first calculates the correlation between each node in the reply path and the central node i, and then truncates the path to remove irrelevant nodes. The collection of all the truncated paths constitutes the subgraph corresponding to node i, i.e. the relevant local discussion.

The common GCN model uses an adjacency matrix, a degree matrix, and a Laplace matrix to model the interactivity of the node and the first-order neighbor nodes, in this example, a path matrix, a path length matrix, and a path Laplace matrix to model the interactivity of the node and the higher-order neighbor nodes.

The dynamic subgraph generation method in the embodiment comprises the following steps:

s1, constructing a path matrix P and a path length matrix S based on the 'post-comment' graph G, wherein the path matrix P records all paths from each node in the graph G to terminal nodes, and the terminal nodes comprise post nodes in the graph G and comment nodes without replies; the path length matrix S records the length of each path in the path matrix.

S11, for each post, firstly, constructing a 'post-comment' graph G (V, E) according to a reply relationship, wherein V is a set of nodes and comprises post nodes and comment nodes, and obtaining initial expressions of the nodes based on texts in the nodes by using a BERT model; e represents the reply relationship between nodes, including the connecting edge between the posts and the comments and the connecting edge between the comments and the comments.

S12, constructing a path matrix P based on the graph G, wherein the path matrix P belongs to R^m*mM paths in the graph G are recorded, each row represents a path, and for the node i, the relevant paths include a path from bottom to top (from the node i to the post node) and all paths from top to bottom (from the node i to the comment node without reply). For example: three paths from node P are recorded in the first 3 rows of the matrix, where 1 represents the corresponding node on the path and 0 represents the corresponding node off the path.

For each node, firstly recording all paths from bottom to top, then recording all paths from top to bottom, and recording the paths corresponding to all nodes according to the breadth-first traversal.

It should be noted that for different paths from the same node, the nodes of the overlapping portions of these paths do not occupy the same column. E.g. for a path P-C from node P₁-C_1-1And P-C₁-C_1-2C in the path₁The nodes are in different columns (as shown by the circles on the path matrix in fig. 2). In order to make the path matrix P a square matrix, each node occupies the same number of columns and rows.

S12, constructing a path length matrix S based on the graph G, wherein the path length matrix S belongs to R^m*mIt is a diagonal matrix, and the element value on the diagonal of the ith row of the matrix represents the length of the ith path (except for the central node) in the matrix P.

S2, calculating a path Laplace matrix L based on the path matrix P and the path length matrix S, and defining the path Laplace matrix by adopting the difference between the path matrix and the path length matrix: l ═ S-P.

s31, calculating the normalization form of the path Laplace matrix L, which is

L′＝I-S^-1P

Wherein I is an identity matrix of M.

S32, calculating the expression of the sensing path information of the central node i based on the matrix L', and assuming that the matrix H belongs to R^m*dD-dimensional expression vectors of each central node in the path matrix are recorded (note that part of rows in the matrix H correspond to the same central node, so that the corresponding vectors are also the same), and the expression matrix Q epsilon R of the central node^m*dCan be calculated as:

Q＝L′H

where each row of elements in Q represents a representation of a central node of the perceptual path information.

S41, calculating a correlation matrix between the nodes based on the matrix Q and the matrix H:

R＝QW_sH^T

wherein W_s∈R^d*dIs a learnable matrix; each row in the matrix R represents the relevance of the central node to all other nodes, even if some of the nodes are not on the path corresponding to the central node.

S42, filtering out the nodes on the corresponding paths of the central node by using the path matrix P, and calculating the normalized correlation value between the central node and the nodes on the corresponding paths by using a Softmax function according to the rows:

R′＝Softmax(P⊙R)

wherein |, represents the product of the corresponding elements in the matrix.

S43, for the path centered on the node i, accumulating the correlation values on the path from the node i along the path, and when the accumulated correlation values are larger than the threshold θ, truncating the remaining nodes. The collection of all the truncated paths becomes the subgraph corresponding to the central node, and is recorded as SG_iAnd partial discussion information corresponding to the node i is recorded in the subgraph.

In this embodiment, after obtaining the subgraph corresponding to each node, the expression of the central node is updated by using the node information in the subgraph. The example is still based on the classical GNN model, but uses the nodes in the subgraph instead of the first-order neighbor nodes, assuming that the expression of node i at layer 1 is

The update rule is as follows:

wherein g is an aggregation function, different aggregation functions being used in different GNN models; in the GCN, the number of bits in the GCN,

wherein W^(l)Is a learnable parameter matrix; σ is a nonlinear activation function; b^(l)Is a bias vector.

The local dispute mining method in the embodiment comprises the following steps:

in a debate local discussion, there are always many discussion nodes with opposite views, and differences between the expression of these nodes may be apparent. Therefore, the present embodiment uses the differences between node expressions to model the dispute patterns in the subgraph generated by the dynamic subgraph generation method in this example.

And use a full linkAnd (3) learning the differences by layer connection, and finally summing all the differences in the subgraph to obtain an expression vector of the subgraph, wherein the calculation formula is as follows:

wherein

Is a matrix of parameters that can be learned,

is a matrix of bias terms.

In order to make the model focus on the information related to the post for dispute detection, the embodiment uses the attention mechanism guided by the post information to re-weight each sub-graph so as to capture the disputes related to the post, and the calculation process is as follows:

wherein h is_pRepresenting the expression of the post node, and SG representing all subgraph sets in a 'post-comment' graph;

wherein

A real tag representing the ith post;

The present embodiment also provides a storage medium on which a computer program capable of being executed by a processor is stored, and the computer program when executed implements the steps of the dynamic subgraph generation method or the dispute detection method in the present embodiment.

The present embodiment also provides a computer device having a memory and a processor, where the memory stores a computer program capable of being executed by the processor, and the computer program when executed implements the steps of the dynamic subgraph generation method or the dispute detection method in the present embodiment.

Claims

1. A dynamic subgraph generation method based on node features and reply paths is characterized in that:

3. The method for dynamic subgraph generation based on node features and reply paths according to claim 1, wherein the step S2 comprises: the difference of the path matrix P and the path length matrix S is used to define a path laplacian matrix: l ═ S-P.

4. The method for dynamic subgraph generation based on node features and reply paths according to claim 1, wherein the step S3 comprises:

s31, calculating a normalization form of the path Laplace matrix L:

L'＝I-S^-1P

wherein I is an identity matrix of M;

based on the matrix L', calculating the expression of the sensing path information of the central node i, wherein the expression matrix Q of the central node belongs to R^m*dThe calculation is as follows:

Q＝L′H

s32, where the matrix H ∈ R^m*dAnd recording the d-dimensional expression vector of each central node in the path matrix.

5. The method for dynamic subgraph generation based on node features and reply paths according to claim 4, wherein the step S4 comprises:

R＝QW_sH^T

s42, filtering out nodes on the path corresponding to the central node by using the path matrix P, and calculating a normalized correlation value between the central node and the nodes on the corresponding path by using a Softmax function according to the rows;

R'＝Softmax(P⊙R)

wherein |, represents the product of the corresponding elements in the matrix;

s43, for the path with the node i as the center, accumulating the correlation value on the path from the node i along the path, and cutting off the rest nodes when the accumulated correlation value is larger than the threshold value theta;

6. The method of dynamic subgraph generation based on node features and reply paths according to claim 1, characterized in that: updating the expression of the node by utilizing the node information in the subgraph based on the classical GNN model, wherein the expression of the node i in the l-th layer is

The update rule is as follows:

wherein g is an aggregation function, different aggregation functions being used in different GNN models; activation of sigma being non-linearA function; b^(l)Is a bias vector.

7. The dynamic subgraph generation method based on node features and reply paths according to claim 6, characterized in that:

in the GCN, the number of bits in the GCN,

wherein W^(l)Is a learnable parameter matrix.

8. A method of dispute detection, comprising:

A. adopting differences among node expressions to model a dispute mode in a subgraph generated by the dynamic subgraph generation method of any one of claims 1-7;

9. The dispute detection method according to claim 8, wherein the step a comprises:

Using the fully connected layer to learn these differences;

wherein

Is a matrix of parameters that can be learned,

is a matrix of bias terms.

10. The dispute detection method according to claim 8, wherein the post-directed attention mechanism is used to re-weight each sub-graph, and the calculation is as follows:

is the weight of attention mechanism, representing sub-graph SG_iRelevancy to a post;

wherein

A real tag representing the ith post;