CN112711951A

CN112711951A - Induction consciousness-based false news interpretability detection system and method

Info

Publication number: CN112711951A
Application number: CN202110008758.XA
Authority: CN
Inventors: 饶元; 吴连伟; 孙菱; 贺王卜; 兰玉乾; 丁毅
Original assignee: Xian Jiaotong University
Current assignee: Xian Jiaotong University
Priority date: 2021-01-05
Filing date: 2021-01-05
Publication date: 2021-04-27

Abstract

The invention discloses a false news interpretability detection system and method based on induction consciousness, which are used for discovering a valuable comment subchain of questioning news by exploring the evolution rule and the hierarchical structure characteristics of a place in a comment thread and overcoming the defects of relevance mining and mutual influence deficiency among comments in the prior art. According to the method, interdisciplinary knowledge is combined with a neural network model, and semantic association mining and mutual influence mining among comment nodes in a false news comment tree are explored; the bottom-up evolutionary tree network considers two social psychology theories to research the evolution rule of the ground in the comment clues so as to strengthen the valuable comments of the questioned news; the top-down coordination tree network provided by the invention coordinates the information absorption between the father node and the brother node, enhances the hierarchical structure of the comment, fuses the comment with the characteristics of the evolutionary tree network to obtain a valuable comment, highlights the child chain of the comment, and enhances the interpretability of the verification result.

Description

Induction consciousness-based false news interpretability detection system and method

Technical Field

The invention relates to a system and a method for detecting false news interpretability based on perceptual evolution of inductive consciousness and a coordination tree network.

Background

The development of social media opens a convenient door to the production and dissemination of false news. The curated and packaged fake news is more likely to attract the attention and consumption of the audience than is true and reliable news content. The world health organization has always defined this problem as an information epidemic, which calls fake news to be more quickly and easily disseminated than the virus itself. Therefore, how to detect and analyze the fake news and take intervention measures to reduce the spread of the fake news becomes one of the urgent problems in the current social media field.

A typical false news detection method usually extracts text features of news content, such as semantic, emotion, and genre, and performs binary classification by supervised learning (e.g., CNN and RNN). Due to the abundance of information content in social media, multi-modal semantic information such as images and videos and various context information appear. Researchers have also studied user portrayal features and forward propagation features. Furthermore, user reviews have proven useful as supplementary semantics to improve the performance of false news detection, especially where the standpoint of reviews is widely utilized. Currently, a common approach is to construct a multi-task learning model between false news detection and review standpoint detection to capture common features between the two tasks, thereby improving model performance. However, although effective, the multitask learning method for detecting false news based on the comment standpoint often treats comments in the comment tree as independent individuals, and lacks effective exploration of the correlation between comments because the subsequent comment semantics and posture may be affected or interfered by current or previous comments. In particular, they ignore the relationships between comments in the comment conversation thread, including the hierarchy of comments and the evolutionary process of the standpoint of comments.

Disclosure of Invention

The invention aims to solve the problems in the prior art and provides a false news interpretability detection system and method based on induction consciousness. The invention not only improves the false news detection performance, but also provides the transparency of the detection process and the interpretability of the detection result.

In order to achieve the purpose, the invention adopts the following technical scheme to realize the purpose:

a false news interpretability detection method based on induction consciousness comprises the following steps:

step 1, learning each tweet t by utilizing Bi-GRU_iAnd using the last step to conceal the layer vector

Representing the tweet;

step 2, establishing a bottom-up evolution tree recursive network, gradually combining child nodes to father nodes, and finally forming a vector representing the whole comment thread; traversing the whole comment conversation thread from bottom to top in a reverse time sequence to obtain a comment sequence containing a plurality of comment subchains;

step 3, considering the relevance between a single comment position and the comment subchain where the single comment position is located, and evaluating the relation between the current node and the previous node by means of social psychology knowledge, namely, a utilization superposition theory and a conservative deviation theory;

step 4, constructing a significance scoring function s (-) and a composite vector p (-) to calculate the merging degree between the ith node and the previous (i-1) nodes in one comment subchain; the scoring function s (-) is used for measuring the degree of the ith node in the comment subchain needing to be strengthened, and the synthetic vector p (-) represents the fusion degree of the two nodes;

and 5, refining the significance score based on two social psychology theories:

1) probability τ of two adjacent nodes merging:

τ(a_i，a_i-1)＝a_i ^TSa_i-1 (1)

2) semantic differences between nodes and their children chains

Wherein the content of the first and second substances,

representing the evaluation of semantic difference of the ith node and the previous (i-1) node;

mean semantics representing the preceding (i-1) node;

3) review the differences between the sites φ:

where φ (-) represents the difference between the review standpoint of the ith node and the previous (i-1) node; d_iA position vector representing the ith node,

an average vector representing the previous (i-1) nodes;

4) timeliness of the nodes ψ:

ψ(t_i)＝gt_i ^-α-1 (4)

wherein, t_iIs the interval between the ith node and the first node in the sub-chain of comments; g and alpha are both learnable parameters;

step 6, significance scoring:

step 7, calculating a synthetic vector p (·):

p(i，i-1)＝s(i，i-1)m(i，i-1) (7)

step 8, merging all nodes in the whole comment subchain as follows:

step 9, utilizing TreeLSTM to realize a composite function m (·); TreeLSTM generates a hidden state vector h' and, given two input vectors a_iAnd a_jThe following cell state vector c' yields:

c′＝c′_i⊙f_i+c′_j⊙f_j+x⊙u (10)

h′＝o′+tanh(c′) (11)

wherein U and b are trainable parameters, which indicate a dot product operation;

step 10, traversing the whole comment sequence containing a plurality of comment subchains, and after evolution and combination, highlighting valuable comments of questioning news in the comment threads through the evolution tree network, wherein the final learnable expression is marked as BE.

Step 11, constructing a top-down coordination tree network to coordinate information absorption between a father node and a brother node;

step 12, updating the coordination tree network:

subscripts in, fo, hf, c and o are respectively used for an input gate, a temporary forgetting gate, a hierarchical forgetting gate, a cell unit and an output; vector in_t，fo_tAnd hf_tRespectively representing the weight of the newly captured information, memorizing old information from brother nodes, and memorizing old information from father nodes;

representing a hadamard product; if it is not

Then h_p(t)And c_p(t)Is set to an initial state value;

step 13, after traversing, obtaining the representation of the whole conversation thread, and recording the representation as TC;

step 14, integrating the comment of the questioning news acquired by the evolvement tree network and the hierarchical structure information of the comment acquired by the coordination tree network to obtain a valuable comment subchain of the questioning news:

F＝[BE；TC] (15)。

the invention further improves the following steps:

the specific method of the step 11 is as follows:

traversing the entire annotation tree from top to bottom in time order, assuming that π (t) represents the parent of t, k (t) represents its children, p (t) is the previous sibling at the same level in time, and s (t) represents its successor at the same level.

A induction awareness based false news interpretability detection system, comprising:

an input embedding module to learn a contextual semantic hidden layer representation of a conversation;

an evolutionary tree network for capturing valuable reviews that challenge news;

the coordination tree network is used for amplifying the comment subchains and explaining verification results;

and the task learning module integrates the position evolution semantics obtained by the evolution tree and the comment hierarchical structure semantics captured by the coordination tree network and inputs the position evolution semantics and the comment hierarchical structure semantics into Softmax to predict probability distribution.

A induction awareness based false news interpretability detection terminal device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, the processor when executing the computer program implementing the steps of the method as described above.

A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method as described above.

Compared with the prior art, the invention has the following beneficial effects:

the invention discloses a false news interpretability detection method based on induction consciousness perception evolution and coordination tree network, which discovers a valuable comment subchain of questioning news by exploring the evolution rule and the hierarchical structure characteristics of a place in a comment thread, and overcomes the defects of relevance mining and mutual influence deficiency among comments in the prior art.

The invention provides a mode of combining interdisciplinary knowledge and a neural network model for the first time, semantic association mining and mutual influence mining among comment nodes in a false news comment tree are explored, and theoretical support of interpretable false news detection is effectively provided; the bottom-up evolutionary tree network considers two social psychology theories to research the evolution rule of the ground in the comment clues so as to strengthen the valuable comments of the questioned news; the top-down coordination tree network provided by the invention coordinates the information absorption between the father node and the brother node, enhances the hierarchical structure of the comment, fuses the comment with the characteristics of the evolutionary tree network to obtain a valuable comment, highlights the child chain of the comment, and enhances the interpretability of the verification result; extensive experiments on both real data sets confirmed the better performance of the present invention compared to the most advanced models.

Drawings

In order to more clearly explain the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention, and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.

FIG. 1 is an architectural diagram of the present invention;

FIG. 2 is a graph of experimental performance of the present invention under two data sets, RumourEval and PHEME;

FIG. 3 is a graph comparing the separation performance of different modules of the present invention under both RumourEval and PHEME data sets.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.

Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.

In the description of the embodiments of the present invention, it should be noted that if the terms "upper", "lower", "horizontal", "inner", etc. are used for indicating the orientation or positional relationship based on the orientation or positional relationship shown in the drawings or the orientation or positional relationship which is usually arranged when the product of the present invention is used, the description is merely for convenience and simplicity, and the indication or suggestion that the referred device or element must have a specific orientation, be constructed and operated in a specific orientation, and thus, cannot be understood as limiting the present invention. Furthermore, the terms "first," "second," and the like are used merely to distinguish one description from another, and are not to be construed as indicating or implying relative importance.

Furthermore, the term "horizontal", if present, does not mean that the component is required to be absolutely horizontal, but may be slightly inclined. For example, "horizontal" merely means that the direction is more horizontal than "vertical" and does not mean that the structure must be perfectly horizontal, but may be slightly inclined.

In the description of the embodiments of the present invention, it should be further noted that unless otherwise explicitly stated or limited, the terms "disposed," "mounted," "connected," and "connected" should be interpreted broadly, and may be, for example, fixedly connected, detachably connected, or integrally connected; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood by those skilled in the art according to specific situations.

The invention is described in further detail below with reference to the accompanying drawings:

referring to fig. 1, an embodiment of the present invention discloses a false news interpretability detection system based on induction awareness, including:

an input embedding module: considering an embedded representation of a real Twitter dialog tree process, the sequence is expanded in time sequence and Bi-GRU is adopted to learn the semantic hidden layer representation of the context.

Evolving the tree network from bottom to top: in order to explore the evolution law of the comment position, a low-upward evolutionary tree network is established, and two social psychology theories including availability superposition and conservative bias are integrated to capture valuable comments which question news.

Top-down coordination tree network: in order to strengthen the hierarchical relationship of the comment threads, a top-down coordination tree network is provided to adjust the absorption of information semantics between a father node and a brother node, so as to enlarge the comment child chain and explain the verification result.

A task learning module: integrating the vertical evolution semantics obtained by the evolution tree with the comment hierarchical structure semantics captured by the coordination tree network, and inputting the integrated vertical evolution semantics into Softmax to predict probability distribution.

The embodiment of the invention discloses a false news interpretability detection method based on induction consciousness, which comprises the following steps:

stage 0: data initialization

Step 0: consider a Twitter conversation process C that includes a source tweet₁(original rumor news) and a series of responsive commentary tweets in chronological order C ═ t₂，t₃，...，t_|C|H, each of which pushes text t_iExpressing the position of source tweet, including four positions of support, negation, inquiry and comment. Indicates the tweet t_iHas a stand characteristic of d_i。

Step 1: learning each tweet t using Bi-GRU_iAnd using the last step to conceal the layer vector

To represent the tweet.

Stage 1: bottom-up evolution tree network

Step 2: in order to highlight valuable questioning news comments and better express the semantics of the news in the comment thread, a bottom-up evolutionary tree recursive network (BETN) is established to gradually merge child nodes to parent nodes, and finally a vector is formed to represent the whole comment thread. First, the whole comment conversation thread is traversed from bottom to top in a reverse chronological order, so that a comment sequence containing many comment subchains is obtained.

And step 3: note that the creation of the ith comment position may be affected by the previous i-1 comment position in the comment subchain. The invention deeply considers the relevance between a single comment position and the comment subchain where the single comment position is located, and evaluates the relationship between the current node (the ith comment) and the previous node by means of social and psychological knowledge, namely the availability superposition theory and the conservative deviation theory.

And 4, step 4: the invention constructs a significance scoring function s (-) and a composite vector p (-) to calculate the merging degree between the ith node and the previous (i-1) nodes in a comment subchain. s (-) is responsible for measuring the degree to which the ith node in the comment subchain needs reinforcement, and p (-) represents the fusion degree of the two nodes.

And 5: based on these two social psychology theories, the significance score is mainly refined from the following aspects:

1) probability of two adjacent nodes merging. By utilizing a learnable parameter matrix S, the invention realizes the merging probability of two adjacent nodes by means of a bilinear function, thereby evaluating the enhancement degree of the node semantics.

τ(a_i，a_i-1)＝a_i ^TSa_i-1 (1)

2) Semantic differences of nodes and their children chains.

For evaluating the semantic difference between the ith node and the previous (i-1) node. The larger the difference, the more prominent the semantic information of the ith node.

Wherein the content of the first and second substances,

representing the average semantics of the preceding (i-1) node.

3) Comment on the differences between the stands. φ (-) is used to investigate the difference between the review standpoint of the ith node and the previous (i-1) node. The larger the difference, the more obvious the i-th comment from the standpoint.

Wherein d is_iA position vector representing the ith node,

represents the average vector of the previous (i-1) nodes.

4) And (4) timeliness of the nodes. The invention applies power law to slightly reinforce the latest comments to highlight the latest possible rumor splitting sound.

ψ(t_i)＝gt_i ^-α-1 (4)

5) Wherein, t_iIs the interval between the ith node and the first node in the child chain of comments. Both g and α are learnable parameters.

Step 6: finally, the saliency score may be described as:

and 7: the resultant vector p (-) can be computed as:

p(i，i-1)＝s(i，i-1)m(i，i-1) (7)

and 8: the merged results of all nodes in the whole sub-chain of comments are represented as:

and step 9: the invention utilizes TreeLSTM to realize the composite function m (·). TreeLSTM generates a hidden state vector h' and, given two input vectors a_iAnd a_jCell state vector c' of (c).

c′＝c′_i⊙f_i+c′_j⊙f_j+x⊙u (10)

h′＝o′+tanh(c′) (11)

Where U and b are trainable parameters, which indicate a dot product operation. In particular, h here denotes in the framework diagram of the invention

Step 10: traversing the whole comment sequence containing a plurality of comment subchains in the manner, after evolution and combination, the evolution tree network highlights valuable comments of questioning news in the comment threads, and finally the learnable representation is marked as BE (namely the valuable comments are marked as BE)

)。

And (2) stage: top-down coordination tree network

Step 11: considering that the hierarchy of the comment structure is not effectively considered in the bottom-up evolution tree network, the invention constructs a top-down coordination tree network (TCTN) to coordinate the information absorption between the father node and the brother node and strengthen the hierarchical relationship. Specifically, the entire annotation tree is first traversed from top to bottom in chronological order, assuming that π (t) represents the parent of t, k (t) represents its child, p (t) is the previous sibling at the same level in time, and s (t) representsIts successor at the same level. In the top-down coordination tree network of the present architecture diagram, with node t₂For example, π (t) can be obtained₂)＝t₁，k(t₂)＝t₅，

And s (t)₂)＝t₃. If with the node t₃For example, π (t) can be obtained₂)＝t₁，

p(t₃)＝t₂And s (t)₃)＝t₄。

Step 12: the present invention provides for a top-down coordinated tree network update process using subscripts in, fo, hf, c, and o for input gate, temporal forgetting gate, hierarchical forgetting gate, cell unit, and output, respectively. Vector in_t，fo_tAnd hf_tWeights representing newly captured information, old information from siblings, and old information from parents, respectively, are memorized.

Representing a hadamard product. If it is not

Then h_p(t)And c_p(t)Is set to an initial state value. In particular, h_tIn the architecture diagram corresponding to the invention

Step 13: finally, after traversing, we get a representation of the entire conversation thread as TC (i.e., h)_|C|)。

Step 14: according to the invention, the comments of the questioning news acquired from the bottom-up evolution tree network and the hierarchical structure information of the comments acquired from the top-down coordination tree network are integrated, so that a valuable comment subchain of the questioning news is obtained.

F＝[BE；TC] (15)

Stage 3: task learning

Step 15: at the top of the TreeLSTM units of both nets, the bottom-up and top-down state vectors are concatenated and passed to the softmax layer to predict the probability distribution and cross entropy error minimization for the training samples with the true label y:

p＝softmax(W_pF+b_p) (16)

loss＝-∑ylogp (17)

the method is suitable for social network environments and can be used in social media network environments with rich hierarchical comment features.

The device provided by the embodiment of the invention. The embodiment comprises the following steps: a processor, a memory, and a computer program stored in the memory and executable on the processor. The processor realizes the steps of the above-mentioned method embodiments when executing the computer program. Alternatively, the processor implements the functions of the modules/units in the above device embodiments when executing the computer program.

The computer program may be partitioned into one or more modules/units that are stored in the memory and executed by the processor to implement the invention.

The terminal device can be a desktop computer, a notebook, a palm computer, a cloud server and other computing devices. The terminal device may include, but is not limited to, a processor, a memory.

The processor may be a Central Processing Unit (CPU), other general purpose processor, a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), an off-the-shelf programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, etc.

The memory may be used for storing the computer programs and/or modules, and the processor may implement various functions of the terminal device by executing or executing the computer programs and/or modules stored in the memory and calling data stored in the memory.

The terminal device integrated modules/units, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer readable storage medium. Based on such understanding, all or part of the flow of the method according to the embodiments of the present invention may also be implemented by a computer program, which may be stored in a computer-readable storage medium, and when the computer program is executed by a processor, the steps of the method embodiments may be implemented. Wherein the computer program comprises computer program code, which may be in the form of source code, object code, an executable file or some intermediate form, etc. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer memory, Read-only memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution medium, etc. It should be noted that the computer readable medium may contain content that is subject to appropriate increase or decrease as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media does not include electrical carrier signals and telecommunications signals as is required by legislation and patent practice.

The above is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes will occur to those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A false news interpretability detection method based on induction consciousness is characterized by comprising the following steps:

Representing the tweet;

and 5, refining the significance score based on two social psychology theories:

1) probability τ of two adjacent nodes merging:

τ(a_i，a_i-1)＝a_i ^TSa_i-1 (1)

2) semantic differences between nodes and their children chains

Wherein the content of the first and second substances,

mean semantics representing the preceding (i-1) node;

3) review the differences between the sites φ:

an average vector representing the previous (i-1) nodes;

4) timeliness of the nodes ψ:

ψ(t_i)＝gt_i ^-α-1 (4)

step 6, significance scoring:

step 7, calculating a synthetic vector p (·):

p(i，i-1)＝s(i，i-1)m(i，i-1) (7)

step 8, merging all nodes in the whole comment subchain as follows:

c′＝c′_i⊙f_i+c′_j⊙f_j+x⊙u (10)

h′＝o′+tanh(c′) (11)

step 12, updating the coordination tree network:

representing a hadamard product; if it is not

Then h_p(t)And c_p(t)Is set to an initial state value;

F＝[BE；TC] (15)。

2. the induction consciousness-based false news interpretability detection method of claim 1, wherein the specific method of the step 11 is as follows:

3. A induction awareness based false news interpretability detection system, comprising:

4. A induction awareness based false news interpretability detection terminal device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor when executing the computer program implements the steps of the method according to claim 1 or 2.

5. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to claim 1 or 2.