WO2023050670A1 - False information detection method and system, computer device, and readable storage medium - Google Patents

False information detection method and system, computer device, and readable storage medium Download PDF

Info

Publication number
WO2023050670A1
WO2023050670A1 PCT/CN2022/074411 CN2022074411W WO2023050670A1 WO 2023050670 A1 WO2023050670 A1 WO 2023050670A1 CN 2022074411 W CN2022074411 W CN 2022074411W WO 2023050670 A1 WO2023050670 A1 WO 2023050670A1
Authority
WO
WIPO (PCT)
Prior art keywords
detected
model
data
information
reply
Prior art date
Application number
PCT/CN2022/074411
Other languages
French (fr)
Chinese (zh)
Inventor
舒畅
陈又新
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2023050670A1 publication Critical patent/WO2023050670A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent

Definitions

  • the embodiments of the present application relate to the technical field of data processing, and in particular to a false information detection method, system, computer equipment, and readable storage medium.
  • Network platforms such as Twitter, WeChat, Weibo, and Tieba are full of false information, followed by a large number of forwarding and replying. Although there is an algorithm for identifying false information at present, it does not meet the timeliness requirements and cannot quickly identify false information.
  • the purpose of the embodiment of the present application is to provide a
  • the false information detection method, system, computer equipment and readable storage medium are used to solve the problem of insufficient real-time detection of false information.
  • an embodiment of the present application provides a false information detection method, including:
  • Acquiring data to be detected wherein the data to be detected includes detection source information and response information to be detected corresponding to the source information to be detected at the current moment;
  • the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained.
  • the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
  • an embodiment of the present application provides a false information detection system, including:
  • An acquisition module configured to acquire data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
  • a vectorization module configured to perform vectorization processing on the data to be detected, to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
  • An encoding module configured to encode the feature vector of the source to be detected and the feature vector of the reply to be detected through the encoding layer in the pre-trained false information classification model, to obtain the source to be detected corresponding to the information of the source to be detected A feature code and a feature code of the reply to be detected corresponding to the reply information to be detected;
  • the judging module is used to classify and pre-judge the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model, so as to determine whether the data to be detected needs to be classified;
  • a classification module configured to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected through the classification layer of the false information classification model if it is determined that the data to be detected needs to be classified.
  • the detection data is classified to obtain a first classification result corresponding to the data to be detected.
  • an embodiment of the present application provides a computer device, the computer device includes a memory and a processor, the memory stores computer-readable instructions that can run on the processor, and the processor The following steps are also performed when the computer readable instructions are executed:
  • Acquiring data to be detected wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
  • the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained.
  • the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
  • an embodiment of the present application provides a computer-readable storage medium, the computer-readable storage medium stores computer-readable instructions, and the computer-readable instructions can be executed by at least one processor to causing the at least one processor to perform the following steps:
  • Acquiring data to be detected wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
  • the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained.
  • the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
  • the false information detection method, system, computer equipment, and readable storage medium provided in the embodiments of the present application obtain the source information to be detected and the corresponding reply information to be detected at the current moment, and perform the detection on the source information to be detected and the corresponding reply information to be detected.
  • Vectorization processing and encoding processing after encoding and inputting the source information to be detected and its corresponding reply information to be detected into the pre-trained deep reinforcement learning model, whether to classify the source information to be detected and its corresponding reply information to be detected
  • Real-time detection if it is determined that the data to be detected needs to be classified and processed, then the feature code of the data to be detected is classified and judged, and the reply information to be detected in the data to be detected is not obtained again for classification processing, which improves the efficiency of false information detection.
  • FIG. 1 is a flow chart of Embodiment 1 of the false information detection method of the present application.
  • FIG. 2 is a schematic diagram of the modules of Embodiment 2 of the false information detection system of the present application.
  • FIG. 3 is a schematic diagram of the hardware structure of Embodiment 3 of the computer device of the present application.
  • FIG. 1 shows a flow chart of steps of a false information detection method according to Embodiment 1 of the present application. It can be understood that the flowchart in this method embodiment is not used to limit the sequence of execution steps. An exemplary description is given below taking the computer device 2 as the execution subject. details as follows.
  • Step S100 Obtain data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment.
  • the source information to be detected is the source information that needs to be detected for false information
  • the reply information to be detected is information that replies to the source information. Since the source information to be detected and the reply information to be detected are obtained in real time, the source information to be detected and the reply information to be detected have a time sequence.
  • Step S102 performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected.
  • the data to be detected is vectorized through the preset vectorization model to obtain the source feature vector to be detected corresponding to the source information to be detected and the reply feature vector to be detected corresponding to the reply information to be detected.
  • the vectorization model is visual (Bert +ImageNet) model.
  • step S102 includes:
  • Step S1021. Obtain first text data in the source information to be detected, and perform vectorization processing on the first text data through a first vectorization model to obtain a first feature vector.
  • Step S1022. Obtain the first image data in the source information to be detected, and perform vectorization processing on the first image data through a second vectorization model to obtain a second feature vector.
  • Step S1023, splicing the first feature vector and the second feature vector to obtain a source feature vector to be detected corresponding to the source information to be detected.
  • the source information to be detected includes text data and chart data
  • vectorization processing is performed through the first vectorization model and the second vectorization model, wherein the first vectorization model is a visual Bert model, and the second vectorization model is a visual Bert model.
  • the vectorized model is a visual ImageNet model.
  • the visual ImageNet model is a model for vectorizing images. This model is obtained after vectorization training on the ImageNet image library, and the accuracy of the model is more accurate.
  • the processed first feature vector and the second feature vector are then concatenated to obtain the source feature vector to be detected corresponding to the source information to be detected.
  • step S102 includes:
  • Step S102A Obtain second text data in the reply information to be detected, and perform vectorization processing on the second text data through the first vectorization model to obtain a third feature vector.
  • Step S102B Obtain the second picture data in the reply information to be detected, and perform vectorization processing on the second picture data through the second vectorization model to obtain a fourth feature vector.
  • Step S102C concatenating the third feature vector and the fourth feature vector to obtain a feature vector of a reply to be detected corresponding to the reply information to be detected.
  • the reply information to be detected includes text data and graph data
  • vectorization processing is performed through the first vectorization model and the second vectorization model, wherein the first vectorization model is a visual Bert model, and the second vectorization model is a visual Bert model.
  • the vectorized model is the visual ImageNet model.
  • the processed third feature vector and the fourth feature vector are then concatenated to obtain the feature vector of the reply to be detected corresponding to the reply information to be detected.
  • Step S104 Encoding the source feature vector to be detected and the reply feature vector to be detected through the encoding layer in the pre-trained false information classification model to obtain the source feature code to be detected corresponding to the source information to be detected and the feature code of the reply to be detected corresponding to the reply to be detected information.
  • the false information classification model includes an encoding layer.
  • the LSTM model Since the LSTM model has a memory function, the data to be detected at the current time and before the current time are stored in the LSTM model. If the reply information to be detected at the latest time is obtained, it only needs to encode the reply information to be detected at the latest time.
  • Step S106 perform classification pre-judgment on the to-be-detected source feature code and the to-be-detected reply feature code through the trained deep reinforcement learning model, so as to determine whether to classify the to-be-detected data.
  • the false information classification model is a neural network model.
  • the deep reinforcement learning model is the Dueling-DQN (Deep Q Network) network model, which can predict the input source feature codes to be detected and the reply feature codes to be detected, and predict whether the input data to be detected at the current moment can be followed up. classification processing.
  • the Dueling-DQN network model includes state (state), reward (Reward), and behavior (Action).
  • Q is Q(s, a) that is, in the state of s at a certain moment (s ⁇ S), take action a(a ⁇ A)
  • the environment will feedback the corresponding reward r according to the action of the agent, if the agent judges that the current state is enough for the classifier to classify with high confidence, then stop reading the reply and let the classifier do the classification; Otherwise continue reading the next reply.
  • Step S108 if it is determined that the data to be detected needs to be classified, the data to be detected is classified according to the source feature code to be detected and the reply feature code to be detected through the classification layer of the false information classification model Perform classification to obtain a first classification result corresponding to the data to be detected.
  • the false information classification model is provided with a Classifier classifier, that is, a softmax loss function, which performs classification processing on the data to be detected to obtain a corresponding first classification result.
  • the first classification result is used to indicate whether the data to be detected is false information.
  • the classifier may be a binary classification classifier, and the first classification result output is 0 or 1, 0 indicating that it is not false information, and 1 indicating that it is false information.
  • step S108 it also includes:
  • step S102 to step S106 are repeated to perform vectorization and encoding processing on the reply information to be detected at the next moment, and then input into the deep Strengthen the pre-judgment in the school model until it is determined that the data to be tested needs to be classified.
  • the deep enhanced school model performs classification pre-judgment, it is necessary to predict all the data to be tested.
  • the data to be detected includes the source information to be detected, the reply information to be detected before the current moment, the reply information to be detected at the current moment, and the reply information to be detected at the next moment; if it is still determined that the data to be detected does not need to be classified, Then obtain the reply data to be detected at the next moment.
  • the training steps of the deep reinforcement learning model include:
  • each training sample set includes the first feature code of the sample source information and the second feature code of the sample reply information corresponding to the sample source information at different time step values, wherein the sample source information corresponds to The time step value of is smaller than the time step value of the corresponding sample reply information.
  • the first feature encoding and the second feature encoding in the training sample set are obtained by vectorizing and re-encoding the sample source information and the sample reply information.
  • the second feature code corresponding to each time step value includes the first feature code corresponding to the sample source information and the feature codes of all sample reply information at the current moment, so that the enhanced model can be classified according to the first feature code or the second feature code preprocessing.
  • the feature encoding corresponding to each time step value is classified.
  • the enhanced model pre-judges the feature encoding of each time step. Based on the Q value judgment in the enhanced model Dueling-DQN, the larger the Q value, the greater the possibility of classification processing.
  • the Q value is greater than the preset threshold When , it means that the classification process is carried out. If the Q value calculated by the current enhanced model is less than the preset threshold, continue to obtain the feature code of the sample reply information, generate the next second feature code, and input it into the enhanced model for judgment.
  • the model outputs the second classification result of each training sample set. If the enhanced model judges to stop inputting the second feature code into the enhanced model, it means that the enhanced model judges that the current second feature code can be classified into false information, and inputs it into the false information classification model for classification processing. Wherein, if the enhanced model can determine that it is not necessary to read the second feature code of the sample reply information through the first feature code of the sample source information, then input the first feature code into the false information classification model to perform false information classification prediction, Get the second classification result.
  • the real classification results of the training sample set are obtained in advance and associated with the training sample set.
  • the model parameters of the reinforcement model are updated according to the first update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
  • the method further includes:
  • the method further includes:
  • If it is determined to continue to input the second feature code into the enhanced model, update the reward and punishment value of the enhanced model to obtain a third updated reward and punishment value, and calculate the loss of the enhanced model according to the third updated reward and punishment value function to obtain a third update function; update the model parameters of the enhanced model according to the third update function to obtain an updated enhanced model; encode the first feature code and each second feature code in each training sample set Input into the updated enhanced model in sequence according to the time step value, and judge whether to stop inputting the second feature code into the enhanced model through the updated enhanced model, until it is determined to stop inputting the second feature code to the enhanced model. If you choose to continue to read the reply, give a small penalty r t 0.05 to get the third update reward and punishment value, so as to limit the Agent to keep increasing the reply.
  • the whole detection process is divided into two modules, one is the classification module (false information classification model), and the other is the control module (deep reinforcement learning model).
  • the classification module false information classification model
  • the control module deep reinforcement learning model
  • the reply information is encoded, and after encoding, the encoded information will be input to the control module.
  • the control module is a deep reinforcement learning model, which will judge the action of the input encoded information, and judge whether to stop or continue to obtain the next reply information to be detected.
  • the false information classification module will be triggered to classify the current status information (feature code) to determine whether it is false information; if the action is judged to be continued, the classification module will not be triggered to classify, thereby allowing the next
  • the reply information is input into LSTM for encoding, and the latest encoded information is input to the control module to judge the action again, and so on. Because LSTM is a network with a cyclic neural network structure, it has the ability to encode historical information as a whole. When it is judged to continue, it only needs to obtain the next reply information to be detected.
  • the deep reinforcement learning model will judge the action on the feature encoding of the overall information every time.
  • the classifier When training the deep reinforcement learning model, once the action is judged to be stopped, the classifier will be triggered to start classification, so the classification result of the classifier will be divided into right and wrong. If the classification is correct, the control module will be rewarded. If the classification is wrong, the control module is penalized. If the action is always judged as continuing, the control model will also get a penalty, but this penalty will be very small.
  • FIG. 2 shows a schematic diagram of the program modules of Embodiment 2 of the false information detection system of the present application.
  • the false information detection system 20 may include or be divided into one or more program modules, and one or more program modules are stored in a storage medium and executed by one or more processors to complete
  • the program module referred to in the embodiment of this application refers to a series of computer program instruction segments capable of completing specific functions, which is more suitable for describing the execution process of the false information detection system 20 in the storage medium than the program itself.
  • the following description will specifically introduce the functions of each program module of the present embodiment:
  • the acquisition module 200 is configured to acquire data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment.
  • the vectorization module 202 is configured to perform vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected.
  • the vectorization module 202 is also used for:
  • the vectorization module 202 is also used for:
  • the encoding module 204 is configured to encode the source feature vector to be detected and the reply feature vector to be detected through the encoding layer in the pre-trained false information classification model to obtain the source information to be detected corresponding to the source information to be detected The source feature code and the feature code of the reply to be detected corresponding to the reply information to be detected.
  • the judging module 206 is configured to perform classification pre-judgment on the to-be-detected source feature code and the to-be-detected reply feature code through the trained deep reinforcement learning model, so as to determine whether to classify the to-be-detected data.
  • the classification module 208 is configured to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected through the classification layer of the false information classification model if it is determined that the data to be detected needs to be classified.
  • the data to be detected is classified to obtain a first classification result corresponding to the data to be detected.
  • the classification module 208 is also used for:
  • the computer device 2 is a device capable of automatically performing numerical calculation and/or information processing according to preset or stored instructions.
  • the computer device 2 may be a rack server, a blade server, a tower server or a cabinet server (including an independent server, or a server cluster composed of multiple servers) and the like.
  • the server can be an independent server, or it can provide cloud services, cloud database, cloud computing, cloud function, cloud storage, network service, cloud communication, middleware service, domain name service, security service, content delivery network (Content Delivery Network, CDN), and cloud servers for basic cloud computing services such as big data and artificial intelligence platforms.
  • the computer device 2 at least includes, but is not limited to, a memory 21 , a processor 22 , a network interface 23 , and a false information detection system 20 that can communicate with each other through a system bus. in:
  • the memory 21 includes at least one type of computer-readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory ( RAM), static random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, etc.
  • the memory 21 may be an internal storage unit of the computer device 2 , such as a hard disk or memory of the computer device 2 .
  • the memory 21 can also be an external storage device of the computer device 2, such as a plug-in hard disk equipped on the computer device 2, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, flash memory card (Flash Card), etc.
  • the storage 21 may also include both the internal storage unit of the computer device 2 and its external storage device.
  • the memory 21 is generally used to store the operating system and various application software installed in the computer device 2, such as the program code of the false information detection system 20 in the second embodiment.
  • the memory 21 can also be used to temporarily store various types of data that have been output or will be output.
  • the processor 22 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips.
  • the processor 22 is generally used to control the overall operation of the computer device 2 .
  • the processor 22 is used to run the program codes stored in the memory 21 or process data, for example, to run the false information detection system 20, so as to realize the false information detection method in the first embodiment.
  • the network interface 23 may include a wireless network interface or a wired network interface, and the network interface 23 is generally used to establish a communication connection between the server 2 and other electronic devices.
  • the network interface 23 is used to connect the server 2 with an external terminal through a network, and establish a data transmission channel and a communication connection between the server 2 and an external terminal.
  • the network can be an enterprise intranet (Intranet), Internet (Internet), Global System of Mobile communication (Global System of Mobile communication, GSM), wideband code division multiple access (Wideband Code Division Multiple Access, WCDMA), 4G network, 5G Internet, Bluetooth (Bluetooth), Wi-Fi and other wireless or wired networks.
  • FIG. 3 only shows computer device 2 having components 20-23, but it should be understood that implementation of all of the illustrated components is not required and that more or fewer components may instead be implemented.
  • the false information detection system 20 stored in the memory 21 can also be divided into one or more program modules, and the one or more program modules are stored in the memory 21 and are controlled by one or more program modules. Executed by multiple processors (the processor 22 in this embodiment) to complete the application.
  • FIG. 2 shows a schematic diagram of program modules for realizing the second embodiment of the false information detection system 20.
  • the false information detection system 20 can be divided into the acquisition module 200, the vectorization module 202 , the encoding module 204 , the judging module 206 and the classifying module 208 .
  • the program module referred to in this application refers to a series of computer program instruction segments capable of completing specific functions, which is more suitable than a program to describe the execution process of the false information detection system 20 in the computer device 2 .
  • the specific functions of the program modules 200-208 have been described in detail in the second embodiment, and will not be repeated here.
  • This embodiment also provides a computer-readable storage medium, such as flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), only Read memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, optical disk, server, App application store, etc., on which computer programs are stored, The corresponding functions are realized when the program is executed by the processor.
  • the computer-readable storage medium may be non-volatile or volatile.
  • the computer-readable storage medium of this embodiment is used for a computer program, and when executed by a processor, the following steps are implemented:
  • Acquiring data to be detected wherein the data to be detected includes detection source information and response information to be detected corresponding to the source information to be detected at the current moment;
  • the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained.
  • the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.

Abstract

The present application discloses a false information detection method, comprising: obtaining data to be detected comprising source information to be detected and reply information to be detected of said source information at the current time; performing vectorization processing on said data to obtain a source feature vector to be detected corresponding to said source information and a reply feature vector to be detected corresponding to said reply information; performing encoding processing on the feature vector of said data by means of the encoding layer of a pre-trained false information classification model, so as to obtain the feature code of said data; performing classification pre-determination on the feature code of said data by means of a trained deep reinforcement learning model, so as to determine whether said data needs to be classified; and if yes, classifying said data by means of the classification layer of the false information classification model according to a source feature code to be detected and a reply feature code to be detected, so as to obtain a first classification result corresponding to said data. The present application can perform real-time detection on said data.

Description

虚假信息检测方法、系统、计算机设备及可读存储介质False information detection method, system, computer equipment and readable storage medium
本申请要求于2021年09月30日提交中国专利局、申请号为202111156357.5,发明名称为“虚假信息检测方法、系统、计算机设备及可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application submitted to the China Patent Office on September 30, 2021 with the application number 202111156357.5 and the title of the invention is "False Information Detection Method, System, Computer Equipment, and Readable Storage Medium", the entire content of which Incorporated in this application by reference.
技术领域technical field
本申请实施例涉及数据处理技术领域,尤其涉及一种虚假信息检测方法、系统、计算机设备及可读存储介质。The embodiments of the present application relate to the technical field of data processing, and in particular to a false information detection method, system, computer equipment, and readable storage medium.
背景技术Background technique
随着互联网与自媒体行业的迅速发展,人们每天都要接收与发出不计其数的信息,进入了信息爆炸的时代,这些信息无时无刻不在影响着人们的生活。然而,发明人发现,正如人们在传统的口头交流中那样,互联网所传递的信息并不是完全真实与可信的。在铺天盖地的信息中总会包含那么一些虚假的、对人的认知、思想与行为产生误导的信息,这便是网络谣言,即虚假信息。With the rapid development of the Internet and the self-media industry, people have to receive and send countless information every day, entering the era of information explosion, which affects people's lives all the time. However, the inventors have found that, just like people in traditional oral communication, the information delivered by the Internet is not completely true and credible. The overwhelming information always contains some false information that misleads people's cognition, thinking and behavior. This is Internet rumors, that is, false information.
网络平台如推特、微信、微博和贴吧等充斥大量虚假信息,随之而来的是大量的转发和回复等等。目前虽然有识别是否虚假信息的算法,但不满足时效性要求,无法快速的识别出虚假信息。Network platforms such as Twitter, WeChat, Weibo, and Tieba are full of false information, followed by a large number of forwarding and replying. Although there is an algorithm for identifying false information at present, it does not meet the timeliness requirements and cannot quickly identify false information.
发明内容Contents of the invention
有鉴于此,本申请实施例的目的是提供一种In view of this, the purpose of the embodiment of the present application is to provide a
虚假信息检测方法、系统、计算机设备及可读存储介质,用以解决虚假信息检测不够实时性的问题。The false information detection method, system, computer equipment and readable storage medium are used to solve the problem of insufficient real-time detection of false information.
为实现上述目的,本申请实施例提供了一种虚假信息检测方法,包括:In order to achieve the above purpose, an embodiment of the present application provides a false information detection method, including:
获取待检测数据,其中,所述待检测数据包括检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;Acquiring data to be detected, wherein the data to be detected includes detection source information and response information to be detected corresponding to the source information to be detected at the current moment;
对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;Performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及 所述待检测回复信息对应的待检测回复特征编码;Through the encoding layer in the pre-trained false information classification model, the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained. The feature code of the response to be detected corresponding to the response to be detected;
通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及Perform classification pre-judgment on the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model to determine whether the data to be detected needs to be classified; and
若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。If it is determined that the data to be detected needs to be classified, the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
为实现上述目的,本申请实施例提供了一种虚假信息检测系统,包括:In order to achieve the above purpose, an embodiment of the present application provides a false information detection system, including:
获取模块,用于获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;An acquisition module, configured to acquire data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
向量化模块,用于对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;A vectorization module, configured to perform vectorization processing on the data to be detected, to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
编码模块,用于通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码;An encoding module, configured to encode the feature vector of the source to be detected and the feature vector of the reply to be detected through the encoding layer in the pre-trained false information classification model, to obtain the source to be detected corresponding to the information of the source to be detected A feature code and a feature code of the reply to be detected corresponding to the reply information to be detected;
判断模块,用于通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及The judging module is used to classify and pre-judge the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model, so as to determine whether the data to be detected needs to be classified; and
分类模块,用于若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。A classification module, configured to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected through the classification layer of the false information classification model if it is determined that the data to be detected needs to be classified. The detection data is classified to obtain a first classification result corresponding to the data to be detected.
为实现上述目的,本申请实施例提供了一种计算机设备,所述计算机设备包括存储器、处理器,所述存储器上存储有可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时还执行以下步骤:To achieve the above purpose, an embodiment of the present application provides a computer device, the computer device includes a memory and a processor, the memory stores computer-readable instructions that can run on the processor, and the processor The following steps are also performed when the computer readable instructions are executed:
获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;Acquiring data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;Performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码;Through the encoding layer in the pre-trained false information classification model, the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained. The feature code of the response to be detected corresponding to the response to be detected;
通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及Perform classification pre-judgment on the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model to determine whether the data to be detected needs to be classified; and
若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。If it is determined that the data to be detected needs to be classified, the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
为实现上述目的,本申请实施例提供了一种计算机可读存储介质,所述计算机可读存储介质内存储有计算机可读指令,所述计算机可读指令可被至少一个处理器所执行,以使所述至少一个处理器执行以下步骤:In order to achieve the above object, an embodiment of the present application provides a computer-readable storage medium, the computer-readable storage medium stores computer-readable instructions, and the computer-readable instructions can be executed by at least one processor to causing the at least one processor to perform the following steps:
获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;Acquiring data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;Performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码;Through the encoding layer in the pre-trained false information classification model, the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained. The feature code of the response to be detected corresponding to the response to be detected;
通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及Perform classification pre-judgment on the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model to determine whether the data to be detected needs to be classified; and
若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。If it is determined that the data to be detected needs to be classified, the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
本申请实施例提供的虚假信息检测方法、系统、计算机设备及可读存储介质,获取待检测源信息以及在当前时刻对应的待检测回复信息,对待检测源信息以及其对应的待检测回复信息进行向量化处理和编码处理,在将特征待检测源信息以及其对应的待检测回复信息编码输入至预先训练的深度强化学习模型中,对待检测源信息以及其对应的待检测回复信息是否进行分类处理的判断,当进行分类处理时,通过虚假信息分类模型待检测源信息以及其对应的待检测回复信息进行分类,得到对应的分类信息,以实现对待检测源信息以及其对应的待检测回复信息的实时检测,若判定出需要对待检测数据进行分类处理,则对待检测数据的特征编码进行分类判断,不要再次获取待检测数据中的待检测回复信息进行分类处理,提高了虚假信息检测的效率。The false information detection method, system, computer equipment, and readable storage medium provided in the embodiments of the present application obtain the source information to be detected and the corresponding reply information to be detected at the current moment, and perform the detection on the source information to be detected and the corresponding reply information to be detected. Vectorization processing and encoding processing, after encoding and inputting the source information to be detected and its corresponding reply information to be detected into the pre-trained deep reinforcement learning model, whether to classify the source information to be detected and its corresponding reply information to be detected When performing classification processing, classify the source information to be detected and its corresponding reply information to be detected through the false information classification model to obtain the corresponding classification information, so as to realize the classification of the source information to be detected and its corresponding reply information to be detected Real-time detection, if it is determined that the data to be detected needs to be classified and processed, then the feature code of the data to be detected is classified and judged, and the reply information to be detected in the data to be detected is not obtained again for classification processing, which improves the efficiency of false information detection.
附图说明Description of drawings
图1为本申请虚假信息检测方法实施例一的流程图。FIG. 1 is a flow chart of Embodiment 1 of the false information detection method of the present application.
图2为本申请虚假信息检测系统实施例二的模块示意图。FIG. 2 is a schematic diagram of the modules of Embodiment 2 of the false information detection system of the present application.
图3为本申请计算机设备实施例三的硬件结构示意图。FIG. 3 is a schematic diagram of the hardware structure of Embodiment 3 of the computer device of the present application.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.
实施例一Embodiment one
参阅图1,示出了本申请实施例一之虚假信息检测方法的步骤流程图。可以理解,本方法实施例中的流程图不用于对执行步骤的顺序进行限定。下面以计算机设备2为执行主体进行示例性描述。具体如下。Referring to FIG. 1 , it shows a flow chart of steps of a false information detection method according to Embodiment 1 of the present application. It can be understood that the flowchart in this method embodiment is not used to limit the sequence of execution steps. An exemplary description is given below taking the computer device 2 as the execution subject. details as follows.
步骤S100、获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息。Step S100. Obtain data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment.
具体地,为了达到对检测虚假信息的实时性,实时监测并获取待检测数据,待检测源信息为需要进行虚假信息检测的源头信息,待检测回复信息为对该源头信息进行回复的信息。由于是实时获取待检测源信息以及待检测回复信息,因此,待检测源信息以及待检测回复信息具有时序性。Specifically, in order to achieve real-time detection of false information, real-time monitoring and acquisition of data to be detected, the source information to be detected is the source information that needs to be detected for false information, and the reply information to be detected is information that replies to the source information. Since the source information to be detected and the reply information to be detected are obtained in real time, the source information to be detected and the reply information to be detected have a time sequence.
步骤S102、对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量。Step S102 , performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected.
具体地,通过预设的向量化模型对待检测数据进行向量化处理,得到待检测源信息对应的待检测源特征向量以及待检测回复信息对应的待检测回复特征向量,向量化模型为visual(Bert+ImageNet)模型。Specifically, the data to be detected is vectorized through the preset vectorization model to obtain the source feature vector to be detected corresponding to the source information to be detected and the reply feature vector to be detected corresponding to the reply information to be detected. The vectorization model is visual (Bert +ImageNet) model.
示例性地,所述步骤S102包括:Exemplarily, the step S102 includes:
步骤S1021、获取所述待检测源信息中的第一文本数据,将所述第一文本数据通过第一向量化模型进行向量化处理,得到第一特征向量。步骤S1022、获取所述待检测源信息中的第一图片数据,将所述第一图片数据通过第二向量化模型进行向量化处理,得到第二特征向量。步骤S1023、将所述第一特征向量与所述第二特征向量进行拼接,得到所述待检测源信息对应的待检测源特征向量。Step S1021. Obtain first text data in the source information to be detected, and perform vectorization processing on the first text data through a first vectorization model to obtain a first feature vector. Step S1022. Obtain the first image data in the source information to be detected, and perform vectorization processing on the first image data through a second vectorization model to obtain a second feature vector. Step S1023, splicing the first feature vector and the second feature vector to obtain a source feature vector to be detected corresponding to the source information to be detected.
具体地,当待检测源信息中包括有文本数据以及图表数据时,分别通过第一向量化模 型与第二向量化模型进行向量化处理,其中,第一向量化模型为visual Bert模型,第二向量化模型为visual ImageNet模型。visual ImageNet模型为对图像进行向量化处理的模型,该模型通过对ImageNet图像库进行向量化训练后得到,模型的精确度更加准确。再将处理后的第一特征向量以及第二特征向量进行拼接,得到待检测源信息对应的待检测源特征向量。Specifically, when the source information to be detected includes text data and chart data, vectorization processing is performed through the first vectorization model and the second vectorization model, wherein the first vectorization model is a visual Bert model, and the second vectorization model is a visual Bert model. The vectorized model is a visual ImageNet model. The visual ImageNet model is a model for vectorizing images. This model is obtained after vectorization training on the ImageNet image library, and the accuracy of the model is more accurate. The processed first feature vector and the second feature vector are then concatenated to obtain the source feature vector to be detected corresponding to the source information to be detected.
示例性地,所述步骤S102包括:Exemplarily, the step S102 includes:
步骤S102A、获取所述待检测回复信息中的第二文本数据,将所述第二文本数据通过所述第一向量化模型进行向量化处理,得到第三特征向量。步骤S102B、获取所述待检测回复信息中的第二图片数据,将所述第二图片数据通过所述第二向量化模型进行向量化处理,得到第四特征向量。步骤S102C、将所述第三特征向量与所述第四特征向量进行拼接,得到所述待检测回复信息对应的待检测回复特征向量。Step S102A. Obtain second text data in the reply information to be detected, and perform vectorization processing on the second text data through the first vectorization model to obtain a third feature vector. Step S102B. Obtain the second picture data in the reply information to be detected, and perform vectorization processing on the second picture data through the second vectorization model to obtain a fourth feature vector. Step S102C, concatenating the third feature vector and the fourth feature vector to obtain a feature vector of a reply to be detected corresponding to the reply information to be detected.
具体地,当待检测回复信息中包括有文本数据以及图表数据时,分别通过第一向量化模型与第二向量化模型进行向量化处理,其中,第一向量化模型为visual Bert模型,第二向量化模型为visual ImageNet模型。再将处理后的第三特征向量以及第四特征向量进行拼接,得到待检测回复信息对应的待检测回复特征向量。Specifically, when the reply information to be detected includes text data and graph data, vectorization processing is performed through the first vectorization model and the second vectorization model, wherein the first vectorization model is a visual Bert model, and the second vectorization model is a visual Bert model. The vectorized model is the visual ImageNet model. The processed third feature vector and the fourth feature vector are then concatenated to obtain the feature vector of the reply to be detected corresponding to the reply information to be detected.
步骤S104、通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码。Step S104: Encoding the source feature vector to be detected and the reply feature vector to be detected through the encoding layer in the pre-trained false information classification model to obtain the source feature code to be detected corresponding to the source information to be detected and the feature code of the reply to be detected corresponding to the reply to be detected information.
具体地,虚假信息分类模型包括编码层,编码层为LSTM(Long Short-Term Memory,长短期记忆)神经网络模型的结构,可以对待检测源特征向量以及待检测回复特征向量进行编码处理,得到每一时间步的状态向量States t=LSTM(feature t),每一回复信息为一个时间步t,状态向量用于描述当前的环境。LSTM模型的输入是每个回复文本,LSTM的输出是这个文本的向量特征,这个向量代表这个文本的信息,其中t=0时表示源头的新闻/推特/微博/帖子。由于LSTM模型具有记忆功能,当前时刻以及当前时刻之前的待检测数据存储于LSTM模型之中,若获取到最新时刻的待检测回复信息,只需对最新时刻的待检测回复信息进行编码处理。 Specifically, the false information classification model includes an encoding layer. The encoding layer is a structure of an LSTM (Long Short-Term Memory, long-short-term memory) neural network model, which can encode the source feature vector to be detected and the reply feature vector to be detected, and obtain each The state vector States t of a time step = LSTM(feature t ), each reply message is a time step t, and the state vector is used to describe the current environment. The input of the LSTM model is each reply text, and the output of the LSTM is the vector feature of the text, which represents the information of the text, where t=0 indicates the source news/twitter/microblog/post. Since the LSTM model has a memory function, the data to be detected at the current time and before the current time are stored in the LSTM model. If the reply information to be detected at the latest time is obtained, it only needs to encode the reply information to be detected at the latest time.
步骤S106、通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理。所述虚假信息分类模型为神经网络模型。Step S106 , perform classification pre-judgment on the to-be-detected source feature code and the to-be-detected reply feature code through the trained deep reinforcement learning model, so as to determine whether to classify the to-be-detected data. The false information classification model is a neural network model.
具体地,深度强化学习模型为Dueling-DQN(Deep Q Network)网络模型,可以对输入的待检测源特征编码以及待检测回复特征编码进行预测,预测当前时刻输入的待检测数据是否可以进行后续的分类处理。通过深度强化学习模型,可以减少后续数据处理的步骤, 提高虚假信息识别效率。当深度强化学习模型预测出当前时刻可以进行分类处理时,启动虚假信息分类模型对待检测数据进行分类处理。Dueling-DQN网络模型包括状态(state)、奖励(Reward)、行为(Action),Q即为Q(s,a)就是在某一时刻的s状态下(s∈S),采取动作a(a∈A)动作能够获得收益的期望,环境会根据agent的动作反馈相应的回报reward r,如果Agent判断当前状态足够分类器进行高置信度的分类,则停止读取回复,让分类器做分类;否则继续读取下一个回复。动作选择时加入一定的随机性,即有∈=1%的概率随机选择一种动作k=random(K)。Specifically, the deep reinforcement learning model is the Dueling-DQN (Deep Q Network) network model, which can predict the input source feature codes to be detected and the reply feature codes to be detected, and predict whether the input data to be detected at the current moment can be followed up. classification processing. Through the deep reinforcement learning model, the steps of subsequent data processing can be reduced, and the efficiency of false information identification can be improved. When the deep reinforcement learning model predicts that classification processing can be performed at the current moment, the false information classification model is started to classify the data to be detected. The Dueling-DQN network model includes state (state), reward (Reward), and behavior (Action). Q is Q(s, a) that is, in the state of s at a certain moment (s∈S), take action a(a ∈A) The expectation that the action can gain benefits, the environment will feedback the corresponding reward r according to the action of the agent, if the agent judges that the current state is enough for the classifier to classify with high confidence, then stop reading the reply and let the classifier do the classification; Otherwise continue reading the next reply. A certain amount of randomness is added to the action selection, that is, an action k=random(K) is randomly selected with a probability of ∈=1%.
步骤S108、若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。Step S108, if it is determined that the data to be detected needs to be classified, the data to be detected is classified according to the source feature code to be detected and the reply feature code to be detected through the classification layer of the false information classification model Perform classification to obtain a first classification result corresponding to the data to be detected.
具体地,虚假信息分类模型中设有Classifier分类器,即softmax损失函数,对待检测数据进行分类处理,得到相应的第一分类结果。第一分类结果用于表示待检测数据是否为虚假信息,该分类器可以为二分类分类器,输出的第一分类结果为0或1,0表示不是虚假信息,1表示是虚假信息。针对新闻、推特、微博、帖子、微信文章和群聊的虚假信息假资讯,可以在回复信息的传播量增长前,及时更快速地判断出是否是虚假信息假资讯,并让用户及早做出删除屏蔽等舆情控制措施。Specifically, the false information classification model is provided with a Classifier classifier, that is, a softmax loss function, which performs classification processing on the data to be detected to obtain a corresponding first classification result. The first classification result is used to indicate whether the data to be detected is false information. The classifier may be a binary classification classifier, and the first classification result output is 0 or 1, 0 indicating that it is not false information, and 1 indicating that it is false information. For false information and false information in news, tweets, Weibo, posts, WeChat articles, and group chats, it is possible to judge whether it is false information and false information in a timely and rapid manner before the spread of reply information increases, and allow users to act as soon as possible. Public opinion control measures such as deletion and blocking have been introduced.
示例性地,所述步骤S108之后,还包括:Exemplarily, after the step S108, it also includes:
若判定出不需要对所述待检测数据进行分类处理,则返回执行所述获取待检测数据的步骤。If it is determined that there is no need to classify the data to be detected, return to the step of obtaining the data to be detected.
具体地,若判断出不能进行分类处理,则获取下一时刻的待检测回复信息,并重复步骤S102至步骤S106,对下一时刻的待检测回复信息进行向量化和编码处理,再输入至深度强化学校模型中进行预判断,直至判定出需要对待检测数据进行分类处理。当深度强化学校模型进行分类预判断时,需要对所有的待检测数据进行预测。即,待检测数据包括待检测源信息、当前时刻之前的待检测回复信息、当前时刻的待检测回复信息、下一时刻的待检测回复信息;若还是判定出不需要对待检测数据进行分类处理,再获取下一时刻的待检测回复数据。Specifically, if it is judged that the classification process cannot be performed, the reply information to be detected at the next moment is obtained, and step S102 to step S106 are repeated to perform vectorization and encoding processing on the reply information to be detected at the next moment, and then input into the deep Strengthen the pre-judgment in the school model until it is determined that the data to be tested needs to be classified. When the deep enhanced school model performs classification pre-judgment, it is necessary to predict all the data to be tested. That is, the data to be detected includes the source information to be detected, the reply information to be detected before the current moment, the reply information to be detected at the current moment, and the reply information to be detected at the next moment; if it is still determined that the data to be detected does not need to be classified, Then obtain the reply data to be detected at the next moment.
示例性地,所述深度强化学习模型的训练步骤包括:Exemplarily, the training steps of the deep reinforcement learning model include:
获取多个训练样本集,每一个训练样本集包括样本源信息的第一特征编码及所述样本源信息在不同的时间步值对应的样本回复信息的第二特征编码,其中,样本源信息对应的时间步值小于对应的样本回复信息的时间步值。训练样本集中的第一特征编码以及第二特征编码是:将样本源信息以及样本回复信息进行向量化处理再编码后得到的。每一个时间 步值对应的第二特征编码包括样本源信息对应的第一特征编码以及当前时刻所有的样本回复信息的特征编码,以便于强化模型根据该第一特征编码或者第二特征编码进行分类预处理。Obtain multiple training sample sets, each training sample set includes the first feature code of the sample source information and the second feature code of the sample reply information corresponding to the sample source information at different time step values, wherein the sample source information corresponds to The time step value of is smaller than the time step value of the corresponding sample reply information. The first feature encoding and the second feature encoding in the training sample set are obtained by vectorizing and re-encoding the sample source information and the sample reply information. The second feature code corresponding to each time step value includes the first feature code corresponding to the sample source information and the feature codes of all sample reply information at the current moment, so that the enhanced model can be classified according to the first feature code or the second feature code preprocessing.
将每一个训练样本集中的第一特征编码与各个第二特征编码按照时间步值大小依次输入至预设的强化模型中,并通过所述强化模型判断是否停止将第二特征编码输入至所述强化模型中。为了更好的对样本源信息以及样本源信息对应的样本回复信息进行检测,对每一个时间步值对应的特征编码进行分类处理。强化模型对每个时间步的特征编码都进行预判断,基于强化模型Dueling-DQN中的Q值判断,Q值越大,表示可以进行分类处理的可能性越大,当Q值大于预设阈值时,即表示进行分类处理。若当前强化模型计算的Q值小于预设阈值,则继续获取样本回复信息的特征编码,生成下一个第二特征编码,输入至强化模型中进行判断。Input the first feature code and each second feature code in each training sample set into the preset reinforcement model in sequence according to the size of the time step, and judge whether to stop inputting the second feature code into the in the enhanced model. In order to better detect the sample source information and the sample reply information corresponding to the sample source information, the feature encoding corresponding to each time step value is classified. The enhanced model pre-judges the feature encoding of each time step. Based on the Q value judgment in the enhanced model Dueling-DQN, the larger the Q value, the greater the possibility of classification processing. When the Q value is greater than the preset threshold When , it means that the classification process is carried out. If the Q value calculated by the current enhanced model is less than the preset threshold, continue to obtain the feature code of the sample reply information, generate the next second feature code, and input it into the enhanced model for judgment.
若判定出停止将第二特征编码输入至所述强化模型中,则将最后时刻输入至所述强化模型中的第二特征编码输入至所述虚假信息分类模型中,以通过所述虚假信息分类模型输出每一个训练样本集的第二分类结果。若强化模型判断出停止将第二特征编码输入至强化模型中,则表示强化模型判断出当前的第二特征编码可以进行虚假信息分类,将其输入至虚假信息分类模型中进行分类处理。其中,若强化模型通过样本源信息的第一特征编码即可判断出不需要读取样本回复信息的第二特征编码,则将第一特征编码输入至虚假信息分类模型中进行虚假信息分类预测,得到第二分类结果。If it is determined to stop inputting the second feature code into the enhanced model, then input the second feature code input into the enhanced model at the last moment into the false information classification model, so as to pass the false information classification The model outputs the second classification result of each training sample set. If the enhanced model judges to stop inputting the second feature code into the enhanced model, it means that the enhanced model judges that the current second feature code can be classified into false information, and inputs it into the false information classification model for classification processing. Wherein, if the enhanced model can determine that it is not necessary to read the second feature code of the sample reply information through the first feature code of the sample source information, then input the first feature code into the false information classification model to perform false information classification prediction, Get the second classification result.
判断所述第二分类结果与所述每一个训练样本集的真实分类结果是否相同。训练样本集的真实分类结果预先得到的,与训练样本集进行关联。Judging whether the second classification result is the same as the real classification result of each training sample set. The real classification results of the training sample set are obtained in advance and associated with the training sample set.
若不同,则更新所述强化模型的奖惩值,得到第一更新奖惩值,并根据所述第一更新奖惩值计算所述强化模型的损失函数,得到第一更新函数。根据所述第一更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。If not, update the reward and punishment value of the enhanced model to obtain a first updated reward and punishment value, and calculate the loss function of the enhanced model according to the first updated reward and punishment value to obtain a first updated function. The model parameters of the reinforcement model are updated according to the first update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
具体地,当虚假信息分类模型预测的结果不准确时,给与惩罚r t=-100,得到第一更新奖惩值,根据第一更新奖惩值计算损失函数
Figure PCTCN2022074411-appb-000001
进行反向传播更新网络权重。
Specifically, when the prediction result of the false information classification model is inaccurate, a penalty r t = -100 is given to obtain the first updated reward and punishment value, and the loss function is calculated according to the first updated reward and punishment value
Figure PCTCN2022074411-appb-000001
Perform backpropagation to update the network weights.
示例性地,所述判断所述第二分类结果与所述每一个训练样本集的真实分类结果是否相同之后,所述方法还包括:Exemplarily, after the judging whether the second classification result is the same as the true classification result of each training sample set, the method further includes:
若相同,则更新所述强化模型的奖惩值,得到第二更新奖惩值,并根据所述第二更新奖惩值计算所述强化模型的损失函数,得到第二更新函数;根据所述第二更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。如 果当前虚假信息分类器Classifier的输出结果和实际标签一致,则给予奖励r t=1+log(M),得到第二更新奖惩值,其中M表示Agent累计成功获得奖励(r>0)的次数。 If they are the same, update the reward and punishment value of the enhanced model to obtain a second updated reward and punishment value, and calculate the loss function of the enhanced model according to the second updated reward and punishment value to obtain a second update function; according to the second update The function updates the model parameters of the reinforcement model until the preset conditions are met, and a trained deep reinforcement learning model is obtained. If the output result of the current false information classifier Classifier is consistent with the actual label, a reward r t =1+log(M) will be given to obtain the second updated reward and punishment value, where M represents the cumulative number of times the Agent has successfully obtained rewards (r>0) .
示例性地,所述通过所述强化模型判断是否停止将第二特征编码输入至所述强化模型中之后,所述方法还包括:Exemplarily, after determining whether to stop inputting the second feature code into the enhanced model through the enhanced model, the method further includes:
若判定出继续将第二特征编码输入至所述强化模型中,则更新所述强化模型的奖惩值,得到第三更新奖惩值,并根据所述第三更新奖惩值计算所述强化模型的损失函数,得到第三更新函数;根据所述第三更新函数对所述强化模型的模型参数进行更新,得到更新后的强化模型;将每一个训练样本集中的第一特征编码与各个第二特征编码按照时间步值大小依次输入至更新后的强化模型中,并通过所述更新后的强化模型判断是否停止将第二特征编码输入至所述强化模型中,直至判定出停止将第二特征编码输入至所述强化模型。如果选择继续读取回复,则给予一个很小的惩罚r t=0.05,得到第三更新奖惩值,以限制Agent不断让回复增长。 If it is determined to continue to input the second feature code into the enhanced model, update the reward and punishment value of the enhanced model to obtain a third updated reward and punishment value, and calculate the loss of the enhanced model according to the third updated reward and punishment value function to obtain a third update function; update the model parameters of the enhanced model according to the third update function to obtain an updated enhanced model; encode the first feature code and each second feature code in each training sample set Input into the updated enhanced model in sequence according to the time step value, and judge whether to stop inputting the second feature code into the enhanced model through the updated enhanced model, until it is determined to stop inputting the second feature code to the enhanced model. If you choose to continue to read the reply, give a small penalty r t =0.05 to get the third update reward and punishment value, so as to limit the Agent to keep increasing the reply.
示例性地,为了全面的了解虚假信息检测的过程,通过以下实施例进行再一次的描述:Exemplarily, in order to fully understand the process of false information detection, the following examples are used to describe again:
整个检测过程分为两个模块,一个是分类模块(虚假信息分类模型),一个是控制模块(深度强化学习模型)。针对一个源头新闻/推特/微博/帖子(待检测源信息)都会按照时间顺序产生若干个待检测回复信息,每产生一个待检测回复信息的时候,都会使用虚假信息分类模块的LSTM对待检测回复信息进行编码,编码后会将编码信息输入给控制模块,控制模块是一个深度强化学习模型,会对输入的编码信息进行action的判断,判断是停止或者继续获取下一个待检测回复信息。如果判断action为停止,则会触发虚假信息分类模块对当前的状态信息(特征编码)进行分类,判断是否为虚假信息;如果判断action为继续,则不会触发分类模块进行分类,从而允许下一个回复的信息输入进LSTM进行编码,把最新的编码信息输入给控制模块,再次判断action,以此类推。因为LSTM是一个循环神经网络结构的网络,所以具有对历史信息整体编码的能力,当判断为继续时,只需获取下一个待检测回复信息。深度强化学习模型每次都会对整体信息的特征编码进行action的判断。The whole detection process is divided into two modules, one is the classification module (false information classification model), and the other is the control module (deep reinforcement learning model). For a source news/twitter/microblog/post (source information to be detected), several reply messages to be detected will be generated in chronological order, and each time a reply message to be detected is generated, the LSTM of the false information classification module will be used for detection The reply information is encoded, and after encoding, the encoded information will be input to the control module. The control module is a deep reinforcement learning model, which will judge the action of the input encoded information, and judge whether to stop or continue to obtain the next reply information to be detected. If it is judged that the action is to stop, the false information classification module will be triggered to classify the current status information (feature code) to determine whether it is false information; if the action is judged to be continued, the classification module will not be triggered to classify, thereby allowing the next The reply information is input into LSTM for encoding, and the latest encoded information is input to the control module to judge the action again, and so on. Because LSTM is a network with a cyclic neural network structure, it has the ability to encode historical information as a whole. When it is judged to continue, it only needs to obtain the next reply information to be detected. The deep reinforcement learning model will judge the action on the feature encoding of the overall information every time.
在训练深度强化学习模型的时候,action一旦被判断为停止,则触发分类器开始分类,所以分类器分类结果会出现分对和分错两种,如果是分类正确,则控制模块会得到奖励,如果分类错误,则控制模块得到惩罚。如果action一直判断为继续,则控制模型也会得到一个惩罚,只是这个惩罚会非常小。When training the deep reinforcement learning model, once the action is judged to be stopped, the classifier will be triggered to start classification, so the classification result of the classifier will be divided into right and wrong. If the classification is correct, the control module will be rewarded. If the classification is wrong, the control module is penalized. If the action is always judged as continuing, the control model will also get a penalty, but this penalty will be very small.
实施例二Embodiment two
请继续参阅图2,示出了本申请虚假信息检测系统实施例二的程序模块示意图。在本实 施例中,虚假信息检测系统20可以包括或被分割成一个或多个程序模块,一个或者多个程序模块被存储于存储介质中,并由一个或多个处理器所执行,以完成本申请,并可实现上述虚假信息检测方法。本申请实施例所称的程序模块是指能够完成特定功能的一系列计算机程序指令段,比程序本身更适合于描述虚假信息检测系统20在存储介质中的执行过程。以下描述将具体介绍本实施例各程序模块的功能:Please continue to refer to FIG. 2 , which shows a schematic diagram of the program modules of Embodiment 2 of the false information detection system of the present application. In this embodiment, the false information detection system 20 may include or be divided into one or more program modules, and one or more program modules are stored in a storage medium and executed by one or more processors to complete In this application, the above false information detection method can be realized. The program module referred to in the embodiment of this application refers to a series of computer program instruction segments capable of completing specific functions, which is more suitable for describing the execution process of the false information detection system 20 in the storage medium than the program itself. The following description will specifically introduce the functions of each program module of the present embodiment:
获取模块200,用于获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息。The acquisition module 200 is configured to acquire data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment.
向量化模块202,用于对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量。The vectorization module 202 is configured to perform vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected.
示例性地,所述向量化模块202还用于:Exemplarily, the vectorization module 202 is also used for:
获取所述待检测源信息中的第一文本数据,将所述第一文本数据通过第一向量化模型进行向量化处理,得到第一特征向量;获取所述待检测源信息中的第一图片数据,将所述第一图片数据通过第二向量化模型进行向量化处理,得到第二特征向量;将所述第一特征向量与所述第二特征向量进行拼接,得到所述待检测源信息对应的待检测源特征向量。Acquiring the first text data in the source information to be detected, and performing vectorization processing on the first text data through a first vectorization model to obtain a first feature vector; obtaining the first picture in the source information to be detected Data, the first image data is vectorized through the second vectorization model to obtain a second feature vector; the first feature vector and the second feature vector are spliced to obtain the source information to be detected The corresponding source feature vector to be detected.
示例性地,所述向量化模块202还用于:Exemplarily, the vectorization module 202 is also used for:
获取所述待检测回复信息中的第二文本数据,将所述第二文本数据通过所述第一向量化模型进行向量化处理,得到第三特征向量;获取所述待检测回复信息中的第二图片数据,将所述第二图片数据通过所述第二向量化模型进行向量化处理,得到第四特征向量;将所述第三特征向量与所述第四特征向量进行拼接,得到所述待检测回复信息对应的待检测回复特征向量。Obtaining the second text data in the reply information to be detected, and performing vectorization processing on the second text data through the first vectorization model to obtain a third feature vector; obtaining the first text data in the reply information to be detected Two picture data, performing vectorization processing on the second picture data through the second vectorization model to obtain a fourth feature vector; splicing the third feature vector and the fourth feature vector to obtain the described The feature vector of the response to be detected corresponding to the reply information to be detected.
编码模块204,用于通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码。The encoding module 204 is configured to encode the source feature vector to be detected and the reply feature vector to be detected through the encoding layer in the pre-trained false information classification model to obtain the source information to be detected corresponding to the source information to be detected The source feature code and the feature code of the reply to be detected corresponding to the reply information to be detected.
判断模块206,用于通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理。The judging module 206 is configured to perform classification pre-judgment on the to-be-detected source feature code and the to-be-detected reply feature code through the trained deep reinforcement learning model, so as to determine whether to classify the to-be-detected data.
分类模块208,用于若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。The classification module 208 is configured to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected through the classification layer of the false information classification model if it is determined that the data to be detected needs to be classified. The data to be detected is classified to obtain a first classification result corresponding to the data to be detected.
示例性地,所述分类模块208还用于:Exemplarily, the classification module 208 is also used for:
若判定出不需要对所述待检测数据进行分类处理,则返回执行所述获取待检测数据的步骤。If it is determined that there is no need to classify the data to be detected, return to the step of obtaining the data to be detected.
实施例三Embodiment three
参阅图3,是本申请实施例三之计算机设备的硬件架构示意图。本实施例中,所述计算机设备2是一种能够按照事先设定或者存储的指令,自动进行数值计算和/或信息处理的设备。该计算机设备2可以是机架式服务器、刀片式服务器、塔式服务器或机柜式服务器(包括独立的服务器,或者多个服务器所组成的服务器集群)等。服务器可以是独立的服务器,也可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、内容分发网络(Content Delivery Network,CDN)、以及大数据和人工智能平台等基础云计算服务的云服务器。如图3所示,所述计算机设备2至少包括,但不限于,可通过系统总线相互通信连接存储器21、处理器22、网络接口23、以及虚假信息检测系统20。其中:Referring to FIG. 3 , it is a schematic diagram of a hardware architecture of a computer device according to Embodiment 3 of the present application. In this embodiment, the computer device 2 is a device capable of automatically performing numerical calculation and/or information processing according to preset or stored instructions. The computer device 2 may be a rack server, a blade server, a tower server or a cabinet server (including an independent server, or a server cluster composed of multiple servers) and the like. The server can be an independent server, or it can provide cloud services, cloud database, cloud computing, cloud function, cloud storage, network service, cloud communication, middleware service, domain name service, security service, content delivery network (Content Delivery Network, CDN), and cloud servers for basic cloud computing services such as big data and artificial intelligence platforms. As shown in FIG. 3 , the computer device 2 at least includes, but is not limited to, a memory 21 , a processor 22 , a network interface 23 , and a false information detection system 20 that can communicate with each other through a system bus. in:
本实施例中,存储器21至少包括一种类型的计算机可读存储介质,所述可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等。在一些实施例中,存储器21可以是计算机设备2的内部存储单元,例如该计算机设备2的硬盘或内存。在另一些实施例中,存储器21也可以是计算机设备2的外部存储设备,例如该计算机设备2上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。当然,存储器21还可以既包括计算机设备2的内部存储单元也包括其外部存储设备。本实施例中,存储器21通常用于存储安装于计算机设备2的操作系统和各类应用软件,例如实施例二的虚假信息检测系统20的程序代码等。此外,存储器21还可以用于暂时地存储已经输出或者将要输出的各类数据。In this embodiment, the memory 21 includes at least one type of computer-readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory ( RAM), static random access memory (SRAM), read only memory (ROM), electrically erasable programmable read only memory (EEPROM), programmable read only memory (PROM), magnetic memory, magnetic disk, optical disk, etc. In some embodiments, the memory 21 may be an internal storage unit of the computer device 2 , such as a hard disk or memory of the computer device 2 . In other embodiments, the memory 21 can also be an external storage device of the computer device 2, such as a plug-in hard disk equipped on the computer device 2, a smart memory card (Smart Media Card, SMC), a secure digital (Secure Digital, SD) card, flash memory card (Flash Card), etc. Of course, the storage 21 may also include both the internal storage unit of the computer device 2 and its external storage device. In this embodiment, the memory 21 is generally used to store the operating system and various application software installed in the computer device 2, such as the program code of the false information detection system 20 in the second embodiment. In addition, the memory 21 can also be used to temporarily store various types of data that have been output or will be output.
处理器22在一些实施例中可以是中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器22通常用于控制计算机设备2的总体操作。本实施例中,处理器22用于运行存储器21中存储的程序代码或者处理数据,例如运行虚假信息检测系统20,以实现实施例一的虚假信息检测方法。In some embodiments, the processor 22 may be a central processing unit (Central Processing Unit, CPU), a controller, a microcontroller, a microprocessor, or other data processing chips. The processor 22 is generally used to control the overall operation of the computer device 2 . In this embodiment, the processor 22 is used to run the program codes stored in the memory 21 or process data, for example, to run the false information detection system 20, so as to realize the false information detection method in the first embodiment.
所述网络接口23可包括无线网络接口或有线网络接口,该网络接口23通常用于在所述服务器2与其他电子装置之间建立通信连接。例如,所述网络接口23用于通过网络将所述服务器2与外部终端相连,在所述服务器2与外部终端之间的建立数据传输通道和通信连接等。所述网络可以是企业内部网(Intranet)、互联网(Internet)、全球移动通讯系统(Global System of Mobile communication,GSM)、宽带码分多址(Wideband Code Division Multiple Access, WCDMA)、4G网络、5G网络、蓝牙(Bluetooth)、Wi-Fi等无线或有线网络。The network interface 23 may include a wireless network interface or a wired network interface, and the network interface 23 is generally used to establish a communication connection between the server 2 and other electronic devices. For example, the network interface 23 is used to connect the server 2 with an external terminal through a network, and establish a data transmission channel and a communication connection between the server 2 and an external terminal. The network can be an enterprise intranet (Intranet), Internet (Internet), Global System of Mobile communication (Global System of Mobile communication, GSM), wideband code division multiple access (Wideband Code Division Multiple Access, WCDMA), 4G network, 5G Internet, Bluetooth (Bluetooth), Wi-Fi and other wireless or wired networks.
需要指出的是,图3仅示出了具有部件20-23的计算机设备2,但是应理解的是,并不要求实施所有示出的部件,可以替代的实施更多或者更少的部件。It should be noted that FIG. 3 only shows computer device 2 having components 20-23, but it should be understood that implementation of all of the illustrated components is not required and that more or fewer components may instead be implemented.
在本实施例中,存储于存储器21中的所述虚假信息检测系统20还可以被分割为一个或者多个程序模块,所述一个或者多个程序模块被存储于存储器21中,并由一个或多个处理器(本实施例为处理器22)所执行,以完成本申请。In this embodiment, the false information detection system 20 stored in the memory 21 can also be divided into one or more program modules, and the one or more program modules are stored in the memory 21 and are controlled by one or more program modules. Executed by multiple processors (the processor 22 in this embodiment) to complete the application.
例如,图2示出了所述实现虚假信息检测系统20实施例二的程序模块示意图,该实施例中,所述虚假信息检测系统20可以被划分为所述获取模块200、所述向量化模块202、所述编码模块204、所述判断模块206以及所述分类模块208。其中,本申请所称的程序模块是指能够完成特定功能的一系列计算机程序指令段,比程序更适合于描述所述虚假信息检测系统20在所述计算机设备2中的执行过程。所述程序模块200-208的具体功能在实施例二中已有详细描述,在此不再赘述。For example, FIG. 2 shows a schematic diagram of program modules for realizing the second embodiment of the false information detection system 20. In this embodiment, the false information detection system 20 can be divided into the acquisition module 200, the vectorization module 202 , the encoding module 204 , the judging module 206 and the classifying module 208 . Wherein, the program module referred to in this application refers to a series of computer program instruction segments capable of completing specific functions, which is more suitable than a program to describe the execution process of the false information detection system 20 in the computer device 2 . The specific functions of the program modules 200-208 have been described in detail in the second embodiment, and will not be repeated here.
实施例四Embodiment four
本实施例还提供一种计算机可读存储介质,如闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘、服务器、App应用商城等等,其上存储有计算机程序,程序被处理器执行时实现相应功能。所述计算机可读存储介质可以是非易失性,也可以是易失性。本实施例的计算机可读存储介质用于计算机程序,被处理器执行时实现以下步骤:This embodiment also provides a computer-readable storage medium, such as flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), only Read memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, optical disk, server, App application store, etc., on which computer programs are stored, The corresponding functions are realized when the program is executed by the processor. The computer-readable storage medium may be non-volatile or volatile. The computer-readable storage medium of this embodiment is used for a computer program, and when executed by a processor, the following steps are implemented:
获取待检测数据,其中,所述待检测数据包括检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;Acquiring data to be detected, wherein the data to be detected includes detection source information and response information to be detected corresponding to the source information to be detected at the current moment;
对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;Performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码;Through the encoding layer in the pre-trained false information classification model, the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained. The feature code of the response to be detected corresponding to the response to be detected;
通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及Perform classification pre-judgment on the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model to determine whether the data to be detected needs to be classified; and
若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得 到所述待检测数据对应的第一分类结果。If it is determined that the data to be detected needs to be classified, the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
上述本申请实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the above embodiments of the present application are for description only, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。Through the description of the above embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus a necessary general-purpose hardware platform, and of course also by hardware, but in many cases the former is better implementation.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only preferred embodiments of the present application, and are not intended to limit the patent scope of the present application. All equivalent structures or equivalent process transformations made by using the description of the application and the accompanying drawings are directly or indirectly used in other related technical fields. , are all included in the patent protection scope of the present application in the same way.

Claims (20)

  1. 一种虚假信息检测方法,其中,包括:A false information detection method, including:
    获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;Acquiring data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
    对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;Performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
    通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码;Through the encoding layer in the pre-trained false information classification model, the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained. The feature code of the response to be detected corresponding to the response to be detected;
    通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及Perform classification pre-judgment on the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model to determine whether the data to be detected needs to be classified; and
    若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。If it is determined that the data to be detected needs to be classified, the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
  2. 根据权利要求1所述的虚假信息检测方法,其中,所述通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理之后,还包括:The false information detection method according to claim 1, wherein, the trained deep reinforcement learning model performs classification pre-judgment on the source feature codes to be detected and the reply feature codes to be detected, so as to determine whether the After the data to be detected is classified and processed, it also includes:
    若判定出不需要对所述待检测数据进行分类处理,则返回执行所述获取待检测数据的步骤。If it is determined that there is no need to classify the data to be detected, return to the step of obtaining the data to be detected.
  3. 根据权利要求1所述的虚假信息检测方法,其中,所述对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量包括:The method for detecting false information according to claim 1, wherein the vectorization processing is performed on the data to be detected to obtain the source feature vector to be detected corresponding to the source information to be detected and the eigenvector corresponding to the reply information to be detected Reply feature vectors to be detected include:
    获取所述待检测源信息中的第一文本数据与第一图片数据;Acquiring the first text data and the first image data in the source information to be detected;
    将所述第一文本数据通过第一向量化模型进行向量化处理,得到第一特征向量;并,将所述第一图片数据通过第二向量化模型进行向量化处理,得到第二特征向量;performing vectorization processing on the first text data through a first vectorization model to obtain a first feature vector; and performing vectorization processing on the first image data through a second vectorization model to obtain a second feature vector;
    将所述第一特征向量与所述第二特征向量进行拼接,得到所述待检测源信息对应的待检测源特征向量;splicing the first feature vector and the second feature vector to obtain a source feature vector to be detected corresponding to the source information to be detected;
    获取所述待检测回复信息中的第二文本数据与第二图片数据;Acquiring the second text data and the second image data in the reply information to be detected;
    将所述第二文本数据通过所述第一向量化模型进行向量化处理,得到第三特征向量;并,将所述第二图片数据通过所述第二向量化模型进行向量化处理,得到第四特 征向量;及performing vectorization processing on the second text data through the first vectorization model to obtain a third feature vector; and performing vectorization processing on the second picture data through the second vectorization model to obtain a third feature vector Four eigenvectors; and
    将所述第三特征向量与所述第四特征向量进行拼接,得到所述待检测回复信息对应的待检测回复特征向量。The third feature vector and the fourth feature vector are spliced to obtain a feature vector of a reply to be detected corresponding to the reply information to be detected.
  4. 根据权利要求3所述的虚假信息检测方法,其中,所述虚假信息分类模型为神经网络模型。The false information detection method according to claim 3, wherein the false information classification model is a neural network model.
  5. 根据权利要求1所述的虚假信息检测方法,其中,所述深度强化学习模型的训练步骤包括:The false information detection method according to claim 1, wherein, the training step of the deep reinforcement learning model comprises:
    获取多个训练样本集,每一个训练样本集包括样本源信息的第一特征编码及所述样本源信息在不同的时间步值对应的样本回复信息的第二特征编码,其中,样本源信息对应的时间步值小于对应的样本回复信息的时间步值;Obtain multiple training sample sets, each training sample set includes the first feature code of the sample source information and the second feature code of the sample reply information corresponding to the sample source information at different time step values, wherein the sample source information corresponds to The time step value of is less than the time step value of the corresponding sample reply information;
    将每一个训练样本集中的第一特征编码与各个第二特征编码按照时间步值大小依次输入至预设的强化模型中,并通过所述强化模型判断是否停止将第二特征编码输入至所述强化模型中;Input the first feature code and each second feature code in each training sample set into the preset reinforcement model in sequence according to the size of the time step, and judge whether to stop inputting the second feature code into the In the reinforcement model;
    若判定出停止将第二特征编码输入至所述强化模型中,则将最后时刻输入至所述强化模型中的第二特征编码输入至所述虚假信息分类模型中,以通过所述虚假信息分类模型输出每一个训练样本集的第二分类结果;If it is determined to stop inputting the second feature code into the enhanced model, then input the second feature code input into the enhanced model at the last moment into the false information classification model, so as to pass the false information classification The model outputs the second classification result of each training sample set;
    判断所述第二分类结果与所述每一个训练样本集的真实分类结果是否相同;judging whether the second classification result is the same as the true classification result of each training sample set;
    若不同,则更新所述强化模型的奖惩值,得到第一更新奖惩值,并根据所述第一更新奖惩值计算所述强化模型的损失函数,得到第一更新函数;及If not, update the reward and punishment value of the enhanced model to obtain a first updated reward and punishment value, and calculate the loss function of the enhanced model according to the first updated reward and punishment value to obtain a first update function; and
    根据所述第一更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。The model parameters of the reinforcement model are updated according to the first update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
  6. 根据权利要求5所述的虚假信息检测方法,其中,所述判断所述第二分类结果与所述每一个训练样本集的真实分类结果是否相同之后,所述方法还包括:The method for detecting false information according to claim 5, wherein, after determining whether the second classification result is the same as the true classification result of each training sample set, the method further comprises:
    若相同,则更新所述强化模型的奖惩值,得到第二更新奖惩值,并根据所述第二更新奖惩值计算所述强化模型的损失函数,得到第二更新函数;及If they are the same, update the reward and punishment value of the enhanced model to obtain a second updated reward and punishment value, and calculate the loss function of the enhanced model according to the second updated reward and punishment value to obtain a second update function; and
    根据所述第二更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。The model parameters of the reinforcement model are updated according to the second update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
  7. 根据权利要求5所述的虚假信息检测方法,其中,所述通过所述强化模型判断是否停止将第二特征编码输入至所述强化模型中之后,所述方法还包括:The false information detection method according to claim 5, wherein, after determining whether to stop inputting the second feature code into the enhanced model through the enhanced model, the method further comprises:
    若判定出继续将第二特征编码输入至所述强化模型中,则更新所述强化模型的奖惩值,得到第三更新奖惩值,并根据所述第三更新奖惩值计算所述强化模型的损失函 数,得到第三更新函数;If it is determined to continue to input the second feature code into the enhanced model, update the reward and punishment value of the enhanced model to obtain a third updated reward and punishment value, and calculate the loss of the enhanced model according to the third updated reward and punishment value function to get the third update function;
    根据所述第三更新函数对所述强化模型的模型参数进行更新,得到更新后的强化模型;及updating model parameters of the enhanced model according to the third update function to obtain an updated enhanced model; and
    将每一个训练样本集中的第一特征编码与各个第二特征编码按照时间步值大小依次输入至更新后的强化模型中,并通过所述更新后的强化模型判断是否停止将第二特征编码输入至所述强化模型中,直至判定出停止将第二特征编码输入至所述强化模型。Input the first feature code and each second feature code in each training sample set into the updated enhanced model in sequence according to the size of the time step, and judge whether to stop inputting the second feature code through the updated enhanced model into the enhanced model until it is determined to stop inputting the second feature code into the enhanced model.
  8. 一种虚假信息检测系统,其中,包括:A false information detection system, including:
    获取模块,用于获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;An acquisition module, configured to acquire data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
    向量化模块,用于对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;A vectorization module, configured to perform vectorization processing on the data to be detected, to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
    编码模块,用于通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码;An encoding module, configured to encode the feature vector of the source to be detected and the feature vector of the reply to be detected through the encoding layer in the pre-trained false information classification model, to obtain the source to be detected corresponding to the information of the source to be detected A feature code and a feature code of the reply to be detected corresponding to the reply information to be detected;
    判断模块,用于通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及The judging module is used to classify and pre-judge the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model, so as to determine whether the data to be detected needs to be classified; and
    分类模块,用于若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。A classification module, configured to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected through the classification layer of the false information classification model if it is determined that the data to be detected needs to be classified. The detection data is classified to obtain a first classification result corresponding to the data to be detected.
  9. 一种计算机设备,其中,所述计算机设备包括存储器、处理器,所述存储器上存储有可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时还执行以下步骤:A computer device, wherein the computer device includes a memory and a processor, the memory stores computer-readable instructions operable on the processor, and the processor also executes the computer-readable instructions Perform the following steps:
    获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;Acquiring data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
    对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;Performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
    通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编码以及所述待检测回复信息对应的待检测回复特征编码;Through the encoding layer in the pre-trained false information classification model, the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained. The feature code of the response to be detected corresponding to the response to be detected;
    通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编 码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及Perform classification pre-judgment on the source feature code to be detected and the reply feature code to be detected by the trained deep reinforcement learning model to determine whether the data to be detected needs to be classified; and
    若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。If it is determined that the data to be detected needs to be classified, the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
  10. 根据权利要求9所述的计算机设备,其中,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device according to claim 9, wherein said processor further performs the following steps when executing said computer readable instructions:
    若判定出不需要对所述待检测数据进行分类处理,则返回执行所述获取待检测数据的步骤。If it is determined that there is no need to classify the data to be detected, return to the step of obtaining the data to be detected.
  11. 根据权利要求9所述的计算机设备,其中,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device according to claim 9, wherein said processor further performs the following steps when executing said computer readable instructions:
    获取所述待检测源信息中的第一文本数据与第一图片数据;Acquiring the first text data and the first image data in the source information to be detected;
    将所述第一文本数据通过第一向量化模型进行向量化处理,得到第一特征向量;并,将所述第一图片数据通过第二向量化模型进行向量化处理,得到第二特征向量;performing vectorization processing on the first text data through a first vectorization model to obtain a first feature vector; and performing vectorization processing on the first image data through a second vectorization model to obtain a second feature vector;
    将所述第一特征向量与所述第二特征向量进行拼接,得到所述待检测源信息对应的待检测源特征向量;splicing the first feature vector and the second feature vector to obtain a source feature vector to be detected corresponding to the source information to be detected;
    获取所述待检测回复信息中的第二文本数据与第二图片数据;Acquiring the second text data and the second image data in the reply information to be detected;
    将所述第二文本数据通过所述第一向量化模型进行向量化处理,得到第三特征向量;并,将所述第二图片数据通过所述第二向量化模型进行向量化处理,得到第四特征向量;及performing vectorization processing on the second text data through the first vectorization model to obtain a third feature vector; and performing vectorization processing on the second picture data through the second vectorization model to obtain a third feature vector Four eigenvectors; and
    将所述第三特征向量与所述第四特征向量进行拼接,得到所述待检测回复信息对应的待检测回复特征向量。The third feature vector and the fourth feature vector are spliced to obtain a feature vector of a reply to be detected corresponding to the reply information to be detected.
  12. 根据权利要求11所述的计算机设备,其中,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device according to claim 11, wherein said processor further performs the following steps when executing said computer readable instructions:
    获取多个训练样本集,每一个训练样本集包括样本源信息的第一特征编码及所述样本源信息在不同的时间步值对应的样本回复信息的第二特征编码,其中,样本源信息对应的时间步值小于对应的样本回复信息的时间步值;Obtain multiple training sample sets, each training sample set includes the first feature code of the sample source information and the second feature code of the sample reply information corresponding to the sample source information at different time step values, wherein the sample source information corresponds to The time step value of is less than the time step value of the corresponding sample reply information;
    将每一个训练样本集中的第一特征编码与各个第二特征编码按照时间步值大小依次输入至预设的强化模型中,并通过所述强化模型判断是否停止将第二特征编码输入至所述强化模型中;Input the first feature code and each second feature code in each training sample set into the preset reinforcement model in sequence according to the size of the time step, and judge whether to stop inputting the second feature code into the In the reinforcement model;
    若判定出停止将第二特征编码输入至所述强化模型中,则将最后时刻输入至所述强化模型中的第二特征编码输入至所述虚假信息分类模型中,以通过所述虚假信息分 类模型输出每一个训练样本集的第二分类结果;If it is determined to stop inputting the second feature code into the enhanced model, then input the second feature code input into the enhanced model at the last moment into the false information classification model, so as to pass the false information classification The model outputs the second classification result of each training sample set;
    判断所述第二分类结果与所述每一个训练样本集的真实分类结果是否相同;judging whether the second classification result is the same as the true classification result of each training sample set;
    若不同,则更新所述强化模型的奖惩值,得到第一更新奖惩值,并根据所述第一更新奖惩值计算所述强化模型的损失函数,得到第一更新函数;及If not, update the reward and punishment value of the enhanced model to obtain a first updated reward and punishment value, and calculate the loss function of the enhanced model according to the first updated reward and punishment value to obtain a first update function; and
    根据所述第一更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。The model parameters of the reinforcement model are updated according to the first update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
  13. 根据权利要求12所述的计算机设备,其中,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device according to claim 12, wherein said processor further performs the following steps when executing said computer readable instructions:
    若相同,则更新所述强化模型的奖惩值,得到第二更新奖惩值,并根据所述第二更新奖惩值计算所述强化模型的损失函数,得到第二更新函数;及If they are the same, update the reward and punishment value of the enhanced model to obtain a second updated reward and punishment value, and calculate the loss function of the enhanced model according to the second updated reward and punishment value to obtain a second update function; and
    根据所述第二更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。The model parameters of the reinforcement model are updated according to the second update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
  14. 根据权利要求13所述的计算机设备,其中,所述处理器执行所述计算机可读指令时还执行以下步骤:The computer device according to claim 13, wherein said processor, when executing said computer readable instructions, further performs the following steps:
    若判定出继续将第二特征编码输入至所述强化模型中,则更新所述强化模型的奖惩值,得到第三更新奖惩值,并根据所述第三更新奖惩值计算所述强化模型的损失函数,得到第三更新函数;If it is determined to continue to input the second feature code into the enhanced model, update the reward and punishment value of the enhanced model to obtain a third updated reward and punishment value, and calculate the loss of the enhanced model according to the third updated reward and punishment value function to get the third update function;
    根据所述第三更新函数对所述强化模型的模型参数进行更新,得到更新后的强化模型;及updating model parameters of the enhanced model according to the third update function to obtain an updated enhanced model; and
    将每一个训练样本集中的第一特征编码与各个第二特征编码按照时间步值大小依次输入至更新后的强化模型中,并通过所述更新后的强化模型判断是否停止将第二特征编码输入至所述强化模型中,直至判定出停止将第二特征编码输入至所述强化模型。Input the first feature code and each second feature code in each training sample set into the updated enhanced model in sequence according to the size of the time step, and judge whether to stop inputting the second feature code through the updated enhanced model into the enhanced model until it is determined to stop inputting the second feature code into the enhanced model.
  15. 一种计算机可读存储介质,其中,所述计算机可读存储介质内存储有计算机可读指令,所述计算机可读指令可被至少一个处理器所执行,以使所述至少一个处理器执行以下步骤:A computer-readable storage medium, wherein computer-readable instructions are stored in the computer-readable storage medium, and the computer-readable instructions can be executed by at least one processor, so that the at least one processor performs the following step:
    获取待检测数据,其中,所述待检测数据包括待检测源信息以及所述待检测源信息在当前时刻对应的待检测回复信息;Acquiring data to be detected, wherein the data to be detected includes source information to be detected and reply information to be detected corresponding to the source information to be detected at the current moment;
    对所述待检测数据进行向量化处理,得到所述待检测源信息对应的待检测源特征向量以及所述待检测回复信息对应的待检测回复特征向量;Performing vectorization processing on the data to be detected to obtain a source feature vector to be detected corresponding to the source information to be detected and a reply feature vector to be detected corresponding to the reply information to be detected;
    通过预先训练好的虚假信息分类模型中的编码层对所述待检测源特征向量以及所述待检测回复特征向量进行编码处理,得到所述待检测源信息对应的待检测源特征编 码以及所述待检测回复信息对应的待检测回复特征编码;Through the encoding layer in the pre-trained false information classification model, the source feature vector to be detected and the reply feature vector to be detected are encoded, and the source feature code to be detected corresponding to the source information to be detected and the source feature code to be detected are obtained. The feature code of the response to be detected corresponding to the response to be detected;
    通过训练好的深度强化学习模型对所述待检测源特征编码以及待检测回复特征编码进行分类预判断,以判定是否需要对所述待检测数据进行分类处理;及Perform classification pre-judgment on the source feature code to be detected and the reply feature code to be detected through the trained deep reinforcement learning model to determine whether the data to be detected needs to be classified; and
    若判定出需要对所述待检测数据进行分类处理,则通过所述虚假信息分类模型的分类层根据所述待检测源特征编码和所述待检测回复特征编码对所述待检测数据进行分类,得到所述待检测数据对应的第一分类结果。If it is determined that the data to be detected needs to be classified, the classification layer of the false information classification model is used to classify the data to be detected according to the source feature code to be detected and the reply feature code to be detected, A first classification result corresponding to the data to be detected is obtained.
  16. 根据权利要求15所述的计算机可读存储介质,其中,所述计算机可读指令可被至少一个处理器所执行,以使所述至少一个处理器还执行以下步骤:The computer-readable storage medium of claim 15, wherein the computer-readable instructions are executable by at least one processor, such that the at least one processor further performs the steps of:
    若判定出不需要对所述待检测数据进行分类处理,则返回执行所述获取待检测数据的步骤。If it is determined that there is no need to classify the data to be detected, return to the step of obtaining the data to be detected.
  17. 根据权利要求15所述的计算机可读存储介质,其中,所述计算机可读指令可被至少一个处理器所执行,以使所述至少一个处理器还执行以下步骤:The computer-readable storage medium of claim 15, wherein the computer-readable instructions are executable by at least one processor, such that the at least one processor further performs the steps of:
    获取所述待检测源信息中的第一文本数据与第一图片数据;Acquiring the first text data and the first image data in the source information to be detected;
    将所述第一文本数据通过第一向量化模型进行向量化处理,得到第一特征向量;并,将所述第一图片数据通过第二向量化模型进行向量化处理,得到第二特征向量;performing vectorization processing on the first text data through a first vectorization model to obtain a first feature vector; and performing vectorization processing on the first image data through a second vectorization model to obtain a second feature vector;
    将所述第一特征向量与所述第二特征向量进行拼接,得到所述待检测源信息对应的待检测源特征向量;splicing the first feature vector and the second feature vector to obtain a source feature vector to be detected corresponding to the source information to be detected;
    获取所述待检测回复信息中的第二文本数据与第二图片数据;Acquiring the second text data and the second image data in the reply information to be detected;
    将所述第二文本数据通过所述第一向量化模型进行向量化处理,得到第三特征向量;并,将所述第二图片数据通过所述第二向量化模型进行向量化处理,得到第四特征向量;及performing vectorization processing on the second text data through the first vectorization model to obtain a third feature vector; and performing vectorization processing on the second picture data through the second vectorization model to obtain a third feature vector Four eigenvectors; and
    将所述第三特征向量与所述第四特征向量进行拼接,得到所述待检测回复信息对应的待检测回复特征向量。The third feature vector and the fourth feature vector are spliced to obtain a feature vector of a reply to be detected corresponding to the reply information to be detected.
  18. 根据权利要求17所述的计算机可读存储介质,其中,所述计算机可读指令可被至少一个处理器所执行,以使所述至少一个处理器还执行以下步骤:The computer-readable storage medium of claim 17, wherein the computer-readable instructions are executable by at least one processor, such that the at least one processor further performs the steps of:
    获取多个训练样本集,每一个训练样本集包括样本源信息的第一特征编码及所述样本源信息在不同的时间步值对应的样本回复信息的第二特征编码,其中,样本源信息对应的时间步值小于对应的样本回复信息的时间步值;Obtain multiple training sample sets, each training sample set includes the first feature code of the sample source information and the second feature code of the sample reply information corresponding to the sample source information at different time step values, wherein the sample source information corresponds to The time step value of is less than the time step value of the corresponding sample reply information;
    将每一个训练样本集中的第一特征编码与各个第二特征编码按照时间步值大小依次输入至预设的强化模型中,并通过所述强化模型判断是否停止将第二特征编码输入至所述强化模型中;Input the first feature code and each second feature code in each training sample set into the preset reinforcement model in sequence according to the size of the time step, and judge whether to stop inputting the second feature code into the In the reinforcement model;
    若判定出停止将第二特征编码输入至所述强化模型中,则将最后时刻输入至所述强化模型中的第二特征编码输入至所述虚假信息分类模型中,以通过所述虚假信息分类模型输出每一个训练样本集的第二分类结果;If it is determined to stop inputting the second feature code into the enhanced model, then input the second feature code input into the enhanced model at the last moment into the false information classification model, so as to pass the false information classification The model outputs the second classification result of each training sample set;
    判断所述第二分类结果与所述每一个训练样本集的真实分类结果是否相同;judging whether the second classification result is the same as the true classification result of each training sample set;
    若不同,则更新所述强化模型的奖惩值,得到第一更新奖惩值,并根据所述第一更新奖惩值计算所述强化模型的损失函数,得到第一更新函数;及If not, update the reward and punishment value of the enhanced model to obtain a first updated reward and punishment value, and calculate the loss function of the enhanced model according to the first updated reward and punishment value to obtain a first update function; and
    根据所述第一更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。The model parameters of the reinforcement model are updated according to the first update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
  19. 根据权利要求18所述的计算机可读存储介质,其中,若相同,则更新所述强化模型的奖惩值,得到第二更新奖惩值,并根据所述第二更新奖惩值计算所述强化模型的损失函数,得到第二更新函数;及The computer-readable storage medium according to claim 18, wherein, if they are the same, update the reward and punishment value of the enhanced model to obtain a second updated reward and punishment value, and calculate the reward and punishment value of the enhanced model according to the second updated reward and punishment value A loss function to obtain a second update function; and
    根据所述第二更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。The model parameters of the reinforcement model are updated according to the second update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
  20. 根据权利要求18所述的计算机可读存储介质,其中,所述计算机可读指令可被至少一个处理器所执行,以使所述至少一个处理器还执行以下步骤:The computer-readable storage medium of claim 18, wherein the computer-readable instructions are executable by at least one processor, such that the at least one processor further performs the steps of:
    若相同,则更新所述强化模型的奖惩值,得到第二更新奖惩值,并根据所述第二更新奖惩值计算所述强化模型的损失函数,得到第二更新函数;及If they are the same, update the reward and punishment value of the enhanced model to obtain a second updated reward and punishment value, and calculate the loss function of the enhanced model according to the second updated reward and punishment value to obtain a second update function; and
    根据所述第二更新函数对所述强化模型的模型参数进行更新,直至满足预设条件,得到训练好的深度强化学习模型。The model parameters of the reinforcement model are updated according to the second update function until a preset condition is met, and a trained deep reinforcement learning model is obtained.
PCT/CN2022/074411 2021-09-30 2022-01-27 False information detection method and system, computer device, and readable storage medium WO2023050670A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111156357.5 2021-09-30
CN202111156357.5A CN113869431A (en) 2021-09-30 2021-09-30 False information detection method, system, computer device and readable storage medium

Publications (1)

Publication Number Publication Date
WO2023050670A1 true WO2023050670A1 (en) 2023-04-06

Family

ID=79000953

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/074411 WO2023050670A1 (en) 2021-09-30 2022-01-27 False information detection method and system, computer device, and readable storage medium

Country Status (2)

Country Link
CN (1) CN113869431A (en)
WO (1) WO2023050670A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113869431A (en) * 2021-09-30 2021-12-31 平安科技(深圳)有限公司 False information detection method, system, computer device and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111475649A (en) * 2020-04-02 2020-07-31 中国人民解放军国防科技大学 False news prediction method, system, device and medium based on deep learning
CN111950268A (en) * 2020-08-17 2020-11-17 珠海格力电器股份有限公司 Method, device and storage medium for detecting junk information
CN112685739A (en) * 2020-12-31 2021-04-20 卓尔智联(武汉)研究院有限公司 Malicious code detection method, data interaction method and related equipment
US11087086B2 (en) * 2019-07-12 2021-08-10 Adp, Llc Named-entity recognition through sequence of classification using a deep learning neural network
CN113869431A (en) * 2021-09-30 2021-12-31 平安科技(深圳)有限公司 False information detection method, system, computer device and readable storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110929025B (en) * 2018-09-17 2023-04-25 阿里巴巴集团控股有限公司 Junk text recognition method and device, computing equipment and readable storage medium
CN110188194B (en) * 2019-04-26 2020-12-01 哈尔滨工业大学(深圳) False news detection method and system based on multitask learning model
CN113032525A (en) * 2021-03-23 2021-06-25 深圳大学 False news detection method and device, electronic equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11087086B2 (en) * 2019-07-12 2021-08-10 Adp, Llc Named-entity recognition through sequence of classification using a deep learning neural network
CN111475649A (en) * 2020-04-02 2020-07-31 中国人民解放军国防科技大学 False news prediction method, system, device and medium based on deep learning
CN111950268A (en) * 2020-08-17 2020-11-17 珠海格力电器股份有限公司 Method, device and storage medium for detecting junk information
CN112685739A (en) * 2020-12-31 2021-04-20 卓尔智联(武汉)研究院有限公司 Malicious code detection method, data interaction method and related equipment
CN113869431A (en) * 2021-09-30 2021-12-31 平安科技(深圳)有限公司 False information detection method, system, computer device and readable storage medium

Also Published As

Publication number Publication date
CN113869431A (en) 2021-12-31

Similar Documents

Publication Publication Date Title
CN106992994B (en) Automatic monitoring method and system for cloud service
US10587632B1 (en) Neural network-based malware detection
US10645105B2 (en) Network attack detection method and device
CN110175851B (en) Cheating behavior detection method and device
CN110855648B (en) Early warning control method and device for network attack
US20200112578A1 (en) System and method for detecting bots based on anomaly detection of javascript or mobile app profile information
CN110737818B (en) Network release data processing method, device, computer equipment and storage medium
CN111368289B (en) Malicious software detection method and device
CN112231592A (en) Network community discovery method, device, equipment and storage medium based on graph
CN113328994A (en) Malicious domain name processing method, device, equipment and machine readable storage medium
WO2023050670A1 (en) False information detection method and system, computer device, and readable storage medium
CN112579462A (en) Test case acquisition method, system, equipment and computer readable storage medium
CN111783883A (en) Abnormal data detection method and device
US11200284B1 (en) Optimization of feature embeddings for deep learning models
CN110598115A (en) Sensitive webpage identification method and system based on artificial intelligence multi-engine
CN114048512A (en) Method and device for processing sensitive data
US9519864B1 (en) Method and system for identifying dependent components
CN114528908A (en) Network request data classification model training method, classification method and storage medium
CN114282218A (en) Attack detection method and device, electronic equipment and storage medium
CN113052509A (en) Model evaluation method, model evaluation apparatus, electronic device, and storage medium
US20220405585A1 (en) Training device, estimation device, training method, and training program
CN111639277A (en) Automated extraction method of machine learning sample set and computer-readable storage medium
CN115718696B (en) Source code cryptography misuse detection method and device, electronic equipment and storage medium
CN112905987B (en) Account identification method, device, server and storage medium
CN111815442B (en) Link prediction method and device and electronic equipment