CN117313709B

CN117313709B - Method for detecting generated text based on statistical information and pre-training language model

Info

Publication number: CN117313709B
Application number: CN202311614320.1A
Authority: CN
Inventors: 张勇东; 毛震东; 徐本峰; 张立成; 胡博; 郭子康
Original assignee: University of Science and Technology of China USTC
Current assignee: University of Science and Technology of China USTC
Priority date: 2023-11-29
Filing date: 2023-11-29
Publication date: 2024-03-29
Anticipated expiration: 2043-11-29
Also published as: CN117313709A

Abstract

The invention relates to the technical field of generated text detection, and discloses a generated text detection method based on statistical information and a pre-training language model, wherein a class label of a generated text is detected through a detection model consisting of a statistical learning model, a deep learning model and a dynamic fusion frame; the construction method of the detection model comprises the following steps: constructing a statistical learning model; constructing a deep learning model; constructing a dynamic fusion frame; based on the training dataset, the detection model is trained by computing a cross entropy loss function for the dynamically fused class label probability distribution and the true class label. The statistical learning model effectively relieves the problem of poor model migration under the condition of limited multi-field labeling data, the deep learning model gets rid of the problem of manual design characteristics, more implicit characteristics can be extracted, and the dynamic fusion framework improves the model migration capacity on the premise of losing less detection effect.

Description

Method for detecting generated text based on statistical information and pre-training language model

Technical Field

The invention relates to the technical field of generated text detection, in particular to a generated text detection method based on statistical information and a pre-training language model.

Background

With the development of large-scale language models, the generated text is more and more similar to human writing. But at the same time poses a serious security problem in that machine-generated text may be used to mislead people maliciously. The generation of text detection systems aimed at distinguishing whether text is generated by a machine or a human has become a research hotspot in the field of natural language processing in recent years. Although statistical learning models do not require a large amount of labeling data to train and are easily migrated to new areas, their detection accuracy tends to be low. The deep learning model can automatically extract the features, avoids inconvenience and effect dependence caused by manual design rules and features, can extract more implicit features, and can obtain better detection effects. Training these models requires a large amount of labeling data in the domain, and the effect of detection is greatly reduced when migrating to a new domain. However, obtaining high-quality labeling data in multiple fields is generally time-consuming and labor-consuming in many real-world scenarios, and thus how to build a good-performance generated text detection system under limited resources and data becomes a significant challenge.

Considering the situations that the statistical learning model has strong mobility but poor effect and the deep learning model has poor mobility, the invention hopes to combine the statistical learning model and the deep learning model to solve the problem of poor detection effect of the generated text in multiple fields.

Disclosure of Invention

In order to solve the technical problems, the invention provides a generated text detection method based on statistical information and a pre-training language model, which is characterized in that the statistical characteristics such as confusion degree, word frequency and the like are obtained through the language model, the depth characteristics of the text are extracted through a deep learning model, probability calibration is carried out on the prediction results of the statistical characteristics and the depth characteristics respectively, and finally dynamic fusion prediction of the generated text is realized.

In order to solve the technical problems, the invention adopts the following technical scheme:

a method for detecting generated text based on statistical information and a pre-training language model detects class labels of the generated text through a detection model consisting of a statistical learning model, a deep learning model and a dynamic fusion frame; the training data set adopted in the training of the detection model is recorded as，/>Corresponding tag set->And->，For tag collection->For the length of the training dataset, +.>Is->A corresponding category label; text->Is oneWord sequences，/>Represents the first->Text->The%>Individual words->For text->Is a length of (2);

the construction method of the detection model comprises the following steps:

step one, constructing a statistical learning model:

the statistical learning model adopts an autoregressive language model; obtaining the generation probability of each word in the text to be detected through an autoregressive language modelCounting the number of words in the text that appear in the vocabulary in the top ten words respectivelyThe number of the previous hundred->The number of previous thousand->The method comprises the steps of carrying out a first treatment on the surface of the Generating probability based on each word->Calculate text +.>Probability of->According to->Calculate text +.>Is->The method comprises the steps of carrying out a first treatment on the surface of the Will be、/>、/>The confusion degree with the text is used as a statistical feature, and the category label probability distribution ++f of the text to be detected based on statistical feature prediction is obtained through a logistic regression classifier>；

Step two, constructing a deep learning model:

the deep learning model adopts a self-coding language model, and the text to be detected is coded by the self-coding language model and then starts the initiator [ CLS ] of the text]Vector representation of (a)As semantic representation of the whole text, then obtaining the class label probability distribution ++of the text to be detected based on the depth coding feature prediction through the fully connected network and the classifier network>；

Step three, constructing a dynamic fusion framework:

using tag smoothing to separate original single thermal tags fromThe range of values of (2) is extended to +.>，/>Is a constant representing the degree of smoothness, increasing the true probability distribution of the predicted class after label smoothing +.>The method comprises the following steps:

；

wherein the method comprises the steps ofClass labels representing predictions of statistical and deep learning models for which cross entropy loss functions are used, so +.>The class labels used for the prediction of the two models are collectively referred to, or +.>、/>And (3) representing. />K represents the total number of category labels for the real category labels; the cross entropy loss function of the detection model is:

；

original cross entropy loss of a logistic regression classifier and a classifier network is achieved, and finally, category label probability distribution which is predicted based on two characteristics and is dynamically fused is obtained through dynamic fusion>：/>The method comprises the steps of carrying out a first treatment on the surface of the Wherein (1)>、/>Are all weight parameters;

step four, based on the training data set, calculating the informationAnd->Cross entropy loss function->To train the detection model.

Further, in step one, the probability of generation is based on each wordCalculate text +.>Probability of (2)When (1):

；

wherein,representing conditional probabilities.

Further, in step one, according toCalculate text +.>Is->When (1):

。

further, in the first step、/>、/>The confusion degree with the text is used as a statistical feature, and the category label probability distribution of the text based on statistical feature prediction is obtained through a logistic regression classifierWhen (1):

；

wherein,for logistic regression classifier,>representing a stitching operation.

Further, in the second step, the initiator [ CLS ] of the text is added]Vector representation of (a)As whole textObtaining a category label probability distribution of the text to be detected based on depth coding feature prediction through a fully connected network and a classifier network>When (1):

；

wherein,for the activation function of the classifier network, +.>Is a fully connected network, < >>Is a bias parameter.

Compared with the prior art, the invention has the beneficial technical effects that:

the detection model comprises a statistical learning model, a deep learning model and a dynamic fusion framework; the statistical learning model provides statistical characteristics, so that the problem of poor model mobility under the condition of limited labeling data in multiple fields is effectively solved. The deep learning model gets rid of the problem of manual design characteristics, more implicit characteristics can be extracted, the pre-training language model provides potential inter-word correlation characteristics by means of strong coding capacity, and the detection effect of the model is improved. On one hand, the dynamic fusion framework uses label smoothing to calibrate the probability of the model, converts the probability of model prediction into real probability, on the other hand, the dynamic fusion framework combines the advantages of the statistical learning model and the deep learning model, greatly improves the migration capability of the model on the premise of losing less detection effect, obtains good detection effect in the new field, and has very wide application prospect.

Drawings

FIG. 1 is a schematic diagram of a detection model in an embodiment of the invention.

Detailed Description

A preferred embodiment of the present invention will be described in detail with reference to the accompanying drawings.

In the present invention, training data setsCorresponding tag set->And->Tag set->，/>Representing a human being->Indicating machine->Is the length of the training dataset. Text->Is a word sequence +.>，/>Represents->Personal text->The%>Individual words->Is->Is a length of (c). The goal of the task is to learn a pass +.>To predict the correct category label->Function of->。

The detection model provided by the invention is shown in fig. 1, and comprises the following three parts: (1) statistical learning models; (2) a deep learning model; (3) dynamic fusion framework.

(1) Statistical learning model

The body of the statistical learning model adopts an autoregressive language model such as GPT-2, because the generation process of the autoregressive language model can better simulate the process of generating text by human beings. These models employ autoregressive means to predict the next word or token from the previously generated word or token, thereby progressively generating semantically coherent text. Since language models tend to sample words that have a higher probability of being generated, words that are selected by humans are more random. Thus selecting a language model such as GPT-2 to obtain the probability of generation of each wordProbability of generation->Representing the given +.>Condition of individual word->Predictive probability distribution of individual words, and statistics of the number of words in the text that appear in the vocabulary in the first ten, hundred and thousand, respectively, expressed as +.>、/>、/>。

Each text is first calculated based on the probability of generation of each wordProbability of->：

；

Thereby calculating the confusion degree of each text：

；

The obtained statistical result of word ranking and the confusion degree of the text are used as statistical characteristics, and the category label probability distribution of the input text based on statistical characteristic prediction is obtained through a logistic regression classifier：

；

Wherein the method comprises the steps ofFor logistic regression classifier,>representing a splicing operation。

(2) Deep learning model

The subject of the deep learning model employs an auto-coded language model such as BERT, rather than an autoregressive language model, because the auto-coded language model generally performs better on language understanding class tasks. After being encoded by a language model such as BERT, the initiator [ CLS ] of the text is encoded]Vector representation of (a)As semantic representation of the whole text, the class label probability distribution of the input text based on depth coding feature prediction is then obtained via a fully connected network and a classifier network>：

；

Wherein the method comprises the steps ofFor the activation function of the classifier network, +.>Is a fully connected network, < >>Is a bias parameter.

(3) Dynamic fusion framework

In performing the classification tasks, it is generally only of interest if the output of the model is greater than a certain threshold, and not of interest as to how confidence. However, in the field of generating text detection, the measure of confidence is equally important. The model calibration aims to keep the model prediction probability consistent with the true experience probability, namely, the probability of model prediction is as close as possible to the true probability. The present invention uses label smoothing to separate the original single thermal label from the labelThe value range of (2) is extended to a larger rangeI.e. +.>Wherein->Is a small number indicating the degree of smoothness. Increasing the true probability distribution of the label post-smoothing prediction category +.>The process is as follows:

。

wherein the method comprises the steps ofClass labels representing statistical and deep learning model predictions,>for the true category label, K represents the total number of categories, in this embodiment k=2. Cross entropy loss function->The change is as follows:

。

for the original cross entropy loss of a logistic regression classifier and a classifier network, finally obtaining final class label probability distribution based on two feature predictions and dynamic fusion through dynamic fusion>：

；

Wherein,，/>and->，/>And->The weight of each input probability distribution is controlled by adjusting +.>And->To obtain the best results.

Based on the training data set, by calculating the information aboutAnd->Is +.>To train the detection model.

It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Furthermore, it should be understood that although the present disclosure describes embodiments, not every embodiment is provided with a single embodiment, and that this description is provided for clarity only, and that the disclosure is not limited to specific embodiments, and that the embodiments may be combined appropriately to form other embodiments that will be understood by those skilled in the art.

Claims

1. A method for detecting generated text based on statistical information and a pre-training language model detects class labels of the generated text through a detection model consisting of a statistical learning model, a deep learning model and a dynamic fusion frame; the training data set adopted in the training of the detection model is recorded as，/>Corresponding tag set->And->，/>For tag collection->For the length of the training dataset, +.>Is->A corresponding category label; text->Is a word sequence，/>Represents->Personal text->The%>Individual words->For text->Is a length of (2);

the construction method of the detection model comprises the following steps:

step one, constructing a statistical learning model:

the statistical learning model adopts an autoregressive language model; obtaining the generation probability of each word in the text to be detected through an autoregressive language modelCounting the number of words in the text that appear in the vocabulary in the top ten words, respectively +.>The number of the previous hundred->The number of previous thousand->The method comprises the steps of carrying out a first treatment on the surface of the Generating probabilities on a per word basisCalculate text +.>Probability of->According to->Calculate text +.>Is->The method comprises the steps of carrying out a first treatment on the surface of the Will be、/>、/>The confusion degree with the text is used as a statistical feature, and the category label probability distribution ++f of the text to be detected based on statistical feature prediction is obtained through a logistic regression classifier>；

Step two, constructing a deep learning model:

the deep learning model adopts a self-coding language model, and the text to be detected is coded by the self-coding language model and then starts the initiator [ CLS ] of the text]Vector representation of (a)As a semantic representation of the entire text, then through a full concatenationConnecting a network and a classifier network to obtain class label probability distribution of text to be detected based on depth coding feature prediction>；

Step three, constructing a dynamic fusion framework:

using tag smoothing to separate original single thermal tags fromThe range of values of (2) is extended to +.>，/>Is a constant representing the degree of smoothness, increasing the true probability distribution of class labels after label smoothing +.>The method comprises the following steps:

；

wherein the method comprises the steps ofClass labels representing statistical and deep learning model predictions,>k represents the total number of category labels for the real category labels; finally, obtaining a class label probability distribution ∈10 based on two feature predictions and dynamic fusion through dynamic fusion>：/>The method comprises the steps of carrying out a first treatment on the surface of the Wherein the method comprises the steps of，/>、/>Are all weight parameters;

step four, based on the training data set, calculating the informationAnd->Cross entropy loss function->Training a detection model:

cross entropy loss function of detection modelThe method comprises the following steps:

；

original cross entropy loss for logistic regression classifiers and classifier networks;

in step one, according toCalculate text +.>Is->When (1):

。

2. the method for detecting generated text based on statistical information and pre-trained language model according to claim 1, wherein in step one, the probability of generation is based on each wordCalculate text +.>Probability of->When (1):

；

wherein,representing conditional probabilities.

3. The method for detecting generated text based on statistical information and pre-trained language model according to claim 1, wherein in step one, the method comprises the steps of、/>、/>The confusion degree with the text is used as a statistical feature, and the category label probability distribution of the text based on statistical feature prediction is obtained through a logistic regression classifier>When (1):

；

wherein,for logistic regression classifier,>representing a stitching operation.

4. The method for detecting generated text based on statistical information and pre-trained language model according to claim 1, characterized in that in step two, the initiator [ CLS ] of the text is entered]Vector representation of (a)As semantic representation of the whole text, and obtaining the category label probability distribution of the text to be detected based on depth coding feature prediction through a fully connected network and a classifier networkWhen (1):

；