CN116206375A

CN116206375A - Face counterfeiting detection method based on double-layer twin network and sustainable learning

Info

Publication number: CN116206375A
Application number: CN202310474306.XA
Authority: CN
Inventors: 鲍志鹏; 周琪华; 周志立; 袁程胜; 潘文焱; 崔琦
Original assignee: Nanjing University of Information Science and Technology
Current assignee: Nanjing University of Information Science and Technology
Priority date: 2023-04-28
Filing date: 2023-04-28
Publication date: 2023-06-02
Anticipated expiration: 2043-04-28
Also published as: CN116206375B

Abstract

The invention discloses a face counterfeiting detection method based on a double-layer twin network and sustainable learning, which comprises the following steps: constructing an image training set for a continuous learning strategy; training the constructed double-layer twin network through a continuous learning strategy, wherein the double-layer twin network comprises: the supervised subnetwork for fast learning is applicable to the unsupervised subnetwork for slow learning and the memory module; the non-supervision sub-network extracts the characteristics through non-supervision learning and guides the supervised sub-network, and the supervised sub-network performs the supervised learning to extract the characteristics under the guidance of the non-supervision sub-network; the memory module is used for consolidating the learned knowledge; inputting the image to be detected into a trained detection model, and detecting the specific position of face counterfeiting in the image by dividing and detecting the image by the model. The invention can improve the accuracy of the human face fake detection model and simultaneously realize the prediction of specific fake positions; and the generalization performance of the face counterfeiting detection model is improved by using a continuous learning strategy.

Description

Face counterfeiting detection method based on double-layer twin network and sustainable learning

Technical Field

The invention belongs to the field of artificial intelligence security, and particularly relates to a face counterfeiting detection method based on a double-layer twin network and sustainable learning.

Background

Face deep forgery is a generated deep learning algorithm that can create or modify facial features of a face, and it is difficult for the human eye to distinguish between the true and false images from these modified false videos or images, so how to effectively detect face forgery is a problem that needs to be solved.

Currently, most face counterfeiting detection methods are based on deep learning, and the methods generally use an artificial neural network to extract features of a face image and identify tamper marks so as to detect whether the face image is counterfeit. However, the neural network model of many face fake detection methods based on deep learning generally has only a single network architecture, and such a network architecture model usually tends to extract features in a single dimension when training to extract features, but ignores extracting features from a multi-dimension perspective, so that the accuracy of detection is not high. Therefore, if the network model is decoupled into two sub-networks and the multi-dimensional features are interactively fused, the capability of overall feature extraction of the model can be improved, and the accuracy of model detection can be improved.

Although the detection method based on deep learning can obtain a good detection effect, most methods focus on improving accuracy, but neglect generalization of the detection method. Therefore, when the forgery technology to be detected is different from the forgery technology at the time of training, the accuracy of detection may be greatly reduced, mainly because of insufficient generalization of the detection model.

Although a few new methods for improving the generalization of detection have been proposed, there is still a great room for improvement of the generalization of the detection model. The continuous learning refers to continuous learning of the model in a plurality of different task projects, and the performance of the model on new knowledge is improved on the premise of not forgetting the learned knowledge to the maximum extent. Therefore, if the model is optimized based on continuous learning, the defect of the model in generalization can be well made up, and the model has good accuracy in the technology of detecting unknown face counterfeiting. On the other hand, many methods based on continuous learning perform iterative supervised learning on the data so that the obtained feature representations are prone to overfitting. This is because supervised learning is more prone to fit old features on known tasks, and thus during continuous learning, when new unknown tasks are learned, it is more prone to forget old tasks that were learned before. In general, when a model trained by supervised learning is used for a new unknown task, catastrophic forgetting is easy to occur, and accuracy is greatly reduced.

In real life, because the face counterfeiting modes are various, the influence of different face counterfeiting modes on the image is different, and the existing face counterfeiting detection generally only judges whether the face image is counterfeit or not, and rarely judges the specific counterfeiting position of the face counterfeiting image.

In summary, it can be obtained that the main defects and shortcomings of the existing face counterfeiting detection technology are as follows:

1) Most of the existing face fake detection network models are single network model structures, when the model structures extract features, the features with single dimension tend to be extracted, the features with multiple dimensions are easily ignored, and the detection accuracy is not high enough;

2) The existing face counterfeiting detection method has insufficient generalization performance, and the accuracy of prediction is not high when facing to unknown counterfeiting technology. In addition, the existing continuous learning-based method has the problems of disastrous forgetting and overfitting to a certain extent because most of the existing continuous learning-based methods only adopt supervised learning;

3) The existing face fake detection method generally only judges whether the face image is fake or not, and does not detect specific fake positions.

Disclosure of Invention

The invention aims to: the invention aims to provide a method capable of improving accuracy, generalization performance and detecting specific face counterfeiting

A human face fake detection method based on a double-layer twin network and sustainable learning of a position.

The technical scheme is as follows: the face counterfeiting detection method of the invention comprises the following steps:

s1, preprocessing the obtained huge number of face fake images, dividing an image dataset into different task datasets according to different fake technologies, and integrating all the task datasets to form a complete face fake image dataset;

s2, training a pre-constructed double-layer twin network based on a continuous learning strategy by taking a face counterfeiting image dataset as a training set to obtain a face counterfeiting detection model based on the double-layer twin network and the continuous learning;

s3, detecting a face image to be detected by adopting a face counterfeiting detection model, and predicting the counterfeiting probability of each pixel point in the image; if the forging probability is larger than the threshold value, judging that the pixel point is forged, and finally obtaining a predicted forging result image and a forging position corresponding to the image to be detected.

Further, in step S1, the implementation steps of the face counterfeit image dataset are as follows:

s11, selecting a public data set faceforensis++ and a face public data set CelebA of a face fake detection task;

the public data set comprises a large number of face fake videos, four face fake technologies of DeepFakes, face2Face, faceSwap and NeuralTextures are utilized, and a frame sampling mode is adopted to obtain face fake images according to the videos in the public data set;

the Face public data set comprises a large number of real Face images, four Face forging technologies of deep fakes, face2Face, faceSwap and NeuralTextures are utilized to forge the Face images in the Face public data set, and each image adopts a forging mode at random;

s12, uniformly adjusting all the obtained face fake images to be the same size, and dividing the face fake images into t task data sets according to different fake technologies

Wherein->

The dataset composed of individual forgery techniques is represented as

All task data sets are integrated together to form a complete face counterfeit image data set.

Further, in step S2, the dual-layer twin network mainly includes three parts: a supervised subnetwork for fast learning, an unsupervised subnetwork for slow learning, and a memory module;

the unsupervised sub-network fully learns the falsified trace features of the human face falsification in an unsupervised learning mode, fuses each layer of learned features to the supervised sub-network in a feature fusion mode to guide the learning of the supervised sub-network, calculates self-supervision loss by utilizing the current detection result of the supervised sub-network, and updates the weight parameters of the unsupervised sub-network;

the supervised subnetwork is trained under supervised learning with three losses including: the loss is calculated according to the unsupervised detection result of the unsupervised subnetwork; calculating the loss by using the supervised learning detection result of the supervised subnetwork under the non-supervision guidance; calculating KL divergence between the detection result of the current model on the memory sample and the recorded old detection result of the memory sample under the guidance of the unsupervised subnetwork;

the memory module consolidates the knowledge learned by the face counterfeiting detection model through continuous experience playback, a part of samples randomly extracted each time are copied into the memory module, and when the face counterfeiting detection model learns a new counterfeiting technology, sample data are continuously sampled from the memory module to train the face counterfeiting detection model.

Further, the unsupervised subnetwork includes a network for capturing counterfeit trace

Self-monitoring encoder->

And affine network->

Wherein a network of counterfeit marks is captured->

The method is composed of 6 convolution layers in an accumulation and stacking way, and the learning aim is to find a fake area in an image and to cover the fake area; self-supervision encoder->

Each block in the ResNet34 is utilized to replace the corresponding block in the encoder in the U-Net architecture in sequence; affine network->

The system consists of a full connection layer, a batch standardization layer and a ReLU activation function layer;

the supervised subnetwork comprises a supervised encoder

Decoder->

And a split detector->

Wherein, supervised encoder->

Self-supervision coder in the same structure as the non-supervision subnetwork>

Decoder->

Is composed of 6 convolution layers and residual structure, and the supervised coder in the supervised sub-network>

And decoder->

The residual structures are connected in the same dimension to form a U-Net structure; split detector->

Is composed of a full connection layer and a Sigmoid function layer.

Further, in step S2, with the face counterfeit image dataset as a training set, based on a continuous learning strategy, training the pre-built double-layer twin network includes the following steps:

s21, randomly taking a data set consisting of the t-th fake technology from the training set

The j th batch of (a) has label sample images, expressed as +.>

Copying a portion of the samples of the batch of sample images into a memory module, represented as

The method comprises the steps of carrying out a first treatment on the surface of the Wherein, recordThe memory module is a memory area for storing sample data when the double-layer twin network is constructed;

s22, regarding the first

Data set consisting of individual forgery techniques->

Sample image of lot j->

Duplicate into two copies, input the unsupervised subnetwork through two different routes separately:

route 1 is a direct input self-supervising encoder

By means of a self-supervising encoder->

Each layer of extracted features multiplies the same-dimensional features element by element in a feature fusion mode to guide a supervised encoder in a supervised sub-network>

Learning process of (a) and obtaining a prediction result +.>

The method comprises the steps of carrying out a first treatment on the surface of the Furthermore, the->

Sending into affine network->

Get the characteristics->

；

Route 2 is input to the network for capturing counterfeit trace

Finally, the characteristic->

；

At this time, the features are calculated by an unsupervised manner

and />

Barlow Tins self-supervision loss->

The expression is as follows:

wherein ,

in the formula ,

is a super-parameter weighting factor; />

Is->

And->

Is a cross-correlation matrix of (a); m and n represent->

And->

Index of dimensions on the two feature vectors; />

Representation matrix->

Meta-on mid-diagonalA prime value; />

Representation matrix->

The element values of the m-th row and the n-th column;

wherein ,

equivalent to two different amplified feature vectors +.>

Mth dimension and->

B is represented as an index of the current batch of samples;

s23, regarding the data set composed by the t-th fake technology

The%>

Batched label sample image->

Firstly, inputting the result into an unsupervised sub-network, and outputting the prediction result of the unsupervised sub-network +.>

Using the prediction result

Calculating a first segmentation loss function Dice loss->

Feature extraction of a constrained face counterfeiting detection model under unsupervised learning, and a first segmentation loss function Dice loss +.>

Expression of (2)The formula is as follows:

wherein y represents the true forging position of the mark;

then, sampling the sample data in the memory module to obtain the sample data

Predicting by using the supervised subnetwork guided by the unsupervised subnetwork to obtain the prediction result of the supervised subnetwork>

By using the prediction result->

Calculating a second division loss function Dice loss->

Feature extraction under supervised learning of a constrained face counterfeiting detection model, and second segmentation loss function Dice loss ++>

The expression of (2) is as follows:

representing the number of sample data sampled in the memory module, < >>

Representing the true counterfeit location of the sample data annotation of the sample;

prediction results through a supervised network

And learning the same sampleRecorded old prediction +.>

Calculating KL divergence between the two to obtain +.>

Loss:

wherein ,/>

Indicating the calculation of the KL divergence between the two inputs, and (2)>

Is the hyper-parametric degradation coefficient,/->

Indicates activation via the neural network Softmax layer, < ->

Is a super parameter temperature coefficient;

finally, supervised loss of face counterfeit detection model

The method comprises the following steps:

using supervised losses

And simultaneously, optimizing the weight parameters of the supervised sub-network and the unsupervised sub-network, and continuously iterating until the conditions are met, so as to finally obtain the optimal weight parameters.

Further, in step S2, the task data set is repeatedly selected based on the continuous learning strategy

Different ones of (a)Subtask data set composed of manufacturing technology +.>

And for continuously training the two-layer twin network until the task dataset is +.>

All the t subtask data sets are selected; finally, the optimal high generalization weight parameter is obtained.

Compared with the prior art, the invention has the following remarkable effects:

1. the double-layer twin network model designed by the invention consists of two sub-networks, wherein the two sub-networks respectively correspond to the extracted features under the unsupervised learning and the extracted features under the supervised learning, the two sub-networks interact, the features are fused, the model can acquire the multi-dimensional features, the feature extraction capability of the face counterfeiting image of the model is reasonably enhanced, and the accuracy of the face counterfeiting detection is effectively improved;

2. the invention uses a continuous learning strategy to train the double-layer twin network model, thereby effectively overcoming the defect of the existing method in the aspect of insufficient detection generalization performance; meanwhile, the feature is extracted by adding the unsupervised learning in the continuous learning, so that the problems of disastrous forgetting and overfitting caused by the fact that the existing continuous learning-based method only adopts the supervised learning mode are solved, and the generalization performance of the model face counterfeiting detection is further improved;

3. the invention adopts a special loss function for segmentation detection and a U-Net network structure, judges whether the image is forged or not, and can also segment the image to detect the specific position of face forging.

Drawings

FIG. 1 is a diagram of the whole structure in the training process of a face fake detection model;

FIG. 2 is a schematic diagram of an unsupervised subnetwork in a two-layer twin network of the present invention;

FIG. 3 is a schematic diagram of a supervised subnetwork in a two-tier twin network of the present invention;

fig. 4 is a schematic diagram of the interaction between two subnetworks in a two-layer twin network of the present invention.

Detailed Description

The invention is described in further detail below with reference to the drawings and the detailed description.

The human face counterfeiting detection method based on the double-layer twin network and sustainable learning effectively improves the accuracy of model detection by combining the supervised learning and the unsupervised learning. Meanwhile, the generalization capability of the detection model to unknown fake modes is greatly improved through a continuous learning strategy, and the unsupervised addition also relieves the problems of disastrous forgetting and overfitting in continuous learning. In addition, the invention can accurately detect the specific forged position.

As shown in fig. 1, the embodiment provides a face counterfeiting detection method based on a double-layer twin network and sustainable learning, and the specific details include the following steps:

step 1, constructing a training data set

Preprocessing the obtained huge number of face fake images, dividing the image data set into different task data sets according to different fake technologies, and integrating all the task data sets to form a complete face fake image data set so as to facilitate subsequent continuous learning.

In the embodiment, two data sets are selected as experimental data sets, namely a data set faceforensis++ which is most commonly used for Face forging and a Face public CelebA data set, wherein the faceforensis++ data set has a large number of Face forging videos, four Face forging technologies, namely deep fakes, face2Face, faceSwap and neuroalTextures, are utilized for sampling frames of the videos by using an opencv-python tool library package, and particularly 10 frames per second are utilized for sampling the videos; celebA data set containing a large number of real Face images, forging the Face images by using four Face forging technologies of deep fakes, face2Face, faceSwap and NeuralTextures, randomly adopting a forging mode for each image to ensure that the number of the images applied by the four forging modes is approximately the same, obtaining a huge number of Face forging images by the two modes, preprocessing the Face forging images, and carrying out the imagesThe dimensions of the images are scaled to 158 x 158; dividing the adjusted image into t task data sets according to different forging techniques

Wherein->

The dataset consisting of individual forgery techniques is denoted +.>

All are integrated together to form a large complete face counterfeit image dataset.

Preferably, t is 4 in the embodiment, and the data set is divided into 4 different tasks, each subtask data set contains 3 ten thousand face fake images, and the total data set contains 12 ten thousand face fake images for training and testing.

And 2, training a pre-constructed double-layer twin network based on a continuous learning strategy by taking the face fake image data set in the step 1 as a drive. The dual-layer twin network mainly comprises three parts: the system comprises a supervised subnetwork suitable for fast learning, an unsupervised subnetwork suitable for slow learning and a memory module.

The unsupervised sub-network fully learns the falsified trace features of the human face falsification in an unsupervised learning mode, fuses each layer of learned features to the supervised sub-network in a feature fusion mode to guide the learning of the supervised sub-network, and calculates self-supervision loss by utilizing the current detection result of the supervised sub-network so as to update the weight parameters of the unsupervised sub-network.

The supervised subnetwork uses three losses to train together under supervised learning, including A1) the unsupervised detection results of the unsupervised subnetwork to calculate losses; a2 Using the supervised learning detection results of the supervised subnetwork under the unsupervised guidance to calculate the loss; a3 Calculating KL divergence between the detection result of the current model on the memory sample under the guidance of the non-supervision subnetwork and the old detection result of the memory sample recorded before, combining the three losses, and realizing the feature extraction of the simultaneous constraint model, wherein the non-supervision subnetwork and the supervision subnetwork have complementary effects.

In addition, the memory module consolidates the knowledge learned by the face counterfeiting detection model through continuous experience playback, a part of samples randomly extracted each time are copied into the memory module, when the face counterfeiting detection model learns a new counterfeiting technology, sample data can be continuously sampled from the memory module, and the face counterfeiting detection model is trained, so that old knowledge learned before is consolidated. The weight parameters of the whole model are optimized through continuous iteration, so that the optimal weight parameters are finally obtained, and the common characteristics among the proper various face counterfeiting technologies can be extracted, so that the high generalization in face counterfeiting detection is realized.

Training a pre-constructed double-layer twin network by using the face fake image data set in the step 1, wherein the method comprises the following steps of:

step 21, randomly selecting a data set consisting of the t-th fake technology from the training image data set

The%>

Batched label sample image, denoted +.>

Copying part of the samples of the batch of sample images into a memory module of the model, denoted +.>

The memory module is a memory area for storing sample data when the double-layer twin network is constructed.

Preferably, the number of sample images per batch taken in this embodiment is 32, and the number of sample images copied into the memory module therein is 12.

Step 22, as shown in FIG. 2, begins to build an unsupervised subnetwork in the two-layer twinning network consisting of a network of captured forgery marks

Self-monitoring encoder->

And affine network->

The structure is that the network for capturing fake trace consists of 6 convolution layer accumulation stacks, and the self-supervision encoder is->

The network of the corresponding block in the encoder in the U-Net architecture is replaced by each block in the Resnet34 in turn, affine network ∈>

Consists of a full connection layer, a batch standardization layer and a ReLU activation function layer.

For the first

Data set consisting of individual forgery techniques->

The%>

Batch sample image +.>

Duplicate into two, walk two different routes to input the unsupervised subnetwork separately:

route 1 is a direct input self-supervising encoder

By means of a self-supervising encoder->

Each layer of extracted features are multiplied element by the same-dimensional features in a feature fusion mode, and a supervised encoder in a supervised sub-network is subjected to +.>

Guiding to obtain the predicted result +.>

Prediction result->

Sending into affine network->

Get the characteristics->

；

Another route 2 is through a network of capturing counterfeit traces

The network aims to find possible fake areas in the image as much as possible and to mask the fake areas, the specific route is the same as route 1, and finally the characteristic +.>

。

At this time, the same sample batch is routed through two different routes to obtain a pair of features

and />

The self-supervision loss of Barlow Twos of these two features can be calculated in an unsupervised manner>

The weight parameters of the unsupervised subnetwork are optimized, so that the unsupervised subnetwork can extract more proper counterfeit characteristics, and meanwhile, the unsupervised learning mode can reduce the overfitting of the model and can relieve the catastrophic forgetting in the continuous learning: />

wherein ,

in the formula (1), the components are as follows,

is a super-parameter weighting factor; />

Is->

And->

Is a cross-correlation matrix of (a); m and n represent->

And->

Index of dimensions on the two feature vectors; />

Representation matrix->

Element values on the middle diagonal; />

Representation matrix->

The element values of the m-th row and the n-th column. In the formula (2), ->

And is also equivalent to two different amplified feature vectors +>

Mth dimension/>

And b is represented as an index of the current batch of samples. In this embodiment super parameter +.>

Set to 0.5.

Step 23, as shown in FIG. 3, the construction of a supervised subnetwork in a two-layer twinning network is started, the supervised subnetwork being composed of supervised encoders

Decoder->

And a split detector->

The construction, wherein the supervision encoder->

Self-supervision coder in the same structure as the non-supervision subnetwork>

Decoder->

Is composed of multiple convolution layers and residual structure, and the supervised encoder in the supervised sub-network>

Decoder->

And the residual structures are connected in the same dimension to form a Unet structure. Split detector->

Is composed of a full connection layer and a Sigmoid function layer.

As shown in fig. 4, in a supervised subnetworkSupervised encoder

Self-supervising encoder with non-supervising sub-network

The feature fusion mechanism is arranged on the same-dimensional feature of each layer of block for extracting the features, the non-supervision subnetwork is connected with the supervised subnetwork in a unidirectional way, and the element-by-element multiplication can be carried out aiming at the same-dimensional feature, so that the purpose that the non-supervision subnetwork guides the features of the supervised subnetwork is achieved.

For any data set composed of t fake technology

The%>

Batched label sample image

By using the prediction result->

Calculating a first segmentation loss function Dice loss->

Feature extraction of constrained double-layer twin network under unsupervised learning, first segmentation loss function Dice loss ∈>

The expression of (2) is as follows:

wherein y represents the true forging position of the mark;

representing intersection operations, ++>

Representing a union operation.

Then sampling the sample data in the memory module to obtain sample data

By using the prediction result->

Calculating a second division loss function Dice loss->

Feature extraction of the constrained double-layer twin network under supervised learning, and sampling in the memory module, so that the supervised subnetwork can learn common features among various forging technologies better, and the second segmentation loss function price loss->

The expression of (2) is as follows: />

in the formula ,

representing the number of sample data sampled in the memory module, < >>

Representing the true counterfeit location of the sample data annotation of the sample; in this embodiment, the number of sample data sampled in the memory12.

Using prediction results from sampling from a memory module and then passing through a supervised network

And learning the old prediction result recorded at the same time of the sample +.>

Calculating KL divergence between the two to obtain loss +.>

The method can relieve the problem of catastrophic forgetting in continuous learning, and avoid completely different predicted results from the current predicted results when predicting old learned knowledge when learning new knowledge; using shortened KL divergence to enable predictive outcome of a supervised network>

And old prediction result->

The difference is kept small, and finally, the double-layer twin network can be forced to learn the common characteristics among various face counterfeiting technologies, so that the face counterfeiting detection model has excellent face counterfeiting detection generalization performance and KL divergence loss->

The expression of (2) is as follows:

in the formula ,

indicating the calculation of the KL divergence between the two, < >>

Is the hyper-parametric degradation coefficient,/->

Indicates activation via the neural network Softmax layer, < ->

Is a super-parametric temperature coefficient.

Preferably, in this embodiment, the superparameter

Setting to 10.0, super parameter +.>

Set to 2.0.

Eventually, the two-layer twinned network has a supervised penalty

Summing the three loss functions described above:

using supervised losses

And simultaneously, optimizing the weight parameters of the supervised sub-network and the unsupervised sub-network, and continuously iterating to finally obtain the optimal weight parameters.

Based on continuous learning strategy, task data sets are repeatedly selected continuously

Subtask data set composed of different forgery techniques +.>

And (5) all t subtask data sets are selected. Finally, the optimal high generalization weight parameter is obtained.

And 3, detecting the face image to be detected by using the face counterfeiting detection model trained in the step 2 and based on the double-layer twin network and sustainable learning, and obtaining the counterfeiting prediction probability of each pixel point in the image, if the probability is greater than a threshold value, judging that the pixel point is counterfeiting, and finally obtaining a predicted counterfeiting result image corresponding to the image to be detected, wherein the counterfeiting white pixel point position of the predicted result image in fig. 1 is the counterfeiting position predicted by the model.

Preferably, in this embodiment, the threshold is set to 0.5, i.e. if the prediction probability of a pixel is greater than 0.5, the pixel is determined to be counterfeit.

The above is a preferred embodiment of the present invention, and all changes made according to the technical solution of the present invention belong to the protection scope of the present invention when the generated functional effects do not exceed the scope of the technical solution of the present invention.

Claims

1. The human face counterfeiting detection method based on the double-layer twin network and sustainable learning is characterized by comprising the following steps of:

2. The face fake detection method based on the double-layer twin network and sustainable learning according to claim 1, wherein in step S1, the implementation steps of the face fake image dataset are as follows:

s12, uniformly adjusting all the obtained face fake images to the same size, and dividing the face fake images into different parts according to fake technologies

Personal task data set->

Wherein->

The dataset consisting of individual forgery techniques is denoted +.>

3. The face falsification detection method based on a double-layer twin network and sustainable learning according to claim 1, wherein in step S2, the double-layer twin network mainly comprises three parts: a supervised subnetwork for fast learning, an unsupervised subnetwork for slow learning, and a memory module;

4. A face falsification detection method based on a double-layer twin network and sustainable learning as claimed in claim 3 wherein the unsupervised subnetwork comprises a network capturing falsification trace

Self-monitoring encoder->

And affine network->

Wherein a network of counterfeit marks is captured->

the supervised subnetwork comprises a supervised encoder

Decoder->

And a split detector->

Wherein, supervised encoder->

Self-supervision coder in the same structure as the non-supervision subnetwork>

Decoder->

And decoder->

By a full connection layer and a Sigmoid function layerThe composition is formed.

5. The face fake detection method based on the double-layer twin network and the sustainable learning according to claim 4, wherein in step S2, the training of the pre-constructed double-layer twin network based on the sustainable learning strategy by using the face fake image dataset as a training set comprises the following steps:

s21, randomly taking the first part from the training set

Data set consisting of individual forgery techniques->

The%>

Batched label sample image, denoted +.>

The method comprises the steps of carrying out a first treatment on the surface of the The memory module is a memory area for storing sample data when the double-layer twin network is constructed;

s22, regarding the first

Data set consisting of individual forgery techniques->

The%>

Batch sample image +.>

Is duplicated into two parts, respectively by two different partsRoute input unsupervised subnetwork:

route 1 is a direct input self-supervising encoder

By means of a self-supervising encoder->

Learning process of (a) and obtaining a prediction result +.>

Sending into affine network->

Get the characteristics->

；

Route 2 is input to the network for capturing counterfeit trace

Finally, the characteristic->

；

At this time, the features are calculated by an unsupervised manner

and />

Barlow Tw of (A)ins self-supervision loss->

The expression is as follows:

wherein ,

in the formula ,

is a super-parameter weighting factor; />

Is->

And->

Is a cross-correlation matrix of (a); />

Representation->

And->

Index of dimensions on the two feature vectors; />

Representation matrix->

Element values on the middle diagonal; />

Representation matrix->

Middle->

Line, th->

Element values of columns;

wherein ,

equivalent to two different amplified feature vectors +.>

First->

Dimension and->

Is>

Sum of products of corresponding values of the dimensions, +.>

An index represented as a current batch of samples;

s23, for the first

Data set consisting of individual forgery techniques->

The%>

Batched label sample image->

By using the prediction result->

Calculating a first segmentation loss function Dice loss->

The expression of (2) is as follows:

wherein ,/>

Representing the true counterfeit location of the annotation;

then, sampling the sample data in the memory module to obtain the sample data

By using the prediction result->

Calculating a second division loss function Dice loss->

The expression of (2) is as follows:

representing the number of sample data sampled in the memory module, < >>

prediction results through a supervised network

And learning old prediction results recorded while the same sample

Calculating KL divergence between the two to obtain +.>

Loss:

wherein ,/>

Is the hyper-parametric degradation coefficient,/->

Indicates activation via the neural network Softmax layer, < ->

Is a super parameter temperature coefficient;

finally, supervised loss of face counterfeit detection model

The method comprises the following steps:

using supervised losses

6. The face forgery detection method based on the double-layer twin network and the sustainable learning as claimed in claim 4, wherein in step S2, the task data set is repeatedly selected based on the sustainable learning strategy

Subtask data set composed of different forgery techniques +.>

Middle->

Subtask data set completionThe part is selected; finally, the optimal high generalization weight parameter is obtained. />