CN114707572A

CN114707572A - Deep learning sample testing method and device based on loss function sensitivity

Info

Publication number: CN114707572A
Application number: CN202210173151.1A
Authority: CN
Inventors: 陈晋音; 李晓豪; 金海波; 郑海斌; 宣琦; 倪洪杰
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2022-02-24
Filing date: 2022-02-24
Publication date: 2022-07-05

Abstract

The invention discloses a deep learning sample testing method and device based on loss function sensitivity, which comprises the steps of obtaining an image data set as a clean sample and constructing a deep learning network; carrying out attack resisting operation on the image data set to obtain a resisting sample; screening to obtain a total sample set, and dividing the total sample set into a training sample set and a testing sample set; setting a loss function sensitivity function; establishing a particle swarm optimization model, calculating a loss sensitivity drift value, performing iterative updating, and selecting two optimal drift sample sets of a normal sample and a confrontation sample; training by using the optimal drift sample set to obtain a loss sensitivity classifier; inputting the two optimal drift sample sets into a deep learning network for retraining to obtain a test model; inputting the test sample set into the test model for iteration, searching for an optimal drift noise sample, calculating a loss function sensitivity drift value of the optimal drift noise sample, and judging by using a loss sensitivity classifier to complete the detection of the sample.

Description

Deep learning sample testing method and device based on loss function sensitivity

Technical Field

The invention belongs to the field of safety problems facing deep learning samples, and particularly relates to a deep learning sample testing method and device based on loss function sensitivity.

Background

Deep learning is one of the directions in which artificial intelligence has attracted much attention in recent years, and as Deep Neural Networks (DNNs) have enjoyed great success in many applications, and with the mass data coming along, neural networks have enjoyed good performance in image classification tasks, for example, the ImageNet data set has 320 million artificially labeled images. Although amazon and other large platforms already label class labels of large-scale images as much as possible, some errors are inevitable in the labeling process, and the samples with wrong labels are easy to damage the performance of the model trained on the same type of samples. And the ability of the deep neural network to remember the mass data of randomly assigned labels demonstrates the sensitivity of the overfitting when training the noise samples. Therefore, a DNN algorithm robust to noise labels is needed to solve the potential problem. Further, where samples are readily available but accurate labeling is expensive, the algorithm can make datasets with more noisy labels more beneficial than datasets with fewer more accurate labels.

At present, the artificial intelligence attacks are mainly divided into poisoning attacks and counterattack attacks, and the poisoning attacks are mainly formed by marking normal samples. And then, injecting the poisoning sample into a training data set, thereby completing embedding a backdoor trigger into the deep learning model in a training stage, and triggering attack outbreak when a poison sample is input in a testing stage. The counterattack is mainly characterized in that some disturbances which cannot be perceived by human eyes are added in an original sample, different from other attacks, the counterattack mainly occurs when a counterattack sample is constructed, a machine learning model is input by using the generated counterattack sample and a normal sample, a deception recognition result is obtained, and the counterattack is divided into a white box attack and a black box attack.

However, the existing neural network testing method is focused on the identification precision of the model and the robustness of the model in a malicious sample, and in addition, the sample testing method can only be used for identifying a small number of samples or a specific sample, so that the risk of damaging the model when a plurality of types of malicious samples are applied to an edge end is ignored. Malicious samples of different types still have destructive effects in the defended neural network model. When the neural network is applied to real life, for example, in an automatic driving scene, a malicious sample formed by samples generated by different weather and pollution on targets such as pedestrians or guideboards can attack an automatic discrimination model, so that a great risk is brought to model errors, and the potential risk is not ignored.

Disclosure of Invention

Aiming at the defects of the prior art, the invention provides a deep learning sample testing method and device based on loss function sensitivity.

In order to achieve the purpose, the technical scheme of the invention is as follows: 1. a deep learning sample testing method based on loss function sensitivity comprises the following steps:

(1) acquiring an image data set as a clean sample, and constructing a deep learning network;

(2) selecting an anti-attack method, and carrying out anti-attack operation on the image data set obtained in the step (1) to obtain an anti-sample; screening, and taking the screened clean sample and the confrontation sample as a total sample set; dividing the total sample set into a training sample set and a testing sample set;

(3) setting a loss function sensitivity function based on the loss function and the training sample set;

(4) establishing a particle swarm optimization model, calculating loss sensitivity drift values of all samples, performing iterative updating on the loss sensitivity drift values, and selecting two optimal drift sample sets of normal samples and confrontation samples; training by using the optimal drift sample set to obtain a loss sensitivity classifier;

(5) inputting the two types of optimal drift sample sets obtained in the step (4) into the deep learning network constructed in the step (1) for retraining to obtain a test model; inputting the test sample set obtained in the step (2) into a test model for iterative updating, finding out an optimal drift noise sample corresponding to the test sample, calculating a loss function sensitivity drift value of the optimal drift noise sample, inputting the drift value into the loss sensitivity classifier constructed in the step (4) for discrimination, and completing the detection of the sample.

Further, the image dataset is a typical image dataset such as MNIST, CIFAR10, ImageNet, GTSRB, CASIA, and the like.

Further, the deep learning network is a network such as VGG16, AlexNet, VGG11 or ResNet 34.

Further, the anti-attack method is one of FGSM, Boundary and MIFGSM.

Further, the step (3) is specifically:

3.1) constructing a loss function: setting the cross entropy function as a loss function, wherein the formula is as follows:

wherein C represents the number of classification results; n denotes an overview of the training samples and h is the final output of the network.

3.2) constructing a loss sensitivity function: the gradient function generated by derivation of the loss function on the sample is used as a target function, and the second-order norm is used for dimensionality reduction, and the function is as follows:

generating new noise samples by adding noises with different degrees on a training sample set, and simultaneously inputting the training sample set and the noise samples into a deep learning model to generate a loss function value; using the offset value of the loss function of the training sample set and the noise sample as the loss function sensitivity function:

Ls＝Lf_noise-Lf_nor

wherein, Lf_noiseIs the sample loss sensitivity value after adding noise, Lf_norIs the loss sensitivity value for the initial non-additive noise.

Further, the step (4) specifically includes the following steps:

(4.1) adding random noise to the input sample, and initializing a particle swarm optimization model;

(4.2) sample loss sensitivity drift value calculation: calculating the loss function sensitivity of the initialized sample added with random noise through the loss function sensitivity function set in the step (3), and then making a difference with the loss function sensitivity of the input sample to obtain drift values of the loss function sensitivities of different noise samples;

(4.3) updating the historical optimal position and the global optimal position: calculating the sensitivity value of the loss function of the particle swarm, comparing the historical optimal sensitivity of the loss function and updating the historical optimal position p of each particle^bestGlobal optimum position g of particle swarm^best；

(4.4) updating the speed v of the particle swarm_iAnd position x_i；

(4.5) selecting an optimal drift sample: continuously updating and optimizing the loss function sensitivity of the input samples to generate noise samples with different loss sensitivities, and recording the noise sample with the maximum loss function sensitivity corresponding to each sample, namely the optimal drift sample, in the iterative updating of the same input sample; different optimal drifting samples generated by the normal sample and the confrontation sample are gathered to form two optimal drifting sample sets of the normal sample and the confrontation sample.

(4.6) loss sensitivity classifier construction: selecting an optimal drift sample interval to record drift intervals of different types of samples to form a sample interval data set, wherein the data set comprises a normal sample drift interval type and an antagonistic sample drift interval type, and training the data set by using a two-classifier to train the two-classifier which can distinguish the normal sample from the antagonistic sample according to loss sensitivity; the two classifiers are used as loss sensitivity classifiers.

Further, the step (5) specifically comprises:

(5.1) sorting the two types of optimal drift sample sets obtained in the step (4) according to the magnitude of the drift values, selecting front Top-K sheet samples and labels of each type, inputting the samples and the labels into the deep learning model trained in the step (1) for retraining, and generating a test model;

(5.1) inputting the test sample set divided in the step (2) into a test model to obtain a prediction label; inputting the test sample and the prediction label into the particle group optimization model for iterative updating, finding out the optimal drift noise sample corresponding to the test sample, calculating the loss function sensitivity drift value of the optimal drift noise sample, and inputting the drift value into the step (4) for discrimination, thereby realizing the purpose of deducing whether the sample belongs to a normal sample or a confrontation sample, and completing the detection of the sample.

A second aspect of the embodiments of the present invention provides a deep learning sample testing apparatus based on a sensitivity of a loss function, including one or more processors, for implementing the above deep learning sample testing method based on a sensitivity of a loss function.

A third aspect of the embodiments of the present invention is a computer-readable storage medium, on which a program is stored, which, when being executed by a processor, is used for the above-mentioned deep learning sample testing method based on the sensitivity of the loss function.

The invention has the beneficial effects that: the method can ensure that the safety detection is carried out on the input sample before the sample is input into the deep learning model, whether the sample is a malicious sample or not is tested, and whether the judgment strategy of the deep learning model is disturbed or not is tested, so that serious consequences in practical application are avoided, and the effectiveness in the common model training and sample testing process is ensured.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a schematic view of the apparatus of the present invention.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present invention. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the invention, as detailed in the appended claims.

The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in this specification and the appended claims, the singular forms "a", "an", and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used herein refers to and encompasses any and all possible combinations of one or more of the associated listed items.

It is to be understood that although the terms first, second, third, etc. may be used herein to describe various information, these information should not be limited to these terms. These terms are only used to distinguish one type of information from another. For example, first information may also be referred to as second information, and similarly, second information may also be referred to as first information, without departing from the scope of the present invention. The word "if" as used herein may be interpreted as "at … …" or "when … …" or "in response to a determination", depending on the context.

The method and apparatus for testing deep learning samples based on sensitivity of loss function according to the present invention will be described in detail with reference to the accompanying drawings. The features of the following examples and embodiments may be combined with each other without conflict.

Before the sample library is taken for model training, a sample identification task needs to be carried out on the model library. Specifically, clean samples of the same type and corresponding confrontation samples are input into the model as test data sets, the loss sensitivity interval of each type of sample is found, abnormal samples which are not in the normal sample interval are removed, and therefore the effect of removing malicious samples on the basis of main task training convergence can be guaranteed. When samples of different attack types are input into the test model, the drift degrees of the loss functions of the samples are different from those of the optimized model, and the types of the samples are reversely deduced by using the difference of the drift intervals. Using clean samples X_cleanAnd confrontation sample X_advThe three data sets are input into a model, and corresponding three loss function sensitivities are generated. And performing optimization sequencing on the generated loss sensitivities by using a particle swarm optimization algorithm, selecting out Top-k samples in each class, inputting the Top-k samples into the original model for retraining, and generating a new discrimination model. And inputting unknown samples to be tested into a discrimination model, and discriminating the samples by using the regenerated loss sensitivity.

Referring to fig. 1, a method for detecting a deep learning malicious sample based on a loss function sensitivity according to an embodiment of the present invention includes the following steps:

1) data set selection:

typical image datasets such as MNIST, CIFAR10, ImageNet, GTSRB, CASIA, etc. are selected, and in the embodiment of the present invention, a CIFAR10 dataset is taken as an example, and the CIFAR10 dataset is a 10-classified 6000-picture dataset with a picture size of 32 × 32.

2) Sample attack preparation:

2.1) selecting a deep learning network: networks such as VGG16, AlexNet, VGG11, ResNet34 and the like can be selected, and the VGG16 network is taken as an example in the embodiment of the present invention.

2.3) selecting an anti-attack method to generate an anti-attack sample: FGSM, Boundary, MIFGSM, etc. are selected to resist attack. In the embodiment of the invention, the MNIST takes FGSM and Boundary as examples, wherein FGSM is a white box attack, and Boundary is a black box attack. Both attacks are targetless attacks, i.e. random noise is added, and only the original label needs to be changed.

2.4) sample screening and combination: the embodiment of the invention screens the original clean samples and the counterattack samples, and firstly eliminates the malicious samples which can not be correctly classified by the model and are unsuccessful in counterattack. Secondly, the two types of samples which are screened are classified separately to form a total sample set. And (4) selecting 4000 samples from each type in the total sample set to form a training sample set. And finally, selecting 1000 samples different from the training samples from each type in the total sample set to form a test sample set for final performance test.

3) Setting a loss function sensitivity function:

3.1) constructing a loss function: ideally, there is one clean data set

Wherein x_iIs a feature space, y_iIs a class label. The function of the classifier is to find different class labels f: x → R from different feature spaces^c. In the embodiment of the present invention, the output layer of the DNN model is set as the softmax layer, and the cross entropy function is set as the loss function, so the loss function is:

wherein C represents the number of classification results; n represents the total number of training samples and h is the final output of the network.

3.2) constructing a loss sensitivity function: firstly, in order to further improve the sensitivity of the loss function to the sample, a gradient function generated by derivation of the loss function to the sample is used as a target function, and the gradient function is reduced by using a second-order norm, wherein a function Lf of the gradient function is as follows:

new noise samples are generated by adding different degrees of noise to the training sample set, and the training sample set is input to the loss function values generated in the target model simultaneously with the noise samples. Using the offset value of the loss function of the training sample set and the noise sample as the loss function sensitivity function Ls:

Ls＝Lf_noise-Lf_nor

When noise is added to the sample, noise addition along the gradient direction is selected, and the noise addition process is as follows:

x'＝x+s*grad

where s is the iteration step size.

When changing the pixel value of the image, the following requirements are satisfied:

wherein x represents an original image; x' represents an image after changing the pixel value; l0 represents the maximum number of pixels that have been altered; l ∞ represents the maximum value of the pixel modification; size (x) is the number of pixels in the image 0< x; 0< α, β < 1.

4) Particle swarm optimization model (PSO) building

4.1) initialization of particle swarm optimization model

In the embodiment of the invention, when each sample is input into the particle swarm optimization model, 20 initialization samples added with different random noises are constructed by adding random noises, and each initialization sample is taken as a particle to construct 20 particles of the PSO model. Using the RGB values of all pixel points of each input sample as the position matrix x of the particles_iThe variation matrix of RGB values is used as the velocity matrix v of the particles_i. And the current iteration number G, the maximum iteration number G_kCurrent inertial weight factor ω_g. Historical optimal position p of ith particle^bestParticles ofGlobal optimum position g for population discovery^best。

4.2) sample loss sensitivity value calculation

And (4) calculating the loss function sensitivity of the initialized 20 initialization samples added with random noise through the loss function sensitivity function set in the step (3), and then calculating the drift values of the loss function sensitivities of different noise samples by making a difference with the loss function sensitivity of the input sample.

4.3) calculating the sensitivity value of the loss function of the particle swarm, comparing the historical optimal sensitivity of the loss function and updating the historical optimal position p of each particle^bestGlobal optimum position g of particle swarm^best。

4.4) updating the speed v of the particle swarm_iAnd position x_iAnd an inertia factor is adopted in the updating process of the particle speed and the particle position, when the value of the inertia factor is larger, the global searching capability is stronger, and when the value of the inertia factor is smaller, the local searching capability is stronger. The calculation formula is as follows:

ω^(g)＝(ω_ini-ω_end)(G_k-g)/G_k+ω_end

υ_i＝ω^(g)×υ_i+c₁×rand()×(pbest_i-x_i)+c₂×rand()×(gbest_i-x_i)

x_i＝x_i+υ_i

wherein, ω is_iniIs the initial weight factor value, ω_endIs the final weight factor value, c₁And c₂To initialize the learning factor, rand () is a random number between (0, 1) generated by the system.

4.5) optimal Drift sample set selection

Continuously updating and optimizing the loss function sensitivity of the input samples in the step 4.4) to generate noise samples with different loss sensitivities, and recording the noise sample with the maximum loss function sensitivity corresponding to each sample, namely the optimal drift sample, in the iterative updating of the same input sample. Different optimal drift samples generated by the normal sample and the confrontation sample are collected to form two optimal drift sample sets of the normal sample and the confrontation sample.

4.6) loss sensitivity classifier construction

Selecting an optimal drift sample interval to record drift intervals of different types of samples to form a sample interval data set, wherein the data set comprises a normal sample drift interval type and an antagonistic sample drift interval type, and training the data set by using a two-classifier, thereby training the two-classifier which can distinguish the normal sample from the antagonistic sample according to loss sensitivity. The two classifiers are used as loss sensitivity classifiers.

5) Sample testing

5.1) model retraining

In order to increase the loss function discrimination of the two types of samples, the two types of optimal drift sample sets in 4.5) are sorted according to the magnitude of the drift value, the front Top-K pieces of samples and labels of each type are selected, and the samples and the labels are sent into the deep learning model (the VGG16 model in the embodiment) trained in the step (1) for retraining to generate a test model for testing a subsequent test data set.

5.2) testing of samples

Inputting each sample in the test sample set in 2.4) into the test model in 5.1), a prediction label can be obtained. Inputting the test sample and the prediction label into the particle group optimization model in 5.1) for iterative updating, finding out an optimal drift noise sample corresponding to the test sample, calculating a loss function sensitivity drift value of the optimal drift noise sample, and inputting the change drift value into the classifier constructed in the step 4.6) for discrimination, thereby realizing the purpose of deducing whether the sample belongs to a normal sample or a confrontation sample, and further completing the detection of the sample.

Corresponding to the embodiment of the deep learning sample testing method based on the loss function sensitivity, the invention also provides an embodiment of a deep learning sample testing device based on the loss function sensitivity.

Referring to fig. 2, an embodiment of the present invention provides a deep learning sample testing apparatus based on a sensitivity of a loss function, which includes one or more processors, and is configured to implement the deep learning sample testing method based on a sensitivity of a loss function in the foregoing embodiment.

The embodiment of the deep learning sample testing device based on the sensitivity of the loss function can be applied to any equipment with data processing capability, such as computers and other equipment or devices. The apparatus embodiments may be implemented by software, or by hardware, or by a combination of hardware and software. The software implementation is taken as an example, and as a logical device, the device is formed by reading corresponding computer program instructions in the nonvolatile memory into the memory for running through the processor of any device with data processing capability. In terms of hardware, as shown in fig. 2, a hardware structure diagram of any device with data processing capability where the deep learning sample testing apparatus based on loss function sensitivity of the present invention is located is shown, except for the processor, the memory, the network interface, and the nonvolatile memory shown in fig. 2, in the embodiment, any device with data processing capability where the apparatus is located may also include other hardware according to the actual function of the any device with data processing capability, which is not described again.

The implementation process of the functions and actions of each unit in the above device is specifically described in the implementation process of the corresponding step in the above method, and is not described herein again.

For the device embodiments, since they substantially correspond to the method embodiments, reference may be made to the partial description of the method embodiments for relevant points. The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the scheme of the invention. One of ordinary skill in the art can understand and implement it without inventive effort.

An embodiment of the present invention further provides a computer-readable storage medium, on which a program is stored, where the program, when executed by a processor, implements the deep learning sample testing method based on the sensitivity of the loss function in the foregoing embodiments.

The computer readable storage medium may be an internal storage unit, such as a hard disk or a memory, of any data processing capability device described in any of the foregoing embodiments. The computer readable storage medium can be any device with data processing capability, such as a plug-in hard disk, a Smart Media Card (SMC), an SD Card, a Flash memory Card (Flash Card), etc. provided on the device. Further, the computer readable storage medium may include both an internal storage unit and an external storage device of any data processing capable device. The computer-readable storage medium is used for storing the computer program and other programs and data required by the arbitrary data processing-capable device, and may also be used for temporarily storing data that has been output or is to be output.

The above embodiments are only used for illustrating the design idea and features of the present invention, and the purpose of the present invention is to enable those skilled in the art to understand the content of the present invention and implement the present invention accordingly, and the protection scope of the present invention is not limited to the above embodiments. Therefore, all equivalent changes and modifications made in accordance with the principles and concepts disclosed herein are intended to be included within the scope of the present invention.

Claims

1. A deep learning sample testing method based on loss function sensitivity is characterized by comprising the following steps:

(2) selecting an anti-attack method, and carrying out anti-attack operation on the image data set obtained in the step (1) to obtain an anti-sample; screening, and taking the screened clean sample and the screened confrontation sample as a total sample set; dividing the total sample set into a training sample set and a testing sample set;

2. The method for testing deep learning samples based on loss function sensitivity as claimed in claim 1, wherein the image data set is typical image data set such as MNIST, CIFAR10, ImageNet, GTSRB, CASIA, etc.

3. The method for testing deep learning samples based on loss function sensitivity as claimed in claim 1, wherein the deep learning network is a network such as VGG16, AlexNet, VGG11 or ResNet 34.

4. The method for testing deep learning samples based on loss function sensitivity as claimed in claim 1, wherein the attack countermeasure method is one of FGSM, Boundary, and MIFGSM.

5. The method for testing deep learning samples based on sensitivity to loss functions as claimed in claim 1, wherein the step (3) is specifically:

3.1) constructing a loss function: setting the cross entropy function as a loss function, and the formula is as follows:

Ls＝Lf_noise-Lf_nor

6. The deep learning sample testing method based on loss function sensitivity as claimed in claim 1, wherein the step (4) comprises the following steps:

(4.3) updating the historical optimal position and the global optimal position: calculating the sensitivity value of the loss function of the particle swarm, and comparing the historical optimal sensitivity of the loss functionMeasure and update the historical optimal position p of each particle^bestGlobal optimum position g of particle swarm^best；

(4.4) updating the velocity v of the particle population_iAnd position x_i；

(4.5) selecting an optimal drift sample: continuously updating and optimizing the loss function sensitivity of the input samples to generate noise samples with different loss sensitivities, and recording the noise sample with the maximum loss function sensitivity corresponding to each sample, namely the optimal drift sample, in the iterative updating of the same input sample; different optimal drift samples generated by the normal sample and the confrontation sample are collected to form two optimal drift sample sets of the normal sample and the confrontation sample.

7. The method for testing deep learning samples based on sensitivity to loss function as claimed in claim 1, wherein the step (5) is specifically as follows:

(5.1) sequencing the two types of optimal drift sample sets obtained in the step (4) according to the magnitude of the drift values, selecting front Top-K samples and labels of each type, inputting the samples and labels into the deep learning model trained in the step (1) for retraining, and generating a test model;

8. A deep learning sample testing device based on loss function sensitivity, comprising one or more processors for implementing the deep learning sample testing method based on loss function sensitivity according to any one of claims 1 to 7.

9. A computer-readable storage medium, on which a program is stored, which, when being executed by a processor, is configured to carry out the method for testing a deep learning sample based on sensitivity to a loss function of any one of claims 1 to 7.