CN111950635B

CN111950635B - Robust feature learning method based on layered feature alignment

Info

Publication number: CN111950635B
Application number: CN202010809932.6A
Authority: CN
Inventors: 张笑钦; 王金鑫; 赵丽
Original assignee: Wenzhou University
Current assignee: Wenzhou University
Priority date: 2020-08-12
Filing date: 2020-08-12
Publication date: 2023-08-25
Anticipated expiration: 2040-08-12
Also published as: CN111950635A

Abstract

The invention discloses a robust feature learning method based on layered feature alignment, which comprises the following steps: using a deep convolutional neural network to conduct hierarchical extraction of depth features from input samples in different fields; for the extracted layered features, the channel and the spatial relation of the features are limited through a graph convolution neural network, so that the model learns more abundant feature representations; using Wasserstein distance based on optimal transmission theory to accurately measure the difference between sample characteristic representations in different fields; differences between layered features extracted from different domain samples are used as part of a model loss function to help the model learn more robust features, thereby improving the robustness against deep neural network models. According to the technical scheme, the depth network model can learn robust features, damage to an attack resisting method is avoided, and therefore a safe and reliable depth system is obtained.

Description

Robust feature learning method based on layered feature alignment

Technical Field

The invention relates to the technical field of robust machine learning, in particular to a robust feature learning method based on layered feature alignment.

Background

In recent years, deep convolutional neural networks have made breakthroughs in many computer vision tasks such as image classification, object detection, and the like. However, researchers have found that these deep convolutional neural networks are susceptible to spoofing by specially designed anti-disturbance samples that are not readily perceived by the human eye. These challenge samples generated by challenge attack methods pose serious challenges for systems with high requirements for safety and stability, including autopilot systems, medical diagnostic systems, security systems, and the like. In addition, if a deep network model changes its prediction with high confidence given a sample with a small amount of disturbance as input, it can be determined that these models do not learn well from the input of the sample from scratch to the task related inherent properties, nor from the sample to the robust visual concept. Thus, designing a deep network model that is sufficiently robust to combat disturbances is critical for safe and reliable computer vision applications.

In recent years of research work, researchers have proposed a variety of challenge defense mechanisms to overcome different challenge approaches. These defense mechanisms can be roughly divided into two categories. The first category of defense approaches mainly uses various pre-treatments on the input image to overcome the challenge. Dziugaite et al and Das et al compress JPEG images as an countermeasure against defenses. These methods use discrete fourier transforms in the field of input images to handle noise immunity. However, these methods based on JPEG image compression have not achieved the objective of successfully removing the noise countermeasure. By fully utilizing the strong representation capability of the generated countermeasure network, the method of the Defence-GAN is proposed by Samantouue et al to defend various countermeasure attacks; the method achieves the objective of removing noise by regenerating an image sufficiently similar to the input image. Mustafa et al propose to use image super-resolution as a means of countermeasures, and by using a deep super-resolution network as a mapping function, the method maps samples from the countermeasures to the normal fields, thereby achieving the purpose of removing countermeasures noise, and finally inputting the mapped images into an image recognition system for normal recognition. Another approach to combat defense is to improve the robustness of the model by mainly modifying the training process or the network structure to handle the combat disturbances. Challenge training is an effective means of improving the robustness of a model by adding specifically designed challenge samples to the training data. Goodfellow et al train a network model by adding to the clean samples challenge samples generated using the FGSM (Fast Gradient Sign Method ) challenge method. Madry et al used Min-Max optimization to conduct challenge training, which used PGD (Project Gradient Descent, map gradient descent) attack methods to generate challenge samples. Integrated challenge training is also a novel challenge defense method that uses challenge samples generated from a variety of different depth networks as training data to optimize model parameters. In addition, song et al used a domain-adaptive approach to training a network model in order to improve the generalization ability of the depth model to the challenge samples.

Although the above-described methods have made good progress in improving the robustness against deep convolutional neural networks, they often fail to achieve satisfactory results for different kinds of white-box attack methods, which are limited by the poor generalization performance of the model.

Disclosure of Invention

Aiming at the defects existing in the prior art, the invention aims to provide a robust feature learning method based on layered feature alignment, which enables a model based on a deep convolutional neural network to learn more robust image features through layered feature alignment operation, thereby solving the problem of limited generalization capability of an anti-sample model in different fields in the prior art and providing effective reliability and safety guarantee for deployment and application of a deep model system.

In order to achieve the above purpose, the present invention provides the following technical solutions: a robust feature learning method based on hierarchical feature alignment comprises the following steps:

(1) Lifting depth features of different layers from samples in different fields by using a depth convolution neural network;

(2) For the extracted layered features, the channel and the spatial relation of the features are limited through a graph convolution neural network, so that the model learns more abundant feature representations;

(3) Using Wasserstein distance based on optimal transmission theory to accurately measure the difference between sample characteristic representations in different fields;

(4) Differences between layered features extracted from different domain samples are used as part of a model loss function to help the model learn more robust features, thereby improving the robustness against deep neural network models.

Preferably, the image samples of different fields include a normal field image sample and an countermeasure field image sample.

Preferably, in the step (1), the ResNet-110 network structure is used for extracting the characteristics of the image, the ResNet-110 network structure is divided into 4 different structural layers, and after a normal sample or an countermeasure sample is input, the convolution structure is used for extracting the image characteristics with different scales and different abstract degrees at the 4 different structural layers when the forward reasoning is carried out on the network.

Preferably, in step (2), a graph convolution operation is performed using two one-dimensional convolutions, the graph convolution operation being formulated as follows:

formula (1):

GCN(f)＝Conv1D[Conv1D(f)]

in the formula (1), GCN (not shown) represents a graph convolution neural network, f represents a feature vector subjected to dimension reduction processing, and f represents input of graph convolution operation; in addition, conv1D (the) represents a one-dimensional convolution operation, feature extraction is performed using two different-direction one-dimensional graph convolution operations, which enhance the representation of relationships between different regions in the feature after sufficient end-to-end training.

Preferably, in the step (3), the characteristic at a certain layer extracted from the sample in the normal field using the deep neural network is represented by X, and the characteristic at the same layer extracted from the sample in the countermeasure field using the same deep neural network is represented by Y, and the optimal transmission distance between the two characteristic distributions X and Y is formulated as follows:

formula (2):

equation (2) is a definition of Wasserstein distance (bulldozer distance). Wherein: representing this is a definition, defining the calculation result on the right as the representation form on the left, P _X And P _Y Respectively represent the edge distribution forms of the features X and Y, and P (X-P _X ，Y～P _Y ) Representing the joint distribution of features X and Y, c (X, Y) is any measurable error function that measures the distance between X and Y; in addition, E _(X，Y)～Γ Representing the mathematical expectation under joint probability, inf represents the infinitesimal bound where the calculation is the mathematical expectation, and therefore, W _c (P _X ，P _Y ) Is defined as the edge distribution P of features X and Y under the premise of a measurable error function c _X And P _Y For input, among all the metric distance methods, the method in which the X to Y distance is the smallest is called an optimal transmission method, and the calculated distance value is the required optimal transmission distance.

Preferably, step (4), specifically including hierarchically extracted feature representations from the normal-domain image sample and the challenge-domain image sample, calculating the differences between them using Wasserstein distance after processing using graph convolution, adding Wasserstein distance of the challenge-sample feature representation and the normal-sample feature representation in different hierarchies to a final loss function used to optimize network parameters, and allowing the network model to learn a more robust feature representation by using feature alignment gradually through sufficient end-to-end training;

the final loss function is shown below:

equation (3):

wherein in formula (3), F represents a method for classifying imagesθ is a parameter of the deep neural network that is learned during end-to-end training of the network, L _CE Representing a cross entropy loss function, and simultaneously calculating cross entropy loss of the normal sample and the corresponding countermeasure sample, so that the network can successfully classify the normal sample and the countermeasure sample; x is x _clean Represents a normal sample, x _adv Representing challenge samples, y _true A correct tag representing the data is displayed,and->Representing image feature representations extracted from normal and challenge samples at the first layer of the deep neural network F, respectively, l=1, 2 or l=1, 2,3,4; LC means linear combination of features; λ represents the relative weights between multiple loss functions, and when training a model using a training set, the final loss function shown in formula (3) is used to calculate classification errors and differences between sample features in different fields, and then a random gradient descent algorithm is used to optimize model parameters of the network according to the errors, so as to finally find the optimal model parameters.

The invention has the advantages that: compared with the prior art, the invention provides a novel hierarchical feature alignment method from the field self-adaption point of view, so that the deep convolutional neural network can learn robust feature representation from the countermeasure sample; in order to better let the model learn the robust feature representation when the similarity of the antagonistic sample feature and the normal sample feature is improved by progressively improving the model network structure, the invention provides a Wasserstein distance method for measuring the difference between the antagonistic sample feature and the normal sample feature based on the optimal transmission theory.

The method provided by the invention can effectively improve the generalization capability of the model based on the deep convolutional neural network to samples in different countermeasure fields, and provides effective defense even if the model is attacked by a white box which is difficult to process by the prior method;

the model based on the depth convolution neural network can learn more robust image features through layered feature alignment operation, so that the problem that the generalization capability of an anti-sample model is limited in different fields in the prior art is solved, and effective reliability and safety guarantee are provided for deployment and application of a depth model system.

The invention is further described below with reference to the drawings and specific examples.

Drawings

FIG. 1 is a flow chart of an embodiment of the present invention;

FIG. 2 is a schematic diagram showing a model structure for counterdefense according to an embodiment of the present invention;

FIG. 3 is a schematic diagram of a robust feature learning process over challenge samples according to an embodiment of the present invention;

FIG. 4 is a schematic view of a visualization of the decision space of a normal sample and an challenge sample, respectively, on top of three exemplary classification datasets in accordance with an embodiment of the present invention.

Detailed Description

Referring to fig. 1,2 and 3, the robust feature learning method based on hierarchical feature alignment disclosed by the invention comprises the following steps:

Step (1), the specific process is to use a PGD (Project Gradient Descent, mapping gradient descent) attack method to generate corresponding countermeasure field image samples with different degrees of disturbance from the image samples in the normal field. For normal field image samples and contrast field image samples, the method uses a deep convolutional neural network to extract multi-level features of the images. To make the extracted features more representative, we divide the depth network into multiple layers according to its structure. Since the present invention proposes a framework, we use different "layers" to represent the structural hierarchy of these partitions for feature extraction, rather than explicitly specifying which layer is the network, the structural hierarchy partitioning criteria proposed by the present invention are shown in fig. 2.

Taking the structure shown in fig. 2 as an example, we use a res net-110 ("res net-110" refers here to a residual network with 110 network layers) network structure for feature extraction of images, where we divide it into 4 different structural hierarchies. When forward reasoning is carried out on the network after normal samples or countermeasure samples are input, image features with different scales and different abstract degrees are extracted at 4 different levels by using a convolution structure.

In order to better perform feature alignment operation during model training, the invention proposes to perform model training on a clean sample, and then perform model training based on a robust feature learning process of hierarchical feature alignment by using the clean sample and a corresponding countermeasure sample together.

And (2) extracting image features at different levels by using a deep convolutional neural network, and processing the extracted features at different levels by using the convolutional neural network for representing the representative image features so as to make the network learn more rich image feature representations. Graph convolution can better capture the relationship between different areas in the depth feature from a global perspective; and simultaneously, stronger limitation can be added to the characteristics when the characteristics are aligned subsequently. Since we calculate the distance between feature vectors when calculating the optimal transmission Wasserstein distance, we turn the tensor-form image features into the form of feature vectors; and, to speed up the calculation of the distance metric, we will take a series of feature selection and dimension reduction operations.

To reduce the complexity of dimension reduction, we use a representative linear combination of features to process the extracted features from both the channel and feature nodes,

after dimension reduction, we perform a graph convolution operation using two one-dimensional convolutions, which can be formulated as follows:

formula (1):

GCN(f)＝Conv1D[Conv1D(f)]

in the formula (1), GCN (, a,) represents a graph convolution neural network, f represents a feature vector subjected to a dimension reduction process, where f represents an input of a graph convolution operation; in addition, conv1D (, the figure) represents a one-dimensional convolution operation where we use two different-direction one-dimensional graph convolution operations for feature extraction. After sufficient end-to-end training, the graph convolution operation may enhance the representational capacity for relationships between different regions in the feature.

The specific details are shown in fig. 2. After sufficient end-to-end training, the graph convolution operation may enhance the representational capacity for relationships between different regions in the feature. In addition, for features extracted from normal samples and features extracted from challenge samples at different network structure levels, the present invention uses graph convolution to process the features before calculating Wasserstein distance between them. As shown in fig. 2, we calculated Wasserstein distance at 4 different locations to measure the differences in characteristics of the different domain samples when using the res net-110 structure.

And (3) performing hierarchical image feature extraction by using the step (1), performing feature selection, dimension reduction and graph convolution operation by using the step (2), and calculating differences among samples in different fields by using optimal transmission Wasserstein distance with regularization terms, wherein the step is to perform feature alignment operation among the features of the samples in different fields, and align the hierarchical features of the countermeasure samples to the hierarchical features of the normal samples, so that the neural model has enough robustness.

In this embodiment, we use X and Y to represent a set of feature vectors of two different distributions, more specifically, X represents a feature at a certain layer extracted from a sample in the normal domain using a deep neural network, and Y represents a feature at the same layer extracted from a sample in the challenge domain using the same deep neural network, and the optimal transmission distance between the two feature distributions X and Y can be formulated as follows:

formula (2):

equation (2) is a definition of Wasserstein distance (bulldozer distance). Wherein, the symbol: representing this is a definition, we define the right calculation result to the left representation. In the formula, P _X And P _Y Respectively represent the edge distribution forms of the features X and Y, and P (X-P _X ，Y～P _Y ) Representing the joint distribution of features X and Y. c (X, Y) is any measurable error function that measures the distance between X and Y. Also, in the formula, E _(X，Y)～Γ Representing the mathematical expectation under joint probability, inf represents the inf where the calculation is the infinitesimal of the mathematical expectation. Thus W is _c (P _X ，P _Y ) Is defined as the edge distribution P of features X and Y under the premise of a measurable error function c _X And P _Y For input, among all the measurement distance modes, the mode with the smallest X-to-Y distance is called an optimal transmission mode, and the calculated distance value is the optimal transmission distance required here.

In the present embodiment, use is made ofTo calculate the distance between the feature vectors. Thus, the formula can be expressed as follows:

in practical applications, it may be discretized into the following formula form:

where in the formula <, represents the Hadamard product between the matrices P and C, where P and C are both in the form of a two-dimensional matrix, and thus represent the sum of the products of the elements at each corresponding position of the matrices P and C, and min represents an optimization problem where the minimum is calculated. Because the calculation cost of the method can rise rapidly along with the increase of the data quantity, the method uses an entropy regularization mode to improve the algorithm and uses a sink horn iterative algorithm to optimize. The entropy regularization term for matrix P is shown as follows:

thus, an optimal transmission Wasserstein distance calculation method with regularization can be obtained:

where e is used to balance the approximation of the regularization problem to the original problem in the formula, when e goes towards 0, the regularization problem is converted into the original problem, and in the present invention, e=0.1. Furthermore, since this problem is a convex optimization problem, it has a unique solution. In addition, wasserstein distance is used in the present invention to measure the difference between intermediate feature representations extracted from normal and challenge samples using deep convolutional neural networks.

In addition, we choose to use the Sinkhorn iterative algorithm when optimizing the optimal transmission distance.

Step (4), the specific process is that Wasserstein distance of the anti-sample characteristic representation and the normal sample characteristic representation in different layers are added into a final loss function used for optimizing network parameters, and the network model gradually learns more robust characteristic representation by utilizing characteristic alignment through sufficient end-to-end training.

The final loss function is shown in the following formula:

formula (3)

Wherein in formula (3), F represents a deep neural network for image classification, θ is a parameter of the deep neural network that is learned during end-to-end training of the network, L _CE Representing a cross entropy loss function, and simultaneously calculating cross entropy loss of the normal sample and the corresponding countermeasure sample, so that the network can successfully classify the normal sample and the countermeasure sample; x is x _clean Represents a normal sample, x _adv Representing challenge samples, y _true A correct tag representing the data is displayed,and->Representing image feature representations extracted from normal and challenge samples at the first layer of the deep neural network F, respectively, l=1, 2 or l=1, 2,3,4; LC means linear combination of features; λ represents the relative weights between multiple loss functions, and when training a model using a training set, the final loss function shown in formula (3) is used to calculate classification errors and differences between sample features in different fields, and then a random gradient descent algorithm is used to optimize model parameters of the network according to the errors, so as to finally find the optimal model parameters.

The embodiment of the invention has the following beneficial effects:

compared with the prior art, the invention provides a novel hierarchical feature alignment method from the field self-adaption point of view, so that the deep convolutional neural network can learn robust feature representation from the countermeasure sample; in order to better let the model learn the robust feature representation when the similarity of the antagonistic sample feature and the normal sample feature is improved by progressively improving the model network structure, the invention provides a Wasserstein distance method for measuring the difference between the antagonistic sample feature and the normal sample feature based on the optimal transmission theory.

Those of ordinary skill in the art will appreciate that all or a portion of the steps in implementing the methods of the above embodiments may be implemented by a program to instruct related hardware, where the program may be stored in a computer readable storage medium, such as ROM/RAM, a magnetic disk, an optical disk, etc.

The foregoing embodiments are provided for further explanation of the present invention and are not to be construed as limiting the scope of the present invention, and some insubstantial modifications and variations of the present invention, which are within the scope of the invention, will be suggested to those skilled in the art in light of the foregoing teachings.

Claims

1. A robust feature learning method based on hierarchical feature alignment is characterized in that: the method comprises the following steps:

(1) Lifting depth features of different layers from image samples in different fields by using a depth convolution neural network;

(4) Taking the difference between layered features extracted from different field samples as a part of a model loss function to help the model learn more robust features, thereby improving the antagonism robustness of the deep neural network model;

step (2) performing a graph convolution operation using two one-dimensional convolutions, the graph convolution operation formulated as:

GCN(f)＝Conv1D[Conv1D(f)]

wherein, in the formula, GCN (or) represents a graph convolution neural network, f represents a feature vector subjected to dimension reduction treatment, and f represents the input of graph convolution operation; in addition, conv1D (the) represents a one-dimensional convolution operation, the feature extraction is carried out by using two one-dimensional graph convolution operations with different directions, and after sufficient end-to-end training, the graph convolution operation enhances the representation capability of the relation between different areas in the feature;

step (3), using X to represent the features at a certain layer extracted from the samples in the normal field using the deep neural network, and Y to represent the features at the same layer extracted from the samples in the countermeasure field using the same deep neural network, the optimal transmission distance between the two feature distributions X and Y being formulated as follows:

wherein, in the formula: representing this is a definition, defining the calculation result on the right as the representation form on the left, P _X And P _Y Respectively represent the edge distribution forms of the features X and Y, and P (X-P _X ,Y～P _Y ) Representing the joint distribution of features X and Y, c (X, Y) is any measurable error function that measures the distance between X and Y; this isIn addition, E _(X,Y)～Γ Representing the mathematical expectation under joint probability, inf represents the infinitesimal bound where the calculation is the mathematical expectation, and therefore, W _c (P _X ,P _Y ) Is defined as the edge distribution P of features X and Y under the premise of a measurable error function c _X And P _Y For input, among all the metric distance methods, the method in which the X to Y distance is the smallest is called an optimal transmission method, and the calculated distance value is the required optimal transmission distance.

2. A robust feature learning method based on hierarchical feature alignment as claimed in claim 1, wherein: the different domain image samples include a normal domain image sample and a countermeasure domain image sample.

3. A robust feature learning method based on hierarchical feature alignment as claimed in claim 2, wherein: and (1) extracting the characteristics of the image by using a ResNet-110 network structure, dividing the image into 4 different structural layers, and extracting the image characteristics with different scales and different abstract degrees by using a convolution structure at the 4 different structural layers when forward reasoning is performed on the network after a normal sample or a countermeasure sample is input.

4. A robust feature learning method based on hierarchical feature alignment according to claim 3, characterized in that: step (4), specifically including the characteristic representation hierarchically extracted from the normal field image sample and the contrast field image sample, after processing by using graph convolution, calculating the difference between them by using Wasserstein distance, adding Wasserstein distance the contrast sample characteristic representation and the normal sample characteristic representation in different levels to the final loss function used for optimizing the network parameters, and gradually utilizing characteristic alignment to learn the more robust characteristic representation by full end-to-end training;

the final loss function is shown in the following formula:

wherein, in the formula, F represents a deep neural network for image classification, θ is a parameter of the deep neural network, the parameter is learned during end-to-end training of the network, L _CE Representing a cross entropy loss function, and simultaneously calculating cross entropy loss of the normal sample and the corresponding countermeasure sample, so that the network can successfully classify the normal sample and the countermeasure sample; x is x _clean Represents a normal sample, x _adv Representing challenge samples, y _true A correct tag representing the data is displayed,and->Representing image feature representations extracted from normal and challenge samples at the first layer of the deep neural network F, respectively, l=1, 2 or l=1, 2,3,4; LC means linear combination of features; λ represents the relative weights between multiple loss functions, and when training a model using a training set, the final loss function calculates the classification errors and the differences between the sample features of different fields, and then optimizes the model parameters of the network using a random gradient descent algorithm according to these errors, finally finding the optimal model parameters.