CN111950635A

CN111950635A - Robust feature learning method based on hierarchical feature alignment

Info

Publication number: CN111950635A
Application number: CN202010809932.6A
Authority: CN
Inventors: 张笑钦; 王金鑫; 赵丽
Original assignee: Wenzhou University
Current assignee: Wenzhou University
Priority date: 2020-08-12
Filing date: 2020-08-12
Publication date: 2020-11-17
Anticipated expiration: 2040-08-12
Also published as: CN111950635B

Abstract

The invention discloses a robust feature learning method based on hierarchical feature alignment, which comprises the following steps: carrying out hierarchical extraction of depth features from input samples in different fields by using a depth convolution neural network; for the extracted hierarchical features, the channels and the spatial relation of the features are limited through a graph convolution neural network, so that the model learns richer feature representation; accurately measuring the difference between the sample feature representations in different fields by using Wasserstein distance based on the optimal transmission theory; differences between layered features extracted from samples in different fields are used as a part of a model loss function to help a model to learn more robust features, so that the robustness of the deep neural network model is improved. By the technical scheme, the deep network model can learn the robust characteristics, and the damage of the anti-attack method is avoided, so that a safe and reliable deep system is obtained.

Description

Robust feature learning method based on hierarchical feature alignment

Technical Field

The invention relates to the technical field of robust machine learning, in particular to a robust feature learning method based on hierarchical feature alignment.

Background

In recent years, deep convolutional neural networks have broken through many computer vision tasks such as image classification, target detection, and the like. However, researchers have found that these deep convolutional neural networks are vulnerable to spoofing by specially designed anti-perturbation samples that are not easily detectable by the human eye. These challenge samples generated by the challenge-challenge method pose serious challenges to systems with high requirements for safety and stability, including automatic driving systems, medical diagnostic systems, and security systems. In addition, if a deep network model changes its prediction result with high confidence when a sample with a small amount of disturbance is given as an input, it can be judged that the models do not learn task-related inherent attributes from the input sample from the beginning, and a robust visual concept cannot be learned from the sample. Therefore, designing a deep network model that is robust enough to combat disturbances is crucial for safe and reliable computer vision applications.

In recent research work, researchers have proposed a variety of defense mechanisms to overcome different approaches to combat attacks. These defense mechanisms can be roughly divided into two categories. The first category of defense methods mainly employs various pre-processing on the input image to overcome the counterattack. Dziugaite et al and Das et al use JPEG image compression as a countermeasure defense. These methods use discrete fourier transforms in the field of input images to process the anti-noise. However, these JPEG image compression-based methods have far failed to successfully remove the counternoise. By fully utilizing the strong representation capability of generating an anti-network, the Defense-GAN method is proposed by Samangouei and the like to defend various anti-attacks; the method achieves the aim of removing the counternoise by regenerating an image sufficiently similar to the input image. Mustafa et al propose to use image super-resolution as a countermeasure defense means, and by using a depth super-resolution network as a mapping function, the method maps samples from the countermeasure domain to the normal domain, thereby achieving the purpose of removing the countermeasure noise, and finally inputting the mapped image into an image recognition system for normal recognition. Another countermeasure is to improve the robustness of the model by modifying the training process or the network structure to deal with the counterdisturbance. The confrontation training is an effective means for improving the confrontation robustness of the model, and achieves the aim by adding specially designed confrontation samples to training data. Goodfellow et al trained the network model by adding to the clean samples countersamples generated using the FGSM (Fast Gradient Sign Method) counterattack Method. Madry et al performed challenge training using a Min-Max optimization method that generated challenge samples using a PGD (Project Gradient determination) attack method. Integrated countermeasure training is also a novel countermeasure defense method that uses countermeasure samples generated from a variety of different deep networks as training data to optimize model parameters. In addition, in order to improve the generalization ability of the depth model to the countermeasure sample, Song et al trains the network model using a domain adaptive method.

Although the above methods have made good progress in improving robustness of the deep convolutional neural network, they are often unable to achieve satisfactory results for different kinds of white-box attack methods, which are limited by poor generalization performance of the model.

Disclosure of Invention

Aiming at the defects in the prior art, the invention aims to provide a robust feature learning method based on hierarchical feature alignment, which enables a model based on a deep convolutional neural network to obtain more robust image features through the operation of hierarchical feature alignment, thereby solving the problem of limited ability of resisting sample model generalization in different fields in the prior art and providing effective reliability and safety guarantee for the deployment and application of a deep model system.

In order to achieve the purpose, the invention provides the following technical scheme: a robust feature learning method based on hierarchical feature alignment comprises the following steps:

(1) extracting depth features of different levels from samples of different fields by using a depth convolution neural network;

(2) for the extracted hierarchical features, the channels and the spatial relation of the features are limited through a graph convolution neural network, so that the model learns richer feature representation;

(3) accurately measuring the difference between the sample feature representations in different fields by using Wasserstein distance based on the optimal transmission theory;

(4) differences between layered features extracted from samples in different fields are used as a part of a model loss function to help a model to learn more robust features, so that the robustness of the deep neural network model is improved.

Preferably, the image samples of different domains include normal domain image samples and confrontation domain image samples.

Preferably, in step (1), the feature extraction of the image is performed by using a ResNet-110 network structure, the image is divided into 4 different structural levels, and after a normal sample or a countermeasure sample is input, when the network performs forward reasoning, the image features with different scales and different abstraction degrees are extracted by using a convolution structure at the 4 different structural levels.

Preferably, in step (2), the graph convolution operation is performed using two one-dimensional convolutions, and is formulated as follows:

formula (1):

GCN(f)＝Conv1D[Conv1D(f)]

in formula (1), GCN (∙) represents a graph convolution neural network, f represents a feature vector subjected to dimension reduction processing, and f represents an input of a graph convolution operation; in addition, Conv1D (∙) represents a one-dimensional convolution operation that uses two differently oriented one-dimensional graph convolution operations for feature extraction that, after sufficient end-to-end training, enhances the representation of the relationships between different regions in a feature.

Preferably, in step (3), X represents a feature at a certain layer extracted from a sample in the normal domain by using a deep neural network, Y represents a feature at the same layer extracted from a sample in the countermeasure domain by using the same deep neural network, and the optimal transmission distance between the two feature distributions X and Y is formulated as follows:

formula (2):

equation (2) is the definition of Wasserstein distance. Wherein: meaning that this is a definition, the right calculation result is defined as the representation to the left, P_XAnd P_YRespectively represent edge distribution forms of the features X and Y, and P (X to P)_X，Y～P_Y) Representing the joint distribution of features X and Y, c (X, Y) being an arbitrary measurable error function that measures the distance between X and Y; furthermore, E_(X，Y)～Representing the mathematical expectation under joint probability, inf represents the infimum bound where the computation is the mathematical expectation, and thus, W_c(P_X，P_Y) Is defined as the distribution P of the edges of the features X and Y with the premise of a measurable error function c_XAnd P_YFor input, among all the distance measuring methods, the method in which the distance from X to Y is the smallest is called the optimal transmission method, and the calculated distance value is the required optimal transmission distance.

Preferably, the step (4) specifically includes extracting feature representations hierarchically from the normal domain image sample and the confrontation domain image sample, calculating a difference between the normal domain image sample and the confrontation domain image sample by using Wasserstein distance after processing by using graph volume, adding the Wasserstein distance of the confrontation sample feature representation and the normal sample feature representation in different layers into a final loss function for optimizing network parameter use, and gradually learning a more robust feature representation by using feature alignment for the network model through sufficient end-to-end training;

the final loss function is shown below:

formula (3):

wherein, in formula (3), F represents a deep neural network for image classification, θ is a parameter of the deep neural network, the parameter is learned when the network is trained end-to-end, and L_CERepresenting a cross entropy loss function, and simultaneously calculating the cross entropy loss of the normal sample and the corresponding confrontation sample, so that the network can successfully classify the normal sample and the confrontation sample; x is the number of_cleanDenotes a normal sample, x_advRepresenting a challenge sample, y_trueThe correct tag that represents the data is,

and

image feature representations extracted from the normal sample and the confrontation sample at the l-th layer of the deep neural network F are respectively represented, l is 1, 2 or 1, 2, 3, 4; LC represents linear combination of features; and lambda represents the relative weight among a plurality of loss functions, when the model is trained by using a training set, the final loss function shown in the formula (3) is used for calculating classification errors and differences among sample characteristics in different fields, and then a random gradient descent algorithm is used for optimizing model parameters of the network according to the errors, so that the optimal model parameters are finally found.

The invention has the advantages that: compared with the prior art, the invention provides a novel hierarchical feature alignment method from the field self-adaption perspective, so that the deep convolutional neural network can learn robust feature representation from a confrontation sample; when the similarity of the feature of the countermeasure sample and the feature of the normal sample is improved progressively along the network structure of the model, in order to better enable the model to learn robust feature representation, the invention provides a Wasserstein distance based on the optimal transmission theory to measure the difference between the feature of the countermeasure sample and the feature of the normal sample.

The method provided by the invention can effectively improve the generalization capability of the model based on the deep convolutional neural network to samples in different confrontation fields, and can provide effective defense even if the model is attacked by a white box which is difficult to process by the conventional method;

the model based on the deep convolutional neural network can obtain more robust image characteristics through the operation of hierarchical characteristic alignment, so that the problem that the generalization capability of the anti-sample model in different fields is limited in the prior art is solved, and effective reliability and safety guarantee are provided for the deployment and application of a deep model system.

The invention is further described with reference to the drawings and the specific embodiments in the following description.

Drawings

FIG. 1 is a flow chart of an embodiment of the present invention;

FIG. 2 is a schematic diagram of a model structure proposed for defense fighting according to an embodiment of the present invention;

FIG. 3 is a diagram illustrating a robust feature learning process on a challenge sample according to an embodiment of the present invention;

FIG. 4 is a schematic diagram illustrating the visualization of the decision space for normal samples and confrontation samples on three typical classification data sets according to the embodiment of the present invention.

Detailed Description

Referring to fig. 1, fig. 2 and fig. 3, the robust feature learning method based on hierarchical feature alignment disclosed by the present invention includes the following steps:

Step (1), the specific process is that a PGD (Project Gradient) attack method is used to generate corresponding confrontation domain image samples with different degrees of disturbance from the image samples in the normal domain. For normal domain image samples and confrontation domain image samples, the method uses a deep convolutional neural network to extract multilevel characteristics of the image. To make the extracted features more representative, we divide the deep network into multiple levels according to its structure. Since the present invention proposes a framework, we use different "layers" to represent the divided structural levels for feature extraction, rather than to explicitly specify which layer of the network is, the proposed structural level division standard of the present invention is shown in fig. 2.

Taking the structure shown in fig. 2 as an example, we use the network structure of ResNet-110 ("ResNet-110" here refers to a residual network with 110 network layers) to perform feature extraction of an image, where we divide it into 4 different structural levels. After a normal sample or a confrontation sample is input, when the network carries out forward reasoning, image features with different scales and different abstraction degrees are extracted by using a convolution structure at 4 different levels.

When model training is carried out, in order to better carry out the operation of feature alignment, the invention proposes that the model training is carried out on a clean sample, and then the clean sample and a corresponding countermeasure sample are jointly used for carrying out the model training of a robust feature learning process based on hierarchical feature alignment.

And (2) after extracting image features at different levels by using a deep convolutional neural network, for the representative image features, in order to enable the network to learn richer image feature representation, processing the extracted features at different levels by using the graph convolutional neural network. Graph convolution can better capture the relation between different areas in the depth feature from the global perspective; and may also impose stronger constraints on the features for subsequent alignment of the features. When the optimal transmission Wasserstein distance is calculated, the distance between the characteristic vectors is calculated, so that the image characteristics in the tensor form are converted into the characteristic vector form; and, in order to speed up the calculation of the distance metric, we will take a series of operations of feature selection and dimension reduction.

In order to reduce the complexity of dimension reduction, a representative characteristic linear combination mode of the characteristics is used for processing the extracted characteristics from two aspects of channels and characteristic nodes,

after dimensionality reduction, we perform the graph convolution operation using two one-dimensional convolutions, which can be formulated as:

formula (1):

GCN(f)＝Conv1D[Conv1D(f)]

in formula (1), GCN (∙) represents a graph convolution neural network, f represents a feature vector subjected to dimensionality reduction, where f represents the input of a graph convolution operation; in addition, Conv1D (∙) represents a one-dimensional convolution operation where we perform feature extraction using two one-dimensional graph convolution operations in different directions. After sufficient end-to-end training, the graph convolution operation can enhance the representation capability of the relationship between different regions in the feature.

The specific detail information is shown in fig. 2. After sufficient end-to-end training, the graph convolution operation can enhance the representation capability of the relationship between different regions in the feature. In addition, for features extracted from normal samples and features extracted from challenge samples at different network structure levels, the present invention processes the features using graph convolution before computing the Wasserstein distance between them. As shown in FIG. 2, when using the ResNet-110 structure, we calculated Wasserstein distances at 4 different locations to make a measure of the difference in sample characteristics in different domains.

And (3) performing a specific process that after the step (1) is used for extracting the hierarchical image features and the step (2) is used for performing feature selection, dimension reduction and graph volume operations, the difference between samples in different fields is calculated by using the optimal transmission Wasserstein distance with the regularization term, and the step aligns the hierarchical features of the countermeasure samples to the hierarchical features of the normal samples in order to perform feature alignment operations between the characteristics of the samples in the different fields, so that the neural model has sufficient robustness.

In this embodiment, we use X and Y to represent a set of feature vectors of two different distributions, more specifically, X represents a feature at a certain layer extracted from a sample in the normal domain using a deep neural network, and Y represents a feature at the same layer extracted from a sample in the countermeasure domain using the same deep neural network, and the optimal transmission distance between the two feature distributions X and Y can be formulated as follows:

formula (2):

equation (2) is the definition of Wasserstein distance. Wherein, the symbol: to indicate that this is a definition, we define the results of the calculations on the right as the representation on the left. In the formula, P_XAnd P_YRespectively represent edge distribution forms of the features X and Y, and P (X to P)_X，Y～P_Y) Representing the joint distribution of features X and Y. c (X, Y) is an arbitrary measurable error function that measures the distance between X and Y. Further, in the formula, E_(X，Y)～It represents the mathematical expectation under joint probability, inf represents the infimum bound where the computation is the mathematical expectation. Thus, W_c(P_X，P_Y) Is defined as the distribution P of the edges of the features X and Y with the premise of a measurable error function c_XAnd P_YFor input, among all the distance measuring methods, the method with the smallest distance from X to Y is called the optimal transmission method, and the calculated distance value is the optimal transmission distance required here.

In this embodiment, use is made of

The distance between the feature vectors is calculated. Thus, the formula can be expressed as follows:

in practical application, it can be discretized into the form of the following formula:

wherein <, > represents the Hadamard (Hadamard) product between the matrices P and C, since both P and C are two-dimensional matrices, the sum of the products of the elements at each corresponding position of the matrix P and the matrix C is represented here, and min represents the optimization problem of calculating the minimum value here. As the calculation cost of the method can rise rapidly along with the increase of the data volume, the method improves the algorithm by using an entropy regularization mode and optimizes by using a Sinkhorn iterative algorithm. The entropy regularization term for the matrix P is shown by the following equation:

therefore, an optimal transmission Wasserstein distance calculation method with regularization can be obtained:

wherein in the formula, e is used to balance the approximation degree of the regularization problem and the original problem, when e is close to 0, the regularization problem is converted into the original problem, and in the invention, e is 0.1. Furthermore, since the problem is a convex optimization problem, it has a unique solution. In addition, in the present invention, Wasserstein distance is used to measure the difference between intermediate feature representations extracted using deep convolutional neural networks from normal and challenge samples.

In addition, we choose to use Sinkhorn iterative algorithm when optimizing the optimal transmission distance.

And (4) adding the Wasserstein distance represented by the confrontation sample characteristic representation and the normal sample characteristic representation in different layers into a final loss function used for optimizing network parameters, and gradually learning a more robust characteristic representation by utilizing characteristic alignment of a network model through sufficient end-to-end training.

The final loss function is shown in the following equation:

formula (3)

and

image feature representations extracted from the normal sample and the confrontation sample at the l-th layer of the deep neural network F are respectively represented, l is 1, 2 or 1, 2, 3, 4; LC represents linear combination of features; lambda represents relative weight among a plurality of loss functions, when a model is trained by using a training set, a classification error and differences among sample characteristics in different fields are calculated by using a final loss function shown in formula (3), and then a random gradient is used according to the classification error and the differencesAnd optimizing the model parameters of the network by using a descent algorithm, and finally finding the optimal model parameters.

The embodiment of the invention has the following beneficial effects:

compared with the prior art, the invention provides a novel hierarchical feature alignment method from the field self-adaption perspective, so that the deep convolutional neural network can learn robust feature representation from a confrontation sample; when the similarity of the feature of the countermeasure sample and the feature of the normal sample is improved progressively along the network structure of the model, in order to better enable the model to learn robust feature representation, the invention provides a Wasserstein distance based on the optimal transmission theory to measure the difference between the feature of the countermeasure sample and the feature of the normal sample.

It will be understood by those skilled in the art that all or part of the steps in the method for implementing the above embodiments may be implemented by relevant hardware instructed by a program, and the program may be stored in a computer-readable storage medium, such as ROM/RAM, a magnetic disk, an optical disk, etc.

The above embodiments are described in detail for the purpose of further illustrating the present invention and should not be construed as limiting the scope of the present invention, and the skilled engineer can make insubstantial modifications and variations of the present invention based on the above disclosure.

Claims

1. A robust feature learning method based on hierarchical feature alignment is characterized in that: the method comprises the following steps:

2. The robust feature learning method based on hierarchical feature alignment as claimed in claim 1, wherein: the image samples of different domains include normal domain image samples and confrontation domain image samples.

3. The robust feature learning method based on hierarchical feature alignment as claimed in claim 2, wherein: and (1) extracting the features of the image by using a ResNet-110 network structure, wherein the image features are divided into 4 different structural levels, and after a normal sample or a confrontation sample is input, when the network carries out forward reasoning, the image features with different scales and different abstraction degrees are extracted by using a convolution structure at the 4 different structural levels.

4. The robust feature learning method based on hierarchical feature alignment as claimed in claim 3, wherein: step (2), using two one-dimensional convolutions to perform graph convolution operation, wherein the graph convolution operation is formulated as the following form:

GCN(f)＝Conv1D[Conv1D(f)]

in the formula, GCN (∙) represents a graph convolution neural network, f represents a feature vector subjected to dimensionality reduction, and f represents the input of a graph convolution operation; in addition, Conv1D (∙) represents a one-dimensional convolution operation that uses two differently oriented one-dimensional graph convolution operations for feature extraction that, after sufficient end-to-end training, enhances the representation of the relationships between different regions in a feature.

5. The robust feature learning method based on hierarchical feature alignment as claimed in claim 4, wherein: step (3), using X to represent the feature at a certain layer extracted from the sample in the normal field by using the deep neural network, and using Y to represent the feature at the same layer extracted from the sample in the confrontation field by using the same deep neural network, wherein the optimal transmission distance between the two feature distributions X and Y is formulated as follows:

wherein, in the formula,

indicating that this is a definition, the right calculation result is defined as the left representation, P_XAnd P_YRespectively represent edge distribution forms of the features X and Y, and P (X to P)_X，Y～P_Y) Representing the joint distribution of features X and Y, c (X, Y) being an arbitrary measurable error function that measures the distance between X and Y; furthermore, E_(X，Y)～Representing the mathematical expectation under joint probability, inf represents the infimum bound where the computation is the mathematical expectation, and thus, W_c(P_X，P_Y) Is defined as the distribution P of the edges of the features X and Y with the premise of a measurable error function c_XAnd P_YFor input, among all the distance measuring methods, the method in which the distance from X to Y is the smallest is called the optimal transmission method, and the calculated distance value is the required optimal transmission distance.

6. The robust feature learning method based on hierarchical feature alignment as claimed in claim 5, wherein: step (4), specifically, the feature representation extracted in a layering mode from the normal domain image sample and the confrontation domain image sample is processed by using graph convolution, the difference between the normal domain image sample and the confrontation domain image sample is calculated by using Wasserstein distance, the confrontation sample feature representation and the normal sample feature representation in different layers are added into a final loss function used for optimizing network parameters, and the network model is gradually learned to the more robust feature representation by using feature alignment through sufficient end-to-end training;

the final loss function is shown in the following equation:

wherein in the formula, F represents a deep neural network for image classification, theta is a parameter of the deep neural network, the parameter is learned when the network is trained end to end, and L_CERepresenting a cross entropy loss function, and simultaneously calculating the cross entropy loss of the normal sample and the corresponding confrontation sample, so that the network can successfully classify the normal sample and the confrontation sample; x is the number of_cleanDenotes a normal sample, x_advRepresenting a challenge sample, y_trueThe correct tag that represents the data is,

and

image feature representations extracted from the normal sample and the confrontation sample at the l-th layer of the deep neural network F are respectively represented, l is 1, 2 or 1, 2, 3, 4; LC represents linear combination of features; lambda represents the relative weight between multiple loss functions, and when the model is trained by using the training set, the final loss function calculates the classification error and the difference between the sample characteristics in different fields, and then the final loss function is used according to the errorsAnd optimizing the model parameters of the network by using a stochastic gradient descent algorithm, and finally finding the optimal model parameters.