CN113411329A

CN113411329A - DAGMM-based federated learning backdoor attack defense method

Info

Publication number: CN113411329A
Application number: CN202110675081.5A
Authority: CN
Inventors: 陈晋音; 刘涛; 张龙源; 李荣昌
Original assignee: Zhejiang University of Technology ZJUT
Current assignee: Zhejiang University of Technology ZJUT
Priority date: 2021-06-17
Filing date: 2021-06-17
Publication date: 2021-09-17
Anticipated expiration: 2041-06-17
Also published as: CN113411329B

Abstract

The invention discloses a DAGMM-based federated learning backdoor attack defense method, which comprises the following steps: (1) the client receives the global model, trains and uploads the local model and the corresponding neuron activation condition; (2) the server receives the update and calculates the loss of the corresponding client by using the DAGMM; (3) defense based on multiple rounds of reconstruction errors. The method can effectively protect the model from being attacked by the backdoor.

Description

DAGMM-based federated learning backdoor attack defense method

Technical Field

The invention relates to the technical field of backdoor attack defense, in particular to a DAGMM-based federal learning backdoor attack defense method.

Background

Federal learning has been proposed to facilitate federated model training using data from multiple clients, where the training process is coordinated by a central server. In the whole process, the data of the clients are kept local, and only the model parameters are communicated among the clients through the parameter server.

A typical training iteration works as follows: first, the central server sends the latest global model to each client. Each client then updates the model locally using the local data and reports the updated model to the parameter server. Finally, the server performs model aggregation on all submitted local updates to form a new global model that has better performance than models trained using data of any single client.

In contrast to alternative approaches that simply collect all data from the client and train the model from these data, federal learning can save communication overhead by transmitting only the model parameters, and protect privacy, since all data remains local. Therefore, joint learning has attracted a wide range of attention and is widely used for model training using data from multiple users and organizations.

However, federated learning systems are vulnerable to malicious clients. The central server has no access to the client's data and therefore cannot verify model updates from the client, especially when the system adds a secure aggregation protocol to further protect the client's privacy. In theory, a malicious client can send any updates to the server, which is easily damaged if there is no effective protection to identify malicious updates to the neural network learning weights.

Backdoor attacks are one of the most common attacks in federal learning, and an attacker can modify or spoof a classifier to assign a label of the attacker's choice to a sample with a particular characteristic. Backgate attacks typically trigger "backgate neurons" that are activated only when a backgate sample is present. Research shows that the activation condition of normal model neurons is greatly different from that of back door model neurons, and back door attacks can be greatly reduced by trimming the back door neurons without damaging too much model performance. However, this pruning approach relies on a reliable "clean" data source, which is not guaranteed in the federal learning scenario.

The invention provides a backdoor attack defense method based on a depth self-coding Gaussian mixture model (DAGMM) in horizontal federal learning, which is an anomaly detection mechanism based on the DAGMM. The method carries out abnormal detection of updating of the client based on the difference between the back door model and the normal model, does not need to access the original data of the client, and only needs to obtain the activation condition of the local model neuron. In the process of federal learning training, the central server requires each client to provide neuron activation conditions, puts the neuron activation conditions and updates into a DAGMM, and detects a back door model so as to defend back door attacks.

Disclosure of Invention

The invention aims to provide a DAGMM-based federated learning backdoor attack defense method to protect a model from backdoor attacks.

A DAGMM-based federated learning backdoor attack defense method comprises the following steps:

(1) the client receives the global model, trains and uploads the local model and the corresponding neuron activation condition;

(2) the server receives the update and calculates the loss of the corresponding client by using the DAGMM;

(3) defense based on multiple rounds of reconstruction errors.

The technical conception of the invention is as follows: the back gate input samples trigger neurons that are not normally used by normal clean input samples. These so-called "back door neurons" are utilized by attackers to recognize back door patterns and trigger inappropriate behavior, while remaining silent when the input data is clean. Therefore, at the neuron level, the posterior portal model is obviously different from the normal model.

Based on the situation, a deep self-coding Gaussian mixture model (DAGMM) is combined with a neuron activation situation to defend against a federal learning backdoor attack. Firstly, a client is required to upload the model neuron activation condition together when uploading the updated model. Putting the updates of all the clients into a DAGMM, jointly solving the reconstruction probability of the updates of all the clients, and screening abnormal clients; secondly, counting abnormal conditions of DAGMM counting in each round, and screening out customers marked for many times; and finally, sending the aggregated global model to each client, and not issuing the global model for the client identified as the attacker.

Preferably, step (1) comprises:

(1.1) the server calls a global model and distributes the global model to each client;

(1.2) after receiving the global model, the client uses the local data training model to obtain the local model finished by the round of training, and then the client obtains the activation condition ranking of all neurons according to the activation condition of each neuron;

and (1.3) packaging the model and the activation ranking and sending the packaged model and activation ranking to a server.

Further preferably, a global model is obtained through federal learning training, and the global model aggregates distributed training results from N parties to summarize test data; the federally learned training objectives are summarized as a limited optimization:

where N represents the existence of N parties respectively processing N local models w, each party based on a private dataset

Using local targets f_i：R^d→ R for training, wherein a_i＝|D_iI and

each data sample is represented along with a corresponding label.

Further preferably, after aggregating the distributed training results from the N-party, the global model summarizes the test data, specifically:

in the t-th round, the central server will share the current model G^tSending to N selected parties, where [ N]Represents an integer set {1, 2,. ·, N }; the selected party i uses its own data set D_iAnd the learning rate lr running the optimization algorithm of E local turns to locally calculate the function f_iTo obtain a new local model

Client side updates model

Sent to the central server, which will average all updates with its own learning rate η to generate a new global model Gⁱ⁺¹：

Preferably, step (2) comprises:

(2.1) for the updated local model matrix, first concatenating all rows in the matrix to create one-dimensional vector, which is then fed to the autoencoder of the DAGMM;

(2.2) compressing the neuron activation ranking matrix into a one-dimensional vector, computing the standard deviation of the input vector, stacking this metric to create a new vector;

and (2.3) connecting the new vector with the low-dimensional representation learned by the automatic encoder to form an output cascade vector, and feeding the output cascade vector to an estimation network for multivariate Gaussian estimation to obtain reconstruction energy.

Further preferably, the overall network structure of the DAGMM comprises a compression and estimation network;

the compression is a depth self-coding network, a low-dimensional representation Zc of an input x is obtained through the depth self-coding network, a reconstruction error characteristic Zr between the input x and a reconstructed x' and a standard deviation Zs calculated by a neuron activation matrix are obtained at the same time, and the three are spliced to form Z; the network input is estimated as Z to get a probability distribution through multiple layers of full connections.

Further preferably, the compression network calculates its low dimensional representation z as follows:

z_c＝h(x；θ_e)，x′＝g(z_c；θ_d)，

z_r＝f(x，x′)，

z_s＝σ(x^*)

z＝[z_c，z_r，z_s]

wherein z is_cIs a reduced low-dimensional representation, z, learned by a depth self-encoder_rIncluding features derived from reconstruction errors, zs is the standard deviation calculated from the neuron activation matrix, x^*Expressed is a sample neuron activation matrix, θ_eAnd theta_dIs the reconstructed counterpart of the depth autocoder x, h () represents the coding function, g () represents the decoding function, f () represents the function that computes the reconstructed error characteristics, σ () represents the standard deviation function; finally, the compression network feeds z to the subsequent estimation network.

Preferably, step (3) comprises:

(3.1) the central server records the reconstruction loss of each client in the first rounds, marks abnormal clients and does not perform subsequent operation;

(3.2) after recording enough rounds, counting the marked times of each client, and screening out the clients marked many times;

(3.3) repeating steps (3.1) and (3.2) and continuing to screen until there is no abnormal update.

The invention has the beneficial effects that:

(1) and carrying out back door model detection by using the DAGMM, protecting the global model and improving the robustness.

(2) In the federal learning process, the neuron activation condition and the abnormality detection are hooked, and the detection efficiency of backdoor attack is improved.

(3) The attacker is screened out at the initial stage of training, so that even if the global model has been injected into the backdoor, as training progresses, backdoor features will be erased by newly learned features and no longer exist.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a block diagram of a DAGMM-based federated learning backdoor attack defense system of the method of the present invention;

fig. 3 is an overall network structure of a DAGMM.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1-3, a DAGMM-based federated learning backdoor attack defense method includes the following steps:

(1) and the client receives the global model, trains and uploads the local model and the corresponding neuron activation condition. The federal learned training objectives can be summarized as a limited optimization:

Using local targets f_i：R^d→ R for training, wherein a_i＝|D_iI and

each data sample is represented along with a corresponding label. The goal of federal learning is to obtain a global model that can summarize test data well after aggregating distributed training results from N parties.

Specifically, in the t-th round, the central server will currently share model G^tSending to N selected parties, where [ N]Representing the set of integers 1, 2. The selected party i uses its own data set D_iAnd the learning rate lr running the optimization algorithm of E local turns to locally calculate the function f_iTo obtain a new local model

The client then updates the model

Sent to the central server, which will average all updates with its own learning rate η to generate a new global model G^t+1：

For an attacker, the backdoor attack aims to mislead the trained model to predict the target tag τ on any input data embedding the pattern (i.e., trigger) selected by the attacker. The goal of backdoor attack in federal learning is to manipulate a local model and fit a main task and a backdoor task at the same time, so that a global model normally runs on an untampered data sample, and meanwhile, a high attack success rate is achieved on the backdoor data sample. Attacker i carries local data D_iAnd the targets in round t of target tag τ are:

wherein the content of the first and second substances,

in order to be a sample of the data,

data representing the corresponding real label of the specimen, with back door trigger

And clean data

Satisfy the requirement of

And

function P is the corresponding training optimization function, and function R uses a set of parameters phi to convert clean data in any class into back-gate data with trigger patterns selected by the attacker. Thus the normal model w_iConversion to back door model by maximizing the formula

Therefore, the algorithm steps are as follows:

the depth self-coding Gaussian mixture model (DAGMM) is an organic combination of a neural network, EM and GMM, a low-dimensional representation and reconstruction error are generated for each input data point by using a depth automatic encoder, and the low-dimensional representation and the reconstruction error are further input into the Gaussian Mixture Model (GMM) for anomaly detection. FIG. 3 is an overall network structure of DAGMM, which is divided into two sub-structures, the left part is compressed, and is a depth self-coding network, and a low-dimensional representation Zc of an input x can be obtained through the self-coding, and meanwhile, a reconstruction error characteristic Zr between the input x and a reconstructed x' and a standard deviation Zs calculated by a neuron activation matrix are obtained, and the three are spliced to form Z; the estimation network is arranged on the right side, the neural network is also a multilayer neural network, the input is Z, the probability distribution is obtained through multilayer full connection, and the length of the probability distribution is the number of the sub-distributions in the Gaussian mixture distribution.

Compression is a deep self-coding network, and the low-dimensional representation provided by the compression network comprises two characteristic sources: (1) learning a reduced low-dimensional representation by a depth autoencoder; (2) features derived from the reconstruction error; (3) the standard deviation of the neuron activation matrix contributed by the client. Given a sample x, an activation value sample x, the compression network calculates its low dimensional representation z as follows.

z_c＝h(x；θ_e)，x′＝g(z_c；θ_d)，

z_r＝f(x，x′)，

z_s＝σ(x^*)

z＝[z_c，z_r，z_s]

Wherein z is_cIs a reduced low-dimensional representation, z, learned by a depth self-encoder_rComprising features derived from the reconstruction error, z_sIs the standard deviation, x, calculated from the neuron activation matrix^*Expressed is a sample neuron activation matrix, θ_eAnd theta_dIs the reconstructed counterpart of the depth self-encoder x, h () represents the encoding function, g () represents the decoding function, f () represents the function that computes the reconstruction error characteristics, and σ () represents the standard deviation function. Finally, the compression network feeds z to the subsequent estimation network.

Given the low-dimensional representation z of the input samples, the estimation network performs density estimation under the GMM framework. In the presence of unknown mixed component distribution

And in the training stage of the mixed mean and the mixed covariance sigma, the estimation network estimates the parameters of the GMM and evaluates the likelihood/energy of the sample. The estimation network then accomplishes this by using a multi-layer neural network to predict the mixed membership of each sample. Given the low dimensional representation z and the integer K as the number of mixed components, the estimated network performs membership prediction as follows.

p＝MLN(z；θ_m)

Wherein

Is a k-dimensional vector for membership prediction of mixed components, p is represented by θ_mThe output of the parameterized multi-layer network, MLN, represents the constructed multi-layer network. Given a batch of n samples and their membership predictions

The parameters in the GMM may be further estimated as follows:

wherein

Is a low dimensional representation z_iIs predicted by the degree of membership of (a),

∑_krespectively representing the probability, mean and covariance of the kth distribution in the GMM.

With the estimated parameters, the sample energy can be further inferred by:

where | represents the determinant of the matrix, z is the given low-dimensional representation,

In the anomaly detection, the value of E (z) can be calculated through a model, theoretically, the smaller the expected value is, the better the value is (a negative sign is added in front of a likelihood function), the larger the value of E (z) is, the more likely the value is to be an anomalous attacker, and whether the update contributed by the client is backdoor or not can be judged according to the prior threshold value obtained from the data in the training set.

Therefore, the algorithm steps are as follows:

(2.1) for the updated local model matrix, first concatenating all rows in the matrix to create one-dimensional vector, which is then fed to the autoencoder (compression network) of the DAGMM;

(2.2) then, compressing the neuron activation ranking matrix into a one-dimensional vector, computing the standard deviation of the input vector, stacking this metric to create a new vector;

and (2.3) finally, connecting the new vector with a low-dimensional representation learned by an automatic encoder (compression network) to form an output concatenated vector, and feeding the output concatenated vector to an estimation network for multivariate Gaussian estimation to obtain reconstruction energy.

(3) Defense based on multiple rounds of reconstruction errors: the central server marks the abnormal detection clients in the round, and screens out the clients after multiple rounds of verification. The specific process is as follows:

(3.1) the central server records the reconstruction loss of each client in the first rounds, marks abnormal clients, but does not perform subsequent operation, because the detection result of only one round judges that the contingency exists and needs multiple rounds of verification. The back door attack has a timeliness problem, so that even if the global model is injected into the back door, only the back door attacker is screened out and is not injected into the back door, and the original back door characteristics are erased by newly learned characteristics along with the training, so that the attack is invalid;

(3.3) repeating the steps and continuing to screen until no abnormal update exists.

Although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that various changes in the embodiments and/or modifications of the invention can be made, and equivalents and modifications of some features of the invention can be made without departing from the spirit and scope of the invention.

Claims

1. A DAGMM-based federated learning backdoor attack defense method is characterized by comprising the following steps:

(3) defense based on multiple rounds of reconstruction errors.

2. The DAGMM-based federated learning backdoor attack defense method according to claim 1, wherein step (1) includes:

3. The DAGMM-based federated learning backdoor attack defense method according to claim 1 or 2, characterized in that, a global model is obtained through the training of federated learning, and the global model is aggregated after the distributed training results from N parties to summarize the test data; the federally learned training objectives are summarized as a limited optimization:

Using local targets f_i：R^d→ R for training, wherein a_i＝|D_iI and

each data sample is represented along with a corresponding label.

4. The DAGMM-based federated learning backdoor attack defense method according to claim 3, wherein the global model aggregates the distributed training results from the N parties to summarize test data, specifically:

Client side updates model

5. The DAGMM-based federated learning backdoor attack defense method according to claim 1, wherein step (2) includes:

6. The DAGMM-based federated learning backdoor attack defense method according to claim 1 or 5, wherein the overall network structure of the DAGMM includes a compression and estimation network;

7. The DAGMM-based federated learning backdoor attack defense method according to claim 6, wherein the compression network calculates its low-dimensional representation z as follows:

z_c＝h(x；θ_e)，x′＝g(z_c；θ_d)，

z_r＝f(x，x′)，

z_s＝σ(x^*)

z＝[z_c，z_r，z_s]

wherein z is_cIs a reduced low-dimensional representation, z, learned by a depth self-encoder_rComprising features derived from the reconstruction error, z_sIs the standard deviation, x, calculated from the neuron activation matrix^*Expressed is a sample neuron activation matrix, θ_eAnd theta_dIs the reconstructed counterpart of the depth autocoder x, h () denotes the coding function, g () denotes the decoding function, f () denotes the function of calculating the reconstruction error characteristicsNumber, σ () represents a standard deviation function; finally, the compression network feeds z to the subsequent estimation network.

8. The DAGMM-based federated learning backdoor attack defense method according to claim 1, wherein step (3) includes: