CN113806768A

CN113806768A - Lightweight federated learning privacy protection method based on decentralized security aggregation

Info

Publication number: CN113806768A
Application number: CN202110966055.8A
Authority: CN
Inventors: 沈蒙; 顾艾婧; 张�杰; 王婧
Original assignee: Beijing Institute of Technology BIT
Current assignee: Beijing Institute of Technology BIT
Priority date: 2021-08-23
Filing date: 2021-08-23
Publication date: 2021-12-17

Abstract

The invention relates to a lightweight federal learning privacy protection method based on decentralized security aggregation, and belongs to the technical field of data privacy protection. And constructing a safe decentralized aggregation platform by utilizing the edge nodes and the block chains of the alliances on the user side, and cooperatively performing an aggregation process on the platform. Each user segments the local model and sends it separately to each connected edge node. And each user generates a global random number, divides the global random number and shares the global random number with the edge nodes connected with the user. And then, all edge nodes are subjected to safe decentralized aggregation, each user can receive the global model added with the self-defined global random number disturbance, the edge nodes participating in the aggregation cannot know the global model, and each user can remove the added disturbance to obtain the original global model. The method can realize privacy protection without encryption operation, and is superior to the prior art in the aspects of calculation efficiency, model accuracy and privacy protection on member reasoning attack.

Description

Lightweight federated learning privacy protection method based on decentralized security aggregation

Technical Field

The invention relates to a lightweight federal learning privacy protection method based on decentralized security aggregation, aims to achieve user-side lightweight training and reduce privacy disclosure threats of a traditional central aggregator by using decentralized security aggregation, and belongs to the technical field of data privacy protection.

Background

In recent years, Federal Learning (FL) has been widely used as a new distributed Learning framework.

Federated learning is a unified machine learning model that allows multiple participants to collaboratively train using local data, with privacy preservation. In each round of training, the participants respectively obtain local models according to their own data sets, and then a central aggregator aggregates the local models, and the aggregator constructs a global model and sends the global model to the participants for the next round of training. Although the user's local training data is not disclosed in the federal learning process, frequent parameter sharing between training participants and aggregators can be exploited by malicious participants, resulting in divulgence of data privacy.

In recent years, attacks against federal learning have come into endlessly, wherein member reasoning attacks are a typical kind of attacks. The member reasoning attack aims to train an attack model to deduce whether data records exist in a training data set. These attacks are roughly classified into local attacks and global attacks based on a priori knowledge obtained by the attacker. By observing changes in the local model updates, a malicious participant can initiate a local attack on the participant. A global attack is usually initiated by a malicious aggregator, which isolates a participant and sends it a well-constructed global model. Since the victim trains the local model using a carefully constructed global model, the attacker can infer more private information from the local model updates. Therefore, member reasoning attack poses a huge threat to the privacy of the data set of the federal learning participant.

The prior art generally implements federal learning privacy protection by:

(1) the Privacy of the global model is protected by adding Differential Privacy (DP) noise to the global model.

The purpose of differential privacy is to hide the customer's contribution during training, which can ensure that the privacy of the global model is not known to the participants.

However, this method cannot protect local privacy, the aggregator can still obtain all local models, which will result in member inference attacks on local privacy by the aggregator, and model accuracy is also compromised.

(2) Homomorphic Encryption (Homomorphic Encryption) technology is utilized to protect gradients on honest but curious cloud servers.

However, the encryption and decryption operations in each round of training incur a significant amount of encryption and decryption computation and communication costs. Its application cost in a large-scale environment is high and may affect the efficiency of the machine learning model.

(3) The privacy of the local model is protected by using secret sharing techniques and adding random perturbations.

The homomorphic hash function is integrated with a pseudorandom technology as an infrastructure of a verifiable method, allowing participants to verify the correctness of cloud server execution at an acceptable cost.

However, the participants are less burdened with secret sharing computations and frequent communication costs and the global model still faces the risk of privacy leakage.

In view of the foregoing, privacy protection for federal learning remains a number of challenges.

Disclosure of Invention

The invention aims to overcome the defects in the prior art, and creatively provides a lightweight federal learning privacy protection method based on decentralized security aggregation to solve the technical problem of federal learning privacy protection.

The innovation points of the invention are as follows: and on the user side, a safe decentralized aggregation platform is constructed by utilizing the edge nodes and the block chains of the alliances. On this platform, the polymerization process is carried out in coordination. Each user is connected with N edge nodes, and model segmentation is performed locally.

In order to protect the privacy of the local model and ensure the accuracy, the invention designs a safety parameter segmentation and recovery algorithm based on random disturbance, and each user segments the local model (namely parameters) and respectively sends the segmented local model (namely parameters) to each connected edge node. Under the byzantine assumption, neither a single edge node nor a group of cooperating edge nodes can recover the local model. Meanwhile, in order to protect the privacy of the global model, each user generates a global random number, and the global random numbers are divided and respectively shared to the edge nodes connected with the user. Then, all edge nodes will perform secure decentralized aggregation, and each user will receive the global model with its own global random number perturbation added.

Therefore, the edge nodes participating in aggregation cannot know the global model, and each user can remove the added disturbance to obtain the original global model. The method can realize privacy protection without time-consuming encryption operation, thereby ensuring the lightweight training process of the user side.

In the method, each user is a data holder and is responsible for updating the local model in the federal learning process. Each user randomly connects N edge nodes, and the number of the edge nodes is not less than the total number of the Byzantine nodes. The edge nodes provide a user with a secure decentralized local model aggregation, which plays two roles: the local model aggregators and the blockchain consensus nodes. The edge node is a secure aggregated service platform established at the edge of the user network, and the service provider provides storage, computation and network resources, is responsible for local models and performs partial model aggregation.

Firstly, each user segments the model parameters to generate a well-constructed global random number, and segments the global random number through a parameter segmentation algorithm.

The user then sends the segmented model parameters and global random number to its connected edge nodes.

And then, the edge nodes carry out local model aggregation and upload the local model aggregation to the block chain. And the block chain classified account book is used as a data sharing platform, and global model aggregation is completed by using a global model aggregation contract to obtain a global model covered by global random numbers. During the operation of the intelligent contracts, each edge node inquires data of other edge nodes from the block chain shared account book. Meanwhile, by adopting an access control security policy, except for each edge node and each user, other entities cannot acquire data uploaded to the general ledger.

And finally, the edge node sends the global model to a corresponding user, and the global random number is completely eliminated by the user to obtain a final global model. And realizing user-side lightweight training based on a global model, thereby reducing the threat of privacy disclosure.

Advantageous effects

Compared with the prior art, the method of the invention has the following advantages:

and constructing a safe decentralized aggregation platform by utilizing the edge nodes and the block chains of the alliances at the user side, and cooperatively performing an aggregation process on the platform to perform privacy protection on the global and local models.

(1) The method is suitable for federal learning privacy protection in a decentralized platform environment.

(2) The present invention allows privacy preserving training to be performed in a lightweight manner without loss of model accuracy. A safe decentralized aggregation platform is adopted to replace a centralized aggregator, so that data privacy leakage can be avoided;

(3) the invention designs a safety parameter division and recovery algorithm based on one-time filling, protects local and global models, reduces the calculation overhead of a user side, and ensures the high availability of the models without losing precision.

(4) The invention carries out strict safety analysis and proves the safety of the proposal.

A large number of experiments prove that the method is superior to the prior art in the aspects of calculation efficiency, model accuracy and privacy protection on member reasoning attack.

Drawings

FIG. 1 is a collaboration model upon which the method of the present invention relies;

fig. 2 is an interactive protocol process of the method of the present invention.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings and examples. It should be noted that the practice of the present invention is not limited to the following examples, and any modification or variation of the present invention may be made without departing from the scope of the present invention.

A lightweight federated learning privacy protection method based on decentralized security aggregation. And on the user side, a safe decentralized aggregation platform is constructed by utilizing the edge nodes and the block chains of the alliances. On this platform, the polymerization process is carried out in coordination.

Specifically, the method comprises the following steps:

step 1: each user in federal learning connects N edge nodes. And each user segments the model parameters to generate a global random number. The global random number is partitioned by parameter partitioning and sent to each connected edge node separately.

Specifically, the method comprises the following steps:

step 1.1: edge nodes establish alliance block chains, reach consensus and establish a safe decentralized aggregation platform.

Step 1.2: each user generates a global random number to mask the global model during each subsequent training round.

In Federal learning training, selected users train local models under respective local data sets, and then user u is calculatedⁱLocal model of

The method comprises the following steps:

step 1.2.1: initializing a global model w₀Training round number T and learning rate lambda;

step 1.2.2: for each round of training process, selecting user U from user set U^sParticipating in training;

step 1.2.3: for each user U^sLocal model training is carried out through formula 1;

wherein, w^t-1Representing the parameters of the training model in the previous round, eta representing the learning rate,

representing the gradient, L the loss function, w the model parameters, b the model parameters.

Step 1.3: using a parameter segmentation algorithm to map local model parameters

And user uⁱGlobal random number of

The segmentation is performed, sent to each connected edge node separately, and uploaded to the decentralized security aggregation intelligent contract.

In the parameter segmentation algorithm, v is set as a parameter, n is a segmentation number generated after segmentation is needed, seed is a random number seed, and PRG is a random number generator, and the method comprises the following steps:

step 1.3.1: generating a set of pseudo-random numbers using PRG (seed), resulting in { r }₁,r₂，…,r_nDenotes n pseudo random numbers.

Step 1.3.2: a segmentation parameter v.

V. the_i＝v+r_i-r_i+1I ∈ {1,2, …, n +1}, let v_n＝v+r_n-r₀。

Wherein v is_i,v_nRepresenting model parameters, r_i,r_i+1Representing a pseudo random number.

Step 1.3.3: return a set of v_i，i∈{1,2,…,n+1}。

Step 2: the aggregation party performs decentralized security aggregation, and comprises the following steps:

step 2.1: edge node Edg^jAggregating all partitioned local models using global random numbers of users selected in the next round of training

Step 2.2: uploading post-aggregation local models from blockchains

To block chain classification ledger. And the block chain classified account book is used as a data sharing platform, and global model aggregation is carried out by utilizing a global model aggregation contract.

Step 2.3: when all edge node data returns to all sets

Thereafter, the edge nodes compute and return the global model covered by the global random number

The global model is derived from user uⁱGlobal random number of

And (6) covering.

And step 3: and removing the global random number covered on the global model by the user to obtain the global model.

User uⁱTo obtain

Then, the global random number added by the user-defined user is removed

By passing

Thereby obtaining a global model g^t。

And 4, step 4: and realizing user-side lightweight training based on a global model, thereby reducing the threat of privacy disclosure.

Example 1

In this embodiment, a cooperation model based on the lightweight privacy protection federal learning method based on decentralized security aggregation is established, as shown in fig. 1.

Fig. 1 depicts the following decentralized security aggregation scenario: each user holds a local data set and updates the local model in the FL flow. Each user is randomly connected with a plurality of edge nodes, the user divides the model parameters to generate a well-constructed global random number, and the global random number is divided through parameter division. The divided parameters and the global random number are transmitted to the connection node. The edge nodes provide safe and decentralized local model aggregation for users, receive the segmented local models and perform partial model aggregation. And uploading the partial aggregation model to a block chain ledger for global aggregation to obtain a global model covered by a global random number. The block chain ledger serves as a data sharing platform and aims to help complete model aggregation. A global model aggregation contract runs in a blockchain. In the running process of the intelligent contract, each edge node can inquire data of other edge nodes from the block chain shared account book, complete global aggregation and send the data to a global model covered by the user-defined random number. The user can eventually remove the random number to get the final model.

Depending on the model in fig. 1, when the method of the present invention is specifically implemented, the following steps are taken:

step A: the number of users is set to 100, 1000 and 10000, which respectively represent the applications of small, medium and large user scale, and the proportion of users selected to participate in training is 5%, 10% and 15%.

Training and testing was performed on CNN networks according to steps 1 to 3 using MNIST data sets (http:// yann. letter. com/exdb/MNIST /), with MNIST set to IID by default. The model accuracy R is calculated by equation 2, where t_pIs the number of correctly classified positive instances, f_nIs the number of active instances of misclassification.

R＝t_p/(f_n+t_p) (2)

Example model accuracies obtained from training at different user scale scenarios are shown in table 1.

TABLE 1 accuracy results for different user scales

As can be seen from FIG. 2, the training loss drops faster on both small and medium user scales, concluding that the training loss is almost independent of the percentage of users selected. However, with a larger user size, the greater the proportion of users selected, the faster the training loss will decrease. This is because in a large scale situation, more users are engaged in training, each user usage having fewer instances of data. The highest accuracy of the training model at different user scales has almost no relation to the percentage of users selected. In all three user scales, the method can train a high-precision model.

And B: the user side training time and the aggregation side (edge node side) time are calculated.

And C: calculating the block chain communication time, namely the delay of calling intelligent contract sharing and inquiring data, and the communication time between the user and the edge node.

Example training time overhead in different user size scenarios is shown in table 2.

TABLE 2 training time overhead

As can be seen from table 2, the percentage of users selected has no effect on the training time on both the user side and the aggregation side. This is because an increase in the number of selected users only increases the computational overhead of random number generation, which is a lightweight operation. Then, with all three user scales, the time cost is very small, which proves that the training is a lightweight training on the user side. This is because the operation of this part does not involve encryption and decryption work. The larger the user scale, the less training time consumption on the user side. The reason is that more users means fewer data instances. In all cases, the time cost on the side of the polymerizer is very small. This is because the edge nodes only have to continue the addition operation and do not need to do any additional heavy computational work.

Example communication time overhead per round of training blockchains is shown in table 3.

TABLE 3 Block chain communication time overhead for each round of training

As can be seen from table 3, as the number of blockchains increases, this overhead also increases, which is caused by the characteristics of the blockchains. However, since the local aggregation process of each blockchain node is independent in parallel, there is no significant change in time. Similarly, as the number of users increases, each user operates independently of the other, and thus the communication time overhead does not vary significantly.

Example 2

In the embodiment, results of the method in various scenes are compared, and the privacy protection method has high training accuracy and efficiency. This example is compared to existing methods, all of which are intended to protect data privacy during federal learning. Federal learning has no privacy protection measures. HEDL protects the privacy of the local model with HE encryption in distributed deep learning. The DPFed ensures the privacy of the common global model unknown to the user by adding DP noise in the global model. PSA and VerifyNet protect the privacy of local models by covering random perturbations. The comparison experiment of these existing methods and the method is performed to obtain the comparison result of the accuracy and the time overhead of the training model, as shown in tables 4 and 5.

TABLE 4 comparison of the precision accuracy of different methods at different user scales

TABLE 5 comparison of the calculation time overhead for different methods at different user scales

As can be seen from Table 4, the method is superior to DPFed in model accuracy, and a noise mechanism is not added into the global model, so that the condition that the model is damaged is avoided. At the same time, the method has the same level of accuracy as HEDL, VerifyNet and PSA. These methods do not have a significant loss in accuracy compared to traditional federal learning methods.

The results of the communication time comparison with the prior art method are given in table 5. The method has obvious advantages because time-consuming operations such as encryption and decryption are not needed. Compared with the two methods of VerifyNet and PSA, which adopt secret sharing, the user-side computation time overhead is higher than that of the method. Because in the HEDL where the client needs a lot of homomorphic operations, the performance of the method is superior to that of the HEDL. The server-side time cost is also comparable to other comparison methods. The overall time cost of the process increases slightly with the increase in customer size, while HEFed, VerifyNet and PSA increase.

Example 3

In the embodiment, results of the method in various scenes are compared, and the privacy protection method has a resistance effect on member reasoning attack. The member reasoning attack method is used for attacking five different methods of Federal learning, HEDL, DPFed, VerifyNet and PSA, a CIFAR-10 dataset (https:// www.cs.toronto.edu/. about. kriz/CIFAR. html) is used for member reasoning attack, and attack comparison results are shown in Table 6.

TABLE 6 comparison results of different methods for resisting member reasoning attack

As can be seen from table 6, conventional FL, VerifyNet and PSA cannot defend against membership inference attacks, because the central server can still expose the global model. In HEDL, the server can only obtain encrypted local models and global models, and the attack precision is very low. In the method, the attack precision is kept at a lower level, which means that the attack model is a guess of the global model, because an attacker can only obtain virtual random number model parameters, and other attacks such as attribute reasoning attack and the like are carried out under the knowledge condition of the local model or the global model. On the basis, the attack can not achieve higher attack precision as the result of the membership inference attack.

While the embodiments of the present invention have been described in connection with the drawings and examples, it will be apparent to those skilled in the art that various modifications can be made without departing from the principles of this patent, and it is intended to cover all modifications that are within the scope of this patent.

Claims

1. The lightweight federated learning privacy protection method based on decentralized security aggregation is characterized in that each user is a data holder and is responsible for updating a local model in a federated learning process; each user is randomly connected with N edge nodes, and the number of the edge nodes is not less than the total number of Byzantine nodes;

the edge nodes provide a secure decentralized local model aggregation for the user, which plays two roles: local model aggregators and blockchain consensus nodes; the edge node is a safe aggregation service platform established at the edge of the user network, and a service provider provides storage, calculation and network resources, is responsible for local models and executes partial model aggregation;

firstly, each user segments model parameters to generate a well-constructed global random number, and segments the global random number through a parameter segmentation algorithm;

then, the user sends the divided model parameters and the global random number to the edge node connected with the user;

then, the edge nodes carry out local model aggregation and upload the local model aggregation to a block chain;

the block chain classified account is used as a data sharing platform, global model aggregation is completed by using a global model aggregation contract, and a global model covered by global random numbers is obtained; in the running process of the intelligent contract, each edge node inquires data of other edge nodes from the block chain shared account book; meanwhile, an access control security strategy is adopted, and except for each edge node and each user, other entities cannot acquire data uploaded to the general ledger;

finally, the edge node sends the global model to the corresponding user, and the global random number is completely eliminated by the user to obtain a final global model;

and realizing user-side lightweight training based on a global model, thereby reducing the threat of privacy disclosure.

2. The decentralized safety aggregation based lightweight federated learning privacy protection method according to claim 1, wherein the method for generating the global random number by segmenting the model parameters by each user is as follows:

The method comprises the following steps:

first, the global model w is initialized₀Training round number T and learning rate lambda;

then, for each round of training process, selecting user U from user set U^sParticipating in training;

then, forAt each user U^sLocal model training is carried out through formula 1;

3. The decentralized security aggregation based lightweight federated learning privacy protection method according to claim 1, wherein the method of partitioning global random numbers by parameter partitioning and sending them to each connected edge node respectively is as follows:

using a parameter segmentation algorithm to map local model parameters

And user uⁱGlobal random number of

Partitioning, respectively sending the partitioned data to each connected edge node, and uploading the partitioned data to an intelligent contract for decentralized security aggregation;

first, a set of pseudo random numbers is generated using PRG (seed), resulting in { r }₁,r₂，…,r_nRepresents n pseudo random numbers;

then, the segmentation parameter v:

v. the_i＝v+r_i-r_i+1I ∈ {1,2, …, n +1}, let v_n＝v+r_n-r₀；

Wherein v is_i、v_nRepresenting model parameters, r_i、r_i+1Representing a pseudo-random number;

finally u, return a set of v_i，i∈{1,2,…,n+1}。

4. The decentralized security aggregation based lightweight federated learning privacy protection method of claim 1, wherein a method for an aggregator to perform decentralized security aggregation is as follows:

first, an edge node Edg^jAggregating all partitioned local models using global random numbers of users selected in the next round of training

The aggregated local model is then uploaded from the blockchain

Sorting accounts into block chains; the block chain classified account book is used as a data sharing platform, and global model aggregation is carried out by utilizing a global model aggregation contract;

when all edge node data returns to all sets

The global model is derived from user uⁱGlobal random number of

And (6) covering.

5. The decentralized security aggregation based lightweight federated learning privacy protection method of claim 1, wherein the method is characterized in thatUser uⁱGet a global model

Then, the random number added by user-defined is removed

By passing

Thereby obtaining a global model g^t，

Representing user uⁱThe global random number of (2).