CN112668044A

CN112668044A - Privacy protection method and device for federal learning

Info

Publication number: CN112668044A
Application number: CN202011523140.9A
Authority: CN
Inventors: 牛犇; 李凤华; 张立坤; 陈亚虹
Original assignee: Institute of Information Engineering of CAS
Current assignee: Institute of Information Engineering of CAS
Priority date: 2020-12-21
Filing date: 2020-12-21
Publication date: 2021-04-16
Anticipated expiration: 2040-12-21
Also published as: CN112668044B

Abstract

Embodiments of the present invention provide a federated learning-oriented privacy protection method and device, including: a parameter setting step, a data division step, a first training step, a second training step, a first calculation step, a second calculation step, and a generation Adversarial example step. This embodiment adopts the idea of adversarial samples, and adds a certain amount of noise to the parameter update to disturb the distribution characteristics of the parameters, so that after passing through the privacy attribute inference model, the privacy inference result is randomly output according to the probability distribution expected by the user, so as to resist the privacy attribute inference attack. So as to alleviate the privacy property leakage problem of federated learning.

Description

Privacy protection method and device for federal learning

Technical Field

The invention relates to the technical field of computers, in particular to a privacy protection method and device for federal learning.

Background

Federal Learning (FL) is a distributed deep Learning method, which can give consideration to efficiency, accuracy, and privacy to some extent, and thus has attracted much attention. The main process of federal learning is as follows: and the server randomly assigns values to the global model parameters to initialize and distributes the model to each user, the users train the model locally by using own data, then the updated parameters of the model are sent back to the server, and the server updates the global model according to the updated parameters and distributes the updated parameters to the users again, and then a new round of iterative updating is carried out. In the process, the server only collects parameters of the user model instead of original data, so that data privacy protection is facilitated. In addition, different users participate in training together, so that the model generalization and the training efficiency are enhanced, and meanwhile, the personalized model training of the user side is realized.

However, although users participating in federal learning do not have to submit training data directly to the server, the parameters sent by the users can still indirectly cause privacy leaks. This privacy disclosure problem is particularly prominent in situations where the user data is not distributed identically: at this time, data of different users often have different privacy attributes, for example, for users with different genders, races, or income levels, shopping data of the users often have a certain difference in distribution, and this difference further affects parameter distribution of the target model in a model training stage of the users, and by using the difference in parameter distribution among the different users, an attacker (including an external malicious user and an untrusted server) can infer information such as the genders, races, or income levels of the users, thereby bringing a great threat to the privacy of the users. The specific inference method is that some known parameter vectors are used as training data, corresponding privacy attributes of the training data are used as training data labels, and a privacy attribute classifier is trained, so that the privacy attributes of user data are inferred through parameters sent by a user. Therefore, in a federal learning scenario, how to sufficiently ensure the accuracy of a global model and prevent privacy leakage caused by model parameter exchange is an urgent problem to be solved.

In order to solve the privacy problem, various protection technologies, such as a dynamic encryption technology, a secure multi-party computation technology, a differential privacy technology, and the like, have been proposed, and all of these technologies protect the privacy of the user to some extent. In the above protection scheme, the homomorphic encryption based method provides reliable security and accuracy guarantees. However, the homomorphic encryption algorithm often has high computational complexity, low use efficiency, high communication overhead of parameters, and a complex key management mechanism, so that it is difficult to popularize and apply the homomorphic encryption algorithm. The safe multi-party calculation and difference technology is widely applied due to strong theoretical support and easy-to-implement performance advantages. Abadi et al propose applying differential privacy to the gradient descent algorithm of deep learning to ensure that true training data cannot be recovered by the model parameters. However, this algorithm requires each user to add noise to the gradient independently, which has a large impact on the accuracy of the aggregate model. Bonawitz and the like utilize a federal learning architecture designed by a secure multiparty computing protocol to ensure that all users participating in learning can share the same global model, and a server can only obtain the updated global model but cannot obtain the real parameters submitted by any user, but the scheme cannot resist collusion attack among participants. In order to make up for the defects, Truex and the like combine differential privacy and multi-party calculation, so that the noise quantity is reduced, the model accuracy and privacy are ensured, and collusion threats among users can be resisted. In conclusion, the existing federal learning privacy protection scheme is difficult to well consider accuracy, privacy and universality. According to the protection scheme based on the differential privacy, a large amount of noise needs to be added on the gradient in the model training stage of a user when the privacy is guaranteed, the accuracy of the aggregated global model is sacrificed, and the differential privacy cannot protect the privacy of the training data in a targeted manner; after the difference privacy is combined with the safe multi-party calculation, even if the accuracy is improved, the correct calculation of each round of results of the safe multi-party calculation protocol needs to meet the requirement that a certain number of users simultaneously perform model training and aggregation on line, so that the method is not suitable for an updating mode of asynchronous federal learning, and in addition, the homomorphic encryption technology introduced into the multi-party calculation greatly reduces the operation efficiency.

Disclosure of Invention

Aiming at the problems in the prior art, the embodiment of the invention provides a privacy protection method and device facing federal learning.

In a first aspect, an embodiment of the present invention provides a privacy protection method for federal learning, including:

setting parameters, namely setting parameters according to actual requirements and outputting the parameters; the parameters include a set of privacy attributes, an expected output probability vector for noise, and an availability budget; the privacy attribute set refers to a set of privacy attribute values to be protected, the expected output probability vector of the noise refers to an expected output privacy inference result, and the availability budget refers to availability budget constraint of noise amount;

a data dividing step, namely dividing a data set based on the privacy attribute set, and determining m training data subsets corresponding to the privacy attributes;

a first training step, training m target models according to the m training data subsets corresponding to the privacy attributes, and determining a model iteration parameter data set;

a second training step, namely, taking the model iteration parameter data set as training data, taking the privacy attribute value corresponding to each set of parameters in the model iteration parameter data set as a data tag to train a privacy attribute inference model, and determining a privacy attribute inference result; the privacy attribute inference model is an m-class privacy attribute classifier;

a first calculation step of determining a minimized noise by a fast gradient method based on the privacy attribute inference result;

a second calculation step of determining a noise actual output probability distribution based on the minimized noise, the expected output probability vector of the noise, and the availability budget;

and generating a countermeasure sample, generating a countermeasure sample based on the noise actual output probability distribution, and sending the countermeasure sample to a parameter server so that the server performs parameter updating based on the countermeasure sample.

Further, the first calculating step, based on the privacy attribute inference result, determines the minimized noise by using a fast gradient method, specifically includes:

finding a set of noise r ═ r by fast gradient method₁,r₂,…r_mIs such that noise r is added to the parameter x to be transmitted_iLater confrontation sample x_i ^*The privacy attribute value output after the model f is inferred through the privacy attribute is i;

the calculation method of the rapid gradient method comprises the following steps:

the noise calculation method is as follows:

r_i＝x_i ^*-x

where r represents a set of noises, r₁、r₂、r_mAnd r_iAll represent noise, x represents a parameter to be transmitted, x_i ^*Representing the generated countermeasure sample, f representing the privacy attribute inference model, i representing the privacy attribute value, epsilon being the availability budget, and l representing a loss function for calculating the distance between the privacy attribute f (x) and the target privacy attribute i output by the privacy attribute inference model.

Further, the second calculating step determines an actual output probability distribution of the noise based on the minimized noise, the expected output probability vector of the noise, and the availability budget, and specifically includes:

calculating a noise actual output probability distribution q based on the minimized noise, the expected output probability vector p of the noise and the availability pre-adoption of a first relation model; the first relationship model is as follows:

min KL(q，p)s.t.∑q_i*||r_i||≤ε，i∈{1，2，...，m}

the min KL (q, p) is an optimization target, q is the actual output probability of each noise, p represents the expected output probability of the noise, the KL (q, p) is used for calculating the KL divergence distance of the q and p two vector distributions, and the Σ q is used for calculating the KL divergence distance of the q and p two vector distributions_i*||r_iThe actual output probability q meets the availability budget constraint of the user with | | < epsilon, and | | | r_iIs the noise vector r_iL2 norm, q_iRepresenting the ith noise actual output probability.

Further, the generating of the countermeasure sample step generates the countermeasure sample based on the actual output probability distribution of the noise, and sends the countermeasure sample to a parameter server, so that the server performs parameter update based on the countermeasure sample, specifically including:

when the user needs to send parameter x to the server, actually outputting probability distribution q based on the noise_iFrom r ═ { r₁，r₂，...r_mRandomly selecting a noise r_iGenerating a confrontation sample x_i ^*＝x+r_i；

And combining the challenge sample x_i ^*＝x+r_iSending to a parameter server, such that the server is based on the countermeasure sample x_i ^*＝x+r_iAnd updating the parameters.

In a second aspect, an embodiment of the present invention provides a privacy protection apparatus for federal learning, including:

the parameter setting module is used for setting parameters according to actual requirements and outputting the parameters; the parameters include a set of privacy attributes, an expected output probability vector for noise, and an availability budget; the privacy attribute set refers to a set of privacy attribute values to be protected, the expected output probability vector of the noise refers to an expected output privacy inference result, and the availability budget refers to availability budget constraint of noise amount;

the data dividing module is used for dividing a data set based on the privacy attribute set and determining m training data subsets corresponding to the privacy attributes;

the first training module trains m target models according to the m training data subsets corresponding to the privacy attributes and determines a model iteration parameter data set;

the second training module is used for training a privacy attribute inference model by taking the model iteration parameter data set as training data and the privacy attribute value corresponding to each set of parameter in the model iteration parameter data set as a data tag, and determining a privacy attribute inference result; the privacy attribute inference model is an m-class privacy attribute classifier;

a first calculation module for determining a minimized noise by a fast gradient method based on the privacy attribute inference result;

a second calculation module that determines a noise actual output probability distribution based on the minimized noise, a desired output probability vector of the noise, and an availability budget;

and the countermeasure sample generation module generates countermeasure samples based on the noise actual output probability distribution and sends the countermeasure samples to a parameter server so that the server performs parameter updating based on the countermeasure samples.

Further, the first calculating module is specifically configured to:

finding a set of noise r ═ r by fast gradient method₁，r₂，...r_mIs such that noise r is added to the parameter x to be transmitted_iLater confrontation sample x_i ^*The privacy attribute value output after the model f is inferred through the privacy attribute is i;

the noise calculation method is as follows:

r_i＝x_i ^*-x

Further, the second calculation module is specifically configured to:

min KL(q，p)s.t.∑q_i*||r_i||≤ε，i∈{1，2，...，m}

wherein the min KL (q, p) is an optimization purposeAnd q is the actual output probability of each noise, p represents the expected output probability of the noise, KL (q, p) is used for calculating KL divergence distance of q and p two vector distributions, and the sigma q_i*||r_iThe actual output probability q meets the availability budget constraint of the user with | | < epsilon, and | | | r_iIs the noise vector r_iL2 norm, q_iRepresenting the ith noise actual output probability.

Further, the generate confrontation sample module is specifically configured to:

In a third aspect, an embodiment of the present invention further provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps of the privacy protection method for federated learning according to the first aspect when executing the program.

In a fourth aspect, embodiments of the present invention further provide a non-transitory computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of the federal learning oriented privacy protection method as described in the first aspect.

According to the technical scheme, the privacy protection method and device for federal learning provided by the embodiment of the invention have the advantages that through the parameter setting step, parameters are set according to actual requirements, and the parameters are output; the parameters include a set of privacy attributes, an expected output probability vector for noise, and an availability budget; the privacy attribute set refers to a set of privacy attribute values to be protected, the expected output probability vector of the noise refers to an expected output privacy inference result, and the availability budget refers to availability budget constraint of noise amount; a data dividing step, namely dividing a data set based on the privacy attribute set, and determining m training data subsets corresponding to the privacy attributes; a first training step, training m target models according to the m training data subsets corresponding to the privacy attributes, and determining a model iteration parameter data set; a second training step, namely, taking the model iteration parameter data set as training data, taking the privacy attribute value corresponding to each set of parameters in the model iteration parameter data set as a data tag to train a privacy attribute inference model, and determining a privacy attribute inference result; the privacy attribute inference model is an m-class privacy attribute classifier; a first calculation step of determining a minimized noise by a fast gradient method based on the privacy attribute inference result; a second calculation step of determining a noise actual output probability distribution based on the minimized noise, the expected output probability vector of the noise, and the availability budget; and generating a countermeasure sample, generating the countermeasure sample based on the actual noise output probability distribution, and sending the countermeasure sample to a parameter server, so that the server updates parameters based on the countermeasure sample, thereby facing a federal learning scene, effectively realizing the balance of privacy and accuracy, and being suitable for two updating mechanisms, namely a synchronous updating mechanism and an asynchronous updating mechanism.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

Fig. 1 is a schematic flowchart of a privacy protection method for federated learning according to an embodiment of the present invention;

fig. 2 is a schematic flow chart illustrating setting parameters according to an embodiment of the present invention;

fig. 3 is a schematic structural diagram of a privacy protecting apparatus for federal learning according to an embodiment of the present invention;

fig. 4 is a schematic structural diagram of a privacy protecting apparatus for federal learning according to another embodiment of the present invention;

fig. 5 is a schematic physical structure diagram of an electronic device according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some embodiments, but not all embodiments, of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention. The privacy protection method for federal learning provided by the invention will be explained and illustrated in detail by specific embodiments.

Fig. 1 is a schematic flowchart of a privacy protection method for federated learning according to an embodiment of the present invention; as shown in fig. 1, the method includes:

step 101: setting parameters, namely setting parameters according to actual requirements and outputting the parameters; the parameters include a set of privacy attributes, an expected output probability vector for noise, and an availability budget; the privacy attribute set refers to a set of privacy attribute values to be protected, the expected output probability vector of the noise refers to a privacy inference result expected to be output, and the availability budget refers to an availability budget constraint of noise amount.

In this step, it can be understood that the first step sets parameters, which include three parts, a privacy attribute set, a desired output probability vector of noise, and an availability budget. A privacy attribute set refers to a set of all privacy attribute values that a user specifies need to protect. The expected output probability vector refers to the probability with which the user desires to select one of the generated noises, for example, the user desires to output any privacy inference result with an equal probability uniform distribution, so as to prevent an attacker from correctly inferring the privacy attributes of the user. The availability budget constrains the amount of noise added by the user, guaranteeing the availability of countermeasure samples.

Step 102: and a data dividing step, namely dividing the data set based on the privacy attribute set and determining m training data subsets corresponding to the privacy attributes.

In this step, it can be understood that, in the second step, the user data set is divided, and it is assumed that the user-defined privacy attribute set includes m privacy attribute values, each piece of data corresponds to only one privacy attribute value, and the privacy attribute set does not coincide with the target attribute set. The training data set is partitioned into m subsets according to these privacy attributes. Wherein m is a positive integer greater than zero.

Step 103: and a first training step, namely training m target models according to the m training data subsets corresponding to the privacy attributes, and determining a model iteration parameter data set.

In this step, it can be understood that, in the third step, the target model is trained, and the user trains m target models respectively by using the m training subsets.

Step 104: a second training step, namely, taking the model iteration parameter data set as training data, taking the privacy attribute value corresponding to each set of parameters in the model iteration parameter data set as a data tag to train a privacy attribute inference model, and determining a privacy attribute inference result; the privacy attribute inference model is an m-class privacy attribute classifier.

In this step, it can be understood that, in the fourth step, the privacy attribute inference model is trained, parameters in each round of update iteration of the target model are used as training data of the privacy attribute inference model, and corresponding privacy attribute values are used as data labels, so that an m-class privacy attribute classifier is trained.

Step 105: a first calculation step of determining a minimized noise by a fast gradient method based on the privacy attribute inference result.

In this step, it can be understood that, after the noise is calculated in the fifth step and the privacy attribute inference model is obtained, a set of minimized noises is found through a fast gradient method, so that a set of parameters respectively become countermeasure samples corresponding to one of the m privacy attributes.

Step 106: a second calculation step of determining a noise actual output probability distribution based on the minimized noise, the desired output probability vector of the noise and the availability budget.

In this step, it can be understood that the sixth step calculates the actual output probability of the noise. The optimization goal is to minimize the distance between the output vector distribution of the training data after passing through the privacy attribute inference model and the expected distribution of the user, thereby quantifying the privacy protection effect of the algorithm. While the actual output probability is to satisfy the user's availability budget constraint.

Step 107: and generating a countermeasure sample, generating a countermeasure sample based on the noise actual output probability distribution, and sending the countermeasure sample to a parameter server so that the server performs parameter updating based on the countermeasure sample.

In this step, it can be understood that the seventh step outputs a challenge sample. And when the user participates in the federal learning, randomly selecting one sample from a group of confrontation samples corresponding to the user according to the output probability calculated in the sixth step, and sending the selected sample to the parameter server, so that the server updates the parameters based on the confrontation samples.

In this embodiment, it should be noted that the main purpose of this embodiment is to add countersample noise in the parameter update of the model, so that the countersample noise passes through the privacy attribute inference model and then outputs the privacy inference result with a specific probability distribution (e.g., uniform distribution), thereby alleviating the privacy attribute disclosure problem of the user in federal learning. The privacy protection method for federal learning provided by this embodiment is implemented by users participating in federal learning, and the server still performs parameter aggregation according to a general manner (e.g., weighted average, etc.).

According to the technical scheme, the privacy protection method facing the federal learning provided by the embodiment of the invention faces the federal learning scene, can effectively realize the balance between privacy and accuracy, and is suitable for two updating mechanisms, namely synchronous updating mechanism and asynchronous updating mechanism. Because the existing privacy protection method based on disturbance has the problems of low accuracy, incapability of measuring privacy protection effect and the like, the embodiment improves the disturbance mode of the current parameter, reduces the influence of privacy pre-calculation on parameter usability while ensuring data privacy, and improves the accuracy of the aggregation model. Model parameters actually submitted by a user are disturbed when an algorithm is designed, so that the privacy attribute inference result given by the inference model deviates from the real privacy attribute of the user, and the disturbance of the user on the model parameters does not excessively influence the accuracy of the finally aggregated global model. In addition, how to accurately measure the usability loss caused by the privacy mechanism and the privacy protection effect needs to be considered. Therefore, the concept of resisting the sample is adopted, a certain amount of distribution characteristics of the noise disturbance parameters are added in the parameter updating of the model, the noise disturbance parameters pass through the privacy attribute inference model, and then the privacy inference result is randomly output according to the probability distribution (such as uniform distribution) expected by the user, so that the privacy attribute inference attack is resisted, and the privacy attribute disclosure problem of federal learning is relieved. It should be noted that the probability distribution desired by the user refers to a probability with which the user desires to select one of the generated noises, for example, the user desires to output any privacy inference result with a uniform distribution of equal probabilities, so as to prevent an attacker from correctly inferring the privacy attributes of the user.

On the basis of the foregoing embodiment, in this embodiment, the determining, by the first calculating step, the minimized noise by using a fast gradient method based on the privacy attribute inference result specifically includes:

the noise calculation method is as follows:

r_i＝x_i ^*-x

where r represents a set of noises, r₁、r₂、r_mAnd r_iAll represent noise, x represents a parameter to be transmitted, x_i ^*Representing the generated countermeasure sample, f representing the privacy attribute inference model, i representing the privacy attribute value, epsilon being the availability budget, 1 representing a loss function for calculating the distance between the privacy attribute f (x) and the target privacy attribute i output by the privacy attribute inference model.

In this embodiment, for example, when a user needs to submit a parameter vector x to a server, a set of noise r ═ { r } is found by the fast gradient method₁，r₂，...r_mIs such that noise r is added to the parameter x to be transmitted_iLater confrontation sample x_i ^*The value of the privacy attribute output after passing through the privacy attribute inference model f is i, namely C (x + r)_i)＝C(x_i ^*)＝i，i∈{1，2，...，m}。

The fast gradient method and the noise calculation method are as follows:

r_i＝x_i ^*-x

where ε is the availability budget specified by step 11, and l represents the loss function used to calculate the distance between the privacy attribute f (x) and the target privacy attribute i output by the privacy attribute inference model, common loss functions include, but are not limited to, cross entropy, mean square error, etc. The purpose of the clip operation is to confine the generated countermeasure sample values to the domain of the data itself, and to wipe out values that exceed the domain size to ensure that x is ultimately generated^*＝{x₁ ^*，x₂ ^*，...，x_m ^*Meet the practical application conditions.

According to the technical scheme, the privacy protection method facing the federal learning provided by the embodiment of the invention has the advantages that the privacy inference result is randomly output according to the probability distribution (such as uniform distribution) expected by the user after the model parameters pass through the privacy attribute inference model by adding the least noise, so that the privacy attribute inference attack is resisted, and the privacy attribute disclosure problem of the federal learning is relieved.

On the basis of the foregoing embodiment, in this embodiment, the second calculating step determines an actual output probability distribution of noise based on the minimized noise, the expected output probability vector of noise, and the availability budget, and specifically includes:

min KL(q，p)s.t.∑q_i*||r_i||≤ε，i∈{1，2，...，m}

In the present embodiment, for example, the noise actual output probability q is calculated. The calculation process can be formalized as the following optimization problem:

min KL(q，p)s.t.∑q_i*||r_i||≤ε，i∈{1，2，...，m}

said min KL (q, p) is an optimization objective aimed at minimizing the parameter x to be sent^*The KL distance between the output vector distribution q after model C and the user desired distribution p is inferred from the privacy attributes, where q is the actual output probability of each noise,

q_i∈(0，1]thereby quantifying the privacy preserving effect of the algorithm with the value of KL (q, p). The sigma q_i*||r_i| | < epsilon so that it is realisticThe output probability q needs to satisfy the availability budget constraint of the user, where r_iIs the noise vector r_iL2 norm.

The calculation formula of the KL distance is as follows:

On the basis of the foregoing embodiment, in this embodiment, the generating a challenge sample step generates a challenge sample based on the actual noise output probability distribution, and sends the challenge sample to a parameter server, so that the server performs parameter update based on the challenge sample, specifically including:

In the present embodiment, for example, the challenge sample is output. When the user needs to send parameter x to the server, actually outputting probability distribution q based on the noise_iFrom r ═ { r₁，r₂，...r_mRandomly selecting a noise r_iSending countermeasure sample x to the server_i ^*＝x+r_i。

In order to better understand the present invention, the following examples are further provided to illustrate the present invention, but the present invention is not limited to the following examples.

The method can be applied to the user equipment participating in learning in the federal learning. After the user equipment completes model training, a small amount of noise is added to the parameters to make the parameters become countermeasure samples, then the countermeasure samples are sent to the server for aggregation, and the server still conducts parameter aggregation according to a general mode (such as weighted average and the like) to obtain a global model. The countermeasure sample generated by the user can prevent an attacker from correctly judging the privacy attribute of the user training data, so that the real privacy information of the user is protected. The method comprises the following specific steps:

step one, setting parameters. The parameters include three parts: a set of privacy attributes, a desired output probability vector p of noise and an availability budget epsilon. The specific steps are shown in fig. 2, and include the following three steps.

Step 111, defining a user privacy attribute set, wherein the set contains m privacy attribute values, the numbers of the privacy attribute values are respectively {1, 2., m }, each piece of data in the user data set only corresponds to one privacy attribute value, and the privacy attribute set is not coincident with the target attribute set.

For example, for a gender classification model, it is assumed that the privacy attribute is skin color, where the set of privacy attributes is skin color attribute set { black race, yellow race, white race }, and the set of target attributes is gender attribute set { male, female }.

Defining a desired output probability vector p, p being a one-dimensional vector of length m, step 112

p_i∈[0，1]I.e. the user desires with probability p_iFrom the generated m noises r ═ { r ═ r₁，r₂，...r_mRandomly select noise r in_i。

For example,

indicates the user's desire to

Is selected for noise r₁To do so by

Is selected for noise r₂To do so by

Is selected for noise r₃。

Step 113, defining an availability budget epsilon, and constraining the noise amount added by the user, and ensuring the availability of the countermeasure sample.

And step two, dividing the user data set. The user training data set X is divided into m subsets X ═ X (X) according to the set of privacy attributes defined in step 111₁，X₂，...，X_m). Wherein, the data set X_iThe privacy attribute corresponding to each piece of data in the set is i, i ∈ {1, 2.

And step three, training a target model. User utilization training set X_iTraining target model T_iThe number of iteration rounds is tau_iObtaining a target model set T ═ T₁，T₂，...，T_m}。

And step four, training a privacy attribute inference model. Setting the target model group T as { T ═ T₁，T₂，...，T_mThe parameters in each updating iteration of the training stage are all collected to obtain D ═ D₁，d₂,..) as the training data of the privacy attribute inference model, the privacy attribute value of the training subset corresponding to each set of parameters is used as the data label, and an m-class privacy attribute classifier C is trained as the privacy attribute inference model. The privacy attribute inference model training data D are contained in a whole

The bar data.

And step five, calculating noise. When a user needs to submit a parameter vector x to a server, a set of noise r ═ { r ] is found through a fast gradient method₁，r₂，...r_mIs such that noise r is added to the parameter x to be transmitted_iLater confrontation sample x_i ^*The privacy attribute value output after the model C is deduced through the privacy attribute is i, namely C (x + r)_i)＝C(x_i ^*)＝i，i∈{1，2，...，m}。

The fast gradient method and the noise calculation method are as follows:

r_i＝x_i ^*-x

where epsilon is the availability budget specified by step 11. The purpose of the clip operation is to confine the generated countermeasure sample values to the domain of the data itself, and to wipe out values that exceed the domain size to ensure that x is ultimately generated^*＝{x₁ ^*，x₂ ^*，...，x_m ^*Meet the practical application conditions.

And step six, calculating the actual noise output probability q. The calculation process can be formalized as the following optimization problem:

min KL(q，p)s.t.∑q_i*||r_i||≤ε，i∈{1，2，...，m}

qi∈(0，1]thereby quantifying the privacy preserving effect of the algorithm with the value of KL (q, p). The sigma q_i*||r_iThe actual output probability q needs to meet the availability budget constraint of the user because | | < epsilon, wherein | | r_iIs the noise vector r_iL2 norm.

The calculation formula of the KL distance is as follows:

and seventhly, outputting the confrontation sample. When the user needs to send the parameter x to the server, the probability q obtained by the calculation in the step six is used_iFrom r ═ { r₁，r₂，...r_mRandomly selecting a noise r_iSending countermeasure sample x to the server_i ^*＝x+r_i。

The method provided by the embodiment of the invention has the following advantages:

1. when the privacy is ensured by the existing protection method based on differential privacy, a large amount of noise needs to be added on the gradient in the stage of training a model by a user, and the accuracy of the model is sacrificed. The embodiment of the invention uses the generation mode of the countermeasure sample for reference, reduces the noise added to the parameters, improves the usability of the parameters and improves the precision of the global model.

2. According to the embodiment of the invention, the attack is inferred according to the privacy attributes, so that the privacy attributes of the user training data are effectively ensured not to be acquired by an attacker.

3. The existing scheme based on the safe multi-party computing protocol needs to meet the requirement that a certain number of users simultaneously perform model training and aggregation on line so as to correctly obtain the aggregation result of the global model, and therefore, the scheme is not suitable for updating of asynchronous federal learning. In the embodiment of the invention, the confrontation samples can be independently generated among the users, and the constraint parameters of usability and privacy are set in a personalized manner, so that two updating mechanisms of synchronization and asynchronization in federal learning can be covered.

4. The privacy attribute inference model is trained locally by the user, and the success rate of attacking the privacy attributes of the user data is improved. The attribute inference model trained by using the local data of the user has stronger attack capability, and the designed privacy protection scheme is more reliable by defending the stronger privacy attribute inference model.

5. According to the embodiment of the invention, the countermeasure sample idea is introduced, the model parameters are led to pass through the privacy attribute inference model by adding minimum noise, and then the privacy inference result is randomly output in the probability distribution (such as uniform distribution) expected by the user, so as to resist the privacy attribute inference attack, and further the privacy attribute leakage problem of federal learning is relieved. In addition, the user only needs to perform one-time disturbance on the sent model parameters, and the usability of the parameters is guaranteed.

Fig. 3 is a schematic structural diagram of a privacy protecting apparatus for federal learning according to an embodiment of the present invention, and as shown in fig. 3, the apparatus includes: a parameter setting module 201, a data dividing module 202, a first training module 203, a second training module 204, a first calculating module 205, a second calculating module 206 and a confrontation sample generating module 207, wherein:

the parameter setting module 201 sets parameters according to actual requirements and outputs the parameters; the parameters include a set of privacy attributes, an expected output probability vector for noise, and an availability budget; the privacy attribute set refers to a set of privacy attribute values to be protected, the expected output probability vector of the noise refers to an expected output privacy inference result, and the availability budget refers to availability budget constraint of noise amount;

a data dividing module 202, configured to divide a data set based on the privacy attribute set, and determine m training data subsets corresponding to the privacy attributes;

the first training module 203 trains m target models according to the m training data subsets corresponding to the privacy attributes to determine a model iteration parameter data set;

the second training module 204 is configured to train a privacy attribute inference model by using the model iteration parameter data set as training data and using a privacy attribute value corresponding to each set of parameter in the model iteration parameter data set as a data tag, and determine a privacy attribute inference result; the privacy attribute inference model is an m-class privacy attribute classifier;

a first calculation module 205, which determines a minimized noise by using a fast gradient method based on the privacy attribute inference result;

a second calculation module 206 for determining a noise actual output probability distribution based on the minimized noise, the expected output probability vector of the noise and the availability budget;

and a countermeasure sample generation module 207 for generating a countermeasure sample based on the noise actual output probability distribution and sending the countermeasure sample to a parameter server so that the server performs parameter update based on the countermeasure sample.

Referring to fig. 4, the system is composed of a parameter setting module (i.e., a parameter setting module), a data dividing module, a training module (i.e., a first training module), a privacy attribute inferring module (i.e., a second training module), a noise calculating module (i.e., a first calculating module), a probability calculating module (i.e., a second calculating module), and a confrontation sample generating module (i.e., a confrontation sample generating module). The working process is as follows:

a user sets parameters in a parameter setting module according to actual requirements, and the set parameters are output after the setting is finished and serve as the input of a data dividing module, a noise calculating module and a probability calculating module;

in the data dividing module, according to the privacy attribute set, aiming at each privacy attribute, dividing a corresponding training data subset, wherein each subset corresponds to one privacy attribute and is used as the input of the training module;

respectively training corresponding target models by using each training data subset at a training module, and taking parameters obtained by each training update and corresponding privacy attributes as the input of a privacy attribute inference module and a noise calculation module;

taking the input parameters and the corresponding privacy attributes as training data of a privacy attribute inference model, training an m-class privacy attribute classifier as the privacy attribute inference model, and taking a probability vector generated after a parameter x to be sent by a user passes through the privacy attribute inference model as the input of a noise calculation module;

in a noise calculation module, a set of noise r ═ r is searched for a parameter x to be sent by a user by using a fast gradient method₁，r₂，...r_mIs made to add noise r to the parameter_iAnd outputting a privacy attribute value of i, i belonging to {1, 2.., m } after the subsequent sample passes through the privacy attribute inference model, and setting a noise vector group r as { r ═ r }₁，r₂，...r_mTaking the probability as the input of a probability calculation module;

the probability calculation module is responsible for calculating the actual output probability distribution q of the noise and is used as the input of the confrontation sample generation module;

the challenge sample generation module takes the probability of input q from r ═ { r ═ r₁，r₂，...r_mRandomly selecting a noise r_iOutputs the confrontation sample x_i ^*＝x+r_i。

The privacy protection device for federated learning provided in the embodiments of the present invention may be specifically used to execute the privacy protection method for federated learning in the embodiments described above, and the technical principle and the beneficial effects thereof are similar, which may be specifically referred to the embodiments described above, and are not described herein again.

Based on the same inventive concept, an embodiment of the present invention provides an electronic device, and referring to fig. 5, the electronic device specifically includes the following contents: a processor 301, a communication interface 303, a memory 302, and a communication bus 304;

the processor 301, the communication interface 303 and the memory 302 complete mutual communication through the communication bus 304; the communication interface 303 is used for realizing information transmission between related devices such as modeling software, an intelligent manufacturing equipment module library and the like; the processor 301 is used for calling the computer program in the memory 302, and the processor executes the computer program to implement the method provided by the above method embodiments, for example, the processor executes the computer program to implement the following steps: setting parameters, namely setting parameters according to actual requirements and outputting the parameters; the parameters include a set of privacy attributes, an expected output probability vector for noise, and an availability budget; the privacy attribute set refers to a set of privacy attribute values to be protected, the expected output probability vector of the noise refers to an expected output privacy inference result, and the availability budget refers to availability budget constraint of noise amount; a data dividing step, namely dividing a data set based on the privacy attribute set, and determining m training data subsets corresponding to the privacy attributes; a first training step, training m target models according to the m training data subsets corresponding to the privacy attributes, and determining a model iteration parameter data set; a second training step, namely, taking the model iteration parameter data set as training data, taking the privacy attribute value corresponding to each set of parameters in the model iteration parameter data set as a data tag to train a privacy attribute inference model, and determining a privacy attribute inference result; the privacy attribute inference model is an m-class privacy attribute classifier; a first calculation step of determining a minimized noise by a fast gradient method based on the privacy attribute inference result; a second calculation step of determining a noise actual output probability distribution based on the minimized noise, the expected output probability vector of the noise, and the availability budget; and generating a countermeasure sample, generating a countermeasure sample based on the noise actual output probability distribution, and sending the countermeasure sample to a parameter server so that the server performs parameter updating based on the countermeasure sample.

Based on the same inventive concept, another embodiment of the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, the computer program being implemented to perform the methods provided by the above method embodiments when executed by a processor, for example, the steps of setting parameters, setting parameters according to actual requirements, and outputting the parameters; the parameters include a set of privacy attributes, an expected output probability vector for noise, and an availability budget; the privacy attribute set refers to a set of privacy attribute values to be protected, the expected output probability vector of the noise refers to an expected output privacy inference result, and the availability budget refers to availability budget constraint of noise amount; a data dividing step, namely dividing a data set based on the privacy attribute set, and determining m training data subsets corresponding to the privacy attributes; a first training step, training m target models according to the m training data subsets corresponding to the privacy attributes, and determining a model iteration parameter data set; a second training step, namely, taking the model iteration parameter data set as training data, taking the privacy attribute value corresponding to each set of parameters in the model iteration parameter data set as a data tag to train a privacy attribute inference model, and determining a privacy attribute inference result; the privacy attribute inference model is an m-class privacy attribute classifier; a first calculation step of determining a minimized noise by a fast gradient method based on the privacy attribute inference result; a second calculation step of determining a noise actual output probability distribution based on the minimized noise, the expected output probability vector of the noise, and the availability budget; and generating a countermeasure sample, generating a countermeasure sample based on the noise actual output probability distribution, and sending the countermeasure sample to a parameter server so that the server performs parameter updating based on the countermeasure sample.

The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and the parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.

Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods of the various embodiments or some parts of the embodiments.

In addition, in the present invention, terms such as "first" and "second" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.

Moreover, in the present invention, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.

Furthermore, in the description herein, reference to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.

Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A privacy protection method facing federal learning is characterized by comprising the following steps:

2. The privacy preserving method for federated learning as claimed in claim 1, wherein the first calculating step, based on the privacy attribute inference result, employs a fast gradient method to determine the minimized noise, specifically includes:

the noise calculation method is as follows:

r_i＝x_i ^*-x

3. The privacy preserving method for federal learning as claimed in claim 2, wherein the second calculating step determines an actual output probability distribution of noise based on the minimized noise, an expected output probability vector of noise and an availability budget, and specifically includes:

minKL(q,p)s.t.∑q_i*‖r_i‖≤ε,i∈{1,2,…,m}

wherein minKL (q, p) is an optimization target, q is the actual output probability of each noise, p represents the expected output probability of the noise, KL (q, p) is used for calculating KL divergence distance of q and p two vector distributions, and Σ q_i*‖r_i| ≦ ε such that the actual output probability q satisfies the user's availability budget constraint, | r_iIs the noise vector r |)_iL2 norm, q_iRepresenting the ith noise actual output probability.

4. The privacy protection method for federal learning according to claim 3, wherein the step of generating countermeasure samples includes generating countermeasure samples based on the actual output probability distribution of noise, and sending the countermeasure samples to a parameter server, so that the parameter server performs parameter update based on the countermeasure samples, and specifically includes:

when the user needs to send parameter x to the server, actually outputting probability distribution q based on the noise_iFrom r ═ { r₁,r₂,…r_mRandomly selecting a noise r_iGenerating a confrontation sample x_i ^*＝x+r_i；

5. A privacy preserving apparatus for federal learning, comprising:

6. The privacy preserving apparatus for federated learning as claimed in claim 5, wherein the first computing module is specifically configured to:

the noise calculation method is as follows:

r_i＝x_i ^*-x

7. The privacy preserving apparatus for federated learning as claimed in claim 6, wherein the second computing module is specifically configured to:

minKL(q,p)s.t.∑q_i*‖r_i‖≤ε,i∈{1,2,…,m}

8. The privacy preserving apparatus for federal learning as claimed in claim 7, wherein the generate confrontation sample module is specifically configured to:

when the user needs to send parameter x to the server, actually outputting probability distribution q based on the noise_iFrom r ═ { r₁,r₂,…r_mRandomly selecting a noise r_iGenerating a challenge sampleThis x_i ^*＝x+r_i；

9. An electronic device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, wherein the processor implements the federal learning oriented privacy protection method of any of claims 1-4 when executing the program.

10. A non-transitory computer readable storage medium having a computer program stored thereon, wherein the computer program, when executed by a processor, implements the federated learning-oriented privacy protection method of any one of claims 1-4.