CN116451275B

CN116451275B - Privacy protection method based on federal learning and computing equipment

Info

Publication number: CN116451275B
Application number: CN202310706551.9A
Authority: CN
Inventors: 王志强; 于欣月; 薛培阳; 罗乐琦
Original assignee: BEIJING ELECTRONIC SCIENCE AND TECHNOLOGY INSTITUTE
Current assignee: BEIJING ELECTRONIC SCIENCE AND TECHNOLOGY INSTITUTE
Priority date: 2023-06-15
Filing date: 2023-06-15
Publication date: 2023-08-22
Anticipated expiration: 2043-06-15
Also published as: CN116451275A

Abstract

The invention provides a privacy protection method and computing equipment based on federal learning, and relates to the technical field of computer application. The method comprises the following steps: acquiring a local model update and a global model; determining a direction consistency coefficient according to the local model update and the global model; respectively comparing the direction consistency coefficient with a preset discarding threshold value and a preset protection threshold value, and determining a marker of local model update and a protection mode of uploading the local model update to a server according to a comparison result; receiving a first aggregation model and a second aggregation model issued by a server; the first aggregation model and the second aggregation model are determined according to the marker, the first aggregation model needs to be decrypted, and the second aggregation model does not need to be decrypted; and carrying out decryption aggregation on the first aggregation model and the second aggregation model to obtain global model updating. The beneficial effects of this technical scheme are: the federal learning training time and the traffic are reduced on the premise of ensuring the training safety and the model performance.

Description

Privacy protection method based on federal learning and computing equipment

Technical Field

The invention relates to the technical field of computer application, in particular to a privacy protection method and computing equipment based on federal learning.

Background

Federal learning (FL, federated Learning), which is a distributed machine learning and model training technique, in the distributed learning process, each role involved in learning does not exchange its own individual sample and data field, and only completes the whole model training process by exchanging model parameters or intermediate results, thereby completing a series of business model creation and optimization requirements through sample or feature expansion.

Privacy protection of federal learning is one of the key research problems of federal learning, and gradient data and the like in the exchange process are protected by using privacy protection methods such as differential privacy, homomorphic encryption, multi-party security calculation and the like. In the prior art, the adoption of differential privacy to add noise easily causes the reduction of model performance, the adoption of homomorphic encryption calculation amount is large, the training time is long, and the adoption of multiparty security calculation requires a plurality of servers, thereby easily causing a large amount of communication overhead.

Disclosure of Invention

The invention solves the problem of how to reduce the federal learning training time and the traffic on the premise of ensuring the training safety and the model performance.

In order to solve the above problems, the present invention provides a privacy protection method based on federal learning, including:

acquiring a local model update and a global model;

determining a direction consistency coefficient according to the local model update and the global model;

comparing the direction consistency coefficient with a preset discarding threshold value and a preset protection threshold value respectively, and determining a marker of the local model update and a protection mode of uploading the local model update to a server according to a comparison result; wherein the protection threshold is greater than the discard threshold;

receiving a first aggregation model and a second aggregation model issued by the server; wherein the first aggregate model and the second aggregate model are determined according to the marker, the first aggregate model needs decryption, and the second aggregate model does not need decryption;

and carrying out decryption aggregation on the first aggregation model and the second aggregation model to obtain global model updating.

The beneficial effects of the invention are as follows: updating and globally modeling by acquiring a local model; and determining a direction consistency coefficient according to the local model update and the global model. The direction consistency coefficient is used for judging the degree that the updating direction of the local model meets the requirement. The way in which the local model updates the upload server can be determined from the direction consistency coefficients. Comparing the direction consistency coefficient with a preset discarding threshold value and a preset protection threshold value respectively, and determining a marker of the local model update and a protection mode of uploading the local model update to a server according to a comparison result; wherein the protection threshold is greater than the discard threshold. And comparing the direction consistency coefficient with the discarding threshold value and the protection threshold value to determine a protection mode of uploading the local model update to the server, wherein the local model update can be uploaded in a mode with better protection performance when the direction consistency coefficient is higher, and when the direction consistency coefficient belongs to a medium interval range, the efficiency of uploading the local model update to the server can be improved by using a method which is easier to realize on the premise of ensuring the safety of model data. Receiving a first aggregation model and a second aggregation model issued by the server; wherein the first aggregate model and the second aggregate model are determined according to the marker, the first aggregate model requires decryption, and the second aggregate model does not require decryption. And in the process of issuing the models, a classification aggregation mode is adopted, the aggregation mode is determined to be a first aggregation model or a second aggregation model according to the marker, and the first aggregation model and the second aggregation model are issued to the client together, so that hierarchical aggregation is realized, and server resources can be saved. And carrying out decryption aggregation on the first aggregation model and the second aggregation model to obtain global model updating. Because the first aggregation model needs to be decrypted and the second aggregation model does not need to be decrypted, the global model update obtained through classified uploading and classified aggregation can reduce training time and traffic on the premise of guaranteeing training safety and model performance.

Optionally, the determining the direction consistency coefficient according to the local model update and the global model includes:

determining the direction consistency coefficient through a first formula; wherein the first formula comprises:

in the method, in the process of the invention,for the number of identical symbols between the local model update and the global model,and the number of all symbols of the global model.

Optionally, the comparing the direction consistency coefficient with a preset discard threshold and a preset protection threshold respectively, and determining the marker of the local model update and the protection mode of uploading the local model update to the server according to the comparison result includes:

discarding the local model update and resetting its option to zero if the direction consistency coefficient is less than the discard threshold, without setting the marker;

if the direction consistency coefficient is greater than or equal to the protection threshold, the local model update is protected through homomorphic encryption and uploaded to a server, and the marker is set to be 1;

and if the direction consistency coefficient is larger than or equal to the discarding threshold value and smaller than the protection threshold value, updating the local model to a server through differential privacy protection, and setting the marker as 2.

Optionally, the protecting the local model update by homomorphic encryption includes:

protecting the local model update according to a second formula, wherein the second formula comprises:

in the method, in the process of the invention,updating protection for local model, +.>For the key used in said homomorphic encryption, -/-, for the encryption>For a single weighted scalar variable, +.>Updating,/for the local model of the ith client>For the direction consistency coefficient of the ith client,/th client>Is the protection threshold.

Optionally, the protecting the local model update by differential privacy includes protecting the local model update according to a third formula, wherein the third formula includes:

in the method, in the process of the invention,updating protection for local model, +.>Updating for the local model of the ith client,algorithm for adding noise using Laplace mechanism, +.>For the direction consistency coefficient of the ith client,/th client>For the discard threshold, +_>Is the protection threshold.

Optionally, the first aggregation model and the second aggregation model are determined according to the marker, including:

if the marker is 1, aggregating the local model update of the homomorphic encryption protection in a ciphertext model form to obtain the first aggregation model;

and if the marker is 2, aggregating the local model update of the differential privacy protection in the form of added noise to obtain the second aggregation model.

Optionally, the aggregating the local model updates of the homomorphic encryption protection in the form of a ciphertext model to obtain the first aggregate model includes:

obtaining the first aggregation model according to a fourth formula, wherein the fourth formula comprises:

in the method, in the process of the invention,for the first aggregation model, +.>Protecting the number of clients updated for the local model using the homomorphic encryption,/->For the key used in said homomorphic encryption, -/-, for the encryption>For a single weighted scalar variable, +.>Updating for said local model of the z-th client,/and>is the marker;

said aggregating said local model updates of said differential privacy protection in the form of added noise to obtain said second aggregate model, comprising:

obtaining the second aggregate model according to a fifth formula, wherein the fifth formula comprises:

in the method, in the process of the invention,for the second aggregation model, +.>Updating,/for the local model of the jth client>-number of clients updated for protecting the local model using the differential privacy +.>Algorithm for adding noise using Laplace mechanism, +.>Is the marker.

Optionally, the performing decryption aggregation on the first aggregation model and the second aggregation model to obtain global model update includes:

decrypting the first aggregation model according to the decryption algorithm corresponding to homomorphic encryption to obtain a decryption model;

and aggregating the decryption model and the second aggregation model to obtain the global model update.

Optionally, the decrypting the first aggregation model according to the decryption algorithm corresponding to the homomorphic encryption to obtain a decryption model includes:

obtaining the decryption model according to a sixth formula, wherein the sixth formula comprises:

in the method, in the process of the invention,for the decryption model->-number of clients updated for protecting the local model using the differential privacy +.>For the first aggregation model, +.>For a single weighted scalar variable;

the step of aggregating the decryption model and the second aggregation model to obtain the global model update includes:

obtaining the global model update according to a seventh formula, wherein the seventh formula comprises:

in the method, in the process of the invention,updating for said global model,/->For the decryption model->For the second aggregation model, +.>The number of clients participating in training.

The invention also provides a computing device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, which when executed by the processor implements a federal learning-based privacy protection method as described above.

The computing device of the present invention has the same advantages as the federal learning-based privacy protection method described above relative to the prior art, and is not described in detail herein.

Drawings

Fig. 1 is a schematic flow chart of a privacy protection method based on federal learning according to an embodiment of the present invention;

fig. 2 is a second flowchart of a federal learning-based privacy protection method according to an embodiment of the present invention.

Detailed Description

In order that the above objects, features and advantages of the invention will be readily understood, a more particular description of the invention will be rendered by reference to specific embodiments thereof which are illustrated in the appended drawings.

It is noted that the terms "first," "second," and the like in the description and claims of the invention and in the foregoing figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the invention described herein may be implemented in sequences other than those illustrated or otherwise described herein.

In the description of the present specification, reference to the terms "embodiment," "some embodiments," and "optionally embodiments," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or implementation is included in at least one embodiment or illustrated implementation of the present invention. In this specification, schematic representations of the above terms do not necessarily refer to the same examples or implementations. Furthermore, the particular features, structures, materials, or characteristics may be combined in any suitable manner in any one or more embodiments or implementations.

Referring to fig. 1 and 2, an embodiment of the present invention provides a privacy protection method based on federal learning, including the steps of:

s1, acquiring local model updating and a global model;

the local model update refers to the update of a single client to the global model in one training, and the global model can be an initial global model or a global model update obtained by the client in the last training, and is applicable according to the condition whether the initial training is performed or not. And if the client is in primary training, applying an initial global model, and if the client is not in primary training, applying global model updating obtained in the last training.

S2, determining a direction consistency coefficient according to the local model update and the global model;

and determining a direction consistency coefficient according to the local model update and the global model, namely judging the problem of the direction consistency of the local model update and the previous global model in the training process, and avoiding great update deviation.

S3, comparing the direction consistency coefficient with a preset discarding threshold value and a preset protection threshold value respectively, and determining a marker of the local model update and a protection mode of uploading the local model update to a server according to a comparison result; wherein the protection threshold is greater than the discard threshold;

the discarding threshold and the protecting threshold are manually set according to actual needs, wherein the protecting threshold is larger than the discarding threshold.

S4, receiving a first aggregation model and a second aggregation model which are issued by the server; wherein the first aggregate model and the second aggregate model are determined according to the marker, the first aggregate model needs decryption, and the second aggregate model does not need decryption;

the server updates the local models uploaded by a plurality of clients, determines to aggregate into a first aggregation model or a second aggregation model according to the markers, and transmits all the first aggregation model and the second aggregation model obtained by the round of aggregation to the clients for training in the next round, namely, each client for training in the next round can receive the complete first aggregation model and second aggregation model.

S5, performing decryption aggregation on the first aggregation model and the second aggregation model to obtain global model updating.

And the client side needing the next round of training receives the first aggregation model and the second aggregation model and then carries out decryption aggregation to obtain global model updating of the round.

It should be noted that, the local model update refers to the global model update or the initial global model update performed by the single client of the previous training round. The global model update refers to a new global model obtained by the client-side performing decryption aggregation on the first aggregation model and the second aggregation model issued by the server.

It should be further noted that, as can be appreciated in connection with the basic knowledge of federal learning, the number of clients in federal learning is plural, the plural clients are in communication with the server, the plural clients upload the local model updates to the server, and the server issues the first aggregate model and the second aggregate model to the plural clients. The privacy protection method based on federal learning in the embodiment of the present invention only describes one round of uploading of the local model update and issuing of the first aggregation model and the second aggregation model, and in practical application, the uploading and issuing processes need to be repeatedly trained for multiple times until the models reach convergence conditions.

In this embodiment, local model updates and global models are obtained; and determining a direction consistency coefficient according to the local model update and the global model. The direction consistency coefficient is used for judging the degree that the updating direction of the local model meets the requirement. The way in which the local model updates the upload server can be determined from the direction consistency coefficients. Comparing the direction consistency coefficient with a preset discarding threshold value and a preset protection threshold value respectively, and determining a marker of the local model update and a protection mode of uploading the local model update to a server according to a comparison result; wherein the protection threshold is greater than the discard threshold. And comparing the direction consistency coefficient with the discarding threshold value and the protection threshold value to determine a protection mode of uploading the local model update to the server, wherein the local model update can be uploaded in a mode with better protection performance when the direction consistency coefficient is higher, and when the direction consistency coefficient belongs to a medium interval range, the efficiency of uploading the local model update to the server can be improved by using a method which is easier to realize on the premise of ensuring the safety of model data. Receiving a first aggregation model and a second aggregation model issued by the server; wherein the first aggregate model and the second aggregate model are determined according to the marker, the first aggregate model requires decryption, and the second aggregate model does not require decryption. And in the process of issuing the models, a classification aggregation mode is adopted, the aggregation mode is determined to be a first aggregation model or a second aggregation model according to the marker, and the first aggregation model and the second aggregation model are issued to the client together, so that hierarchical aggregation is realized, and server resources can be saved. And carrying out decryption aggregation on the first aggregation model and the second aggregation model to obtain global model updating. Because the first aggregation model needs to be decrypted and the second aggregation model does not need to be decrypted, the global model update obtained through classified uploading and classified aggregation can reduce training time and traffic on the premise of guaranteeing training safety and model performance.

In another alternative embodiment of the present invention, the determining the direction consistency coefficient according to the local model update and the global model includes:

Specifically, in this embodiment, the symbols refer to the signs of the vector space coordinates in the local model update and the global model. Illustratively, the local model update is (-1, 2, -3), the global model is (1, 2, -1), then the sign of the local model update is (-, +, -), and the global model sign is (+, +, -). The symbols of the last two bits are the same, namely the number of the same symbols between the local model update and the global model is 2, the number of all symbols of the global model is 3, and the direction consistency coefficient is 2/3.

In this embodiment, the local model updates and the global model updates may be determined to adopt different uploading manners for different local model updates according to the same proportion of the local model updates and the global model symbols.

In another alternative embodiment of the present invention, as shown in fig. 2, the comparing the direction consistency coefficient with a preset discard threshold and a preset protection threshold respectively, and determining the marker of the local model update and the protection manner of uploading the local model update to the server according to the comparison result includes:

In this embodiment, if the direction consistency coefficient is smaller than the discard threshold, the local model update is not transmitted, so that the security of the local model update in the transmission process is directly ensured.

If the direction consistency coefficient is greater than the protection threshold, the local model update is protected in a homomorphic encryption mode, and under the premise that the secret key is not transmitted, an attacker cannot acquire the secret key in the transmission process, and the local model update transmitted in the round is difficult to decrypt, so that the safety of the local model update in the transmission process is ensured.

And if the direction consistency coefficient is larger than or equal to the discarding threshold value and smaller than the protection threshold value, the local model update is protected by adopting a differential privacy mode. The manner of differential privacy may include, but is not limited to: laplace mechanism, exponential mechanism, gaussian mechanism, etc. The laplace mechanism is preferably used in this embodiment. Satisfaction using the Laplace noise mechanismDifferential privacy mechanisms, i.e. adding noise, enhance the protection of local model updates during transmission.

According to the judging conditions, three conditions are independent, so that the safety of the local model update is protected in the process of uploading the server.

It should be noted that, in one training, the client that the client uploads the local model update and the client that receives and issues the first aggregation model and the second aggregation model are different clients. The option is reset to zero, which means that in the next round of training, the client with the option reset to zero is not selected. The method does not set a marker, namely, the marker is not added in the model. For example, the clients uploading the local model updates are numbered 1,2, 3, 4 and 5, if the selection right of the client numbered 3 is reset to zero, when the first aggregation model and the second aggregation model are issued, the client numbered 3 is not selected, and the first aggregation model and the second aggregation model are issued to the clients numbered 1,2, 4 and 5.

Optionally, as shown in fig. 2, if the direction consistency coefficient is greater than or equal to the protection threshold, the local model update is protected by homomorphic encryption and uploaded to a server, and the selection weight of the corresponding client is increased; and if the direction consistency coefficient is larger than or equal to the discarding threshold value and smaller than the protection threshold value, protecting the local model from updating to a server through differential privacy, and reducing the selection weight of the corresponding client.

The increasing selection weight of the corresponding client indicates an increase in the probability that the client is selected in the next round of training, and the decreasing selection weight of the corresponding client indicates a decrease in the probability that the client is selected in the next round of training.

In another alternative embodiment of the present invention, said protecting said local model update by homomorphic encryption comprises:

in the method, in the process of the invention,updating protection for local model, +.>For the key used in said homomorphic encryption, -/-, for the encryption>For a single weighted scalar variable, +.>Updating,/for the local model of the ith client>For the direction consistency coefficient of the ith client,/th client>Is the protection threshold. The single weighted scalar variable is a fixed value in the homomorphic encryption algorithm.

It should be noted that, the local model update protection refers to data for uploading to a server, which adds protection to the local model update.

In the present embodiment, the key is passed throughAnd encrypting the local model update, so that the safety of the transmission process is protected.

In another alternative embodiment of the present invention, said protecting said local model updates by differential privacy comprises: protecting the local model update according to a third formula, wherein the third formula comprises:

In this embodiment, noise is introduced by using an algorithm of adding noise by using a laplace mechanism, so that the original local model update is protected.

In another alternative embodiment of the present invention, as shown in connection with fig. 2, the first aggregation model and the second aggregation model are determined according to the marker, comprising:

Specifically, in the present embodiment, the global model is secure in the sort aggregation and the decryption aggregation. And in the process of transmitting the first aggregation model and the second aggregation model by the server, an attacker can only acquire the transmission data of the first aggregation model and the second aggregation model. When an attacker cannot acquire the secret key, the first aggregation model cannot be decrypted, so that the first aggregation model is safe; the addition of noise causes the second polymeric pattern to satisfyDifferential privacy mechanisms and therefore also secure. Therefore, the safety of issuing the first aggregation model and the second aggregation model to the client is ensured.

In another optional embodiment of the present invention, the aggregating the local model updates of the homomorphic encryption protection in the form of a ciphertext model, to obtain the first aggregate model, includes:

in the method, in the process of the invention,for the first aggregation model, +.>Protecting the number of clients updated for the local model using the homomorphic encryption,/->For the key used in said homomorphic encryption, -/-, for the encryption>Representing the inverse of the matrix->For a single weighted scalar variable, +.>Updating for said local model of the z-th client,/and>for the marker->Representing the local model update uploaded with the homomorphic encryption protection;

in the method, in the process of the invention,for the second aggregation model, +.>Updating,/for the local model of the jth client>-number of clients updated for protecting the local model using the differential privacy +.>Adding noise for using Laplace mechanismAcoustic algorithm->For the marker->Representing the local model update uploaded with the differential privacy protection.

In another alternative embodiment of the present invention, as shown in fig. 2, the performing decryption aggregation on the first aggregation model and the second aggregation model to obtain global model update includes:

Specifically, in this embodiment, when the aggregation is decrypted, the global model update is obtained by decrypting the first aggregation model and then aggregating with the second aggregation model, and if the key cannot be obtained, the first aggregation model cannot be decrypted to obtain the correct global model update. The attacker cannot acquire the secret key, and the secret key only exists in the client side, and the attacker cannot acquire the global model update by intercepting the transmission data of the first aggregation model and the second aggregation model, so that the safety of the global model update is ensured.

In another optional embodiment of the present invention, the decrypting the first aggregation model according to the decryption algorithm corresponding to the homomorphic encryption, to obtain a decryption model, includes:

The embodiment of the invention also provides a computing device, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor executes the program to realize the privacy protection method based on federal learning.

The computing device according to the embodiment of the present invention has the same advantages as the privacy protection method based on federal learning, compared with the prior art, and is not described in detail herein.

Although the invention is disclosed above, the scope of the invention is not limited thereto. Various changes and modifications may be made by one skilled in the art without departing from the spirit and scope of the invention, and these changes and modifications will fall within the scope of the invention.

Claims

1. A federal learning-based privacy preserving method, comprising:

acquiring a local model update and a global model;

in the method, in the process of the invention,updating the number of identical symbols between the local model and the global model, +.>The number of all symbols of the global model;

if the direction consistency coefficient is larger than or equal to the discarding threshold value and smaller than the protection threshold value, the local model is protected to be updated to a server through differential privacy, and the marker is set to be 2;

2. The federal learning-based privacy protection method according to claim 1, wherein the protecting the local model update by homomorphic encryption comprises:

3. The federal learning-based privacy preserving method of claim 1, wherein the preserving the local model updates by differential privacy comprises preserving the local model updates according to a third formula, wherein the third formula comprises:

in the method, in the process of the invention,updating protection for local model, +.>Updating,/for the local model of the ith client>Algorithm for adding noise using Laplace mechanism, +.>For the direction consistency coefficient of the ith client,/th client>For the discard threshold, +_>Is the protection threshold.

4. The federal learning-based privacy preserving method of claim 1, wherein the first aggregation model and the second aggregation model are determined from the markers, comprising:

5. The federal learning-based privacy protection method of claim 4,

and aggregating the local model updates of the homomorphic encryption protection in the form of a ciphertext model to obtain the first aggregation model, wherein the aggregation model comprises the following components:

in the method, in the process of the invention,for the second aggregation model, +.>Updating,/for the local model of the jth client>-number of clients updated for protecting the local model using the differential privacy +.>To use the algorithm of the Laplace mechanism to add noise, g is the marker.

6. The federal learning-based privacy preserving method of claim 1, wherein the performing decryption aggregation on the first aggregate model and the second aggregate model results in global model updates, comprising:

7. The federal learning-based privacy protection method of claim 6,

the decrypting the first aggregation model according to the decryption algorithm corresponding to homomorphic encryption to obtain a decryption model, including:

8. A computing device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, which when executed by the processor, implements the federal learning-based privacy protection method of any of claims 1-7.