CN113642715A

CN113642715A - Differential privacy protection deep learning algorithm for self-adaptive distribution of dynamic privacy budget

Info

Publication number: CN113642715A
Application number: CN202111009795.9A
Authority: CN
Inventors: 张亚玲; 白世博
Original assignee: Xian University of Technology
Current assignee: Shenzhen Hongyue Information Technology Co ltd
Priority date: 2021-08-31
Filing date: 2021-08-31
Publication date: 2021-11-12

Abstract

The invention aims to provide a differential privacy protection deep learning algorithm for self-adaptively allocating dynamic privacy budgets, which comprises the steps of firstly, giving a data set, setting and initializing a neural network NN, training the neural network NN by using the data set, and obtaining a deep learning model M without privacy protection; calculating average feature correlation by using an LRP algorithm according to a trained deep learning model M without privacy protection; further calculating a correlation ratio; and finally, reinitializing the neural network NN, setting the iteration times of training, and adding noise in the training process according to the correlation ratio to obtain the deep learning model DPM with differential privacy protection, so that the data privacy can be protected when the model is used for prediction.

Description

Differential privacy protection deep learning algorithm for self-adaptive distribution of dynamic privacy budget

Technical Field

The invention belongs to the technical field of information security, and particularly relates to a differential privacy protection deep learning algorithm for adaptively allocating dynamic privacy budgets.

Background

With the development of internet technology, hundreds of millions of data are generated every day in daily life, and the huge amount of data often contains potential, regular and finally understandable knowledge or patterns. Data Mining (DM) technology can find and extract these useful information from these massive Data and feed back to guide business and human life, it mainly adopts machine learning method and statistical knowledge principle to do knowledge Mining, the research and improvement of machine learning method often has important influence on the efficiency and result of Data Mining. Deep Learning (Deep Learning) is a branch of machine Learning, an algorithm that attempts to perform high-level abstraction of data using multiple processing layers that contain complex structures or consist of multiple nonlinear transformations. Deep learning has made impressive breakthroughs in many areas, including computer vision, speech recognition, image recognition, natural language processing, and search recommendations, among others. The method aims to establish a multi-layer network, extract complex features from original input data and mine a hidden knowledge structure in the data.

Knowledge and patterns hidden in massive data can be discovered by using data mining algorithms such as deep learning, but the knowledge and the patterns are usually at the cost of sacrificing privacy. If the privacy data used for training are not well protected, they can be leaked through model parameters or predictions, so that data mining technology with privacy protection property becomes an important requirement. How to effectively protect the privacy of training sample data from being invaded while applying a deep neural network algorithm is of great importance. Dalenius proposes a concept of privacy disclosure control, and the k-anonymity algorithm lays a foundation for an anonymous privacy protection algorithm based on equivalence class grouping, and then is l-diversity, t-close (alpha, k) -anonymity and the like. These models improve the anonymity protection theory for attackers of different background knowledge. However, they all have some common drawbacks, require newer designs to cope with rapidly evolving attacks, and do not provide strict evidence to quantify privacy protection effects. Many scholars at home and abroad apply the privacy protection technology to various methods for data mining, but the methods are all based on special background knowledge mastered by attackers and cannot provide enough security guarantee.

Differential Privacy (DP) is a new Privacy definition proposed by Dwork in 2006 for the Privacy disclosure problem of statistical databases, which is a Privacy protection model based on data distortion. Compared with the traditional privacy protection model, the differential privacy model is defined on the solid mathematical basis, and the level of algorithm privacy protection can be controlled; at the same time, it defines the maximum background knowledge that an attacker possesses, i.e. the sum of all other information that the attacker can obtain except the target record.

Disclosure of Invention

The invention aims to provide a differential privacy protection deep learning algorithm for adaptively allocating dynamic privacy budgets, and solves the problem that in the prior art, the consumed privacy budget is too large, so that the privacy protection level is low.

The technical scheme adopted by the invention is that a differential privacy protection deep learning algorithm for self-adaptively allocating dynamic privacy budgets is implemented according to the following steps:

step 1, a data set D { (X) is given₁,y₁),(X₂,y₂),...,(X_j,y_j),...,(X_n,y_n) I j e (1, n) }, for a piece of data (X)_j,y_j)，X_jRepresenting a data feature, y_jRepresenting a data label, setting and initializing a neural network NN, wherein the neural network NN comprises an input layer, a hidden layer and an output layer, and training the neural network NN by using a data set D to obtain a deep learning model M without privacy protection;

step 2, calculating average characteristic correlation by using LRP algorithm according to the deep learning model M which is trained in the step 1 and does not have privacy protection

Step 3, obtaining average characteristic correlation according to step 2

Calculating the correlation ratio alpha_j；

Step 4, reinitializing the neural network NN, setting the iteration times T of training, and obtaining the correlation ratio alpha according to the step 3_jNoise is added in the training process to obtain a deep learning model DPM with differential privacy protection,so that data privacy can be protected when using the model for prediction.

The present invention is also characterized in that,

the step 1 is implemented according to the following steps:

step 1.1, a data set D containing an image is given, a neural network NN is set, the neural network NN comprises an input layer, three hidden layers and an output layer, and a neural network parameter omega is initialized randomly;

step 1.2, randomly selecting a batch of data B from the data set D, and inputting the data B into a neural network NN;

step 1.3, training the neural network NN by using the SGD algorithm, and continuously adjusting the neural network parameter omega to obtain the optimal neural network parameter omega^MThen, a deep learning model M without differential privacy protection is obtained, which is used for the correlation of features in the subsequent steps

And (4) calculating.

The step 2 is implemented according to the following steps:

step 2.1, randomly selecting a record (X) from the data set D_i,y_i)；

Step 2.2, inputting data X to the model M trained in the step 1_iObtaining the model predicted value

Step 2.3, predicting values according to the model

For each neuron p in the last hidden layer l, the neuron p and the model output are calculated

Characteristic correlation of

Wherein z is_pm＝a_pω_pmIs the product of network parameters between a neuron p in the hidden layer l and its output layer neuron m, a_pRepresenting the value of the neuron p, ω_pmRepresenting the weight coefficient from neuron p to neuron m, i.e. the network parameter between neuron p and neuron m, z_m＝∑_pz_pm+b_mIs an affine transformation of the hidden layer l to the output layer neurons m, b_mRepresents the bias value of the last hidden layer l to the output layer neuron m,

is a predefined stabilizer for avoiding the condition that the denominator is 0, and

step 2.4, calculating the relevance decomposition information of each neuron p in the last hidden layer l to each neuron q in the previous hidden layer, namely the layer l-1

Wherein z is_qp＝a_qω_qpRepresenting neural network parameters ω between neuron q and neuron p_qmProduct of a_qRepresenting the value of neuron q, z_p＝∑_qz_qp+b_pIs an affine transformation of a neuron q to a neuron p, b_pRepresents the bias value from l-1 layer to l layer;

step 2.5, decomposing information into correlations

Adding noise that follows the Laplace distribution:

wherein the content of the first and second substances,

denotes Laplace noise added to the correlation decomposition information, Δ represents global sensitivity, ε_rRepresenting a privacy budget when noise is added to the dependency resolution information;

step 2.6, calculating the characteristic correlation of each neuron q in the hidden layer l-1

Repeating the steps 2.3-2.6 until the correlation of the input features is calculated, and finally averaging the obtained correlations to obtain the average feature correlation

Step 3 is specifically implemented according to the following steps:

relevance ratio alpha of j-th neuron in layer of certain specific layer in neural network in the layer_jComprises the following steps:

wherein the content of the first and second substances,

represents the average of the average characteristic correlations of the jth neuron in the layer.

Step 4 is specifically implemented according to the following steps:

step 4.1, creating a network Net with the same structure as the network in step 1, and initially changing the parameters of the neural network into omega₀Setting a training batch size L, the iteration times T of a training model and a privacy budget epsilon, and initializing the iteration steps T to 1;

step 4.2, randomly sampling L samples from the data set D to form a training sample set L of the t iteration_t；

Step 4.3, training sample set L of the t iteration_tRespectively sending the L samples into a current neural network Net to obtain model predicted values of the L samples

And based on the model predicted value of each sample

And true value y_iCalculating a model loss function for each sample

This loss function, called the cross entropy loss function, is used to measure the difference between the predicted and true values, where ω is_tModel parameters representing the t-th iteration, (X)_i,y_i) Represents the batch L_tOne record in;

step 4.4, optimizing the parameter omega of the model of the t iteration through error back propagation by using the model loss function of each sample_tCalculating partial derivatives to obtain L intermediate model gradients g of the t-th iteration_t；

Step 4.5, calculating the privacy budget size epsilon of the t iteration_t：

Step 4.6, calculating privacy budget epsilon when noise is added to gradient of jth neuron of a certain layer in the neural network in the t iteration_jt＝α_j*ε_t；

Step 4.7, gradient g_tAdding noise to obtain the noise gradient of the intermediate model

Wherein g is_t(X_i) Represents according to the sample lot L_tRecord (X) in (1)_i,y_i) The calculated gradient is used to calculate the gradient of the sample,

representing the parameter omega at the t-th iteration_tGlobal sensitivity of (d);

step 4.8, updating model parameters

Wherein eta_tRepresenting the learning rate of the t iteration;

step 4.9, judging whether T is equal to T or not, and if so, carrying out the optimization parameter omega obtained by the T-th iteration_t+1And (5) as the final parameters of the neural network Net, obtaining the trained deep learning model DPM with the differential privacy protection, and otherwise, turning to the step 4.2.

Compared with the prior art, the differential privacy protection deep learning algorithm for self-adaptively allocating the dynamic privacy budget has the beneficial effects that on the basis of disturbance of the gradient based on the correlation, the influence degree of the noise magnitude on the model convergence at different stages of training is further considered, and the training process of the deep neural network is a process from random weight to optimal weight, namely a process from an initial model to an optimal model. In the initial stage of training, the random weight is far away from the optimal weight, and the gradient is usually large, so that adding large noise does not have too much influence on the model. While in the later stages of training the values of the random weights are close to the optimal weights, the gradient values are usually also small, which may cause model oscillation and affect the accuracy of the model if the same amount of noise is added to the gradient at this time. Therefore, the invention further dynamically changes the size of the privacy budget in the training phase, thereby reducing the influence of noise on the model, and further improving the practicability of the model while providing effective differential privacy guarantee.

Drawings

FIG. 1 is a flow chart of the LRP algorithm with differential privacy protection of the present invention computing feature correlations;

FIG. 2 is a flow diagram of the adaptive gradient perturbed deep learning differential privacy preserving deep neural network of the present invention;

fig. 3 is a schematic diagram of the LRP algorithm used in the present invention, forward and backward.

Detailed Description

The present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

In the differential privacy protection deep learning algorithm for adaptively allocating dynamic privacy budgets, in the process of optimizing a neural network by using random Gradient Descent (SGD) or variants thereof, a model obtains a current predicted value according to input data of the model, calculates a prediction error of the model by using the predicted value, and reversely propagates the obtained error value to calculate a current Gradient g_iAnd adding noise to the gradient, which obeys Laplace distribution, to obtain a noisy gradient

Then use

And updating the parameters of the network to realize the protection of the private information. After a specified training period or when the model error is smaller than a threshold value, the trained model parameters are obtained, and finallyAnd obtaining the deep neural network classifier with differential privacy protection. In the invention, self-adaptation is adopted, namely, when gradient is disturbed, privacy budget is allocated according to the relevance of the neuron to model output, the larger the relevance is, the larger the allocated privacy budget is, the smaller the added noise is, and vice versa, and the privacy budget to be allocated at this time is the dynamic privacy budget. The dynamic privacy budget is that the size of the privacy budget is dynamically changed along with the training according to different degrees of influence of noise on the gradient at different stages of the training. Different from the existing neural network classifier with privacy protection, the method can effectively improve the accuracy of the deep neural network prediction with the differential privacy protection.

First, a Layer-wise Relevance Propagation (LRP) algorithm is used to calculate the Relevance R of each neuron and the output of the model_iWhile calculating the correlation, Laplace noise is added to the correlation decomposition information to protect the data privacy; secondly, the dynamic privacy budget epsilon of the current stage is calculated at different stages of training_tAnd a dynamic privacy budget epsilon for the current phase based on the calculated correlation_tSelf-adaptive distribution is carried out to obtain the privacy budget epsilon of each gradient in the current stage_jt(ii) a Finally, according to different privacy budgets epsilon_jtDifferent noises are added into the gradient for training, so that the risk of leakage of network model parameters is reduced, the privacy of training data is protected, an attacker cannot deduce the model parameters, and further cannot deduce data in a training set, so that the purpose of privacy protection is achieved.

The invention discloses a differential privacy protection deep learning algorithm for adaptively allocating dynamic privacy budgets, which is implemented by combining the following steps with the combination of figures 1-3:

step 1, a data set D { (X) is given₁,y₁),(X₂,y₂),...,(X_j,y_j),...,(X_n,y_n) I j e (1, n) }, for a piece of data (X)_j,y_j)，X_jRepresenting a data feature, y_jRepresenting data tags, i.e. X_jThe category to which it belongs, set and startThe method comprises the steps that a neural network NN is initialized, the neural network NN comprises an input layer, a hidden layer and an output layer, and a deep learning model M without privacy protection is obtained by training the neural network NN through a data set D;

the step 1 is implemented according to the following steps:

And (4) calculating.

The step 2 is implemented according to the following steps:

step 2.1, randomly selecting a record (X) from the data set D_i,y_i)；

Step 2.3, predicting values according to the model

Characteristic correlation of

Wherein z is_qp＝a_qω_qpRepresenting neural network parameters ω between neuron q and neuron p_qmProduct of a_qRepresenting the value of neuron q, z_p＝Σ_qz_qp+b_pIs an affine transformation of a neuron q to a neuron p, b_pRepresents the bias value from l-1 layer to l layer;

step 2.5, decomposing information into correlations

Adding noise that follows the Laplace distribution:

wherein the content of the first and second substances,

And repeating the step 2.3 to the step 2.6 until the correlation of the input features is calculated, and finally, solving the average value of the obtained correlations to obtain the average feature correlation R.

The above steps are described algorithmically as follows:

inputting: data set D, deep learning model M without differential privacy protection

And (3) outputting: mean characteristic correlation R

(1)for(X_i,y_i)∈D do

(2) Mixing X_iInput neural network to obtain predicted value

(3) Computing

(4)for l,...,1do

(5) Computing

(6) Adding noise

(7) Computing correlations

(8)end for

(9) Computing average feature correlations

(10)end for

Step 3, obtaining average characteristic correlation according to step 2

Calculating the correlation ratio alpha_j；

Step 3 is specifically implemented according to the following steps:

wherein the content of the first and second substances,

Step 4, reinitializing the neural network NN, setting the iteration times T of training, and obtaining the correlation ratio alpha according to the step 3_jNoise is added in the training process, and a deep learning model DPM with differential privacy protection is obtained, so that data privacy can be protected when the model is used for prediction.

Step 4 is specifically implemented according to the following steps:

And based on the model predicted value of each sample

And true value y_iCalculating a model loss function for each sample

This loss function, called the cross entropy loss function, is used to measure the difference between the predicted and true values, where ω is_tModel representing the t-th iterationParameter (X)_i,y_i) Represents the batch L_tOne record in;

Step 4.5, calculating the privacy budget size epsilon of the t iteration_t：

Step 4.7, add g to the gradient_tAdding noise to obtain noise gradient of intermediate model

step 4.8, updating model parameters

Wherein eta_tRepresenting the learning rate of the t iteration;

step 4.9, judging whether T is equal to T or not, and if so, carrying out the optimization parameter omega obtained by the T-th iteration_t+1As final parameters of neural network NetAnd obtaining a trained deep learning model DPM with differential privacy protection, otherwise, turning to the step 4.2.

Claims

1. The differential privacy protection deep learning algorithm for adaptively allocating the dynamic privacy budget is characterized by being implemented according to the following steps:

step 1, a data set D { (X) is given₁,y₁),(X₂,y₂),...,(X_j,y_j),...,(X_n,y_n) I j e (1, n) }, for a piece of data (X)_j,y_j)，X_jRepresenting a data feature, y_jRepresenting data tags, i.e. X_jSetting and initializing a neural network NN according to the category of the neural network NN, wherein the neural network NN comprises an input layer, a hidden layer and an output layer, and training the neural network NN by using a data set D to obtain a deep learning model M without privacy protection;

Step 3, obtaining average characteristic correlation according to step 2

Calculating the correlation ratio alpha_j；

2. The differential privacy protection deep learning algorithm for adaptively allocating dynamic privacy budgets according to claim 1, wherein the step 1 is specifically implemented according to the following steps:

step 1.3, training the neural network NN by using the SGD algorithm, and continuously adjusting the neural network parameter omega to obtain the optimal neural network parameter omega^MA deep learning model M without differential privacy protection is then obtained, which is used for the calculation of the feature correlation R in the subsequent steps.

3. The differential privacy protection deep learning algorithm for adaptively allocating dynamic privacy budgets according to claim 2, wherein the step 2 is specifically implemented according to the following steps:

step 2.1, randomly selecting a record (X) from the data set D_i,y_i)；

Step 2.3, predicting values according to the model

Characteristic correlation of

step 2.5, decomposing information into correlations

Adding noise that follows the Laplace distribution:

wherein the content of the first and second substances,

4. The differential privacy protection deep learning algorithm for adaptively allocating dynamic privacy budgets according to claim 3, wherein the step 3 is specifically implemented according to the following steps:

wherein the content of the first and second substances,

5. The differential privacy protection deep learning algorithm for adaptively allocating dynamic privacy budgets according to claim 4, wherein the step 4 is specifically implemented according to the following steps:

And based on the model predicted value of each sample

And true value y_iCalculating a model loss function for each sample

step 4.4, benefitOptimizing the parameter omega of the model of the t iteration by the model loss function of each sample through error back propagation_tCalculating partial derivatives to obtain L intermediate model gradients g of the t-th iteration_t；

Step 4.5, calculating the privacy budget size epsilon of the t iteration_t：

step 4.8, updating model parameters

Wherein eta_tRepresenting the learning rate of the t iteration;

step 4.9, judging whether T is equal to T or not, and if so, carrying out the optimization parameter omega obtained by the T-th iteration_t+1As final parameters of the neural network Net, a trained deep learning model with differential privacy protection is obtainedDPM, otherwise go to step 4.2.