CN115225405B

CN115225405B - Matrix decomposition method based on security aggregation and key exchange under federal learning framework

Info

Publication number: CN115225405B
Application number: CN202210899003.8A
Authority: CN
Inventors: 夏长达; 张子扬; 夏家骏; 张佳辰
Original assignee: Shanghai Light Tree Technology Co ltd
Current assignee: Shanghai Light Tree Technology Co ltd
Priority date: 2022-07-28
Filing date: 2022-07-28
Publication date: 2023-04-21
Anticipated expiration: 2042-07-28
Also published as: CN116545735A; CN116545734A; CN115225405A

Abstract

The invention discloses a matrix decomposition method of a federal learning framework based on safe aggregation and key exchange, which provides a new idea for federal learning to enhance data security by carrying out safe aggregation on the gradient of an object matrix I of matrix decomposition under the federal learning framework; can not be used locally

And a gradient of safe polymerization

The training samples of the upper recommendation model (namely the federal learning model) are effectively utilized, so that the user data is ensured not to leave the local area, and the recommendation model training process is safer; masking the gradient and noise, so that leakage of source data information caused by exposing the real gradient of the data is effectively avoided; compared with homomorphic encryption technology adopted in the background technology, the gradient summarization mode based on the security aggregation has the advantages that the calculation complexity of gradient encryption and decryption is lower, the calculation speed is faster, and the training speed of a recommended model is improved.

Description

Matrix decomposition method based on security aggregation and key exchange under federal learning framework

Technical Field

The invention relates to the technical field of information processing, in particular to a matrix decomposition method based on secure aggregation and key exchange under a federal learning framework.

Background

The current security matrix decomposition algorithm is mainly based on a matrix decomposition distributed algorithm, ensures the security of transmission information through the Paillier homomorphic encryption and other encryption technologies, and avoids the local data leakage of users. The implementation steps of the existing security matrix decomposition algorithm mainly comprise:

1. the method comprises the steps that a server initializes an article matrix I, a client locally initializes respective user matrices U, a public key is shared between the server and the client, and a private key is shared only by the client;

2. the server side encrypts I by using public key to obtain ciphertext C _I Broadcasting to all clients;

3. each client gets C _I Then utilizing local private key pair C _I Decrypting to obtain a real object matrix I, calculating the gradient of the U held by the client, updating the U, calculating the gradient G of the I after updating, and encrypting to obtain a ciphertext C _G ；

4. C is collected to server _G And update to get C _I ＝G _I -C _G Then the updated C _I Broadcasting to all clients;

5. repeating the steps 3-4 until the algorithm converges.

According to the steps 1-5, the existing scheme ensures that the user data cannot be local, the homomorphic encryption technology enables the server side to not obtain the plaintext of the gradient in the whole training process, so that the original data cannot be reversely pushed out from a single gradient, but the homomorphic encryption scheme needs repeated encryption and decryption to enable training to be not efficient, if homomorphic encryption is removed, the plaintext gradient of the single data is directly summarized, the original data can be reversely pushed out after multi-step training, the safety of the local data cannot be guaranteed, and therefore the technical problem of how to solve the existing safety matrix decomposition algorithm becomes a problem to be solved urgently in the industry.

Disclosure of Invention

The invention aims to make the recommended model training process more efficient and ensure that local data in model training is not leaked, and provides a matrix decomposition method based on safe aggregation and key exchange under a federal learning framework.

To achieve the purpose, the invention adopts the following technical scheme:

the method for matrix decomposition based on secure aggregation and key exchange under the federal learning framework comprises the following steps:

s1, recording a dispatcher of a federal learning framework as a server, and each participating trainer as a client, wherein the server broadcasts an initialized embedded matrix I of an article to each client;

s2, each client X calculates an embedding matrix U related to the respective local user by using the embedding matrix I ^X Gradient of (2)

And utilize->

Updating an embedded matrix U of a local user ^X ；

S3, each client X uses the locally updated U ^X Calculating a gradient generated to the embedding matrix I

S4, updating the gradient by adopting a key exchange method

And is about->

Summarizing to obtain->

After that, use->

Updating the embedded matrix I;

s5, repeating the steps S2-S4 until the termination condition of federal learning is reached.

Preferably, in step S2, the embedding matrix

Embedding vector of the associated local user i +.>

Gradient of->

Calculated by the following formula (1):

in the formula (1), L is a loss function of the client X for federal learning,

M ^X representing a scoring matrix at the client X;

I ^T is the matrix transpose of I;

‖·‖ _F the Frobenius norm of the matrix;

I _j ∈R ^1×k an embedding vector representing an item j common to all clients is an embedding matrix i= [ I ] ₁ ,I ₂ ,…,I _j ,…,I _d ]∈R ^d×k Is the j-th row of (2);

representation I _j Is a vector transpose of (2);

representing the score of the user i about the item j owned by the client X (the missing item that the user i does not have an actual score about the item j is to be predicted after modeling is completed);

j:

exists indicates that the user i owned by the client X actually evaluates the excessive item j;

representing the sum of the items j actually evaluated by the user i owned by the client X with respect to the token j.

Preferably, in step S2, the user embedding matrix local to each of the clients X is updated by the following formula (2):

in the formula (2), lambda _U Representing U ^X Is used for the regularization parameters of the (c),

preferably, in step S3, the embedding vector I of the associated item j in the embedding matrix I _j Corresponding gradient

Calculated by the following formula (3):

in the formula (3),

representation->

Is the j-th row of (2);

an embedded vector I representing the item j common to all the clients _j Is a vector transpose of (2);

representing the embedding matrix U ^X An embedded vector of a related local user i;

a score representing the user i locally owned by the client X with respect to the item j;

i:

exists means those users i owned by the client X who have an over-scoring behavior on item j;

representing the summation of those users i owned by the client X who have scored the item j with respect to the token i.

Preferably, in step S4, the gradient is replaced

The key exchange method adopted specifically comprises the following steps: />

S41, each client X locally generates a private key S _X And public key p _X The server exchanges the public key generated by each client X, and each client X obtains a corresponding exchange public key set which is marked as C _X ；

S42 according to C _X And a private key s locally generated by each of said clients X _X Generating a key agreement between the client X and every other client Y, denoted as key_agreement (X, Y);

s43, the client X generates a mask by using the locally generated key_agreement (X, Y) as a seed, marks the mask (X, Y), and updates the gradient in step S3

Preferably, in step S41, C _X By the followingExpression (4) expresses:

C _X ＝{P ₁ ,…,P _X ,…,p _N expression (4)

In the expression (4) of the above,

representing a public key locally generated by the client X;

p represents prime numbers, and each client is pre-customized;

g represents the primitive root of the model p, and each client is pre-customized;

% p represents modulo arithmetic on prime number p;

{p ₁ ,…,p _X ,…,p _N and represents a set of locally generated public keys for all N of the clients received by the server.

Preferably, in step S42, the key_flag (X, Y) is generated by:

the client X exchanges the public key set C from the client X _X The public key p of the client Y is taken out _Y ；

The client X is based on the public key p _Y And the private key s generated locally _X Generated as key_agreement (X, Y).

Preferably, the generation formula of key_agreement (X, Y) is expressed as follows:

in the formula (5) of the present invention,

represents p _Y S of (2) _X Power of the order;

p represents prime numbers agreed in advance by each client;

% p represents modulo the prime number p.

Preferably, in step S43, the gradient is updated by the following equation (6)

In formula (6), a (X, Y) represents 1 or-1, the clients are numbered {1,2, …, X, …, N }, which value is equal to 1 if the number of client X is greater than the number of client Y, and is otherwise equal to-1;

∑ _{Y∈{1,2,…,N}\{X}} representing the summation of all non-X clients Y with respect to token Y.

Preferably, in step S4, the steps are summarized

Is expressed by the following formula (7):

in step S4, the method of updating the embedding matrix I is expressed by the following formula (8):

in the formula (8), lambda _I Representing regularization parameters of the embedding matrix I.

Preferably, for the gradient generated in step S3

After noise addition, the process goes to step S4, for the gradient +.>

The noise adding method is expressed by the following formula (9):

/>

in the formula (9), n ^X Representing gaussian noise.

The invention has the following beneficial effects:

1. gradient with safe polymerization

And->

And a training sample of the recommendation model is obtained, so that the user data is ensured not to leave the local area, and the recommendation model training process is safer.

2. Masking the gradient and noise, so that leakage of source data information caused by exposing the real gradient of the data is effectively avoided;

3. compared with homomorphic encryption technology adopted in the background technology, the gradient summarization mode based on the security aggregation has the advantages that the calculation complexity of gradient encryption and decryption is lower, the calculation speed is higher, and the training speed of a recommended model is improved;

4. the recommendation model is trained based on the matrix decomposition algorithm provided by the application under the federal learning framework, and in the model training process, the participants do not need to exchange local data, so that the local data is more effectively ensured not to be leaked.

Drawings

In order to more clearly illustrate the technical solution of the embodiments of the present invention, the drawings that are required to be used in the embodiments of the present invention will be briefly described below. It is evident that the drawings described below are only some embodiments of the present invention and that other drawings may be obtained from these drawings without inventive effort for a person of ordinary skill in the art.

FIG. 1 is a diagram of steps in implementing a matrix decomposition method based on secure aggregation and key exchange in a federal learning framework according to an embodiment of the present invention;

fig. 2 is a flow chart of a matrix decomposition method based on secure aggregation and key exchange under the federal learning framework provided by an embodiment of the present invention.

Detailed Description

The technical scheme of the invention is further described below by the specific embodiments with reference to the accompanying drawings.

Wherein the drawings are for illustrative purposes only and are shown in schematic, non-physical, and not intended to be limiting of the present patent; for the purpose of better illustrating embodiments of the invention, certain elements of the drawings may be omitted, enlarged or reduced and do not represent the size of the actual product; it will be appreciated by those skilled in the art that certain well-known structures in the drawings and descriptions thereof may be omitted.

The same or similar reference numbers in the drawings of embodiments of the invention correspond to the same or similar components; in the description of the present invention, it should be understood that, if the terms "upper", "lower", "left", "right", "inner", "outer", etc. indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, only for convenience in describing the present invention and simplifying the description, rather than indicating or implying that the apparatus or elements being referred to must have a specific orientation, be constructed and operated in a specific orientation, so that the terms describing the positional relationships in the drawings are merely for exemplary illustration and should not be construed as limiting the present patent, and that the specific meaning of the terms described above may be understood by those of ordinary skill in the art according to specific circumstances.

In the description of the present invention, unless explicitly stated and limited otherwise, the term "coupled" or the like should be interpreted broadly, as it may be fixedly coupled, detachably coupled, or integrally formed, as indicating the relationship of components; can be mechanically or electrically connected; can be directly connected or indirectly connected through an intermediate medium, and can be communication between the two parts or interaction relationship between the two parts. The specific meaning of the above terms in the present invention will be understood in specific cases by those of ordinary skill in the art.

Taking A, B, C three clients as an example, how the matrix decomposition method based on secure aggregation and key exchange is specifically implemented under the federal learning framework provided in this embodiment is described below:

recording a dispatcher in a federal learning framework as a server, each participating trainer as a client, M as a scoring matrix (such as a matrix corresponding to scoring movies by a plurality of imdb users, containing some missing items needing predictive filling), U ^A 、U ^B 、U ^C The embedded matrix of the local user of the client A, B, C (the local user is numerically represented by the matrix), and I represents the embedded matrix of the object (the common object is numerically represented by the matrix). As shown in fig. 2, the specific implementation steps of the matrix decomposition method based on secure aggregation and key exchange under the federal learning framework provided in this embodiment are as follows:

1. each party determines the good embedding dimension (the embedding dimension represents how much space of the dimension is utilized to numeralize users and articles), the server initializes the embedding matrix I of the articles according to the embedding dimension, and the client A, B, C initializes the own local user embedding matrix U according to the embedding dimension ^A 、U ^B 、U ^C

2. The server broadcasts the embedded matrix I to the client A, B, C;

3. client A calculates U using the embedding matrix I ^A Gradient of (2)

Then updating the embedded matrix U of the local user ^A ，

Wherein->

m _A Representing the total number of users of client A, I _j An embedded vector representing an item j common to all clients,/->

Representation I _j Is transposed by the vector of>

Representing the score of user i owned by client A with respect to item j, j: ∈>

exists means those items j, < >, > which are actually evaluated too much by the user i owned by the client a>

Representing those items j that are actually scored too much by user i owned by client a, summed with respect to token i; u (U) ^A The updating mode of (a) is as follows: />

λ _U U indicator ^A Is used for regularization parameters of (a);

gradient corresponding to client B, C respectively

Is calculated by (a) and updating U respectively ^B 、U ^C The method of (1) is the same as the client A and will not be described in detail here;

4. client A uses locally updated U ^A Calculating gradients generated for the user on the embedding matrix I

Wherein->

d represents the total number of common things, i: the total number of the common things>

exists represent those users i owned by client a who have scored actions on item j,

summing those users i owned by client a who have a scoring behavior for item j with respect to token i;

gradient corresponding to client B, C respectively

The calculation method of (1) is the same as the client A and will not be described in detail here;

to avoid exposing the true gradients, the corresponding gradients for each client are preferably noisy, more preferably client A, B, C by differential privacy techniques

Respectively plus Gaussian noise n ^A 、n ^B 、n ^C . Taking client A as an example, n ^A Representing the generated random matrix (size and +.>

The same) and (II)>

Updated to->

5. Client A, B, C generates locally respective public and private keys, p _A 、p _B 、p _C Respectively representing locally generated public keys s of client A, B, C _A 、s _B 、s _C Representing the private keys locally generated by the client A, B, C, respectively. Taking client a as an example, private key s _A For locally generated random numbers (less than p in value), p _A (by private key s) _A Calculated) is

Wherein g represents the generator (the primitive root of modulo p, can be selected to be smaller, can be simply taken as 2),>

s representing g _A The power of the power, p is a large prime number (2048 bits are generally available), and% p represents modulo operation on p, and g and p of each client are predetermined;

6. service side collectionAll public keys p _A 、p _B 、p _C And the public key sent to the client A is p _B 、p _C The public key sent to client B is p _A 、p _C The public key sent to the client C is p _A 、p _B ；

7. Client a is based on public key p _B 、p _C And a locally generated private key s _A Generating a key_agreement (A, B) of the client B and a key_agreement (A, C) of the client C; client B is based on public key p _A 、p _C And private key s _B Generating a key_agreement (A, B) of the client A and a key_agreement (B, C) of the client C; client C is based on public key p _A 、p _B And its own private key s _C Generate key_agreement (a, C) with client a and key_agreement (B, C) with client B. Taking the example of the client a as the example,

respectively represent p _B S of (2) _A Power of the power of p _C S of (2) _A To power,% p represents modulo p.

8. Client A uses local key_agrement (A, B) as seed to generate mask (A, B), uses local key_agrement (A, C) as seed to generate mask (A, C), and updates gradient

The client B uses the local key_agrement (A, B) as a seed to generate a mask (A, B), uses the local key_agrement (B, C) as a seed to generate a mask (B, C), and updates the gradient

The client C uses the local key_agretement (A, C) as a seed to generate a mask (A, C), uses the local key_agretement (B, C) as a seed to generate a mask (B, C), and updates the gradient ∈ ->

Taking client A as an example, the mask (A, B) is a size, shape and shape generated with key_agrement (A, B) as seed

The same random matrix (the seed parameters are directly generated by calling the open source library function).

9. The server gathers the gradients to obtain

Then update I to obtain

λ _I Regularization parameters representing an embedding matrix I;

10. repeating the steps 2-8 until the maximum training times of the federal recommendation model or algorithm convergence is reached.

Briefly, the matrix decomposition method based on secure aggregation and key exchange under the federal learning framework provided in this embodiment, as shown in fig. 1, includes the steps of:

s1, recording a dispatcher of a federal learning framework as a server, taking each participating training party as a client, and broadcasting an initialized embedded matrix I of an article to each client by the server;

s2, each client X calculates an embedding matrix U of each local user by using the embedding matrix I ^X Gradient of (2)

And utilize->

Updating an embedded matrix U of a local user ^X ；

S3, each client X uses the U updated locally ^X Calculating the gradient generated to the embedding matrix I

S4, updating gradient by adopting key exchange method

And is about->

Summarizing to obtain->

After that, use->

Updating the embedded matrix I;

In conclusion, the gradient of the matrix decomposed object matrix I is safely aggregated under the Federal learning framework, and a new idea is provided for Federal learning to enhance data security; gradient with safe polymerization

And->

The training sample of the recommendation model (namely the federal learning model) is obtained, so that the user data is ensured not to leave the local area, and the recommendation model training process is safer; masking the gradient and noise, so that leakage of source data information caused by exposing the real gradient of the data is effectively avoided; compared with homomorphic encryption technology adopted in the background technology, the gradient summarization mode based on the security aggregation is provided to carry out gradient decompositionThe method has the advantages of lower computation complexity and higher computation speed, and is beneficial to improving the training speed of the recommended model.

It should be understood that the above description is only illustrative of the preferred embodiments of the present invention and the technical principles employed. It will be apparent to those skilled in the art that various modifications, equivalents, variations, and the like can be made to the present invention. However, such modifications are intended to fall within the scope of the present invention without departing from the spirit of the present invention. In addition, some terms used in the specification and claims of the present application are not limiting, but are merely for convenience of description.

Claims

1. A matrix factorization method based on secure aggregation and key exchange under a federal learning framework, comprising the steps of:

And utilize->

Updating an embedded matrix U of a local user ^X ；

S4, the client X links the server to update the gradient by adopting a key exchange method

And is about->

Summarizing to obtain->

After that, use->

Updating the embedded matrix I;

s5, repeating the steps S2-S4 until the termination condition of federal learning is reached;

in step S2, the embedding matrix

Embedding vector of the associated local user i +.>

Gradient of->

Calculated by the following formula (1):

in the formula (1), L is a loss function of the client X for federal learning,

M ^X representing a scoring matrix at the client X;

I ^T is the matrix transpose of I;

||·|| _F the Frobenius norm of the matrix;

representation I _j Is a vector transpose of (2);

representing the score of the user i about the item j owned by the client X, wherein the missing item of the user i about the item j without actual score is needed to be predicted after modeling;

representing that the user i owned by the client X actually evaluates the excessive item j;

2. The matrix factorization method based on secure aggregation and key exchange under the federal learning framework according to claim 1, wherein in step S2, the user embedded matrix local to each of said clients X is updated by the following formula (2):

3. a matrix factorization method based on secure aggregation and key exchange under a federal learning framework, comprising the steps of:

And utilize->

Updating an embedded matrix U of a local user ^X ；

And is about->

Summarizing to obtain->

After that, use->

Updating the embedded matrix I;

in step S3, the embedding vector I of the associated article j in the embedding matrix I _j Corresponding gradient

Calculated by the following formula (3):

in the formula (3),

representation->

Is the j-th row of (2);

representing those users i owned by the client X who have a scoring behavior on item j;

4. A matrix factorization method based on secure aggregation and key exchange under a federal learning framework, comprising the steps of:

And utilize->

Updating an embedded matrix U of a local user ^X ；

And is about->

Summarizing to obtain->

After that, use->

Updating the embedded matrix I;

in step S4, the gradient is replaced

The key exchange method adopted specifically comprises the following steps:

s41, each client X locally generates a private key S _X And public key p _X The server exchanges the public key generated by each client X,each client X obtains a corresponding exchange public key set which is marked as C _X ；

5. The method of matrix factorization based on secure aggregation and key exchange in the federal learning framework of claim 4, wherein in step S41C _X Expressed by the following expression (4):

C _X ＝{p ₁ ,…,p _X ,…,p _N expression (4)

In the expression (4) of the above,

representing a public key locally generated by the client X;

p represents prime numbers, and each client is pre-customized;

% p represents modulo arithmetic on prime number p;

{p ₁ ,…,p _X ,...,p _N and represents a set of locally generated public keys for all N of the clients received by the server.

6. The method of matrix decomposition based on secure aggregation and key exchange in a federal learning framework according to claim 5, wherein in step S42, the key_pattern (X, Y) is generated by:

the client X exchanges the public key set C from the client X _X The public key of the client Y is taken outp _Y ；

7. The matrix factorization method based on secure aggregation and key exchange in the federal learning framework according to claim 6, wherein the generation formula of key_agreement (X, Y) is expressed as follows:

in the formula (5) of the present invention,

represents p _Y S of (2) _X Power of the order;

p represents prime numbers agreed in advance by each client;

% p represents modulo the prime number p.

8. The method of matrix factorization based on secure aggregation and key exchange in the federal learning framework of claim 4, wherein in step S43, the gradient is updated by the following equation (6)

9. The federal learning according to claim 1 or 3 or 4The matrix decomposition method based on secure aggregation and key exchange under the framework is characterized in that in step S4, the matrix decomposition method is summarized

Is expressed by the following formula (7):

10. A method of matrix decomposition based on secure aggregation and key exchange in a federal learning framework according to claim 1 or 3 or 4, wherein the gradient created for step S3

After noise addition, the process goes to step S4, for the gradient +.>

The noise adding method is expressed by the following formula (9):

in the formula (9), n ^X Representing gaussian noise.