CN114723067A

CN114723067A - Federal mixed filtering recommendation method based on user privacy protection

Info

Publication number: CN114723067A
Application number: CN202210379463.8A
Authority: CN
Inventors: 张幸林; 卢正东; 卢艺灵; 卢沁旖; 周志炫; 谢文灏; 林泽蓬
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2022-04-12
Filing date: 2022-04-12
Publication date: 2022-07-08
Anticipated expiration: 2042-04-12
Also published as: CN114723067B

Abstract

The invention discloses a federal mixed filtering recommendation method based on user privacy protection, which comprises the following steps: 1) collecting the scoring information, user attributes and article attributes of the disclosed user-article to form a scoring information matrix, user edge information and article edge information; 2) two noise reduction self-encoders are set up, and self-supervision pre-training is carried out on the two noise reduction self-encoders through a random gradient descent method; 3) constructing a total model, and loading encoder parameters in two noise reduction self-encoders obtained by pre-training into two encoders of the total model; 4) and iteratively updating the total model parameters by using a FedAvg algorithm of federal learning, and generating probability values of implicit feedback of different users to the articles by using the updated total model so as to generate a recommended article list of the corresponding user. The method and the device can reasonably extract and utilize the public characteristic information of the user and the article, and predict the probability of the possible interaction behavior of the user while protecting the personal privacy information of the user to generate a high-hit recommendation list.

Description

Federal mixed filtering recommendation method based on user privacy protection

Technical Field

The invention relates to the technical field of federal learning and mixed collaborative filtering, in particular to a federal mixed filtering recommendation method based on user privacy protection.

Background

Federal learning is a distributed machine learning framework proposed based on a deep learning idea, and aims to perform data use and model training on the basis of ensuring data privacy safety and legal compliance. Federal learning is used as a distributed machine learning paradigm, the problem of data islanding can be effectively solved, participants jointly model on the basis of not sharing data, the data islanding can be broken while the data privacy of the participants is protected technically, and AI cooperation is achieved.

The characteristics of federal learning are characterized by the following: 1. the data of each party is guaranteed to be kept locally, and privacy and legal compliance are not disclosed; 2. establishing a virtual common model by combining data of a plurality of participants; 3. under the condition of user alignment or feature alignment of data of each party, the modeling effect of federal learning is not weaker than that of the traditional modeling mode based on large data volume; 4. under the condition that users or features are not aligned, the effect of knowledge migration can be achieved by exchanging encryption parameters among data.

Among the common recommendation algorithms, two types of methods are widely used, which are: 1. based on content recommendation, the method recommends favorite products for users with similar attribute values by respectively calculating the similarity of the inherent user attribute and the product attribute in respective dimensions, and the extensibility is weak in practical application; 2. the method carries out user preference prediction by analyzing user behavior information including explicit feedback such as scoring and commenting and implicit feedback such as browsing records, is influenced by data sparsity, and is poor in performance under the condition of cold start.

Disclosure of Invention

The invention aims to overcome the defects and shortcomings of the prior art, and provides a federal mixed filtering recommendation method based on user privacy protection, which can reasonably extract and utilize public characteristic information of users and articles, utilizes a neural network structure to perform modeling, predicts the probability of possible interaction behaviors of the users while protecting personal privacy information of the users, and further generates a high-hit recommendation list.

In order to achieve the purpose, the technical scheme provided by the invention is as follows: a federal mixed filtering recommendation method based on user privacy protection comprises the following steps:

1) collecting public user-article scoring information to form a scoring information matrix as explicit feedback, obtaining implicit feedback through a binary scoring information matrix, and collecting public user attributes as user edge information and public article attributes as article edge information; the explicit feedback is an index which clearly shows the preference degree of the user to the articles, namely the user scores the articles, and the implicit feedback is an index which cannot clearly reflect the preference degree of the user, namely whether the user scores the articles or not;

2) respectively building noise reduction self-encoders aiming at user edge information and article edge information, wherein the built two noise reduction self-encoders have the same structure and respectively comprise three parts of a noise adding part, an encoder and a decoder, and performing self-supervised pre-training on the two noise reduction self-encoders respectively through a random gradient descent method to obtain two pre-trained noise reduction self-encoder parameters;

3) constructing a total model consisting of two encoders, a full connection layer and an activation function layer, wherein the structures of the two encoders are respectively equal to the respective encoder parts of the two noise reduction self-encoders and are defined as a user encoder and an article encoder, and after the construction of the total model is finished, loading respective encoder parameters of the two noise reduction self-encoders obtained by pre-training into the user encoder and the article encoder of the total model respectively for initialization;

4) and iteratively updating the total model parameters by using a FedAvg algorithm of federal learning, and generating probability values of implicit feedback of different users to the articles by using the updated total model so as to generate a recommended article list of the corresponding user.

Further, the step 1) comprises the following steps:

1.1) collecting the public user-article grading information to form a grading information matrix R, wherein the grading information matrix is a user-article two-dimensional matrix and is expressed as follows:

in the formula, r_ijThe scoring information of the ith user for the jth article is shared by m users and n articles;

1.2) use the self-defined binarization function Bin (r)_ij) Acting on the scoring information matrix R to obtain an implicit feedback matrix defined as a user-article interaction matrix

That is, for each user-item pair, there is an implicit feedback tag with a value of 0 or 1, where a value of 0 indicates that scoring has not occurred and a value of 1 indicates that scoring has occurred; binarization function Bin (r)_ij) Is defined as follows:

1.3) collecting the disclosed user attribute and article attribute, and respectively generating a vector composed of user binary characteristic values and a vector composed of article binary characteristic values for representing the attribute by using one-hot coding, namely the edge information of the userInformation vector and edge information vector of article, and further form user edge information matrix S_userAnd an article edge information matrix S_itemExpressed as:

in the formula (I), the compound is shown in the specification,

an edge information vector for the ith user, comprising d_userBinary characteristic value of individual user, b_user,xThe xth user binary feature value representing the user,

denotes d_userA vector space of dimensions;

an edge information vector for the j-th article, comprising d_itemBinary characteristic value of individual article, b_item,yA y-th item binary characteristic value representing the item,

denotes d_itemVector space of dimensions.

Further, the step 2) comprises the following steps:

2.1) respectively constructing a noise reduction self-encoder aiming at the user edge information and the article edge information, defining the noise reduction self-encoder for the user and the article, wherein the constructed noise reduction self-encoder structure is expressed by a formula as follows:

in the formula, s is an input vector of the noise reduction self-encoder, namely an edge information vector;

is a reconstructed vector of a noise reduction self-encoder; function d_noiAdding for noise; function f_encIs an encoder; function g_decIs a decoder;

2.2) performing self-supervision pre-training on the two noise reduction self-encoders respectively by a random gradient descent method to obtain two pre-trained noise reduction self-encoder parameters, wherein the two noise reduction self-encoder parameters respectively comprise an encoder parameter and a decoder parameter; to measure the reconstructed vector

Introducing cross entropy as a loss function of two noise reduction self-encoders according to the difference with the input vector s to obtain a loss function L_DAE：

In the formula, N is the total number of input vectors in a batch, p refers to the p-th input vector, d represents the total dimensionality of the input vectors, and k represents the k-th dimensionality of the input vectors;

defining the user edge information vector and the article edge information vector as follows:

in the formula (I), the compound is shown in the specification,

the edge information vector of the ith user has m users,

including d_userBinary characteristic value of individual user, with b_user,xAn xth user binary feature value representing a user;

n articles are total for the edge information vector of the jth article,

including d_itemBinary characteristic value of individual article, b_item,yA y item binary characteristic value representing the item;

will be provided with

And

respectively as input vectors of a user noise reduction self-encoder and an article noise reduction self-encoder, and obtaining a reconstructed vector by forward propagation

And

calculating loss values respectively by loss functions

And

after counter-propagation to obtainGradient of user noise reduction auto-encoder parameters

And gradient of object noise reduction auto-encoder parameters

Wherein theta is_cRepresenting respective parameters in two noise-reducing autocoders;

for the method for updating the parameters of the two noise reduction self-coders, a random gradient descent method is adopted: and randomly selecting N different user edge information vectors or article edge information vectors as unit batches, and totally performing Q-turn batch input selection and updating of parameters of the user noise reduction self-encoders or parameters of the article noise reduction self-encoders to obtain parameters of two noise reduction self-encoders used for the next stage.

Further, the step 3) comprises the following steps:

3.1) constructing a total model consisting of two encoders, a full connection layer and an activation function layer, wherein the two encoder structures are respectively equal to the respective encoder parts of the two noise reduction self-encoders and are defined as a user encoder and an article encoder, after output results of the two encoders are fused by Hadamard products, the output results are sequentially transmitted into the full connection layer with the output dimensionality of 1 and the Sigmoid activation function layer to obtain the probability v of implicit feedback with the value in the (0,1) interval, and the total model uses a binary cross entropy as a loss function L_total；

And 3.2) respectively loading the parameters of the respective encoders in the two noise reduction self-encoders obtained by pre-training into a user encoder and an article encoder in the total model for initialization, wherein the initialization mode of the parameters of the full connection layer of the total model is set to be zero, and then the initialized total model parameters are obtained.

Further, the step 4) comprises the following steps:

4.1) taking the user edge information vector, the article edge information vector and the implicit feedback label as training data of the total model, and performing iterative updating on total model parameters by using FedAvg algorithm of Federal learning to obtain a trained total model;

4.2) for each user, randomly extracting M unscored articles respectively for prediction, wherein the prediction mode is as follows: respectively taking a user edge information vector corresponding to each user and an article edge information vector of any one of the extracted unscored articles as input vectors of a user encoder and an article encoder of a total model, and obtaining a probability value v of implicit feedback of the user and the article after forward propagation, thereby obtaining a probability value of the implicit feedback of each user to M unscored articles;

4.3) for each user, the probability values fed back implicitly are sorted in descending order according to the value size, and a recommended item list which is sorted from high to low in recommendation degree is generated.

Compared with the prior art, the invention has the following advantages and beneficial effects:

1. compared with other centralized recommendation systems, the method can centralize the training process by using a federal learning method, protect the data privacy locally, and has no obvious loss of recommendation precision compared with a non-federal mode.

2. Compared with other collaborative filtering methods based on matrix decomposition, the method can additionally utilize the edge information of the user and the article as a part of the characteristics, better mine the user preference and the article characteristics, and simultaneously relieve the cold start problem of the user.

3. Compared with other collaborative filtering methods based on the neural network, the method takes the noise reduction self-encoder as the model encoding structure, so that the characteristics of the user and the article are better extracted, and the recommendation precision is improved.

4. Compared with other conventional recommendation methods, the method supports pre-training through the user and the article noise reduction self-encoder, so that the time of model training is reduced.

Drawings

FIG. 1 is a schematic logic flow diagram of the method of the present invention.

Fig. 2 is a diagram of a general model structure.

FIG. 3 is a table of experimental results on the MovieLen-100K data set.

Detailed Description

The present invention will be described in further detail with reference to examples and drawings, but the present invention is not limited thereto.

As shown in fig. 1, the embodiment provides a federal hybrid filtering recommendation method based on user privacy protection, which collects public user attributes and article attributes as edge information vectors, builds a total model, and breaks a data island by using a federal learning framework to realize collaborative recommendation, and includes the following steps:

1) and collecting the grading information of the disclosed user-article to form a grading information matrix as explicit feedback, obtaining implicit feedback through a binary grading information matrix, and collecting the disclosed user attribute as user edge information and the disclosed article attribute as article edge information. The explicit feedback is a behavior that the user explicitly expresses the preference of the user to the article, namely the user scores the article, and the implicit feedback is a behavior that the preference of the user cannot be explicitly reflected, namely whether the behavior that the user scores the article occurs or not, and the method comprises the following steps:

1.1) collecting the scoring information of the public user-goods to form a scoring information matrix R. The scoring information matrix R is a two-dimensional user-item matrix, represented as:

That is, for each user-item pair, there is an implicit feedback tag with a value of 0 or 1, where a value of 0 indicates that scoring has not occurred and a value of 1 indicates that scoring has occurred. The binarization function Bin (r) here_ij) Is defined as follows:

1.3) collecting the disclosed user attribute and article attribute, respectively generating a vector composed of user binary characteristic values and a vector composed of article binary characteristic values for representing the attribute by using one-hot coding, and further forming a user edge information matrix S_userAnd an article edge information matrix S_itemExpressed as:

in the formula (I), the compound is shown in the specification,

denotes d_userA vector space of dimensions;

denotes d_itemVector space of dimensions.

2) Build respectively to user edge information and article edge information and fall the autoencoder that makes an uproar, two of building fall the autoencoder structure of making an uproar the same, all include noise addition, encoder and decoder three, fall the autoencoder through the random gradient descent method and carry out the pre-training of self-supervision respectively to two, two after obtaining the pre-training fall make an uproar from the encoder parameter, including following step:

in the structural formula of the noise reduction self-encoder, s is an input vector of the noise reduction self-encoder and corresponds to an edge information vector of an ith user

Or edge information vector of the jth article

Wherein

And

the definition is as follows:

in the formula (I), the compound is shown in the specification,

the edge information vector of the ith user has m users,

n articles are total for the edge information vector of the jth article,

including d_itemBinary characteristic value of individual article, b_item,yRepresenting the y-th item binary characteristic value of the item. The user binary characteristic value and the article binary characteristic value are respectively made of the collected user attributes and article attributes by utilizing unique hot codes.

In the formula of the noise reduction self-encoder structure,

for the reconstructed vector of the noise-reduced self-encoder, function d_noiFor noise addition, additive noise is added to the input vector s, and range pruning is performed to obtain a vector

The way of incorporating additive noise is: for each element in the vector, a product of the noise factor t and a random number q following a gaussian distribution is added to obtain a vector z:

z^(p,k)＝s^(p,k)+t·q,q～N(0,1),t∈[0,1]

where p denotes different input vectors, s^(p,k)Is the element of dimension k in some input vector.

The range pruning is formulated as:

where z refers to a vector that incorporates additive noise.

Finally, obtaining the vector which is blended with the additive noise and subjected to range pruning

In the formula of the noise reduction self-encoder structure, the function f_encFor the encoder, after noise addition

Encoding as hidden vectors

Here, the first and second liquid crystal display panels are,

denotes d_hiddenVector space of dimensions. Encoder f_encIs formulated as:

in the above formula

Denotes d₂×d₁The vector space of the dimensions is such that,

to represent

Vector space of dimensions. Here, the encoder portions of the two noise-reduced autoencoders each have a set of learnable parameters, which include a matrix W₁The matrix W₂The matrix W₃Vector b₁Vector b₂Sum vector b₃. In particular, for an encoder in a user denoised auto-encoder, the learnable parameters include a matrix W_user,1The matrix W_user,2The matrix W_user,3Vector b_user,1Vector b_user,2And vector b_user,3，W_user,1Has a dimension of d₁×d_user(ii) a For an encoder in an article denoising autoencoder, the learnable parameters include a matrix W_item,1The matrix W_item,2The matrix W_item,3Vector b_item,1Vector b_item,2Sum vector b_item,3，W_item,1Has a dimension of d₁×d_item. Tanh and ReLU are two activation functions, and Dropout is a random inactivation function;

function g in the structural formula of the noise reduction self-encoder_decFor the decoder, the implicit vector h is decoded into a reconstructed vector that approximates the input vector s

Decoder g_decIs formulated as:

here, the decoder portions of two noise-reduced autoencoders each have a set of learnable parameters, which include a matrix W₄The matrix W₅The matrix W₆Vector b₄Vector b₅And vector b₆. Specifically, for a decoder in a user denoised self-encoder,the learnable parameters include a matrix W_user,4The matrix W_user,5The matrix W_user,6Vector b_user,4Vector b_user,5And vector b_user,6，W_user,6Has a dimension of d_user×d₅，b_user,6Has a dimension of d_user(ii) a For a decoder in an article denoising autoencoder, the learnable parameters include a matrix W_item,4The matrix W_item,5The matrix W_item,6Vector b_item,4Vector b_item,5And vector b_item,6，W_item,6Has a dimension of d_item×d₅，b_item,6Has a dimension of d_itemSigmoid is an activation function;

2.2) respectively carrying out self-supervision pre-training on the two noise reduction self-encoders by a random gradient descent method to obtain parameters of the two noise reduction self-encoders after pre-training, wherein the detailed steps are as follows:

to measure the reconstructed vector

Introducing cross entropy as a loss function of the noise reduction self-encoder according to the difference with the input vector s to obtain a loss function L_DAE：

will be provided with

And

And

calculating loss values respectively by loss functions

And

obtaining the gradient of the user noise reduction self-encoder parameter after back propagation

And gradient of object noise reduction auto-encoder parameters

Wherein theta is_cRepresenting respective parameters in two noise-reduced self-encoders;

for the method for updating the parameters of the two noise reduction self-coders, a random gradient descent method is adopted: randomly selecting N different user edge information vectors or different article edge information vectors as a unit batch input total model, and calculating the average value of the parameter gradients of the respective noise reduction self-encoders in the batch according to the input

The model parameters are updated using the learning rate η as follows:

and selecting and updating model parameters in Q-turn batch input to obtain parameters of two noise reduction self-encoders used in the next stage.

3) The method comprises the following steps of constructing a total model consisting of two encoders, a full connection layer and an activation function layer, wherein the two encoder structures are respectively equal to respective encoder parts of two noise reduction self-encoders and are defined as a user encoder and an article encoder, and after the total model is constructed, respective encoder parameters in the two noise reduction self-encoders obtained by pre-training are respectively loaded into the user encoder and the article encoder of the total model for initialization, and the method comprises the following steps:

3.1) construct the total model composed of two coders, full-link layer and activation function layer, the two coder structures here are respectively equal to the respective coder parts of two noise-reduction self-coders, defined as user coder and article coder, the output results of the two coders are fused by Hadamard product, and then are sequentially transmitted into the full-link layer with output dimension 1 and the Sigmoid activation function layer, and the probability v of implicit feedback with value in (0,1) interval is obtained, and is expressed by formula:

in the formula (I), the compound is shown in the specification,

representing the ith user hidden vector, wherein m users are provided, each user corresponds to one user hidden vector,

and (4) representing the jth article hidden vector, wherein n articles are provided, and each article corresponds to one article hidden vector, wherein the user hidden vector is the output result of a user encoder, and the article hidden vector is the output result of an article encoder. W is a group of₇Is 1 × h_hidden，b₇Is 1;

to estimate the prediction error of the overall model, the error magnitude between the predicted probability value and the implicit feedback label is measured using binary cross entropy as a loss function.

And 3.2) respectively loading the parameters of the respective encoders in the two noise reduction self-encoders obtained by pre-training into a user encoder and an article encoder in the total model for initialization. And the initialization mode of the parameters of the full connection layer of the total model is set to be zero, so that the initialized parameters of the total model are obtained.

4) Iteratively updating the total model parameters by using a FedAvg algorithm of federal learning, generating probability values of implicit feedback of different users to the objects by using the updated total model, and further generating a recommended object list of the corresponding user, wherein the method comprises the following steps of:

4.1) taking the user edge information vector, the article edge information vector and the implicit feedback label as training data of the total model, and carrying out iterative updating on parameters of the total model by using FedAvg algorithm of Federal learning to obtain the total model after training.

The FedAvg algorithm for federal learning refers to a model training algorithm for updating parameters by using protected user data when data relates to user privacy, and the algorithm comprises the following steps:

a. the central server issues the initialized or updated cloud total model parameters and all article data sets to each client side which is randomly selected and has a ratio of C, each selected client side divides an internal user data set into pieces according to a batch size B, and E rounds of training are respectively carried out on a local total model of which the initialized parameter is the cloud total model parameter to obtain different updated local total model parameters, wherein the total model parameters positioned on the central server are called cloud total model parameters, the total model parameters respectively positioned on each client side are called local total model parameters, and each client side has one local total model parameter;

b. taking the ratio of the internal user data volume of the client to the total user data volume of all selected clients as weight, uploading the updated local total model parameters of each selected client to a server, using weighting and aggregation to obtain new cloud total model parameters, and recording the new cloud total model parameters as a big turn;

c. and (c) terminating if the algorithm performs T rounds, otherwise, turning to the step a for execution.

Here, considering the communication overhead, the training convergence speed, and the total model accuracy in combination, the round T is set to 5, the client occupation ratio C selected per round is set to 0.4, the batch size B is set to 512, and the intra-client round E is set to 4.

In order to evaluate the accuracy of the total model, the MovieLen-100K data set is used, the Hit ratio @ K index and the NDCG @ K index are used for testing and comparing the total model (marked as hybrid CF in the table) with the existing MLP model, GMF model and NeuCF model, and the experimental results are shown in FIG. 3.

4.2) randomly drawing M pieces of unscored goods for each user respectively to predict. The prediction method is as follows: and respectively taking the user edge information vector corresponding to each user and the extracted article edge information vector of any article in the unscored articles as input vectors of a total model user encoder and an article encoder, and obtaining the probability value v of implicit feedback of the user and the article after forward propagation. Therefore, each user respectively predicts the M randomly extracted articles to obtain M implicit feedback probability values;

4.3) for each user, the probability values of the implicit feedback are sorted in a descending order according to the value size, the input article of each probability value of the implicit feedback in the sorted sequence is a recommended article, and therefore the recommended article sequence with the recommendation degree ranging from high to low is generated.

The above embodiments are preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims

1. A federal mixed filtering recommendation method based on user privacy protection is characterized by comprising the following steps:

1) collecting the scoring information of the public user-article to form a scoring information matrix as explicit feedback, obtaining implicit feedback through a binary scoring information matrix, and collecting the public user attribute as user edge information and the public article attribute as article edge information; the explicit feedback is an index which clearly represents the preference degree of the user on the article, namely the user scores the article, and the implicit feedback is an index which cannot clearly reflect the preference degree of the user, namely whether the user scores the article or not;

2. The federal mixed filtering recommendation method based on user privacy protection as claimed in claim 1, wherein said step 1) comprises the following steps:

1.3) collecting the disclosed user attribute and article attribute, respectively generating a vector which represents the attribute and consists of user binary characteristic values and a vector which represents the attribute and consists of article binary characteristic values by using one-hot coding, namely an edge information vector of the user and an edge information vector of the article, and further forming a user edge information matrix S_userAnd an article edge information matrix S_itemExpressed as:

in the formula (I), the compound is shown in the specification,

an edge information vector for the ith user, comprising d_userIndividual user binary eigenvalue，b_user,xThe xth user binary feature value representing the user,

denotes d_userA vector space of dimensions;

denotes d_itemVector space of dimensions.

3. The federal mixed filtering recommendation method based on user privacy protection as claimed in claim 1, wherein said step 2) comprises the following steps:

2.2) respectively carrying out self-supervision pre-training on the two noise reduction self-encoders by a random gradient descent method to obtain two noise reduction self-encoder parameters after pre-training, wherein the two noise reduction self-encoder parameters comprise encodingTwo parts, decoder parameters and decoder parameters; to measure the reconstructed vector

in the formula (I), the compound is shown in the specification,

the edge information vector of the ith user has m users,

n articles are total for the edge information vector of the jth article,

will be provided with

And

And

calculating loss values respectively by loss functions

And

And gradient of object noise reduction auto-encoder parameters

for the method for updating the parameters of the two noise reduction self-coders, a random gradient descent method is adopted: and randomly selecting N different user edge information vectors or article edge information vectors as unit batches, and totally performing Q-turn batch input selection and updating of parameters of the user noise reduction self-encoder or the article noise reduction self-encoder to obtain parameters of two noise reduction self-encoders for the next stage.

4. The federal mixed filtering recommendation method based on user privacy protection as claimed in claim 1, wherein said step 3) comprises the following steps:

5. The federal mixed filtering recommendation method based on user privacy protection as claimed in claim 1, wherein said step 4) comprises the following steps:

4.2) for each user, randomly extracting M unscored articles respectively for prediction, wherein the prediction mode is as follows: respectively taking a user edge information vector corresponding to each user and an article edge information vector of any article in the extracted unscored articles as input vectors of a user encoder and an article encoder of a total model, and obtaining a probability value v of implicit feedback of the user and the article after forward propagation, thereby obtaining the probability value of the implicit feedback of each user to M unscored articles;