CN116226540A

CN116226540A - End-to-end federation personalized recommendation method and system based on user interest domain

Info

Publication number: CN116226540A
Application number: CN202310512758.2A
Authority: CN
Inventors: 吴超; 章焕锭; 李皓
Original assignee: Zhejiang University ZJU
Current assignee: Zhejiang University ZJU
Priority date: 2023-05-09
Filing date: 2023-05-09
Publication date: 2023-06-06
Anticipated expiration: 2043-05-09
Also published as: CN116226540B

Abstract

The invention discloses an end-to-end federal personalized recommendation method and system based on a user interest domain, and belongs to the technical field of personalized recommendation. Based on the cluster-like personalized thought, the recommendation model with the personalized prediction network group and the client classification network based on the graph neural network are arranged, the classification network is embedded into the personalized recommendation model, and the recommendation model and the client classification network are used as a global model in a federal recommendation system for federal training. Through the design of the end-to-end personalized recommendation model, the clustering process of the client can be automatically completed in the forward calculation of the model, so that the training efficiency is improved; the client classification network can also be trained by recommending targets so that the clustering process becomes learnable. The invention solves the problems of low training efficiency and non-learning in the conventional cluster-like personalized recommendation method, and improves the performance of the personalized federal recommendation model.

Description

End-to-end federation personalized recommendation method and system based on user interest domain

Technical Field

The invention belongs to the technical field of personalized recommendation, and particularly relates to an end-to-end federal personalized recommendation method and system based on a user interest domain.

Background

The recommendation system is an effective means for relieving information overload, and is also an engine for increasing users, and is visible everywhere on each large Internet information platform. In recent years, with the increase of the privacy awareness of users and the release of various data security regulations, it is recommended to face privacy protection problems due to the need to use personal data of users. The combination of federal learning and recommendation is currently the main idea to solve this problem. However, the general federal recommendation method only focuses on privacy protection, ignores the problem of non-independent co-distribution (non-IID) of client (user) data under federal settings, and the problem can affect training of a model, resulting in impaired recommendation performance of a final model.

At present, the non-IID problem is solved in a plurality of ways, one of which is to adopt a personalized idea, namely, conversely, different models are customized for different clients by utilizing the nature of the non-IID. Besides the individuation method of the granularity of the client, the individuation scheme mainly clusters the clients to form a plurality of class clusters, and endows different models to different class clusters, wherein the clients in the same class cluster share one model. The two existing personalization methods have the following three defects in federal recommendation:

(1) Personalization of client granularity easily results in model overfitting local data, compromises generalization performance of the model, and once interest of a user is transformed, the model is difficult to recommend with high quality against new interests.

(2) The training model under the cluster-like granularity has more abundant data and wider coverage interest, so that the fitting problem is relieved; however, the method requires that a certain number of clients are clustered in each round of federal training, which affects training efficiency.

(3) Besides the training efficiency problem, the implementation process of the cluster granularity individuation method basically adopts an unsupervised clustering method and a non-learning method.

Disclosure of Invention

In order to solve the problem of non-IID of client data in the traditional federation recommendation application, the invention also adopts a personalized solution idea, and provides an end-to-end federation personalized recommendation method and system based on a cluster personalized idea, thereby improving the problems of low training efficiency and non-learning in the existing cluster personalized recommendation method and improving the performance of a federation recommendation model.

The invention provides the following technical scheme:

one of the purposes of the invention is to provide an end-to-end federal personalized recommendation method which mainly comprises two large contents of an end-to-end personalized recommendation model and a federal training process.

The end-to-end personalized recommendation model performs personalized design on the prediction network group part on the basis of the depth recommendation model, namely, z prediction networks are set, and z client class clusters are respectively corresponding to the z prediction networks. The invention does not personalize the whole recommendation model, but only sets a personalized prediction network group, because (1) the object ID embedding layer of the recommendation model has the characteristic sparsity problem, and users in one class cluster cannot cover all objects, so personalizing the layer can lead to some hidden vectors in one class cluster not to be trained, which brings the object cold start problem and is not beneficial to the generalization of the model; (2) Among the parameters of the federal recommendation model, the item ID embedding layer accounts for a large part, and sharing the layer by all clients is beneficial to saving the storage cost of the parameters.

After the personalized prediction network group is set, the invention also sets a client classification network based on a graph neural network, mainly regards a client (user) and an object interacted with the client as a star graph, namely a client-object interaction graph, takes the interaction graph, a user ID embedded representation and an object ID embedded representation in the interaction graph as the input of the graph neural network, wherein the user ID embedded representation and the object ID embedded representation are the characteristic representation of each node in the graph, and outputs probability distribution vectors of the class cluster of the client corresponding to the interaction graph after passing through the graph neural network.

An end-to-end federation personalized recommendation method based on a user interest domain mainly comprises the following steps:

step 1: the method comprises the steps that a service end builds a global model based on the preset client class cluster number and initializes model parameters, wherein the global model consists of a client classification network and a recommendation model, and the recommendation model comprises a prediction network group, a user ID embedding layer and an article ID embedding layer; distributing the user ID embedded layer parameters in the global model to all clients participating in federal training;

step 2, constructing a local training data set by all clients of federal training;

step 3: the server randomly selects a part of clients to participate in the federal training of the current round, and distributes other parameters except the user ID embedding layer in the global model to the selected clients;

step 4: the selected client trains the received global model by using the local training data set, updates local user ID embedded layer parameters and sends the rest parameters back to the server;

step 5: the server side aggregates global model training parameters except the user ID embedded layer sent back by the client side, and takes an aggregate result as a latest global model;

step 6: repeating the steps 3 to 5 until the global model converges; and each client terminal carries out personalized recommendation of the target object according to the locally stored user ID embedded layer parameters and the latest global model parameters which are transmitted by the server terminal and are except the user ID embedded layer.

Further, the prediction network group is composed of a plurality of parallel multi-layer perceptron (MLP).

Further, the client classification network comprises a graph neural network, a multi-layer perceptron and a softmax layer; the input of the graphic neural network is a user-object interaction graphic, a user ID embedded representation and a history interaction object ID embedded representation; the user ID embedded representation and the historical interaction article ID embedded representation are respectively generated by a user ID embedded layer and an article ID embedded layer in the recommendation model; and the output of the graph neural network sequentially passes through a multi-layer perceptron and a softmax layer to obtain cluster probability distribution vectors of the client.

Further, the number of the client class clusters, the number of the parallel multi-layer perceptrons in the predicted network group and the dimension of the class cluster probability distribution vector output by the client classification network are the same.

Further, the cluster probability distribution vector output by the client classification network is used as the weight of the output result of the prediction network group, the weight is in a soft label form, and the weighted average value of the output result of the prediction network group is used as the final output of the recommendation model.

Further, in the recommendation model, the input of the user ID embedding layer is user ID, the input of the object ID embedding layer is history interaction object ID and target object ID, and the user ID embedding layer and the object ID embedding layer generate user ID embedding representation, history interaction object ID embedding representation and target object ID embedding representation; and splicing the user ID embedded representation, the average value of the historical interactive object ID embedded representation and the target object ID embedded representation to be used as the input of the prediction network group in the recommendation model.

Another object of the present invention is to provide an end-to-end federal personalized recommendation system based on a user interest domain for implementing the above method; the end-to-end federal personalized recommendation system comprises: the system comprises a server side and a plurality of clients, wherein the server side is used for constructing a global model based on the preset client class cluster number, initializing model parameters and aggregating the global model parameters except the user ID embedding layer after the training of each client side; the global model consists of a client classification network and a recommendation model, wherein the recommendation model comprises a prediction network group, a user ID embedding layer and an article ID embedding layer; the client is used for training the global model by using the local training data set and returning the global model training parameters except the user ID embedding layer to the server;

each client comprises:

the data construction module is used for constructing a local training data set of the client, marking the article actually interacted with the user as a positive sample based on the historical interaction data of the user and the article, setting a negative sampling rate, randomly sampling a plurality of negative samples in a global article set, and ensuring that no intersection exists between the positive sample and the negative sample;

the model training module is used for receiving global model parameters sent by the server side and then loading the latest global model, training the global model based on the training data set obtained by the data construction module, updating local user ID embedded layer parameters and sending the rest parameters back to the server side;

the data transmission module is used for data transmission between the server and the client and comprises model parameters sent by the client to the server after the client receives the model parameters sent by the server and model parameters sent by the client to the server after the local training is finished.

Further, the client classification network in the global model is used for generating a class cluster probability distribution vector of the client, the class cluster probability distribution vector is used as a weight of the output result of the prediction network group, and a weighted average value of the output result of the prediction network group is used as a final output of the recommendation model.

The invention has the technical effects that: according to the invention, through personalized recommendation of the predicted network group part of the model and embedding the cluster-like division process into the model, the clustering of the client is automatically completed in the forward calculation of the model, and the training efficiency is improved; the set client classification network can train through the recommended targets, so that the clustering process becomes learnable; in addition, the classification network is set as the graph neural network, so that the classification effect of the class clusters is enhanced, and the recommendation performance of the model is improved.

Drawings

FIG. 1 is a schematic diagram of a framework of an end-to-end federal personalized recommendation method according to an embodiment of the present invention;

FIG. 2 is a schematic diagram of a federal training process in an end-to-end federal personalized recommendation method according to an embodiment of the present invention;

FIG. 3 is a hierarchical structure diagram of an end-to-end federal personalized recommendation system according to an embodiment of the present invention;

fig. 4 shows training results on a public recommended dataset according to an embodiment of the present invention, wherein (a) the evaluation index is AUC, and (b) the evaluation index is hitrate@10.

Detailed Description

The invention is further described below with reference to the accompanying drawings.

The invention discloses an end-to-end training idea, which embeds a client classification network into a personalized recommendation model, and specifically comprises the following steps: designing a client classification network and a recommendation model with prediction network groups, taking each probability value in probability distribution vectors output by the client classification network as the weight output by each prediction network in the prediction network groups in the recommendation model, namely carrying out weighted average on the output of the prediction network groups through the probability distribution vectors, and taking the weighted average result as the final output of the personalized recommendation model. In this way, the client can calculate the probability distribution of the class cluster to which the client belongs according to the characteristics (interaction diagram) of the client when the model is calculated in the forward direction, and select the most suitable prediction network combination according to the probability distribution vector, and the whole process is completed in the forward direction calculation of the model, so that the end-to-end individuation is realized, and the training efficiency of the individuation of the class cluster is improved; when the model gradient is reversely updated, the client classification network can also update through the recommended target, so that the purpose of learning is realized. The output of the client classification network is set in the form of a soft label, and no hardening process is performed (e.g., [0.1, 0.1, 0.8] can be hardened to be [0, 1 ]), because different clusters are not completely irrelevant, and certain similarity exists, so that weighted average by the soft label is more reasonable.

As shown in fig. 1, the end-to-end personalized recommendation model provided by the invention mainly comprises two core parts, wherein one of the core parts is that a personalized prediction network group is arranged in the recommendation model, and the number z of the prediction networks is a super parameter, which means that clients are divided into z class clusters; the other core is to design a client classification network based on a graph neural network, and for each client, the input is a client (user) -object interaction graph and the output is a cluster probability distribution vector to which the corresponding client belongs. The invention embeds the client classification network into the personalized recommendation model, which is specifically as follows: the method not only realizes end-to-end class cluster division, but also enables the client classification network to learn through a recommendation target.

FIG. 2 is a federal training process in accordance with one embodiment of the present invention, wherein the global model is the model shown in FIG. 1, including a recommendation model and a client classification network; the parameters of the recommendation model can be divided into three parts, namely a predicted network group parameter, a user ID embedded layer parameter and an article ID embedded layer parameter, and the user ID embedded layer parameter cannot be acquired by a server side in consideration of privacy protection of a client side, so that the part of parameters need to be updated locally, namely after the server side is initialized, the user ID embedded layer parameter is distributed to a corresponding client side, the part of parameters are not local any more, and the rest of parameters of the global model are still uploaded to the server side for aggregation update after being trained locally.

The whole federal training process is as follows:

step 1: the server sets training parameters such as the sampling rate of the client, the local training round of the client, the batch size of the local training of the client and the like; setting the number z of class clusters, initializing model parameters, wherein the global model consists of a recommendation model and a client classification network, the recommendation model comprises a user ID embedding layer, an article ID embedding layer and a prediction network group, and the client classification network comprises a graph neural network and a multi-layer perceptron. The user ID embedded layer parameters in the global model are distributed to all clients participating in federal training.

Step 2: the client builds a local training dataset.

Step 3: the server randomly selects a part of clients to participate in the federal training of the round based on the client sampling rate, and distributes other parameters except the user ID embedding layer in the global model to the selected clients.

Step 4: the selected client trains the received global model locally according to training parameters set by the server to obtain parameter updating gradients, updates own user ID embedded layer parameters locally, and sends the trained parameters except the user ID embedded layer back to the server.

In the step, as shown in fig. 1, each client participating in the present training takes a user ID as a user ID embedding layer in a recommendation model, takes a history interaction article ID and a target article ID as an article ID embedding layer in the recommendation model, and respectively obtains a user ID embedding representation, a history interaction article ID embedding representation and a target article ID embedding representation; the user ID embedded representation, the average value of the historical interactive object ID embedded representation and the target object ID embedded representation are spliced to be used as the input of a prediction network group in the recommendation model; meanwhile, a user-object interaction diagram, a user ID embedded representation and a history interaction object ID embedded representation are used as inputs of a graphic neural network in a client classification network, an output result of the graphic neural network is used as inputs of a multi-layer perceptron, and then a cluster probability distribution vector corresponding to the client is output through softmax; the dimension of the class cluster probability distribution vector is consistent with the number of the prediction networks in the prediction network group, the class cluster probability distribution vector of the classification network is used as weight, the output of the prediction network group is subjected to weighted average, the output after weighted average is used as the final output of the recommendation model, and the gradient is updated according to the final output calculation parameter.

Step 5: the server adopts FedAvg algorithm to aggregate the global model training parameters except the user ID embedded layer sent back by the client, and takes the aggregate result as the latest global model.

Step 6: the server repeats the steps 3 to 5 until the whole training process is converged; and each client terminal carries out personalized recommendation of the target object according to the locally stored user ID embedded layer parameters and the latest global model parameters which are transmitted by the server terminal and are except the user ID embedded layer.

In one implementation of the invention, the global model is based on a PyTorch implementation. The local training parameters set by the server in the step 1 are 10% of the sampling rate of the client, the local training turns are 5, and the local training batch size is 1024. The number z of the predicted networks in the predicted network group of the end-to-end personalized recommendation model is 16, each predicted network is a four-layer perceptron with the size of [48, 200, 80, 1], reLu is sampled as a nonlinear activation function, only the ID characteristics (ID embedded representation extracted by an ID embedded layer) of a user and an article are utilized, the dimension of the ID embedded layer of the user and the article is set to be 16, and the input of the predicted network group in the recommendation model is a 48-dimensional vector formed by splicing three types of user ID embedded representation, average value of historical interaction article ID embedded representation and target article ID embedded representation. The client classification network part is provided with two layers of graph neural networks in total, wherein the two layers of graph neural networks are in a GCN neighborhood information aggregation form, and the two layers of graph neural networks are input into a user-object interaction graph, a user ID embedded representation and a history interaction object ID embedded representation by adopting a vector splicing mode as an aggregation function of different-order neighborhood, the output of each layer of the graph neural network part is a 16-dimensional vector, and the output of the graph neural network part is subjected to three-layer perceptrons and softmax layers of [16, 80, 16] to finally output a cluster probability distribution vector of the client.

The window of the user history interactive objects is set to be 50, and the situation that less than 50 interactive objects are replaced by the No. 0 object.

In one implementation of the present invention, step 2 is primarily a local dataset used to build a training recommendation model. The local interaction data of all clients are ordered according to time sequence, wherein the first 80% is used as a training set, and the second 20% is used as a testing set. The negative sampling rate of the training set is set to 1:4, i.e. 1 positive sample corresponds to 4 negative samples, and the negative sampling rate of the test set is set to 1:99, i.e. 1 positive sample corresponds to 99 negative samples.

In one embodiment of the present invention, the optimizer of the client local training model in step 4 uses Adam, the learning rate is set to 0.01, adam optimizer parameters are initialized before each client starts training in each round of federal training, and the parameters do not participate in federal aggregation.

In one embodiment of the present invention, the service end in step 5 adopts FedAvg algorithm aggregation model, that is, fedAvg algorithm, and the parameter aggregation process can be expressed as

Wherein t represents the t-th round of federal training, K represents the number of clients participating in the present round of federal training (which may be numbered 1,2, …, K), n _k Representing the data set size of the local client, N is the overall data set size formed by the clients participating in the current round of federal training,

representing global model parameters to be aggregated, W, obtained by local training of kth client of t-th round _t+1 Represents the initial aggregation parameters for the t+1st round of federal training.

In one implementation of the present invention, the movie rating dataset MovieLens-1M is employed as the experimental dataset and the interaction data is partitioned by user ID to simulate the federal recommended scenario. AUC and HitRate@10 were used as evaluation indices.

Because the user is recorded as the client, the MovieLens-1M data set contains 6040 users, and therefore 6040 intelligent terminals are required to perform experiments according to actual conditions, and the experiment resources are limited. Fig. 4 (a) and (b) each show experimental results of an embodiment of the present invention, the abscissa shows federal communication round, and the ordinate of (a) and (b) in fig. 4 is AUC and hitrate@10, respectively, wherein the dotted line shows a conventional recommendation model, only including an embedded layer and a multi-layer perceptron, and the solid line shows a global model shown in fig. 1 proposed by the present invention. The experimental result shows that the effect of the invention is obviously better than that of the traditional recommendation model, and the effectiveness of the invention is demonstrated.

In this embodiment, an end-to-end federal personalized recommendation system based on the user interest domain is also provided, as shown in fig. 1, and the system is used to implement the foregoing embodiments. The terms "module," "unit," and the like, as used below, may be a combination of software and/or hardware that performs a predetermined function. Although the system described in the following embodiments is preferably implemented in software, implementation in hardware, or a combination of software and hardware, is also possible.

The federal learning aggregation system based on no data distillation provided in this embodiment includes:

fig. 3 is a hierarchical structure diagram of the end-to-end federal personalized recommendation system proposed by the present invention, and the whole is composed of a server and a plurality of clients.

The server side represents a certain article recommendation platform in practical application, and needs to train to obtain a global model through a federal learning mode under the condition of privacy protection, and a specific flow is shown in fig. 2 and is already described in detail in the method section.

The client represents an intelligent terminal (such as a mobile phone) in practical application, and a registered user of the article recommendation platform browses articles through the intelligent terminal, so that one terminal can be considered to correspond to one user, namely one client corresponds to one user.

Each client comprises three modules:

and the data construction module is used for constructing a local training data set and a test data set of the client. The training set and the testing set are divided through the interaction time of the user and the article, and the training set and the testing set are specifically as follows: the local interaction data of the client are ordered in time sequence, the former part of data is used for constructing a training set, and the latter part of data is used for constructing a testing set.

In this embodiment, the training data set and the test data set are constructed according to implicit feedback, specifically: based on the interaction data of the user and the objects, the objects actually interacted with the user are marked as positive samples, the negative sampling rate is set, a plurality of negative samples are randomly sampled in the global object set, and no intersection of the positive samples and the negative samples is ensured.

The model training module is mainly used for locally training the global model, a model framework is arranged in the model training module, the latest global model is loaded after receiving the global model parameters sent by the server, the global model is trained based on the training data set obtained by the data construction module, and after the gradient is obtained, the user ID embedded layer parameters in the global model are locally updated.

For the system embodiment, since the system embodiment basically corresponds to the method embodiment, the relevant parts only need to be referred to in the description of the method embodiment, and the implementation methods of the remaining modules are not repeated herein. The system embodiments described above are merely illustrative, wherein the elements illustrated as separate elements may or may not be physically separate, and the elements shown as elements may or may not be physical elements, may be located in one place, or may be distributed over a plurality of network elements. Some or all of the modules may be selected according to actual needs to achieve the purposes of the present invention. Those of ordinary skill in the art will understand and implement the present invention without undue burden.

Embodiments of the system of the present invention may be applied to any device having data processing capabilities, such as a computer or the like. The system embodiment may be implemented by software, or may be implemented by hardware or a combination of hardware and software. Taking software implementation as an example, the device in a logic sense is formed by reading corresponding computer program instructions in a nonvolatile memory into a memory by a processor of any device with data processing capability.

The foregoing list is only illustrative of specific embodiments of the invention. Obviously, the invention is not limited to the above embodiments, but many variations are possible. All modifications directly derived or suggested to one skilled in the art from the present disclosure should be considered as being within the scope of the present invention.

Claims

1. The end-to-end federal personalized recommendation method based on the user interest domain is characterized by comprising the following steps of:

2. The method for personalized end-to-end federation recommendation based on user interest domains according to claim 1, wherein in step 1, the server needs to preset a client sampling rate and training parameters, wherein the training parameters include a client local training round and a client local training batch size.

3. The end-to-end federal personalized recommendation method based on user interest domains according to claim 1, wherein the predictive network group is composed of a plurality of parallel multi-layer perceptrons.

4. The end-to-end federal personalized recommendation method based on user interest domains of claim 3, wherein said client classification network comprises a graph neural network, a multi-layer perceptron and a softmax layer; the input of the graphic neural network is a user-object interaction graphic, a user ID embedded representation and a history interaction object ID embedded representation; the user ID embedded representation and the historical interaction article ID embedded representation are respectively generated by a user ID embedded layer and an article ID embedded layer in the recommendation model; and the output of the graph neural network sequentially passes through a multi-layer perceptron and a softmax layer to obtain cluster probability distribution vectors of the client.

5. The end-to-end federation personalized recommendation method based on user interest domains according to claim 4, wherein the number of client class clusters, the number of parallel multi-layer perceptrons in the predicted network group and the class cluster probability distribution vector dimension output by the client classification network are the same.

6. The personalized end-to-end federation recommendation method based on user interest domain according to claim 5, wherein the cluster probability distribution vector output by the client classification network is used as the weight of the output result of the prediction network group, and the weighted average value of the output result of the prediction network group is used as the final output of the recommendation model.

7. The end-to-end federation personalized recommendation method based on user interest domain according to claim 1, wherein the input of the user ID embedding layer in the recommendation model is user ID, the input of the object ID embedding layer is history interactive object ID and target object ID, and the user ID embedding layer and the object ID embedding layer generate user ID embedding representation, history interactive object ID embedding representation and target object ID embedding representation; and splicing the user ID embedded representation, the average value of the historical interactive object ID embedded representation and the target object ID embedded representation to be used as the input of the prediction network group in the recommendation model.

8. The end-to-end federation personalized recommendation method based on user interest domain according to claim 1, wherein step 5 uses federal averaging algorithm to aggregate global model training parameters except for user ID embedded layer sent back by each client.

9. An end-to-end federal personalized recommendation system based on a user interest domain, comprising: the system comprises a server side and a plurality of clients, wherein the server side is used for constructing a global model based on the preset client class cluster number, initializing model parameters and aggregating the global model parameters except the user ID embedding layer after the training of each client side; the global model consists of a client classification network and a recommendation model, wherein the recommendation model comprises a prediction network group, a user ID embedding layer and an article ID embedding layer; the client is used for training the global model by using the local training data set and returning the global model training parameters except the user ID embedding layer to the server;

each client comprises:

10. The end-to-end federation personalized recommendation system based on user interest domains of claim 9, wherein the client classification network in the global model is configured to generate a class cluster probability distribution vector of the client, the class cluster probability distribution vector is used as a weight of the output result of the prediction network group, and a weighted average of the output results of the prediction network group is used as a final output of the recommendation model.