WO2021121106A1 - Federated learning-based personalized recommendation method, apparatus and device, and medium - Google Patents

Federated learning-based personalized recommendation method, apparatus and device, and medium Download PDF

Info

Publication number
WO2021121106A1
WO2021121106A1 PCT/CN2020/135030 CN2020135030W WO2021121106A1 WO 2021121106 A1 WO2021121106 A1 WO 2021121106A1 CN 2020135030 W CN2020135030 W CN 2020135030W WO 2021121106 A1 WO2021121106 A1 WO 2021121106A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
personalized recommendation
model
federated learning
gradient
Prior art date
Application number
PCT/CN2020/135030
Other languages
French (fr)
Chinese (zh)
Inventor
黄福华
刘畅
郑文琛
Original Assignee
深圳前海微众银行股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳前海微众银行股份有限公司 filed Critical 深圳前海微众银行股份有限公司
Publication of WO2021121106A1 publication Critical patent/WO2021121106A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results

Definitions

  • This application relates to the field of artificial intelligence technology of financial technology (Fintech), and in particular to a method, device, device, and medium for personalized recommendation based on federated learning.
  • personalized recommendation technology With the continuous development of computer software and artificial intelligence, personalized recommendation technology has become more and more widely used.
  • personalized recommendation providers usually obtain user attribute data, user behavior data, user behavior context data and other data obtained by themselves. , Predict the user’s personalized behavior or item, for example, predict the user’s favorite mobile phone and predict the user’s web page click-through rate, etc.
  • the feature richness of user data in this method often has a great impact on the prediction result. Due to the tightening of data privacy protection legislation, user data cannot be shared in clear text between different data owners. Therefore, the feature richness of user data of a single personalized recommendation provider is often low, which in turn leads to personalized behavior or The prediction accuracy of items is also low, which in turn leads to a poor personalized recommendation effect. Therefore, there is a technical problem in related technologies that the personalized recommendation effect is poor.
  • the main purpose of this application is to provide a method, device, device and medium for personalized recommendation based on federated learning, aiming to solve the technical problem of poor personalized recommendation effect in related technologies.
  • the embodiments of the present application provide a personalized recommendation method based on federated learning.
  • the personalized recommendation method based on federated learning is applied to a personalized recommendation device based on federated learning.
  • the personalized recommendation method based on federated learning Recommended methods include:
  • the output results of the model are screened to obtain personalized recommendation results.
  • the target recall set includes a list of items to be recommended
  • the upload data includes user data, item data, and behavior data
  • the step of obtaining the data to be predicted corresponding to the uploaded data and the target recall set, and inputting the data to be predicted into a personalized recommendation model obtained based on the federated learning, and obtaining the model output result includes:
  • the data to be predicted is input into the personalized recommendation model to score and sort the recommended items in the list of recommended items to obtain model output results.
  • the step of inputting the to-be-predicted data into the personalized recommendation model to score and sort the to-be-recommended items in the to-be-recommended item list, and obtaining model output results includes:
  • the items to be recommended are sorted, and the model output result is obtained.
  • the personalized recommendation method based on federated learning is applied to the first device that performs the federated learning
  • the step of obtaining the to-be-predicted data corresponding to the uploaded data and the target recall set, and inputting the to-be-predicted data into the personalized recommendation model obtained based on the federated learning, before the step of obtaining the model output result includes:
  • the personalized recommendation model is obtained by interacting with the second device for federated learning.
  • the personalized recommendation model includes a logistic regression model
  • the step of obtaining the personalized recommendation model by interacting with the second device for federated learning based on the public training sample ID includes:
  • Receive the second weight value sent by the second device calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value through a preset intermediate parameter formula, and send the gradient auxiliary variable to A second device, wherein the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
  • the first gradient is calculated, and the first gradient is sent to a preset federation server, where the preset federation server is used to send data based on the first gradient and the second device
  • the second gradient calculates the total gradient of the federated model
  • the total gradient of the federation model fed back by the federation server is received, and the local training model of the first device is iteratively updated based on the total gradient of the federation model to obtain the logistic regression model.
  • the step of receiving uploaded data and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database includes:
  • the personalized recommendation method based on federated learning is applied to the first device that performs the federated learning
  • the step of receiving uploaded data and extracting the target recall set corresponding to the uploaded data from a preset recall set storage database includes:
  • the second device associated with the first device performs federated learning to obtain a federated recall algorithm model
  • the present application also provides a personalized recommendation device based on federated learning.
  • the personalized recommendation device based on federated learning is applied to a personalized recommendation device based on federated learning.
  • the personalized recommendation device based on federated learning includes:
  • the extraction module is configured to receive the uploaded data, and extract the target recall set corresponding to the uploaded data from a preset recall set storage database;
  • a prediction module configured to obtain the data to be predicted corresponding to the uploaded data and the target recall set, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
  • the screening module is used to screen the output results of the model to obtain personalized recommendation results.
  • the prediction module includes:
  • the screening unit is configured to screen the item data based on the list of items to be recommended to obtain the item data to be recommended, and set the user data, the item data to be recommended and the behavior data as all State the data to be predicted;
  • the prediction unit is used for inputting the data to be predicted into the personalized recommendation model, so as to score and sort the recommended items in the list of to-be-recommended items, and obtain model output results.
  • the prediction unit includes:
  • a scoring subunit for inputting the data to be predicted into the personalized recommendation model to score the item to be recommended based on the user data and the behavior data to obtain a scoring result
  • the sorting subunit is used for sorting the items to be recommended based on the scoring result to obtain a model output result.
  • the personalized recommendation device based on federated learning further includes:
  • a sample matching module configured to perform sample matching on the second device associated with the first device to obtain a public training sample ID
  • the first federated learning module is used to obtain the personalized recommendation model by interacting with the second device to perform federated learning based on the public training sample ID.
  • the federated learning module includes:
  • a first calculation unit configured to extract the first sample data corresponding to the public training sample ID based on the public training sample ID, and calculate the first weight corresponding to the first sample data
  • the second calculation unit is configured to receive the second weight value sent by the second device, and calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value by using a preset intermediate parameter formula, And sending the gradient auxiliary variable to a second device, where the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
  • the third calculation unit is configured to calculate a first gradient based on the gradient auxiliary variable, and send the first gradient to a preset federated server, where the preset federated server is used to calculate the first gradient based on the first gradient.
  • An iterative update unit configured to receive the total gradient of the federation model fed back by the federation server, and to iteratively update the local training model of the first device based on the total gradient of the federation model to obtain the logistic regression model .
  • the extraction module includes:
  • An extraction unit configured to receive the uploaded data and extract the sample ID in the uploaded data
  • the query unit is configured to query the corresponding target recall set in the preset recall set storage database based on the sample ID.
  • the personalized recommendation device based on federated learning includes:
  • a second federated learning module configured to perform federated learning for the second device associated with the first device to obtain a federated recall algorithm model
  • the storage module is used to obtain the sample upload data and input the sample upload data into the federated recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage In the database.
  • the personalized recommendation device based on federated learning includes: a memory, a processor, and the The program of the personalized recommendation method based on federated learning, when the program of the personalized recommendation method based on federated learning is executed by the processor, can realize the steps of the above-mentioned personalized recommendation method based on federated learning.
  • This application also provides a medium, the medium is a readable storage medium, the medium stores a program for implementing a personalized recommendation method based on federated learning, and the program for the personalized recommendation method based on federated learning is processed by a processor During execution, the steps of the above-mentioned personalized recommendation method based on federated learning are realized.
  • This application receives uploaded data, extracts the target recall set corresponding to the uploaded data from a preset recall set storage database, and then obtains the to-be-predicted data corresponding to the uploaded data and the target recall set, and combines the
  • the data to be predicted is input based on the personalized recommendation model obtained by the federated learning, and the model output result is obtained, and then the model output result is filtered to obtain the personalized recommendation result. That is, this application first receives the upload data, and then extracts the target recall set corresponding to the uploaded data from the preset recall set storage database, and then performs the common correspondence between the upload data and the target recall set.
  • the personalized recommendation model is acquired based on federated learning, that is, through the federated learning, the training of the personalized recommendation model can be combined with multi-party data, thereby increasing the feature richness of the training samples of the personalized recommendation model, and The data privacy of each data provider and data user will not be disclosed, thereby improving the robustness and broadness of the personalized recommendation model, thereby improving the prediction accuracy of the personalized recommendation model, and avoiding the result of the personalized recommendation model
  • the prediction accuracy rate of which leads to poor personalized recommendation effect, and by inputting the data to be predicted into the personalized recommendation model obtained based on the federated learning, the joint multi-party data for prediction is achieved when only local data is used for prediction.
  • the prediction effect of the model reduces the amount of calculation for personalized recommendation, thereby improving the response speed during personalized recommendation, and enhancing the effect of personalized recommendation. Therefore, the technical problem of poor personalized recommendation effect is solved.
  • the embodiment of the present application provides a personalized recommendation method based on federated learning.
  • the personalized recommendation method based on federated learning is applied to a personalized recommendation device based on federated learning.
  • the personalized recommendation method based on federated learning includes:
  • Step S10 receiving uploaded data, and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database;
  • the upload data includes sample ID, user data, item data, and behavior data.
  • the upload data is uploaded by the client, and the user data includes user natural attribute data and user data. Interest attribute data, etc.
  • the item data includes data such as the item name and item attributes of the item to be personalized and recommended
  • the behavior data includes user behavior data on the item and context data when the behavior occurs, such as user behavior on the item
  • the data includes browsing, clicking, etc.
  • the context data when the behavior occurs includes geographic location, network type, etc.
  • the sample ID includes the user's name, user ID number, and user phone number.
  • the database is queried for the target recall set corresponding to the user data, where the target recall set includes one or more initial items to be recommended, where the initial items to be recommended refer to items that may be of interest to the user.
  • the step of receiving uploaded data and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database includes:
  • Step S11 receiving uploaded data, and extracting the sample ID in the uploaded data
  • the sample ID includes identity tags such as user IDs and character strings
  • the uploaded data includes one or more data samples
  • the data samples are represented by a preset data sample format.
  • id, label, user_a_feature_i, item_feature_i, action_i, other_a_i is a preset data sample format, where id is the sample ID and label is the sample label, where the sample label identifies the type of user, for example, The user is a good customer or a bad customer, user_a_feature_i is user data, item_feature_i is item data, action_i is behavior data, and other_a_i is other data.
  • Step S12 based on the sample ID, query a corresponding target recall set in the preset recall set storage database.
  • the sample ID includes identity tags such as user ID, character string, phone number, etc. That is, the sample ID can be represented by a character string, or can be directly represented by a phone number .
  • the corresponding target recall set is queried in the preset recall set storage database.
  • the sample ID is used as a keyword to search in the preset recall set storage database to obtain all The target recall set.
  • the personalized recommendation method based on federated learning is applied to the first device that performs the federated learning
  • the step of receiving uploaded data and extracting the target recall set corresponding to the uploaded data from a preset recall set storage database includes:
  • Step A10 the second device associated with the first device performs federated learning to obtain a federated recall algorithm model
  • the federated learning includes vertical federated learning and horizontal federated learning, and the first device and the second device may be communicatively connected.
  • the second device associated with the first device performs federated learning to obtain a federated recall algorithm model, specifically, performs sample alignment with the second device to align the sample ID of the first device with the second device Align the sample ID of the sample to obtain the public training sample ID, train the first training model of the first device based on the uploaded data corresponding to the public training sample ID, obtain the first training result, and calculate the first training result
  • the error from the theoretical training result is further based on the error and the network weight of the first training model to obtain a partial derivative of the preset objective function to obtain the first gradient, wherein the preset objective function is related to the error And the network weight function, and then send the first gradient to a preset federation server, so that the preset federation server sends the first gradient and the second device to the first gradient and the second device based on preset federation rules.
  • the two gradients are federated to obtain the federated gradient, and then the federated gradient fed back by the preset federated server is received, and the first training model is iteratively updated based on the federated gradient to obtain the federated recall algorithm model.
  • the preset federal rules include weighted average, and the conditions for stopping the iterative update include reaching the maximum number of iterations, and the model converges to a preset error threshold.
  • Step A20 Obtain sample upload data, and input the sample upload data into the federal recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage database.
  • the target recall set includes one or more target recall sets, wherein each sample ID corresponds to a target recall set.
  • sample upload data and input the sample upload data into the federal recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage database, specifically, obtain Sample upload data, where the sample upload data includes upload data of each user stored in the first device, and each user corresponds to a sample ID, where the upload data includes user data, item data, and behavior data Etc., and then input the sample upload data into the federal recall algorithm model to predict the items that may be of interest to the user of the first device, obtain one or more items to be recommended, and assign them to the items to be recommended corresponding to the same sample ID.
  • the recommended item is divided into a target recall set, the target recall set is obtained, and the target recall set is stored in the preset recall set storage database, where the target recall set is in the preset recall set
  • the collection storage database is stored in the form of item lists, item collections, etc., and the query keywords corresponding to the target recall collection include sample IDs.
  • Step S20 Obtain the data to be predicted corresponding to the uploaded data and the target recall set, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
  • the participants include a first device and a second device, and the personalized learning obtained based on the federated learning
  • the recommendation model includes a logistic regression model that has been trained based on the federated learning, the output of the model includes items to be recommended and their ratings and rankings, and the data to be predicted includes user data, behavior data, and corresponding target recall sets. Item data, etc.
  • the data to be predicted corresponding to the target recall set is input based on the personalized recommendation model obtained by the federated learning to score and rank the recommended items corresponding to the target recall set, and obtain the corresponding items to be recommended
  • the sorted list of items and the scores corresponding to the items to be recommended, that is, the output results of the model are obtained.
  • the target recall set includes a list of items to be recommended
  • the upload data includes user data, item data, and behavior data
  • the step of obtaining the data to be predicted corresponding to the uploaded data and the target recall set, and inputting the data to be predicted into a personalized recommendation model obtained based on the federated learning, and obtaining the model output result includes:
  • Step S21 Screen the item data based on the list of items to be recommended to obtain item data to be recommended, and set the user data, the item data to be recommended and the behavior data as the data to be predicted ;
  • the items in the list of items to be recommended are associated with the item data to be recommended.
  • Step S22 Input the to-be-predicted data into the personalized recommendation model to score and sort the to-be-recommended items in the to-be-recommended item list to obtain model output results.
  • the to-be-predicted data is input into the personalized recommendation model to score and sort the to-be-recommended items in the to-be-recommended item list to obtain the model output result
  • the The prediction data is input into the personalized recommendation model to score the items in the list of items to be recommended based on the user data, the behavior data, and the item data to be recommended to obtain a scoring result, which is then based on the score
  • the items in the list of items to be recommended are sorted, and the output result of the model is obtained.
  • step S22 the step of inputting the to-be-predicted data into the personalized recommendation model to score and sort the to-be-recommended items in the to-be-recommended item list, and obtaining model output results includes:
  • Step S221 Input the to-be-predicted data into the personalized recommendation model to score the to-be-recommended item based on the user data and the behavior data to obtain a scoring result;
  • the to-be-predicted data is input into the personalized recommendation model to score the to-be-recommended item based on the user data and the behavior data to obtain a scoring result
  • the The data to be predicted is input into the personalized recommendation model to score the items to be recommended based on the user data and the behavior data, to obtain a first scoring result and a second scoring result, based on the first scoring result
  • the scoring result is calculated by a preset calculation rule, where the preset calculation rule includes weighted average, summation, quadrature, etc., for example, assuming that the calculation rule is a summation, If the first scoring result is 1 point and the second scoring result is 2 points, then the scoring result is 3 points.
  • Step S222 Sort the items to be recommended based on the scoring result to obtain a model output result.
  • the items to be recommended are sorted to obtain a model output result, specifically, based on the scoring result, the scoring items to be recommended are performed in a preset sorting manner Sorting, wherein the preset sorting method includes sorting from small to large, sorting from large to small, etc., to obtain model output results.
  • Step S30 Screen the output results of the model to obtain personalized recommendation results.
  • the model output results are filtered to obtain personalized recommendation results.
  • the model output results are filtered based on preset business logic, and the items to be recommended in the model output results are extracted as The personalized recommendation result, for example, extracting a preset number of items with the highest score in the model output result as the personalized recommendation result.
  • upload data is received, and the target recall set corresponding to the uploaded data is extracted from a preset recall set storage database, and then the to-be-predicted data corresponding to the uploaded data and the target recall set are obtained, and all The data to be predicted is input based on the personalized recommendation model obtained by the federated learning, and the model output result is obtained, and then the model output result is filtered to obtain the personalized recommendation result. That is, this embodiment first receives the upload data, and then extracts the target recall set corresponding to the uploaded data from the preset recall set storage database, and then performs the common correspondence between the upload data and the target recall set.
  • the personalized recommendation model is obtained based on federated learning, that is, through the federated learning, the training of the personalized recommendation model can be combined with multi-party data, thereby increasing the feature richness of the training samples of the personalized recommendation model, The data privacy of each data provider and data user will not be leaked, thereby improving the robustness and broadness of the personalized recommendation model, thereby improving the prediction accuracy of the personalized recommendation model, and avoiding the result of personalized recommendation
  • the prediction accuracy of the model leads to poor personalized recommendation effects, and by inputting the data to be predicted into the personalized recommendation model obtained based on the federated learning, joint multi-party data processing is achieved when only local data is used for prediction.
  • the prediction effect of the predicted model reduces the amount of calculation when making personalized recommendation, thereby improving the response speed during personalized recommendation and enhancing the effect of personalized recommendation. Therefore, the technical problem of poor personalized recommendation effect is solved.
  • the personalized recommendation method based on federated learning is applied to the federated learning
  • the first device
  • the step of obtaining the to-be-predicted data corresponding to the uploaded data and the target recall set, and inputting the to-be-predicted data into the personalized recommendation model obtained based on the federated learning, before the step of obtaining the model output result includes:
  • Step B10 Perform sample matching with the second device associated with the first device to obtain a public training sample ID
  • the public training sample ID includes identity tags such as user IDs and identification strings.
  • the second device associated with the first device performs sample matching to obtain a common training sample ID, specifically, the first training sample ID in the first device is compared with the second training sample ID in the second device Perform intersection processing to obtain the public training sample ID.
  • Step B20 Based on the public training sample ID, the personalized recommendation model is obtained by interacting with the second device for federated learning.
  • the personalized recommendation model is obtained by interacting with the second device for federated learning.
  • the public The first sample data corresponding to the training sample ID is obtained, and the first gradient is obtained based on the first sample data, and the second device interacts with the second device to assist the second device in obtaining the second gradient, and to The first gradient is sent to the preset federation server, and further, the federated model total gradient fed back by the preset federated server is received, wherein the federated model total gradient is calculated by the preset federated server through preset federation rules.
  • the second gradient sent by the second device where the preset federation rules include weighted average, summation, etc.
  • the first training model in the first device is further determined based on the total gradient of the federation model. Iterative update is performed to obtain the personalized recommendation model.
  • the personalized recommendation model includes a logistic regression model
  • the step of obtaining the personalized recommendation model by interacting with the second device for federated learning based on the public training sample ID includes:
  • Step B21 based on the public training sample ID, extract the first sample data corresponding to the public training sample ID, and calculate the first weight corresponding to the first sample data;
  • the first sample data is upload data of the user client
  • the upload data includes one or more data samples
  • the first weight value is the same as the first The product of the first sample feature corresponding to this data and the first weight in the first training model, where the first weight is the network weight of the first training model, for example, suppose the first weight Is WB, and the first sample feature is XB, then the first weight value is WBXB.
  • Step B22 Receive the second weight value sent by the second device, calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value through a preset intermediate parameter formula, and assist the gradient The variable is sent to a second device, where the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
  • the second weight value sent by the second device is received, and the gradient auxiliary variable corresponding to the first weight value and the second weight value is calculated through a preset intermediate parameter formula, and the all The gradient auxiliary variable is sent to a second device, where the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable, specifically, receiving a second weight encrypted and sent by the second device, and encrypting The latter second weight is decrypted to obtain the second weight, where the second weight is the second sample feature corresponding to the second sample data and the second in the second training model in the second device.
  • the product of weights wherein the second weight is the network weight of the second training model, and further, the first weight and the second weight are combined to obtain the first device and the second The total weight corresponding to the two devices, and then substitute the total weight variable into the preset intermediate parameter formula to obtain the gradient auxiliary variable, and send the gradient auxiliary variable to the second device, and the The gradient auxiliary variable is associated with the sample label, wherein the preset intermediate parameter formula is as follows:
  • wTx is the total weight
  • y is the sample label
  • Step B23 Calculate a first gradient based on the gradient auxiliary variable, and send the first gradient to a preset federated server, where the preset federated server is configured to be based on the first gradient and the second
  • the second gradient sent by the device calculates the total gradient of the federated model
  • the first gradient is calculated based on the gradient auxiliary variable, and the first gradient is sent to a preset federated server, where the preset federated server is used to calculate the first gradient based on the first gradient and the
  • the second gradient sent by the second device calculates the total gradient of the federated model, specifically, the product of the gradient auxiliary variable and the first sample feature is calculated to obtain the first gradient, where the first gradient can be obtained by the following formula Calculated,
  • gB is the first sampling gradient
  • wTx is the total weight variable
  • y is the sample label
  • xB is the first sample feature of the first device.
  • the first gradient is sent to the preset federation server to pass the pre- It is assumed that the federation server calculates the total gradient of the federation model corresponding to the first gradient and the second gradient sent by the second device based on a preset federation rule, where the preset federation rule includes weighted average, summation, etc.,
  • the second gradient is the product of the gradient auxiliary variable and the second sample feature.
  • Step B24 Receive the total gradient of the federation model fed back by the federation server, and iteratively update the local training model of the first device based on the total gradient of the federation model to obtain the logistic regression model.
  • the local training model is the first training model.
  • the federation server Receive the total gradient of the federation model fed back by the federation server, and iteratively update the local training model of the first device based on the total gradient of the federation model to obtain the logistic regression model, specifically, receive the federation
  • the total gradient of the federated model fed back by the server, and based on the total gradient of the federated model, the local training model of the first device is trained and updated, and it is judged whether the updated local training device meets the preset training completion condition. If the local training device meets the preset training completion condition, the updated local training device is used as the logistic regression model.
  • the next federated learning The local training device performs training updates until the local training device reaches a preset training completion condition, where the training completion condition includes the model reaching the maximum number of iterations, the model error is less than the preset error threshold and converging.
  • a public training sample ID is obtained by performing sample matching with a second device associated with the first device, and then based on the public training sample ID, by interacting with the second device to perform federated learning, the Personalized recommendation model. That is, this embodiment first performs sample matching of the second device associated with the first device to obtain a public training sample ID, and then based on the public training sample ID, through interaction with the second device, In order to perform federated learning, the personalized recommendation model is obtained. That is, this embodiment provides a method for obtaining a personalized recommendation model through federated learning. That is, by interacting with the second device, the training sample data of the second device can be combined to perform federated learning.
  • FIG. 3 is a schematic diagram of the device structure of the hardware operating environment involved in the solution of the embodiment of the present application.
  • the personalized recommendation device based on federated learning may include a processor 1001, such as a CPU, a memory 1005, and a communication bus 1002.
  • the communication bus 1002 is used to implement connection and communication between the processor 1001 and the memory 1005.
  • the memory 1005 can be a high-speed RAM memory or a stable memory (non-volatile memory), such as disk storage.
  • the memory 1005 may also be a storage device independent of the aforementioned processor 1001.
  • the personalized recommendation device based on federated learning may also include a rectangular user interface, a network interface, a camera, and RF (Radio Frequency (radio frequency) circuits, sensors, audio circuits, WiFi modules, etc.
  • the rectangular user interface may include a display screen (Display) and an input sub-module such as a keyboard (Keyboard), and the optional rectangular user interface may also include a standard wired interface and a wireless interface.
  • the optional network interface can include standard wired interface and wireless interface (such as WI-FI interface).
  • the structure of the personalized recommendation device based on federated learning shown in FIG. 3 does not constitute a limitation on the personalized recommendation device based on federated learning, and may include more or fewer components than shown in the figure. Or some parts are combined, or different parts are arranged.
  • the memory 1005 as a computer storage medium may include an operating system, a network communication module, and a personalized recommendation program based on federated learning.
  • the operating system is a program that manages and controls the hardware and software resources of the personalized recommendation device based on federated learning, and supports the running of the personalized recommendation program based on federated learning and other software and/or programs.
  • the network communication module is used to realize the communication between the components in the memory 1005 and the communication with other hardware and software in the personalized recommendation system based on federated learning.
  • the processor 1001 is configured to execute the personalized recommendation program based on federated learning stored in the memory 1005 to realize the personalized recommendation based on federated learning described in any of the above items. Recommended method steps.
  • the specific implementation of the personalized recommendation device based on federated learning in this application is basically the same as the above-mentioned embodiments of the personalized recommendation method based on federated learning, and will not be repeated here.
  • An embodiment of the present application also provides a personalized recommendation device based on federated learning.
  • the personalized recommendation device based on federated learning includes:
  • the extraction module is configured to receive the uploaded data, and extract the target recall set corresponding to the uploaded data from a preset recall set storage database;
  • a prediction module configured to obtain the data to be predicted corresponding to the uploaded data and the target recall set, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
  • the screening module is used to screen the output results of the model to obtain personalized recommendation results.
  • the prediction module includes:
  • the screening unit is configured to screen the item data based on the list of items to be recommended to obtain the item data to be recommended, and set the user data, the item data to be recommended and the behavior data as all State the data to be predicted;
  • the prediction unit is used for inputting the data to be predicted into the personalized recommendation model, so as to score and sort the recommended items in the list of to-be-recommended items, and obtain model output results.
  • the prediction unit includes:
  • a scoring subunit for inputting the data to be predicted into the personalized recommendation model to score the item to be recommended based on the user data and the behavior data to obtain a scoring result
  • the sorting subunit is used for sorting the items to be recommended based on the scoring result to obtain a model output result.
  • the personalized recommendation device based on federated learning further includes:
  • a sample matching module configured to perform sample matching on the second device associated with the first device to obtain a public training sample ID
  • the first federated learning module is used to obtain the personalized recommendation model by interacting with the second device to perform federated learning based on the public training sample ID.
  • the federated learning module includes:
  • a first calculation unit configured to extract the first sample data corresponding to the public training sample ID based on the public training sample ID, and calculate the first weight corresponding to the first sample data
  • the second calculation unit is configured to receive the second weight value sent by the second device, and calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value by using a preset intermediate parameter formula, And sending the gradient auxiliary variable to a second device, where the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
  • the third calculation unit is configured to calculate a first gradient based on the gradient auxiliary variable, and send the first gradient to a preset federated server, where the preset federated server is used to calculate the first gradient based on the first gradient.
  • An iterative update unit configured to receive the total gradient of the federation model fed back by the federation server, and to iteratively update the local training model of the first device based on the total gradient of the federation model to obtain the logistic regression model .
  • the extraction module includes:
  • An extraction unit configured to receive the uploaded data and extract the sample ID in the uploaded data
  • the query unit is configured to query the corresponding target recall set in the preset recall set storage database based on the sample ID.
  • the personalized recommendation device based on federated learning includes:
  • a second federated learning module configured to perform federated learning for the second device associated with the first device to obtain a federated recall algorithm model
  • the storage module is used to obtain the sample upload data and input the sample upload data into the federated recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage In the database.
  • the specific implementation of the personalized recommendation device based on federated learning in this application is basically the same as the above-mentioned embodiments of the personalized recommendation method based on federated learning, and will not be repeated here.
  • the embodiment of the present application provides a medium, the medium is a readable storage medium, and the medium stores one or more programs, and the one or more programs may also be executed by one or more processors to It is used to implement the steps of any one of the above-mentioned personalized recommendation methods based on federated learning.

Abstract

A federated learning-based personalized recommendation method, apparatus and device, and a medium. The federated learning-based personalized recommendation method comprises: receiving uploaded data, and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database (S10); acquiring predicting data corresponding to both the uploaded data and the target recall set, and inputting the predicting data into a personalized recommendation model acquired on the basis of the federated learning and acquiring a model output result (S20); and filtering the model output result to obtain a personalized recommendation result (S30).

Description

基于联邦学习的个性化推荐方法、装置、设备及介质Personalized recommendation method, device, equipment and medium based on federated learning
本申请要求:2019年12月20日申请的、申请号为201911326853.3、名称为“基于联邦学习的个性化推荐方法、装置、设备及介质”的中国专利申请的优先权,在此将其引入作为参考。This application requires: the priority of the Chinese patent application filed on December 20, 2019, with the application number 201911326853.3, and the name "Personalized recommendation method, device, equipment and medium based on federal learning", which is hereby introduced as reference.
技术领域Technical field
本申请涉及金融科技(Fintech)的人工智能技术领域,尤其涉及一种基于联邦学习的个性化推荐方法、装置、设备及介质。This application relates to the field of artificial intelligence technology of financial technology (Fintech), and in particular to a method, device, device, and medium for personalized recommendation based on federated learning.
背景技术Background technique
随着金融科技,尤其是互联网科技金融的不断发展,越来越多的技术(如分布式、区块链Blockchain、人工智能等)应用在金融领域,但金融业也对技术提出了更高的要求,如对金融业对应待办事项的分发也有更高的要求。With the continuous development of financial technology, especially Internet technology and finance, more and more technologies (such as distributed, blockchain, artificial intelligence, etc.) are applied in the financial field, but the financial industry has also proposed higher technology Requirements, such as the distribution of to-do items in the financial industry, also have higher requirements.
随着计算机软件和人工智能的不断发展,个性化推荐技术也应用的越来越广泛,目前,个性化推荐提供商通常是通过己方获取的用户属性数据、用户行为数据、用户行为上下文数据等数据,预测用户的个性化的行为或者物品,例如,预测用户喜爱的手机和预测用户的网页点击率等,但是,在该方法中用户数据的特征丰富度往往对预测结果有着很大影响,而随着数据隐私保护立法的趋严,导致用户数据无法在不同数据所有方进行明文共享,所以单个个性化推荐提供商的用户数据的特征丰富度往往较低,进而导致对用户的个性化的行为或者物品的预测精度也较低,进而导致个性化推荐效果差,所以,相关技术中存在个性化推荐效果差的技术问题。With the continuous development of computer software and artificial intelligence, personalized recommendation technology has become more and more widely used. At present, personalized recommendation providers usually obtain user attribute data, user behavior data, user behavior context data and other data obtained by themselves. , Predict the user’s personalized behavior or item, for example, predict the user’s favorite mobile phone and predict the user’s web page click-through rate, etc. However, the feature richness of user data in this method often has a great impact on the prediction result. Due to the tightening of data privacy protection legislation, user data cannot be shared in clear text between different data owners. Therefore, the feature richness of user data of a single personalized recommendation provider is often low, which in turn leads to personalized behavior or The prediction accuracy of items is also low, which in turn leads to a poor personalized recommendation effect. Therefore, there is a technical problem in related technologies that the personalized recommendation effect is poor.
上述内容仅用于辅助理解本申请的技术方案,并不代表承认上述内容是现有技术。The above content is only used to assist the understanding of the technical solution of the application, and does not mean that the above content is recognized as prior art.
技术解决方案Technical solutions
本申请的主要目的在于提供一种基于联邦学习的个性化推荐方法、装置、设备和介质,旨在解决相关技术中个性化推荐效果差的技术问题。The main purpose of this application is to provide a method, device, device and medium for personalized recommendation based on federated learning, aiming to solve the technical problem of poor personalized recommendation effect in related technologies.
为实现上述目的,本申请实施例提供一种基于联邦学习的个性化推荐方法,所述基于联邦学习的个性化推荐方法应用于基于联邦学习的个性化推荐设备,所述基于联邦学习的个性化推荐方法包括:In order to achieve the above objective, the embodiments of the present application provide a personalized recommendation method based on federated learning. The personalized recommendation method based on federated learning is applied to a personalized recommendation device based on federated learning. The personalized recommendation method based on federated learning Recommended methods include:
接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集;Receiving uploaded data, and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database;
获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果;Acquiring the data to be predicted corresponding to the uploaded data and the target recall set, and inputting the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
对所述模型输出结果进行筛选,获得个性化推荐结果。The output results of the model are screened to obtain personalized recommendation results.
在一实施例中,所述目标召回集包括待推荐物品列表,所述上传数据包括用户数据、物品数据和行为数据,In an embodiment, the target recall set includes a list of items to be recommended, and the upload data includes user data, item data, and behavior data,
所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果的步骤包括:The step of obtaining the data to be predicted corresponding to the uploaded data and the target recall set, and inputting the data to be predicted into a personalized recommendation model obtained based on the federated learning, and obtaining the model output result includes:
基于所述待推荐物品列表,对所述物品数据进行筛选,获得待推荐物品数据,并将所述用户数据、所述待推荐物品数据和所述行为数据设置为所述待预测数据;Filtering the item data based on the list of items to be recommended to obtain item data to be recommended, and setting the user data, the item data to be recommended and the behavior data as the data to be predicted;
将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果。The data to be predicted is input into the personalized recommendation model to score and sort the recommended items in the list of recommended items to obtain model output results.
在一实施例中,所述将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果的步骤包括:In an embodiment, the step of inputting the to-be-predicted data into the personalized recommendation model to score and sort the to-be-recommended items in the to-be-recommended item list, and obtaining model output results includes:
将所述待预测数据输入所述个性化推荐模型,以基于所述用户数据和所述行为数据对所述待推荐物品进行评分,获得评分结果;Input the to-be-predicted data into the personalized recommendation model to score the to-be-recommended item based on the user data and the behavior data to obtain a scoring result;
基于所述评分结果,对所述待推荐物品进行排序,获得模型输出结果。Based on the scoring result, the items to be recommended are sorted, and the model output result is obtained.
在一实施例中,所述基于联邦学习的个性化推荐方法应用于进行所述联邦学习的第一设备,In an embodiment, the personalized recommendation method based on federated learning is applied to the first device that performs the federated learning,
所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果的步骤之前包括:The step of obtaining the to-be-predicted data corresponding to the uploaded data and the target recall set, and inputting the to-be-predicted data into the personalized recommendation model obtained based on the federated learning, before the step of obtaining the model output result includes:
与所述第一设备关联的第二设备进行样本匹配,获得公共训练样本ID(Identity document,身份证标识号);Perform sample matching with the second device associated with the first device to obtain a public training sample ID (Identity document, identification number);
基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型。Based on the public training sample ID, the personalized recommendation model is obtained by interacting with the second device for federated learning.
在一实施例中,所述个性化推荐模型包括逻辑回归模型,In an embodiment, the personalized recommendation model includes a logistic regression model,
所述基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型的步骤包括:The step of obtaining the personalized recommendation model by interacting with the second device for federated learning based on the public training sample ID includes:
基于所述公共训练样本ID,提取所述公共训练样本ID对应的第一样本数据,并计算所述第一样本数据对应的第一权值;Based on the public training sample ID, extract the first sample data corresponding to the public training sample ID, and calculate the first weight corresponding to the first sample data;
接收所述第二设备发送的第二权值,并通过预设中间参数公式计算所述第一权值和所述第二权值共同对应的梯度辅助变量,并将所述梯度辅助变量发送至第二设备,其中,所述第二设备用于计算所述梯度辅助变量对应的第二梯度;Receive the second weight value sent by the second device, calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value through a preset intermediate parameter formula, and send the gradient auxiliary variable to A second device, wherein the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
基于所述梯度辅助变量,计算第一梯度,并将所述第一梯度发送至预设联邦服务器,其中,所述预设联邦服务器用于基于所述第一梯度和所述第二设备发送的第二梯度计算联邦模型总梯度;Based on the gradient auxiliary variable, the first gradient is calculated, and the first gradient is sent to a preset federation server, where the preset federation server is used to send data based on the first gradient and the second device The second gradient calculates the total gradient of the federated model;
接收所述联邦服务器反馈的所述联邦模型总梯度,并基于所述联邦模型总梯度对所述第一设备的本地训练模型进行迭代更新,获得所述逻辑回归模型。The total gradient of the federation model fed back by the federation server is received, and the local training model of the first device is iteratively updated based on the total gradient of the federation model to obtain the logistic regression model.
在一实施例中,所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集的步骤包括:In an embodiment, the step of receiving uploaded data and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database includes:
接收上传数据,并提取所述上传数据中的样本ID;Receiving uploaded data, and extracting the sample ID in the uploaded data;
基于所述样本ID,在所述预设召回集存储数据库中查询相对应的目标召回集。Based on the sample ID, query the corresponding target recall set in the preset recall set storage database.
在一实施例中,所述基于联邦学习的个性化推荐方法应用于进行所述联邦学习的第一设备,In an embodiment, the personalized recommendation method based on federated learning is applied to the first device that performs the federated learning,
所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集的步骤之前包括:The step of receiving uploaded data and extracting the target recall set corresponding to the uploaded data from a preset recall set storage database includes:
与所述第一设备关联的第二设备进行联邦学习,获得联邦召回算法模型;The second device associated with the first device performs federated learning to obtain a federated recall algorithm model;
获取样本上传数据,并将所述样本上传数据输入所述联邦召回算法模型,获得目标召回集集合,并将所述目标召回集集合存储于所述预设召回集存储数据库中。Obtain sample upload data, and input the sample upload data into the federal recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage database.
本申请还提供一种基于联邦学习的个性化推荐装置,所述基于联邦学习的个性化推荐装置应用于基于联邦学习的个性化推荐设备,所述基于联邦学习的个性化推荐装置包括:The present application also provides a personalized recommendation device based on federated learning. The personalized recommendation device based on federated learning is applied to a personalized recommendation device based on federated learning. The personalized recommendation device based on federated learning includes:
提取模块,用于所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集;The extraction module is configured to receive the uploaded data, and extract the target recall set corresponding to the uploaded data from a preset recall set storage database;
预测模块,用于所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果;A prediction module, configured to obtain the data to be predicted corresponding to the uploaded data and the target recall set, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
筛选模块,用于所述对所述模型输出结果进行筛选,获得个性化推荐结果。The screening module is used to screen the output results of the model to obtain personalized recommendation results.
在一实施例中,所述预测模块包括:In an embodiment, the prediction module includes:
筛选单元,用于所述基于所述待推荐物品列表,对所述物品数据进行筛选,获得待推荐物品数据,并将所述用户数据、所述待推荐物品数据和所述行为数据设置为所述待预测数据;The screening unit is configured to screen the item data based on the list of items to be recommended to obtain the item data to be recommended, and set the user data, the item data to be recommended and the behavior data as all State the data to be predicted;
预测单元,用于所述将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果。The prediction unit is used for inputting the data to be predicted into the personalized recommendation model, so as to score and sort the recommended items in the list of to-be-recommended items, and obtain model output results.
在一实施例中,所述预测单元包括:In an embodiment, the prediction unit includes:
评分子单元,用于所述将所述待预测数据输入所述个性化推荐模型,以基于所述用户数据和所述行为数据对所述待推荐物品进行评分,获得评分结果;A scoring subunit for inputting the data to be predicted into the personalized recommendation model to score the item to be recommended based on the user data and the behavior data to obtain a scoring result;
排序子单元,用于所述基于所述评分结果,对所述待推荐物品进行排序,获得模型输出结果。The sorting subunit is used for sorting the items to be recommended based on the scoring result to obtain a model output result.
在一实施例中,所述基于联邦学习的个性化推荐装置还包括:In an embodiment, the personalized recommendation device based on federated learning further includes:
样本匹配模块,用于所述与所述第一设备关联的第二设备进行样本匹配,获得公共训练样本ID;A sample matching module, configured to perform sample matching on the second device associated with the first device to obtain a public training sample ID;
第一联邦学习模块,用于所述基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型。The first federated learning module is used to obtain the personalized recommendation model by interacting with the second device to perform federated learning based on the public training sample ID.
在一实施例中,所述联邦学习模块包括:In an embodiment, the federated learning module includes:
第一计算单元,用于所述基于所述公共训练样本ID,提取所述公共训练样本ID对应的第一样本数据,并计算所述第一样本数据对应的第一权值;A first calculation unit, configured to extract the first sample data corresponding to the public training sample ID based on the public training sample ID, and calculate the first weight corresponding to the first sample data;
第二计算单元,用于所述接收所述第二设备发送的第二权值,并通过预设中间参数公式计算所述第一权值和所述第二权值共同对应的梯度辅助变量,并将所述梯度辅助变量发送至第二设备,其中,所述第二设备用于计算所述梯度辅助变量对应的第二梯度;The second calculation unit is configured to receive the second weight value sent by the second device, and calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value by using a preset intermediate parameter formula, And sending the gradient auxiliary variable to a second device, where the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
第三计算单元,用于所述基于所述梯度辅助变量,计算第一梯度,并将所述第一梯度发送至预设联邦服务器,其中,所述预设联邦服务器用于基于所述第一梯度和所述第二设备发送的第二梯度计算联邦模型总梯度;The third calculation unit is configured to calculate a first gradient based on the gradient auxiliary variable, and send the first gradient to a preset federated server, where the preset federated server is used to calculate the first gradient based on the first gradient. The gradient and the second gradient sent by the second device to calculate the total gradient of the federated model;
迭代更新单元,用于所述接收所述联邦服务器反馈的所述联邦模型总梯度,并基于所述联邦模型总梯度对所述第一设备的本地训练模型进行迭代更新,获得所述逻辑回归模型。An iterative update unit, configured to receive the total gradient of the federation model fed back by the federation server, and to iteratively update the local training model of the first device based on the total gradient of the federation model to obtain the logistic regression model .
在一实施例中,所述提取模块包括:In an embodiment, the extraction module includes:
提取单元,用于所述接收上传数据,并提取所述上传数据中的样本ID;An extraction unit, configured to receive the uploaded data and extract the sample ID in the uploaded data;
查询单元,用于所述基于所述样本ID,在所述预设召回集存储数据库中查询相对应的目标召回集。The query unit is configured to query the corresponding target recall set in the preset recall set storage database based on the sample ID.
在一实施例中,所述基于联邦学习的个性化推荐装置包括:In an embodiment, the personalized recommendation device based on federated learning includes:
第二联邦学习模块,用于所述与所述第一设备关联的第二设备进行联邦学习,获得联邦召回算法模型;A second federated learning module, configured to perform federated learning for the second device associated with the first device to obtain a federated recall algorithm model;
存储模块,用于所述获取样本上传数据,并将所述样本上传数据输入所述联邦召回算法模型,获得目标召回集集合,并将所述目标召回集集合存储于所述预设召回集存储数据库中。The storage module is used to obtain the sample upload data and input the sample upload data into the federated recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage In the database.
本申请还提供一种基于联邦学习的个性化推荐设备,所述基于联邦学习的个性化推荐设备包括:存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的所述基于联邦学习的个性化推荐方法的程序,所述基于联邦学习的个性化推荐方法的程序被处理器执行时可实现如上述的基于联邦学习的个性化推荐方法的步骤。This application also provides a personalized recommendation device based on federated learning. The personalized recommendation device based on federated learning includes: a memory, a processor, and the The program of the personalized recommendation method based on federated learning, when the program of the personalized recommendation method based on federated learning is executed by the processor, can realize the steps of the above-mentioned personalized recommendation method based on federated learning.
本申请还提供一种介质,所述介质为可读存储介质,所述介质上存储有实现基于联邦学习的个性化推荐方法的程序,所述基于联邦学习的个性化推荐方法的程序被处理器执行时实现如上述的基于联邦学习的个性化推荐方法的步骤。This application also provides a medium, the medium is a readable storage medium, the medium stores a program for implementing a personalized recommendation method based on federated learning, and the program for the personalized recommendation method based on federated learning is processed by a processor During execution, the steps of the above-mentioned personalized recommendation method based on federated learning are realized.
本申请通过接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集,进而获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果,进而对所述模型输出结果进行筛选,获得个性化推荐结果。也即,本申请首先进行上传数据的接收,进而进行从预设召回集存储数据库中对所述上传数据对应的目标召回集的提取,进而进行所述上传数据和所述目标召回集共同对应的待预测数据的获取,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果,进而进行对所述模型输出结果的筛选,获得个性化推荐结果。也即,本申请在基于上传数据和目标召回集获取待预测数据之后,通过将待预测数据输入基于所述联邦学习获取的个性化推荐模型,进而预测用户的个性化推荐结果,其中,所述个性化推荐模型是基于联邦学习而获取的,也即,通过所述联邦学习可联合多方数据进行个性化推荐模型的训练,进而提高了所述个性化推荐模型的训练样本的特征丰富度,且不会泄露各数据提供方和数据使用方的数据隐私,进而提高了个性化推荐模型的健壮性和宽泛性,进而提高了所述个性化推荐模型的预测准确率,避免了由于个性化推荐模型的预测准确率而导致个性化推荐效果差的情况发生,且通过将待预测数据输入基于所述联邦学习获取的个性化推荐模型,在仅仅使用本地数据进行预测的情况下达到了联合多方数据进行预测的模型预测效果,减少了进行个性化推荐时的计算量,进而提高了个性化推荐时的响应速度,增强了个性化推荐的效果,所以,解决了个性化推荐效果差的技术问题。This application receives uploaded data, extracts the target recall set corresponding to the uploaded data from a preset recall set storage database, and then obtains the to-be-predicted data corresponding to the uploaded data and the target recall set, and combines the The data to be predicted is input based on the personalized recommendation model obtained by the federated learning, and the model output result is obtained, and then the model output result is filtered to obtain the personalized recommendation result. That is, this application first receives the upload data, and then extracts the target recall set corresponding to the uploaded data from the preset recall set storage database, and then performs the common correspondence between the upload data and the target recall set. Obtain the data to be predicted, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain model output results, and then screen the model output results to obtain personalized recommendation results. That is, after the application obtains the data to be predicted based on the uploaded data and the target recall set, the data to be predicted is input into the personalized recommendation model obtained based on the federated learning to further predict the personalized recommendation result of the user. The personalized recommendation model is acquired based on federated learning, that is, through the federated learning, the training of the personalized recommendation model can be combined with multi-party data, thereby increasing the feature richness of the training samples of the personalized recommendation model, and The data privacy of each data provider and data user will not be disclosed, thereby improving the robustness and broadness of the personalized recommendation model, thereby improving the prediction accuracy of the personalized recommendation model, and avoiding the result of the personalized recommendation model The prediction accuracy rate of, which leads to poor personalized recommendation effect, and by inputting the data to be predicted into the personalized recommendation model obtained based on the federated learning, the joint multi-party data for prediction is achieved when only local data is used for prediction. The prediction effect of the model reduces the amount of calculation for personalized recommendation, thereby improving the response speed during personalized recommendation, and enhancing the effect of personalized recommendation. Therefore, the technical problem of poor personalized recommendation effect is solved.
本发明的实施方式Embodiments of the present invention
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, and are not used to limit the present application.
本申请实施例提供一种基于联邦学习的个性化推荐方法,所述基于联邦学习的个性化推荐方法应用于基于联邦学习的个性化推荐设备,在本申请基于联邦学习的个性化推荐方法的第一实施例中,参照图1,所述基于联邦学习的个性化推荐方法包括:The embodiment of the present application provides a personalized recommendation method based on federated learning. The personalized recommendation method based on federated learning is applied to a personalized recommendation device based on federated learning. In an embodiment, referring to FIG. 1, the personalized recommendation method based on federated learning includes:
步骤S10,接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集;Step S10, receiving uploaded data, and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database;
在本实施例中,需要说明的是,所述上传数据包括样本ID、用户数据、物品数据和行为数据,其中,所述上传数据由客户端上传,所述用户数据包括用户自然属性数据和用户兴趣属性数据等,所述物品数据包括待进行个性化推荐物品的物品名称、物品属性等数据,所述行为数据包括用户对物品的行为数据和发生行为时的上下文数据,例如用户对物品的行为数据包括浏览、点击等,发生行为时的上下文数据包括地理位置、网络类型等,所述样本ID包括用户姓名、用户身份证号和用户电话号码等。In this embodiment, it should be noted that the upload data includes sample ID, user data, item data, and behavior data. The upload data is uploaded by the client, and the user data includes user natural attribute data and user data. Interest attribute data, etc. The item data includes data such as the item name and item attributes of the item to be personalized and recommended, and the behavior data includes user behavior data on the item and context data when the behavior occurs, such as user behavior on the item The data includes browsing, clicking, etc., the context data when the behavior occurs includes geographic location, network type, etc., and the sample ID includes the user's name, user ID number, and user phone number.
接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集,具体地,接收上传数据,并基于所述上传数据中的用户数据,在所述预设召回集存储数据库中查询所述用户数据对应的目标召回集,其中,所述目标召回集包括一个或者多个初始待推荐物品,其中,所述初始待推荐物品指的是用户可能感兴趣的物品。Receive upload data, and extract the target recall set corresponding to the upload data from the preset recall set storage database, specifically, receive the upload data, and store it in the preset recall set based on the user data in the upload data The database is queried for the target recall set corresponding to the user data, where the target recall set includes one or more initial items to be recommended, where the initial items to be recommended refer to items that may be of interest to the user.
其中,所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集的步骤包括:Wherein, the step of receiving uploaded data and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database includes:
步骤S11,接收上传数据,并提取所述上传数据中的样本ID;Step S11, receiving uploaded data, and extracting the sample ID in the uploaded data;
在本实施例中,需要说明的是,所述样本ID包括用户ID、字符串等身份标签,所述上传数据包括一条或者多条数据样本,且所述数据样本由预设数据样本格式表示,例如,“id,label,user_a_feature_i,item_feature_i,action_i,other_a_i”为一种预设数据样本格式,其中,id为样本ID,label为样本标签,其中,所述样本标签标识了用户的类型,例如,用户为好客户或者坏客户等,user_a_feature_i为用户数据,item_feature_i为物品数据,action_i为行为数据,other_a_i为其他数据。In this embodiment, it should be noted that the sample ID includes identity tags such as user IDs and character strings, the uploaded data includes one or more data samples, and the data samples are represented by a preset data sample format. For example, "id, label, user_a_feature_i, item_feature_i, action_i, other_a_i" is a preset data sample format, where id is the sample ID and label is the sample label, where the sample label identifies the type of user, for example, The user is a good customer or a bad customer, user_a_feature_i is user data, item_feature_i is item data, action_i is behavior data, and other_a_i is other data.
步骤S12,基于所述样本ID,在所述预设召回集存储数据库中查询相对应的目标召回集。Step S12, based on the sample ID, query a corresponding target recall set in the preset recall set storage database.
在本实施例中,需要说明的是,所述样本ID包括用户ID、字符串、电话号码等身份标签,也即,所述样本ID可使用字符串进行表示,也可直接使用电话号码进行表示。In this embodiment, it should be noted that the sample ID includes identity tags such as user ID, character string, phone number, etc. That is, the sample ID can be represented by a character string, or can be directly represented by a phone number .
基于所述样本ID,在所述预设召回集存储数据库中查询相对应的目标召回集,具体地,将所述样本ID作为关键词,在所述预设召回集存储数据库进行检索,获得所述目标召回集。Based on the sample ID, the corresponding target recall set is queried in the preset recall set storage database. Specifically, the sample ID is used as a keyword to search in the preset recall set storage database to obtain all The target recall set.
其中,所述基于联邦学习的个性化推荐方法应用于进行所述联邦学习的第一设备,Wherein, the personalized recommendation method based on federated learning is applied to the first device that performs the federated learning,
所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集的步骤之前包括:The step of receiving uploaded data and extracting the target recall set corresponding to the uploaded data from a preset recall set storage database includes:
步骤A10,与所述第一设备关联的第二设备进行联邦学习,获得联邦召回算法模型;Step A10, the second device associated with the first device performs federated learning to obtain a federated recall algorithm model;
在本实施例中,需要说明的是,所述联邦学习包括纵向联邦学习和横向联邦学习,所述第一设备和所述第二设备可进行通信连接。In this embodiment, it should be noted that the federated learning includes vertical federated learning and horizontal federated learning, and the first device and the second device may be communicatively connected.
与所述第一设备关联的第二设备进行联邦学习,获得联邦召回算法模型,具体地,与所述第二设备进行样本对齐,以将所述第一设备的样本ID和所述第二设备的样本ID进行样本对齐,获得公共训练样本ID,基于所述公共训练样本ID对应的上传数据对第一设备的第一训练模型进行训练,获得第一训练结果,并计算所述第一训练结果与理论训练结果的误差,进而基于所述误差和所述第一训练模型的网络权重对预设目标函数进行求偏导,获得第一梯度,其中,所述预设目标函数是关于所述误差和所述网络权重的函数,进而将所述第一梯度发送至预设联邦服务器,以通过所述预设联邦服务器基于预设联邦规则对所述第一梯度和所述第二设备发送的第二梯度进行联邦,获得联邦梯度,进而接收所述预设联邦服务器反馈的联邦梯度,并基于所述联邦梯度对所述第一训练模型进行迭代更新,获得所述联邦召回算法模型,其中,所述预设联邦规则包括加权平均等,停止所述迭代更新的条件包括达到最大迭代次数、模型收敛于预设误差阀值等。The second device associated with the first device performs federated learning to obtain a federated recall algorithm model, specifically, performs sample alignment with the second device to align the sample ID of the first device with the second device Align the sample ID of the sample to obtain the public training sample ID, train the first training model of the first device based on the uploaded data corresponding to the public training sample ID, obtain the first training result, and calculate the first training result The error from the theoretical training result is further based on the error and the network weight of the first training model to obtain a partial derivative of the preset objective function to obtain the first gradient, wherein the preset objective function is related to the error And the network weight function, and then send the first gradient to a preset federation server, so that the preset federation server sends the first gradient and the second device to the first gradient and the second device based on preset federation rules. The two gradients are federated to obtain the federated gradient, and then the federated gradient fed back by the preset federated server is received, and the first training model is iteratively updated based on the federated gradient to obtain the federated recall algorithm model. The preset federal rules include weighted average, and the conditions for stopping the iterative update include reaching the maximum number of iterations, and the model converges to a preset error threshold.
步骤A20,获取样本上传数据,并将所述样本上传数据输入所述联邦召回算法模型,获得目标召回集集合,并将所述目标召回集集合存储于所述预设召回集存储数据库中。Step A20: Obtain sample upload data, and input the sample upload data into the federal recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage database.
在本实施例中,需要说明的是,所述目标召回集集合包括一个或者多个目标召回集,其中,每一样本ID对应一目标召回集。In this embodiment, it should be noted that the target recall set includes one or more target recall sets, wherein each sample ID corresponds to a target recall set.
获取样本上传数据,并将所述样本上传数据输入所述联邦召回算法模型,获得目标召回集集合,并将所述目标召回集集合存储于所述预设召回集存储数据库中,具体地,获取样本上传数据,其中,所述样本上传数据包括存储于所述第一设备中的各用户的上传数据,每一用户对应一样本ID,其中,所述上传数据包括用户数据、物品数据和行为数据等,进而将所述样本上传数据输入所述联邦召回算法模型,以预测所述第一设备的用户可能感兴趣的物品,获得一个或者多个待推荐物品,并将属于同一样本ID对应的待推荐物品划分为一个目标召回集,获得所述目标召回集集合,并将所述目标召回集集合存储于所述预设召回集存储数据库中,其中,所述目标召回集在所述预设召回集存储数据库中以物品列表、物品集合等形式进行存储,所述目标召回集对应的查询关键词包括样本ID。Obtain sample upload data, and input the sample upload data into the federal recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage database, specifically, obtain Sample upload data, where the sample upload data includes upload data of each user stored in the first device, and each user corresponds to a sample ID, where the upload data includes user data, item data, and behavior data Etc., and then input the sample upload data into the federal recall algorithm model to predict the items that may be of interest to the user of the first device, obtain one or more items to be recommended, and assign them to the items to be recommended corresponding to the same sample ID. The recommended item is divided into a target recall set, the target recall set is obtained, and the target recall set is stored in the preset recall set storage database, where the target recall set is in the preset recall set The collection storage database is stored in the form of item lists, item collections, etc., and the query keywords corresponding to the target recall collection include sample IDs.
步骤S20,获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果;Step S20: Obtain the data to be predicted corresponding to the uploaded data and the target recall set, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
在本实施例中,需要说明的是,所述联邦学习的参与方为两个或者两个以上,所述参与方包括第一设备和第二设备,所述基于所述联邦学习获取的个性化推荐模型包括基于所述联邦学习已经训练好的逻辑回归模型,所述模型输出结果包括待推荐物品及其评分和排序,所述待预测数据包括用户数据、行为数据和所述目标召回集对应的物品数据等。In this embodiment, it should be noted that there are two or more participants in the federated learning, and the participants include a first device and a second device, and the personalized learning obtained based on the federated learning The recommendation model includes a logistic regression model that has been trained based on the federated learning, the output of the model includes items to be recommended and their ratings and rankings, and the data to be predicted includes user data, behavior data, and corresponding target recall sets. Item data, etc.
获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果,具体地,将所述上传数据和所述目标召回集共同对应的待预测数据输入基于所述联邦学习获取的个性化推荐模型,以对所述目标召回集对应的待推荐物品进行评分和排序,获得所述待推荐物品对应的物品排序列表及待推荐物品对应的评分,也即,获得模型输出结果。Obtain the data to be predicted corresponding to the upload data and the target recall set, and input the data to be predicted into the personalized recommendation model obtained based on the federated learning to obtain the model output result, specifically, upload the The data to be predicted corresponding to the target recall set is input based on the personalized recommendation model obtained by the federated learning to score and rank the recommended items corresponding to the target recall set, and obtain the corresponding items to be recommended The sorted list of items and the scores corresponding to the items to be recommended, that is, the output results of the model are obtained.
其中,所述目标召回集包括待推荐物品列表,所述上传数据包括用户数据、物品数据和行为数据,Wherein, the target recall set includes a list of items to be recommended, and the upload data includes user data, item data, and behavior data,
所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果的步骤包括:The step of obtaining the data to be predicted corresponding to the uploaded data and the target recall set, and inputting the data to be predicted into a personalized recommendation model obtained based on the federated learning, and obtaining the model output result includes:
步骤S21,基于所述待推荐物品列表,对所述物品数据进行筛选,获得待推荐物品数据,并将所述用户数据、所述待推荐物品数据和所述行为数据设置为所述待预测数据;Step S21: Screen the item data based on the list of items to be recommended to obtain item data to be recommended, and set the user data, the item data to be recommended and the behavior data as the data to be predicted ;
在本实施例中,需要说明的是,所述待推荐物品列表中的物品与所述待推荐物品数据相关联。In this embodiment, it should be noted that the items in the list of items to be recommended are associated with the item data to be recommended.
步骤S22,将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果。Step S22: Input the to-be-predicted data into the personalized recommendation model to score and sort the to-be-recommended items in the to-be-recommended item list to obtain model output results.
在本实施例中,将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果,具体地,将所述待预测数据输入所述个性化推荐模型,以基于所述用户数据、所述行为数据和所述待推荐物品数据对所述待推荐物品列表中的物品进行评分,获得评分结果,进而基于所述评分结果,对所述待推荐物品列表中的物品进行排序,获得模型输出结果。In this embodiment, the to-be-predicted data is input into the personalized recommendation model to score and sort the to-be-recommended items in the to-be-recommended item list to obtain the model output result, specifically, the The prediction data is input into the personalized recommendation model to score the items in the list of items to be recommended based on the user data, the behavior data, and the item data to be recommended to obtain a scoring result, which is then based on the score As a result, the items in the list of items to be recommended are sorted, and the output result of the model is obtained.
其中,在步骤S22中,所述将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果的步骤包括:Wherein, in step S22, the step of inputting the to-be-predicted data into the personalized recommendation model to score and sort the to-be-recommended items in the to-be-recommended item list, and obtaining model output results includes:
步骤S221,将所述待预测数据输入所述个性化推荐模型,以基于所述用户数据和所述行为数据对所述待推荐物品进行评分,获得评分结果;Step S221: Input the to-be-predicted data into the personalized recommendation model to score the to-be-recommended item based on the user data and the behavior data to obtain a scoring result;
在本实施例中,将所述待预测数据输入所述个性化推荐模型,以基于所述用户数据和所述行为数据对所述待推荐物品进行评分,获得评分结果,具体地,将所述待预测数据输入所述个性化推荐模型,以基于所述用户数据和所述行为数据分别对所述待推荐物品进行评分,获得第一评分结果和第二评分结果,基于所述第一评分结果和所述第二评分结果,通过预设计算规则计算所述评分结果,其中,所述预设计算规则包括加权平均、求和、求积等,例如,假设所述计算规则为求和,所述第一评分结果为1分,所述第二评分结果为2分,则所述评分结果为3分。In this embodiment, the to-be-predicted data is input into the personalized recommendation model to score the to-be-recommended item based on the user data and the behavior data to obtain a scoring result, specifically, the The data to be predicted is input into the personalized recommendation model to score the items to be recommended based on the user data and the behavior data, to obtain a first scoring result and a second scoring result, based on the first scoring result And the second scoring result, the scoring result is calculated by a preset calculation rule, where the preset calculation rule includes weighted average, summation, quadrature, etc., for example, assuming that the calculation rule is a summation, If the first scoring result is 1 point and the second scoring result is 2 points, then the scoring result is 3 points.
步骤S222,基于所述评分结果,对所述待推荐物品进行排序,获得模型输出结果。Step S222: Sort the items to be recommended based on the scoring result to obtain a model output result.
在本实施例中,基于所述评分结果,对所述待推荐物品进行排序,获得模型输出结果,具体地,基于所述评分结果,以预设排序方式对评分后的所述待推荐物品进行排序,其中,所述预设排序方式包括从小到大排序和从大到小排序等,进而获得模型输出结果。In this embodiment, based on the scoring result, the items to be recommended are sorted to obtain a model output result, specifically, based on the scoring result, the scoring items to be recommended are performed in a preset sorting manner Sorting, wherein the preset sorting method includes sorting from small to large, sorting from large to small, etc., to obtain model output results.
步骤S30,对所述模型输出结果进行筛选,获得个性化推荐结果。Step S30: Screen the output results of the model to obtain personalized recommendation results.
在本实施例中,对所述模型输出结果进行筛选,获得个性化推荐结果,具体地,基于预设业务逻辑对所述模型输出结果进行筛选,提取所述模型输出结果中的待推荐物品作为所述个性化推荐结果,例如,提取模型输出结果中评分最高的预设数量的物品作为所述个性化推荐结果。In this embodiment, the model output results are filtered to obtain personalized recommendation results. Specifically, the model output results are filtered based on preset business logic, and the items to be recommended in the model output results are extracted as The personalized recommendation result, for example, extracting a preset number of items with the highest score in the model output result as the personalized recommendation result.
本实施例通过接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集,进而获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果,进而对所述模型输出结果进行筛选,获得个性化推荐结果。也即,本实施例首先进行上传数据的接收,进而进行从预设召回集存储数据库中对所述上传数据对应的目标召回集的提取,进而进行所述上传数据和所述目标召回集共同对应的待预测数据的获取,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果,进而进行对所述模型输出结果的筛选,获得个性化推荐结果。也即,本实施例在基于上传数据和目标召回集获取待预测数据之后,通过将待预测数据输入基于所述联邦学习获取的个性化推荐模型,进而预测用户的个性化推荐结果,其中,所述个性化推荐模型是基于联邦学习而获取的,也即,通过所述联邦学习可联合多方数据进行个性化推荐模型的训练,进而提高了所述个性化推荐模型的训练样本的特征丰富度,且不会泄露各数据提供方和数据使用方的数据隐私,进而提高了个性化推荐模型的健壮性和宽泛性,进而提高了所述个性化推荐模型的预测准确率,避免了由于个性化推荐模型的预测准确率而导致个性化推荐效果差的情况发生,且通过将待预测数据输入基于所述联邦学习获取的个性化推荐模型,在仅仅使用本地数据进行预测的情况下达到了联合多方数据进行预测的模型预测效果,减少了进行个性化推荐时的计算量,进而提高了个性化推荐时的响应速度,增强了个性化推荐的效果,所以,解决了个性化推荐效果差的技术问题。In this embodiment, upload data is received, and the target recall set corresponding to the uploaded data is extracted from a preset recall set storage database, and then the to-be-predicted data corresponding to the uploaded data and the target recall set are obtained, and all The data to be predicted is input based on the personalized recommendation model obtained by the federated learning, and the model output result is obtained, and then the model output result is filtered to obtain the personalized recommendation result. That is, this embodiment first receives the upload data, and then extracts the target recall set corresponding to the uploaded data from the preset recall set storage database, and then performs the common correspondence between the upload data and the target recall set. Obtain the data to be predicted, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain model output results, and then screen the model output results to obtain personalized recommendation results. That is, in this embodiment, after obtaining the data to be predicted based on the uploaded data and the target recall set, the data to be predicted is input into the personalized recommendation model obtained based on the federated learning to further predict the personalized recommendation result of the user. The personalized recommendation model is obtained based on federated learning, that is, through the federated learning, the training of the personalized recommendation model can be combined with multi-party data, thereby increasing the feature richness of the training samples of the personalized recommendation model, The data privacy of each data provider and data user will not be leaked, thereby improving the robustness and broadness of the personalized recommendation model, thereby improving the prediction accuracy of the personalized recommendation model, and avoiding the result of personalized recommendation The prediction accuracy of the model leads to poor personalized recommendation effects, and by inputting the data to be predicted into the personalized recommendation model obtained based on the federated learning, joint multi-party data processing is achieved when only local data is used for prediction. The prediction effect of the predicted model reduces the amount of calculation when making personalized recommendation, thereby improving the response speed during personalized recommendation and enhancing the effect of personalized recommendation. Therefore, the technical problem of poor personalized recommendation effect is solved.
进一步地,参照图2,基于本申请中第一实施例,在基于联邦学习的个性化推荐方法的另一实施例中,所述基于联邦学习的个性化推荐方法应用于进行所述联邦学习的第一设备,Further, referring to FIG. 2, based on the first embodiment of this application, in another embodiment of the personalized recommendation method based on federated learning, the personalized recommendation method based on federated learning is applied to the federated learning The first device,
所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果的步骤之前包括:The step of obtaining the to-be-predicted data corresponding to the uploaded data and the target recall set, and inputting the to-be-predicted data into the personalized recommendation model obtained based on the federated learning, before the step of obtaining the model output result includes:
步骤B10,与所述第一设备关联的第二设备进行样本匹配,获得公共训练样本ID;Step B10: Perform sample matching with the second device associated with the first device to obtain a public training sample ID;
在本实施例中,需要说明的是,所述公共训练样本ID包括用户ID、标识字符串等身份标签。In this embodiment, it should be noted that the public training sample ID includes identity tags such as user IDs and identification strings.
与所述第一设备关联的第二设备进行样本匹配,获得公共训练样本ID,具体地,将所述第一设备中的第一训练样本ID与所述第二设备中的第二训练样本ID进行交集处理,获得公共训练样本ID。The second device associated with the first device performs sample matching to obtain a common training sample ID, specifically, the first training sample ID in the first device is compared with the second training sample ID in the second device Perform intersection processing to obtain the public training sample ID.
步骤B20,基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型。Step B20: Based on the public training sample ID, the personalized recommendation model is obtained by interacting with the second device for federated learning.
在本实施例中,基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型,具体地,基于所述公共训练样本ID,提取所述公共训练样本ID对应的第一样本数据,并基于所述第一样本数据获取第一梯度,进而通过与所述第二设备进行交互,协助所述第二设备获取第二梯度,并将所述第一梯度发送至预设联邦服务器,进一步地,接收预设联邦服务器反馈的联邦模型总梯度,其中,所述联邦模型总梯度由预设联邦服务器通过预设联邦规则对所述第一梯度和所述第二设备发送的第二梯度进行联邦获得的,其中,所述预设联邦规则包括加权平均、求和等,进而基于所述联邦模型总梯度对第一设备中的第一训练模型进行迭代更新,获得所述个性化推荐模型。In this embodiment, based on the public training sample ID, the personalized recommendation model is obtained by interacting with the second device for federated learning. Specifically, based on the public training sample ID, the public The first sample data corresponding to the training sample ID is obtained, and the first gradient is obtained based on the first sample data, and the second device interacts with the second device to assist the second device in obtaining the second gradient, and to The first gradient is sent to the preset federation server, and further, the federated model total gradient fed back by the preset federated server is received, wherein the federated model total gradient is calculated by the preset federated server through preset federation rules. And the second gradient sent by the second device, where the preset federation rules include weighted average, summation, etc., and the first training model in the first device is further determined based on the total gradient of the federation model. Iterative update is performed to obtain the personalized recommendation model.
其中,所述个性化推荐模型包括逻辑回归模型,Wherein, the personalized recommendation model includes a logistic regression model,
所述基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型的步骤包括:The step of obtaining the personalized recommendation model by interacting with the second device for federated learning based on the public training sample ID includes:
步骤B21,基于所述公共训练样本ID,提取所述公共训练样本ID对应的第一样本数据,并计算所述第一样本数据对应的第一权值;Step B21, based on the public training sample ID, extract the first sample data corresponding to the public training sample ID, and calculate the first weight corresponding to the first sample data;
在本实施例中,需要说明的是是,所述第一样本数据为用户客户端的上传数据,所述上传数据包括一条或者多条数据样本,所述第一权值为所述第一样本数据对应的第一样本特征和所述第一训练模型中的第一权重之积,其中,所述第一权重为所述第一训练模型的网络权重,例如,假设所述第一权重为WB,第一样本特征为XB,则所述第一权值为WBXB。In this embodiment, it should be noted that the first sample data is upload data of the user client, the upload data includes one or more data samples, and the first weight value is the same as the first The product of the first sample feature corresponding to this data and the first weight in the first training model, where the first weight is the network weight of the first training model, for example, suppose the first weight Is WB, and the first sample feature is XB, then the first weight value is WBXB.
步骤B22,接收所述第二设备发送的第二权值,并通过预设中间参数公式计算所述第一权值和所述第二权值共同对应的梯度辅助变量,并将所述梯度辅助变量发送至第二设备,其中,所述第二设备用于计算所述梯度辅助变量对应的第二梯度;Step B22: Receive the second weight value sent by the second device, calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value through a preset intermediate parameter formula, and assist the gradient The variable is sent to a second device, where the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
在本实施例中,接收所述第二设备发送的第二权值,并通过预设中间参数公式计算所述第一权值和所述第二权值共同对应的梯度辅助变量,并将所述梯度辅助变量发送至第二设备,其中,所述第二设备用于计算所述梯度辅助变量对应的第二梯度,具体地,接收所述第二设备加密发送的第二权值,对加密后的第二权值进行解密,获得第二权值,其中,所述第二权值为所述第二样本数据对应的第二样本特征和第二设备中的第二训练模型中的第二权重之积,其中,所述第二权重为所述第二训练模型的网络权重,进一步地,将所述第一权值和所述第二权值进行并集处理,获得第一设备和第二设备共同对应的的总权值,进而将所述总权值变量代入所述预设中间参数公式,获得所述梯度辅助变量,并将所述梯度辅助变量发送至第二设备,且所述梯度辅助变量与所述样本标签相关联,其中,所述预设中间参数公式如下所示,In this embodiment, the second weight value sent by the second device is received, and the gradient auxiliary variable corresponding to the first weight value and the second weight value is calculated through a preset intermediate parameter formula, and the all The gradient auxiliary variable is sent to a second device, where the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable, specifically, receiving a second weight encrypted and sent by the second device, and encrypting The latter second weight is decrypted to obtain the second weight, where the second weight is the second sample feature corresponding to the second sample data and the second in the second training model in the second device. The product of weights, wherein the second weight is the network weight of the second training model, and further, the first weight and the second weight are combined to obtain the first device and the second The total weight corresponding to the two devices, and then substitute the total weight variable into the preset intermediate parameter formula to obtain the gradient auxiliary variable, and send the gradient auxiliary variable to the second device, and the The gradient auxiliary variable is associated with the sample label, wherein the preset intermediate parameter formula is as follows:
其中,为所述梯度辅助变量,wTx为所述总权值,y为所述样本标签,y可取值1或者-1,例如,当y=1时,可表示客户为好客户,当y=-1时,可表示客户为坏客户。Where is the gradient auxiliary variable, wTx is the total weight, y is the sample label, and y can take the value 1 or -1. For example, when y=1, it can indicate that the customer is a good customer, and when y =-1, it can indicate that the customer is a bad customer.
步骤B23,基于所述梯度辅助变量,计算第一梯度,并将所述第一梯度发送至预设联邦服务器,其中,所述预设联邦服务器用于基于所述第一梯度和所述第二设备发送的第二梯度计算联邦模型总梯度;Step B23: Calculate a first gradient based on the gradient auxiliary variable, and send the first gradient to a preset federated server, where the preset federated server is configured to be based on the first gradient and the second The second gradient sent by the device calculates the total gradient of the federated model;
在本实施例中,基于所述梯度辅助变量,计算第一梯度,并将所述第一梯度发送至预设联邦服务器,其中,所述预设联邦服务器用于基于所述第一梯度和所述第二设备发送的第二梯度计算联邦模型总梯度,具体地,计算所述梯度辅助变量和第一样本特征之积,获得所述第一梯度,其中,所述第一梯度可由如下公式计算获得,In this embodiment, the first gradient is calculated based on the gradient auxiliary variable, and the first gradient is sent to a preset federated server, where the preset federated server is used to calculate the first gradient based on the first gradient and the The second gradient sent by the second device calculates the total gradient of the federated model, specifically, the product of the gradient auxiliary variable and the first sample feature is calculated to obtain the first gradient, where the first gradient can be obtained by the following formula Calculated,
其中,gB为所述第一采样梯度,wTx为所述总权值变量,y为所述样本标签,y可取值1或者-1,例如,当y=1时,可表示客户为好客户,当y=-1时,可表示客户为坏客户,xB为所述第一设备的第一样本特征,进一步地,将所述第一梯度发送至预设联邦服务器,以通过所述预设联邦服务器基于预设联邦规则计算所述第一梯度和所述第二设备发送的第二梯度对应的联邦模型总梯度,其中,所述预设联邦规则包括加权平均、求和等,所述第二梯度为所述梯度辅助变量和第二样本特征之积。Wherein, gB is the first sampling gradient, wTx is the total weight variable, y is the sample label, y can take the value 1 or -1, for example, when y=1, it can indicate that the customer is a good customer , When y=-1, it can indicate that the client is a bad client, and xB is the first sample feature of the first device. Further, the first gradient is sent to the preset federation server to pass the pre- It is assumed that the federation server calculates the total gradient of the federation model corresponding to the first gradient and the second gradient sent by the second device based on a preset federation rule, where the preset federation rule includes weighted average, summation, etc., The second gradient is the product of the gradient auxiliary variable and the second sample feature.
步骤B24,接收所述联邦服务器反馈的所述联邦模型总梯度,并基于所述联邦模型总梯度对所述第一设备的本地训练模型进行迭代更新,获得所述逻辑回归模型。Step B24: Receive the total gradient of the federation model fed back by the federation server, and iteratively update the local training model of the first device based on the total gradient of the federation model to obtain the logistic regression model.
在本实施例中,需要说明的是,所述本地训练模型即为所述第一训练模型。In this embodiment, it should be noted that the local training model is the first training model.
接收所述联邦服务器反馈的所述联邦模型总梯度,并基于所述联邦模型总梯度对所述第一设备的本地训练模型进行迭代更新,获得所述逻辑回归模型,具体地,接收所述联邦服务器反馈的所述联邦模型总梯度,并基于所述联邦模型总梯度对第一设备的本地训练模型进行训练更新,并判断更新后的本地训练设备是否达到预设训练完成条件,若更新后的本地训练设备达到预设训练完成条件,则将所述更新后的本地训练设备作为所述逻辑回归模型,若更新后的本地训练设备未达到预设训练完成条件,则进行下一次联邦学习以对所述本地训练设备进行训练更新,直至所述本地训练设备达到预设训练完成条件,其中,所述训练完成条件包括模型达到最大迭代次数、模型误差小于预设误差阀值而收敛等。Receive the total gradient of the federation model fed back by the federation server, and iteratively update the local training model of the first device based on the total gradient of the federation model to obtain the logistic regression model, specifically, receive the federation The total gradient of the federated model fed back by the server, and based on the total gradient of the federated model, the local training model of the first device is trained and updated, and it is judged whether the updated local training device meets the preset training completion condition. If the local training device meets the preset training completion condition, the updated local training device is used as the logistic regression model. If the updated local training device does not meet the preset training completion condition, the next federated learning The local training device performs training updates until the local training device reaches a preset training completion condition, where the training completion condition includes the model reaching the maximum number of iterations, the model error is less than the preset error threshold and converging.
本实施例通过与所述第一设备关联的第二设备进行样本匹配,获得公共训练样本ID,进而基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型。也即,本实施例首先进行与所述第一设备关联的第二设备的样本匹配,获得公共训练样本ID,进而基于所述公共训练样本ID,并通过进行与所述第二设备的交互,以进行联邦学习,获得所述个性化推荐模型。也即,本实施例提供了一种通过联邦学习获取个性化推荐模型的方法,也即,通过与所述第二设备进行交互,可联合所述第二设备的训练样本数据,进行联邦学习以获取所述个性化推荐模型,进而提高了所述个性化推荐模型的训练样本的特征丰富度,进而通过特征丰富度更高的训练样本进行训练获得所述个性化推荐模型,提高了个性化推荐模型的健壮性和宽泛性,进而提高了所述个性化推荐模型的预测准确率,进而通过将待预测数据输入基于所述联邦学习获取的个性化推荐模型,在仅仅使用本地数据进行预测的情况下可达到联合多方数据进行预测的模型预测效果,减少了进行个性化推荐时的计算量,进而提高了个性化推荐时的响应速度,增强了个性化推荐的效果,所以本实施为解决个性化推荐效果差的技术问题奠定了基础。In this embodiment, a public training sample ID is obtained by performing sample matching with a second device associated with the first device, and then based on the public training sample ID, by interacting with the second device to perform federated learning, the Personalized recommendation model. That is, this embodiment first performs sample matching of the second device associated with the first device to obtain a public training sample ID, and then based on the public training sample ID, through interaction with the second device, In order to perform federated learning, the personalized recommendation model is obtained. That is, this embodiment provides a method for obtaining a personalized recommendation model through federated learning. That is, by interacting with the second device, the training sample data of the second device can be combined to perform federated learning. Obtain the personalized recommendation model, thereby improving the feature richness of the training samples of the personalized recommendation model, and then obtain the personalized recommendation model through training with training samples with higher feature richness, which improves the personalized recommendation The robustness and broadness of the model further improve the prediction accuracy of the personalized recommendation model. Then, by inputting the data to be predicted into the personalized recommendation model obtained based on the federated learning, in the case of using only local data for prediction It can achieve the model prediction effect of joint multi-party data prediction, which reduces the amount of calculation when making personalized recommendation, thereby improving the response speed of personalized recommendation, and enhancing the effect of personalized recommendation. Therefore, this implementation is to solve the personalized recommendation. The technical problem of poor recommendation effect laid the foundation.
参照图3,图3是本申请实施例方案涉及的硬件运行环境的设备结构示意图。Referring to FIG. 3, FIG. 3 is a schematic diagram of the device structure of the hardware operating environment involved in the solution of the embodiment of the present application.
如图3所示,该基于联邦学习的个性化推荐设备可以包括:处理器1001,例如CPU,存储器1005,通信总线1002。其中,通信总线1002用于实现处理器1001和存储器1005之间的连接通信。存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储设备。As shown in FIG. 3, the personalized recommendation device based on federated learning may include a processor 1001, such as a CPU, a memory 1005, and a communication bus 1002. Among them, the communication bus 1002 is used to implement connection and communication between the processor 1001 and the memory 1005. The memory 1005 can be a high-speed RAM memory or a stable memory (non-volatile memory), such as disk storage. Optionally, the memory 1005 may also be a storage device independent of the aforementioned processor 1001.
在一实施例中,该基于联邦学习的个性化推荐设备还可以包括矩形用户接口、网络接口、摄像头、RF(Radio Frequency,射频)电路,传感器、音频电路、WiFi模块等等。矩形用户接口可以包括显示屏(Display)、输入子模块比如键盘(Keyboard),可选矩形用户接口还可以包括标准的有线接口、无线接口。网络接口可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。In an embodiment, the personalized recommendation device based on federated learning may also include a rectangular user interface, a network interface, a camera, and RF (Radio Frequency (radio frequency) circuits, sensors, audio circuits, WiFi modules, etc. The rectangular user interface may include a display screen (Display) and an input sub-module such as a keyboard (Keyboard), and the optional rectangular user interface may also include a standard wired interface and a wireless interface. The optional network interface can include standard wired interface and wireless interface (such as WI-FI interface).
本领域技术人员可以理解,图3中示出的基于联邦学习的个性化推荐设备结构并不构成对基于联邦学习的个性化推荐设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the structure of the personalized recommendation device based on federated learning shown in FIG. 3 does not constitute a limitation on the personalized recommendation device based on federated learning, and may include more or fewer components than shown in the figure. Or some parts are combined, or different parts are arranged.
如图3所示,作为一种计算机存储介质的存储器1005中可以包括操作系统、网络通信模块以及基于联邦学习的个性化推荐程序。操作系统是管理和控制基于联邦学习的个性化推荐设备硬件和软件资源的程序,支持基于联邦学习的个性化推荐程序以及其它软件和/或程序的运行。网络通信模块用于实现存储器1005内部各组件之间的通信,以及与基于联邦学习的个性化推荐系统中其它硬件和软件之间通信。As shown in FIG. 3, the memory 1005 as a computer storage medium may include an operating system, a network communication module, and a personalized recommendation program based on federated learning. The operating system is a program that manages and controls the hardware and software resources of the personalized recommendation device based on federated learning, and supports the running of the personalized recommendation program based on federated learning and other software and/or programs. The network communication module is used to realize the communication between the components in the memory 1005 and the communication with other hardware and software in the personalized recommendation system based on federated learning.
在图3所示的基于联邦学习的个性化推荐设备中,处理器1001用于执行存储器1005中存储的基于联邦学习的个性化推荐程序,实现上述任一项所述的基于联邦学习的个性化推荐方法的步骤。In the personalized recommendation device based on federated learning shown in FIG. 3, the processor 1001 is configured to execute the personalized recommendation program based on federated learning stored in the memory 1005 to realize the personalized recommendation based on federated learning described in any of the above items. Recommended method steps.
本申请基于联邦学习的个性化推荐设备具体实施方式与上述基于联邦学习的个性化推荐方法各实施例基本相同,在此不再赘述。The specific implementation of the personalized recommendation device based on federated learning in this application is basically the same as the above-mentioned embodiments of the personalized recommendation method based on federated learning, and will not be repeated here.
本申请实施例还提供一种基于联邦学习的个性化推荐装置,所述基于联邦学习的个性化推荐装置包括:An embodiment of the present application also provides a personalized recommendation device based on federated learning. The personalized recommendation device based on federated learning includes:
提取模块,用于所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集;The extraction module is configured to receive the uploaded data, and extract the target recall set corresponding to the uploaded data from a preset recall set storage database;
预测模块,用于所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果;A prediction module, configured to obtain the data to be predicted corresponding to the uploaded data and the target recall set, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
筛选模块,用于所述对所述模型输出结果进行筛选,获得个性化推荐结果。The screening module is used to screen the output results of the model to obtain personalized recommendation results.
在一实施例中,所述预测模块包括:In an embodiment, the prediction module includes:
筛选单元,用于所述基于所述待推荐物品列表,对所述物品数据进行筛选,获得待推荐物品数据,并将所述用户数据、所述待推荐物品数据和所述行为数据设置为所述待预测数据;The screening unit is configured to screen the item data based on the list of items to be recommended to obtain the item data to be recommended, and set the user data, the item data to be recommended and the behavior data as all State the data to be predicted;
预测单元,用于所述将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果。The prediction unit is used for inputting the data to be predicted into the personalized recommendation model, so as to score and sort the recommended items in the list of to-be-recommended items, and obtain model output results.
在一实施例中,所述预测单元包括:In an embodiment, the prediction unit includes:
评分子单元,用于所述将所述待预测数据输入所述个性化推荐模型,以基于所述用户数据和所述行为数据对所述待推荐物品进行评分,获得评分结果;A scoring subunit for inputting the data to be predicted into the personalized recommendation model to score the item to be recommended based on the user data and the behavior data to obtain a scoring result;
排序子单元,用于所述基于所述评分结果,对所述待推荐物品进行排序,获得模型输出结果。The sorting subunit is used for sorting the items to be recommended based on the scoring result to obtain a model output result.
在一实施例中,所述基于联邦学习的个性化推荐装置还包括:In an embodiment, the personalized recommendation device based on federated learning further includes:
样本匹配模块,用于所述与所述第一设备关联的第二设备进行样本匹配,获得公共训练样本ID;A sample matching module, configured to perform sample matching on the second device associated with the first device to obtain a public training sample ID;
第一联邦学习模块,用于所述基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型。The first federated learning module is used to obtain the personalized recommendation model by interacting with the second device to perform federated learning based on the public training sample ID.
在一实施例中,所述联邦学习模块包括:In an embodiment, the federated learning module includes:
第一计算单元,用于所述基于所述公共训练样本ID,提取所述公共训练样本ID对应的第一样本数据,并计算所述第一样本数据对应的第一权值;A first calculation unit, configured to extract the first sample data corresponding to the public training sample ID based on the public training sample ID, and calculate the first weight corresponding to the first sample data;
第二计算单元,用于所述接收所述第二设备发送的第二权值,并通过预设中间参数公式计算所述第一权值和所述第二权值共同对应的梯度辅助变量,并将所述梯度辅助变量发送至第二设备,其中,所述第二设备用于计算所述梯度辅助变量对应的第二梯度;The second calculation unit is configured to receive the second weight value sent by the second device, and calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value by using a preset intermediate parameter formula, And sending the gradient auxiliary variable to a second device, where the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
第三计算单元,用于所述基于所述梯度辅助变量,计算第一梯度,并将所述第一梯度发送至预设联邦服务器,其中,所述预设联邦服务器用于基于所述第一梯度和所述第二设备发送的第二梯度计算联邦模型总梯度;The third calculation unit is configured to calculate a first gradient based on the gradient auxiliary variable, and send the first gradient to a preset federated server, where the preset federated server is used to calculate the first gradient based on the first gradient. The gradient and the second gradient sent by the second device to calculate the total gradient of the federated model;
迭代更新单元,用于所述接收所述联邦服务器反馈的所述联邦模型总梯度,并基于所述联邦模型总梯度对所述第一设备的本地训练模型进行迭代更新,获得所述逻辑回归模型。An iterative update unit, configured to receive the total gradient of the federation model fed back by the federation server, and to iteratively update the local training model of the first device based on the total gradient of the federation model to obtain the logistic regression model .
在一实施例中,所述提取模块包括:In an embodiment, the extraction module includes:
提取单元,用于所述接收上传数据,并提取所述上传数据中的样本ID;An extraction unit, configured to receive the uploaded data and extract the sample ID in the uploaded data;
查询单元,用于所述基于所述样本ID,在所述预设召回集存储数据库中查询相对应的目标召回集。The query unit is configured to query the corresponding target recall set in the preset recall set storage database based on the sample ID.
在一实施例中,所述基于联邦学习的个性化推荐装置包括:In an embodiment, the personalized recommendation device based on federated learning includes:
第二联邦学习模块,用于所述与所述第一设备关联的第二设备进行联邦学习,获得联邦召回算法模型;A second federated learning module, configured to perform federated learning for the second device associated with the first device to obtain a federated recall algorithm model;
存储模块,用于所述获取样本上传数据,并将所述样本上传数据输入所述联邦召回算法模型,获得目标召回集集合,并将所述目标召回集集合存储于所述预设召回集存储数据库中。The storage module is used to obtain the sample upload data and input the sample upload data into the federated recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage In the database.
本申请基于联邦学习的个性化推荐装置的具体实施方式与上述基于联邦学习的个性化推荐方法各实施例基本相同,在此不再赘述。The specific implementation of the personalized recommendation device based on federated learning in this application is basically the same as the above-mentioned embodiments of the personalized recommendation method based on federated learning, and will not be repeated here.
本申请实施例提供了一种介质,所述介质为可读存储介质,且所述介质存储有一个或者一个以上程序,所述一个或者一个以上程序还可被一个或者一个以上的处理器执行以用于实现上述任一项所述的基于联邦学习的个性化推荐方法的步骤。The embodiment of the present application provides a medium, the medium is a readable storage medium, and the medium stores one or more programs, and the one or more programs may also be executed by one or more processors to It is used to implement the steps of any one of the above-mentioned personalized recommendation methods based on federated learning.
本申请介质具体实施方式与上述基于联邦学习的个性化推荐方法各实施例基本相同,在此不再赘述。The specific implementation of the medium of this application is basically the same as the above-mentioned embodiments of the personalized recommendation method based on federated learning, and will not be repeated here.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利处理范围内。The above are only preferred embodiments of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of this application, or directly or indirectly used in other related technical fields , The same reason is included in the scope of patent processing of this application.

Claims (20)

  1. 一种基于联邦学习的个性化推荐方法,其中,所述基于联邦学习的个性化推荐方法包括:A personalized recommendation method based on federated learning, wherein the personalized recommendation method based on federated learning includes:
    接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集;Receiving uploaded data, and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database;
    获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果;Acquiring the data to be predicted corresponding to the uploaded data and the target recall set, and inputting the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
    对所述模型输出结果进行筛选,获得个性化推荐结果。The output results of the model are screened to obtain personalized recommendation results.
  2. 如权利要求1所述基于联邦学习的个性化推荐方法,其中,所述目标召回集包括待推荐物品列表,所述上传数据包括用户数据、物品数据和行为数据,The personalized recommendation method based on federated learning according to claim 1, wherein the target recall set includes a list of items to be recommended, and the uploaded data includes user data, item data, and behavior data,
    所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果的步骤包括:The step of obtaining the data to be predicted corresponding to the uploaded data and the target recall set, and inputting the data to be predicted into a personalized recommendation model obtained based on the federated learning, and obtaining the model output result includes:
    基于所述待推荐物品列表,对所述物品数据进行筛选,获得待推荐物品数据,并将所述用户数据、所述待推荐物品数据和所述行为数据设置为所述待预测数据;Filtering the item data based on the list of items to be recommended to obtain item data to be recommended, and setting the user data, the item data to be recommended and the behavior data as the data to be predicted;
    将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果。The data to be predicted is input into the personalized recommendation model to score and sort the recommended items in the list of recommended items to obtain model output results.
  3. 如权利要求2所述基于联邦学习的个性化推荐方法,其中,所述将所述待预测数据输入所述个性化推荐模型,以对所述待推荐物品列表中的待推荐物品进行评分和排序,获得模型输出结果的步骤包括:The personalized recommendation method based on federated learning according to claim 2, wherein said inputting said data to be predicted into said personalized recommendation model to score and sort the items to be recommended in said list of items to be recommended , The steps to obtain model output results include:
    将所述待预测数据输入所述个性化推荐模型,以基于所述用户数据和所述行为数据对所述待推荐物品进行评分,获得评分结果;Input the to-be-predicted data into the personalized recommendation model to score the to-be-recommended item based on the user data and the behavior data to obtain a scoring result;
    基于所述评分结果,对所述待推荐物品进行排序,获得模型输出结果。Based on the scoring result, the items to be recommended are sorted, and the model output result is obtained.
  4. 如权利要求1所述基于联邦学习的个性化推荐方法,其中,所述基于联邦学习的个性化推荐方法应用于进行所述联邦学习的第一设备,The personalized recommendation method based on federated learning according to claim 1, wherein the personalized recommendation method based on federated learning is applied to the first device that performs the federated learning,
    所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果的步骤之前包括:The step of obtaining the to-be-predicted data corresponding to the uploaded data and the target recall set, and inputting the to-be-predicted data into the personalized recommendation model obtained based on the federated learning, before the step of obtaining the model output result includes:
    与所述第一设备关联的第二设备进行样本匹配,获得公共训练样本ID;Performing sample matching on a second device associated with the first device to obtain a public training sample ID;
    基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型。Based on the public training sample ID, the personalized recommendation model is obtained by interacting with the second device for federated learning.
  5. 如权利要求4所述基于联邦学习的个性化推荐方法,其中,所述个性化推荐模型包括逻辑回归模型,The personalized recommendation method based on federated learning according to claim 4, wherein the personalized recommendation model includes a logistic regression model,
    所述基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型的步骤包括:The step of obtaining the personalized recommendation model by interacting with the second device for federated learning based on the public training sample ID includes:
    基于所述公共训练样本ID,提取所述公共训练样本ID对应的第一样本数据,并计算所述第一样本数据对应的第一权值;Based on the public training sample ID, extract the first sample data corresponding to the public training sample ID, and calculate the first weight corresponding to the first sample data;
    接收所述第二设备发送的第二权值,并通过预设中间参数公式计算所述第一权值和所述第二权值共同对应的梯度辅助变量,并将所述梯度辅助变量发送至第二设备,其中,所述第二设备用于计算所述梯度辅助变量对应的第二梯度;Receive the second weight value sent by the second device, calculate the gradient auxiliary variable corresponding to the first weight value and the second weight value through a preset intermediate parameter formula, and send the gradient auxiliary variable to A second device, wherein the second device is used to calculate a second gradient corresponding to the gradient auxiliary variable;
    基于所述梯度辅助变量,计算第一梯度,并将所述第一梯度发送至预设联邦服务器,其中,所述预设联邦服务器用于基于所述第一梯度和所述第二设备发送的第二梯度计算联邦模型总梯度;以及Based on the gradient auxiliary variable, the first gradient is calculated, and the first gradient is sent to a preset federation server, where the preset federation server is used to send data based on the first gradient and the second device The second gradient calculates the total gradient of the federated model; and
    接收所述联邦服务器反馈的所述联邦模型总梯度,并基于所述联邦模型总梯度对所述第一设备的本地训练模型进行迭代更新,获得所述逻辑回归模型。The total gradient of the federation model fed back by the federation server is received, and the local training model of the first device is iteratively updated based on the total gradient of the federation model to obtain the logistic regression model.
  6. 如权利要求1所述基于联邦学习的个性化推荐方法,其中,所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集的步骤包括:The personalized recommendation method based on federated learning according to claim 1, wherein the step of receiving uploaded data and extracting a target recall set corresponding to the uploaded data from a preset recall set storage database comprises:
    接收上传数据,并提取所述上传数据中的样本ID;Receiving uploaded data, and extracting the sample ID in the uploaded data;
    基于所述样本ID,在所述预设召回集存储数据库中查询相对应的目标召回集。Based on the sample ID, query the corresponding target recall set in the preset recall set storage database.
  7. 如权利要求1所述基于联邦学习的个性化推荐方法,其中,所述基于联邦学习的个性化推荐方法应用于进行所述联邦学习的第一设备,The personalized recommendation method based on federated learning according to claim 1, wherein the personalized recommendation method based on federated learning is applied to the first device that performs the federated learning,
    所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集的步骤之前包括:The step of receiving uploaded data and extracting the target recall set corresponding to the uploaded data from a preset recall set storage database includes:
    与所述第一设备关联的第二设备进行联邦学习,获得联邦召回算法模型;The second device associated with the first device performs federated learning to obtain a federated recall algorithm model;
    获取样本上传数据,并将所述样本上传数据输入所述联邦召回算法模型,获得目标召回集集合,并将所述目标召回集集合存储于所述预设召回集存储数据库中。Obtain sample upload data, and input the sample upload data into the federal recall algorithm model to obtain a target recall set set, and store the target recall set set in the preset recall set storage database.
  8. 如权利要求1所述基于联邦学习的个性化推荐方法,其中,接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集的步骤包括:The personalized recommendation method based on federated learning according to claim 1, wherein the step of receiving uploaded data and extracting the target recall set corresponding to the uploaded data from a preset recall set storage database comprises:
    接收上传数据,并基于所述上传数据中的用户数据,在所述预设召回集存储数据库中查询所述用户数据对应的目标召回集,其中Receiving uploaded data, and based on the user data in the uploaded data, query the target recall set corresponding to the user data in the preset recall set storage database, where
    所述目标召回集包括一个或者多个初始待推荐物品,所述初始待推荐物品包括用户可能感兴趣的物品。The target recall set includes one or more initial items to be recommended, and the initial items to be recommended include items that may be of interest to the user.
  9. 如权利要求6所述基于联邦学习的个性化推荐方法,其中,所述基于所述样本ID,在所述预设召回集存储数据库中查询相对应的目标召回集的步骤包括:The personalized recommendation method based on federated learning according to claim 6, wherein the step of querying the corresponding target recall set in the preset recall set storage database based on the sample ID comprises:
    将所述样本ID作为关键词,在所述预设召回集存储数据库进行检索,获得所述目标召回集。The sample ID is used as a keyword, and the preset recall set storage database is searched to obtain the target recall set.
  10. 如权利要求7所述基于联邦学习的个性化推荐方法,其中,所述与所述第一设备关联的第二设备进行联邦学习,获得联邦召回算法模型的步骤包括:8. The personalized recommendation method based on federated learning according to claim 7, wherein the step of performing federated learning for the second device associated with the first device to obtain a federated recall algorithm model comprises:
    将所述第一设备的样本ID和所述第二设备的样本ID进行样本对齐,获得公共训练样本ID;Align the sample ID of the first device with the sample ID of the second device to obtain a common training sample ID;
    基于所述公共训练样本ID对应的上传数据对第一设备的第一训练模型进行训练,获得第一训练结果,并计算所述第一训练结果与理论训练结果的误差;以及Training the first training model of the first device based on the uploaded data corresponding to the public training sample ID, obtaining the first training result, and calculating the error between the first training result and the theoretical training result; and
    基于所述误差和所述第一训练模型的网络权重对预设目标函数进行求偏导,获得第一梯度;Obtaining a partial derivative of a preset objective function based on the error and the network weight of the first training model to obtain the first gradient;
    将所述第一梯度发送至预设联邦服务器,以通过所述预设联邦服务器基于预设联邦规则对所述第一梯度和所述第二设备发送的第二梯度进行联邦,获得联邦梯度;以及Sending the first gradient to a preset federation server to federate the first gradient and the second gradient sent by the second device through the preset federation server based on a preset federation rule to obtain a federated gradient; as well as
    接收所述预设联邦服务器反馈的所述联邦梯度,并基于所述联邦梯度对所述第一训练模型进行迭代更新,获得所述联邦召回算法模型。The federated gradient fed back by the preset federated server is received, and the first training model is iteratively updated based on the federated gradient to obtain the federated recall algorithm model.
  11. 如权利要求10所述基于联邦学习的个性化推荐方法,其中,所述预设联邦规则包括加权平均,停止所述迭代更新的条件包括达到最大迭代次数、模型收敛于预设误差阀值。10. The personalized recommendation method based on federated learning according to claim 10, wherein the preset federated rule includes a weighted average, and the condition for stopping the iterative update includes reaching the maximum number of iterations and the model converging to a preset error threshold.
  12. 如权利要求1所述基于联邦学习的个性化推荐方法,其中,所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果的步骤包括:The personalized recommendation method based on federated learning according to claim 1, wherein said obtaining the data to be predicted corresponding to the uploaded data and the target recall set, and inputting the data to be predicted based on the federated learning The obtained personalized recommendation model, and the steps to obtain the output result of the model include:
    将所述上传数据和所述目标召回集共同对应的待预测数据输入基于所述联邦学习获取的个性化推荐模型;Inputting the data to be predicted corresponding to the uploaded data and the target recall set into a personalized recommendation model obtained based on the federated learning;
    对所述目标召回集对应的待推荐物品进行评分和排序,获得所述待推荐物品对应的物品排序列表及待推荐物品对应的评分;以及Scoring and sorting the items to be recommended corresponding to the target recall set, and obtaining a ranking list of items corresponding to the items to be recommended and a score corresponding to the items to be recommended; and
    获得模型输出结果。Obtain model output results.
  13. 如权利要求1所述基于联邦学习的个性化推荐方法,其中,所述目标召回集包括待推荐物品列表,所述上传数据包括用户数据、物品数据和行为数据。The personalized recommendation method based on federated learning according to claim 1, wherein the target recall set includes a list of items to be recommended, and the uploaded data includes user data, item data, and behavior data.
  14. 如权利要求4所述基于联邦学习的个性化推荐方法,其中,所述公共训练样本ID为所述第一设备中的第一训练样本ID与所述第二设备中的第二训练样本ID进行交集处理后的样本ID。The personalized recommendation method based on federated learning according to claim 4, wherein the public training sample ID is the first training sample ID in the first device and the second training sample ID in the second device. Sample ID after intersection processing.
  15. 如权利要求4所述基于联邦学习的个性化推荐方法,其中,所述基于所述公共训练样本ID,通过与所述第二设备交互以进行联邦学习,获得所述个性化推荐模型的步骤包括:The personalized recommendation method based on federated learning according to claim 4, wherein the step of obtaining the personalized recommendation model by interacting with the second device to perform federated learning based on the public training sample ID comprises :
    基于所述公共训练样本ID,提取所述公共训练样本ID对应的第一样本数据;Extract the first sample data corresponding to the public training sample ID based on the public training sample ID;
    基于所述第一样本数据获取第一梯度,通过与所述第二设备进行交互,协助所述第二设备获取第二梯度,并将所述第一梯度发送至预设联邦服务器;Acquiring a first gradient based on the first sample data, assisting the second device in acquiring the second gradient by interacting with the second device, and sending the first gradient to a preset federation server;
    接收预设联邦服务器反馈的联邦模型总梯度;以及Receive the total gradient of the federation model fed back by the preset federation server; and
    基于所述联邦模型总梯度对第一设备中的第一训练模型进行迭代更新,获得所述个性化推荐模型。The first training model in the first device is iteratively updated based on the total gradient of the federation model to obtain the personalized recommendation model.
  16. 如权利要求15所述基于联邦学习的个性化推荐方法,其中,The personalized recommendation method based on federated learning as claimed in claim 15, wherein:
    所述联邦模型总梯度由预设联邦服务器通过预设联邦规则对所述第一梯度和所述第二设备发送的第二梯度进行联邦获得,The total gradient of the federation model is obtained by federating the first gradient and the second gradient sent by the second device by a preset federation server through a preset federation rule,
    所述预设联邦规则包括加权平均、求和。The preset federal rules include weighted average and sum.
  17. 如权利要求5所述基于联邦学习的个性化推荐方法,其中,The personalized recommendation method based on federated learning according to claim 5, wherein:
    第一权值为所述第一样本数据对应的第一样本特征和所述第一训练模型中的第一权重之积,其中,所述第一权重为所述第一训练模型的网络权重;The first weight is the product of the first sample feature corresponding to the first sample data and the first weight in the first training model, where the first weight is the network of the first training model Weights;
    所述第二权值为所述第二样本数据对应的第二样本特征和第二设备中的第二训练模型中的第二权重之积,所述第二权重为所述第二训练模型的网络权重。The second weight is the product of the second sample feature corresponding to the second sample data and the second weight in the second training model in the second device, and the second weight is the product of the second training model Network weight.
  18. 一种基于联邦学习的个性化推荐装置,其中,所述基于联邦学习的个性化推荐装置包括:A personalized recommendation device based on federated learning, wherein the personalized recommendation device based on federated learning includes:
    提取模块,用于所述接收上传数据,并从预设召回集存储数据库中提取所述上传数据对应的目标召回集;The extraction module is configured to receive the uploaded data, and extract the target recall set corresponding to the uploaded data from a preset recall set storage database;
    预测模块,用于所述获取所述上传数据和所述目标召回集共同对应的待预测数据,并将所述待预测数据输入基于所述联邦学习获取的个性化推荐模型,获得模型输出结果;A prediction module, configured to obtain the data to be predicted corresponding to the uploaded data and the target recall set, and input the data to be predicted into a personalized recommendation model obtained based on the federated learning to obtain a model output result;
    筛选模块,用于所述对所述模型输出结果进行筛选,获得个性化推荐结果。The screening module is used to screen the output results of the model to obtain personalized recommendation results.
  19. 一种基于联邦学习的个性化推荐设备,其中,所述基于联邦学习的个性化推荐设备包括:存储器、处理器以及存储在存储器上的用于实现所述基于联邦学习的个性化推荐方法的程序,A personalized recommendation device based on federated learning, wherein the personalized recommendation device based on federated learning includes: a memory, a processor, and a program stored in the memory for implementing the method of personalized recommendation based on federated learning ,
    所述存储器用于存储实现基于联邦学习的个性化推荐方法的程序;The memory is used to store a program for implementing a personalized recommendation method based on federated learning;
    所述处理器用于执行实现所述基于联邦学习的个性化推荐方法的程序,以实现如权利要求1至17中任一项所述基于联邦学习的个性化推荐方法的步骤。The processor is configured to execute a program for implementing the personalized recommendation method based on federated learning, so as to realize the steps of the personalized recommendation method based on federated learning according to any one of claims 1 to 17.
  20. 一种介质,其中,所述介质上存储有实现基于联邦学习的个性化推荐方法的程序,所述实现基于联邦学习的个性化推荐方法的程序被处理器执行以实现如权利要求1至17中任一项所述基于联邦学习的个性化推荐方法的步骤。A medium, wherein a program for implementing a method of personalized recommendation based on federated learning is stored on the medium, and the program for implementing the method of personalized recommendation based on federated learning is executed by a processor to implement the method as in claims 1 to 17 Any of the steps of the personalized recommendation method based on federated learning.
PCT/CN2020/135030 2019-12-20 2020-12-09 Federated learning-based personalized recommendation method, apparatus and device, and medium WO2021121106A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911326853.3A CN111079022B (en) 2019-12-20 2019-12-20 Personalized recommendation method, device, equipment and medium based on federal learning
CN201911326853.3 2019-12-20

Publications (1)

Publication Number Publication Date
WO2021121106A1 true WO2021121106A1 (en) 2021-06-24

Family

ID=70316309

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/135030 WO2021121106A1 (en) 2019-12-20 2020-12-09 Federated learning-based personalized recommendation method, apparatus and device, and medium

Country Status (2)

Country Link
CN (1) CN111079022B (en)
WO (1) WO2021121106A1 (en)

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113569151A (en) * 2021-09-18 2021-10-29 平安科技(深圳)有限公司 Data recommendation method, device, equipment and medium based on artificial intelligence
CN113627496A (en) * 2021-07-27 2021-11-09 交控科技股份有限公司 Method, device, electronic equipment and readable storage medium for predicting fault of turnout switch machine
CN113642707A (en) * 2021-08-12 2021-11-12 深圳平安智汇企业信息管理有限公司 Model training method, device, equipment and storage medium based on federal learning
CN113722738A (en) * 2021-09-02 2021-11-30 脸萌有限公司 Data protection method, device, medium and electronic equipment
CN113762532A (en) * 2021-06-28 2021-12-07 京东科技控股股份有限公司 Federal learning model training method and device, electronic equipment and storage medium
CN113902473A (en) * 2021-09-29 2022-01-07 支付宝(杭州)信息技术有限公司 Training method and device of business prediction system
CN114186263A (en) * 2021-12-17 2022-03-15 大连理工大学 Data regression method based on longitudinal federal learning and electronic device
CN114363176A (en) * 2021-12-20 2022-04-15 中山大学 Network identification method, device, terminal and medium based on federal learning
CN114676851A (en) * 2022-04-08 2022-06-28 中国科学技术大学 Joint training method, device and storage medium for recall and ranking model
CN114741611A (en) * 2022-06-08 2022-07-12 杭州金智塔科技有限公司 Federal recommendation model training method and system
CN114863499A (en) * 2022-06-30 2022-08-05 广州脉泽科技有限公司 Finger vein and palm vein identification method based on federal learning
CN114969503A (en) * 2022-03-30 2022-08-30 贵州大学 Multi-data user portrait implementation method based on federal learning
CN114996317A (en) * 2022-07-05 2022-09-02 中国电信股份有限公司 Asynchronous optimization method and device based on longitudinal federal learning and storage medium
CN115169576A (en) * 2022-06-24 2022-10-11 上海富数科技有限公司广州分公司 Model training method and device based on federal learning and electronic equipment
CN116629374A (en) * 2023-04-13 2023-08-22 南京大学 Searching and multiplexing method for heterogeneous feature space learning piece
CN117076783A (en) * 2023-10-16 2023-11-17 广东省科技基础条件平台中心 Scientific and technological information recommendation method, device, medium and equipment based on data analysis
CN117421486A (en) * 2023-12-18 2024-01-19 杭州金智塔科技有限公司 Recommendation model updating system and method based on spherical tree algorithm and federal learning
CN117592042A (en) * 2024-01-17 2024-02-23 杭州海康威视数字技术股份有限公司 Privacy disclosure detection method and device for federal recommendation system
CN114186263B (en) * 2021-12-17 2024-05-03 大连理工大学 Data regression method based on longitudinal federal learning and electronic device

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111079022B (en) * 2019-12-20 2023-10-03 深圳前海微众银行股份有限公司 Personalized recommendation method, device, equipment and medium based on federal learning
CN111291417B (en) * 2020-05-09 2020-08-28 支付宝(杭州)信息技术有限公司 Method and device for protecting data privacy of multi-party combined training object recommendation model
CN111859136B (en) * 2020-07-23 2024-03-15 深圳前海微众银行股份有限公司 Personalized recommendation method, device, equipment and readable storage medium
CN113781134A (en) * 2020-07-28 2021-12-10 北京沃东天骏信息技术有限公司 Item recommendation method and device and computer-readable storage medium
CN111897800B (en) * 2020-08-05 2021-06-22 全球能源互联网研究院有限公司 Electric vehicle charging facility recommendation method and system based on federal learning
CN112287231B (en) * 2020-11-05 2024-04-05 深圳大学 Federal recommendation gradient acquisition method and device, intelligent terminal and storage medium
CN112449009B (en) * 2020-11-12 2023-01-10 深圳大学 SVD-based communication compression method and device for Federal learning recommendation system
CN112836830B (en) * 2021-02-01 2022-05-06 广西师范大学 Method for voting and training in parallel by using federated gradient boosting decision tree
CN112966182B (en) * 2021-03-09 2024-02-09 中国民航信息网络股份有限公司 Project recommendation method and related equipment
CN113313268A (en) * 2021-06-11 2021-08-27 杭州煋辰数智科技有限公司 Federal learning-based prediction method and device, storage medium and remote sensing equipment
CN113656681B (en) * 2021-07-08 2023-08-11 北京奇艺世纪科技有限公司 Object evaluation method, device, equipment and storage medium
CN113704555B (en) * 2021-07-16 2023-11-07 杭州医康慧联科技股份有限公司 Feature management method based on medical direction federal learning
CN114969486B (en) * 2022-08-02 2022-11-04 平安科技(深圳)有限公司 Corpus recommendation method, apparatus, device and storage medium
CN115238065B (en) * 2022-09-22 2022-12-20 太极计算机股份有限公司 Intelligent document recommendation method based on federal learning
CN117493662A (en) * 2023-10-09 2024-02-02 上海及未科技有限公司 Personalized recommendation method and system based on federal learning
CN117454185B (en) * 2023-12-22 2024-03-12 深圳市移卡科技有限公司 Federal model training method, federal model training device, federal model training computer device, and federal model training storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110110229A (en) * 2019-04-25 2019-08-09 深圳前海微众银行股份有限公司 A kind of information recommendation method and device
CN110297848A (en) * 2019-07-09 2019-10-01 深圳前海微众银行股份有限公司 Recommended models training method, terminal and storage medium based on federation's study
US20190303838A1 (en) * 2018-03-30 2019-10-03 Atlassian Pty Ltd Using a productivity index and collaboration index for validation of recommendation models in federated collaboration systems
CN110458663A (en) * 2019-08-06 2019-11-15 上海新共赢信息科技有限公司 A kind of vehicle recommended method, device, equipment and storage medium
CN111079022A (en) * 2019-12-20 2020-04-28 深圳前海微众银行股份有限公司 Personalized recommendation method, device, equipment and medium based on federal learning

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10102478B2 (en) * 2015-06-26 2018-10-16 Conduent Business Services, Inc. Distributed and privacy-preserving prediction method
CN109754105B (en) * 2017-11-07 2024-01-05 华为技术有限公司 Prediction method, terminal and server
US20190182059A1 (en) * 2017-12-12 2019-06-13 Facebook, Inc. Utilizing machine learning from exposed and non-exposed user recall to improve digital content distribution
CN110134868B (en) * 2019-05-14 2023-04-07 辽宁工程技术大学 Recommendation method based on user preference heterogeneity analysis
CN110443063B (en) * 2019-06-26 2023-03-28 电子科技大学 Adaptive privacy-protecting federal deep learning method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190303838A1 (en) * 2018-03-30 2019-10-03 Atlassian Pty Ltd Using a productivity index and collaboration index for validation of recommendation models in federated collaboration systems
CN110110229A (en) * 2019-04-25 2019-08-09 深圳前海微众银行股份有限公司 A kind of information recommendation method and device
CN110297848A (en) * 2019-07-09 2019-10-01 深圳前海微众银行股份有限公司 Recommended models training method, terminal and storage medium based on federation's study
CN110458663A (en) * 2019-08-06 2019-11-15 上海新共赢信息科技有限公司 A kind of vehicle recommended method, device, equipment and storage medium
CN111079022A (en) * 2019-12-20 2020-04-28 深圳前海微众银行股份有限公司 Personalized recommendation method, device, equipment and medium based on federal learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
FEI CHEN, MI LUO, ZHENHUA DONG, ZHENGUO LI, XIUQIANG HE: "Federated Meta-Learning with Fast Convergence and Efficient Communication", COMPUTER SCIENCE MACHINE LEARNING (ARXIV), 14 December 2019 (2019-12-14), pages 1 - 14, XP081556998 *

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113762532B (en) * 2021-06-28 2024-04-05 京东科技控股股份有限公司 Training method and device of federal learning model, electronic equipment and storage medium
CN113762532A (en) * 2021-06-28 2021-12-07 京东科技控股股份有限公司 Federal learning model training method and device, electronic equipment and storage medium
CN113627496A (en) * 2021-07-27 2021-11-09 交控科技股份有限公司 Method, device, electronic equipment and readable storage medium for predicting fault of turnout switch machine
CN113642707A (en) * 2021-08-12 2021-11-12 深圳平安智汇企业信息管理有限公司 Model training method, device, equipment and storage medium based on federal learning
CN113642707B (en) * 2021-08-12 2023-08-18 深圳平安智汇企业信息管理有限公司 Model training method, device, equipment and storage medium based on federal learning
CN113722738A (en) * 2021-09-02 2021-11-30 脸萌有限公司 Data protection method, device, medium and electronic equipment
CN113722738B (en) * 2021-09-02 2023-08-08 脸萌有限公司 Data protection method, device, medium and electronic equipment
CN113569151A (en) * 2021-09-18 2021-10-29 平安科技(深圳)有限公司 Data recommendation method, device, equipment and medium based on artificial intelligence
CN113902473A (en) * 2021-09-29 2022-01-07 支付宝(杭州)信息技术有限公司 Training method and device of business prediction system
CN114186263A (en) * 2021-12-17 2022-03-15 大连理工大学 Data regression method based on longitudinal federal learning and electronic device
CN114186263B (en) * 2021-12-17 2024-05-03 大连理工大学 Data regression method based on longitudinal federal learning and electronic device
CN114363176A (en) * 2021-12-20 2022-04-15 中山大学 Network identification method, device, terminal and medium based on federal learning
CN114363176B (en) * 2021-12-20 2023-08-08 中山大学 Network identification method, device, terminal and medium based on federal learning
CN114969503A (en) * 2022-03-30 2022-08-30 贵州大学 Multi-data user portrait implementation method based on federal learning
CN114969503B (en) * 2022-03-30 2024-04-02 贵州大学 Multi-data user portrait realizing method based on federal learning
CN114676851A (en) * 2022-04-08 2022-06-28 中国科学技术大学 Joint training method, device and storage medium for recall and ranking model
CN114676851B (en) * 2022-04-08 2024-03-29 中国科学技术大学 Combined training method, equipment and storage medium for recall and sequence model
CN114741611B (en) * 2022-06-08 2022-10-14 杭州金智塔科技有限公司 Federal recommendation model training method and system
CN114741611A (en) * 2022-06-08 2022-07-12 杭州金智塔科技有限公司 Federal recommendation model training method and system
CN115169576B (en) * 2022-06-24 2024-02-09 上海富数科技有限公司 Model training method and device based on federal learning and electronic equipment
CN115169576A (en) * 2022-06-24 2022-10-11 上海富数科技有限公司广州分公司 Model training method and device based on federal learning and electronic equipment
CN114863499B (en) * 2022-06-30 2022-12-13 广州脉泽科技有限公司 Finger vein and palm vein identification method based on federal learning
CN114863499A (en) * 2022-06-30 2022-08-05 广州脉泽科技有限公司 Finger vein and palm vein identification method based on federal learning
CN114996317A (en) * 2022-07-05 2022-09-02 中国电信股份有限公司 Asynchronous optimization method and device based on longitudinal federal learning and storage medium
CN114996317B (en) * 2022-07-05 2024-02-23 中国电信股份有限公司 Asynchronous optimization method and device based on longitudinal federal learning and storage medium
CN116629374B (en) * 2023-04-13 2023-12-26 南京大学 Searching and multiplexing method for heterogeneous feature space learning piece
CN116629374A (en) * 2023-04-13 2023-08-22 南京大学 Searching and multiplexing method for heterogeneous feature space learning piece
CN117076783B (en) * 2023-10-16 2023-12-26 广东省科技基础条件平台中心 Scientific and technological information recommendation method, device, medium and equipment based on data analysis
CN117076783A (en) * 2023-10-16 2023-11-17 广东省科技基础条件平台中心 Scientific and technological information recommendation method, device, medium and equipment based on data analysis
CN117421486A (en) * 2023-12-18 2024-01-19 杭州金智塔科技有限公司 Recommendation model updating system and method based on spherical tree algorithm and federal learning
CN117421486B (en) * 2023-12-18 2024-03-19 杭州金智塔科技有限公司 Recommendation model updating system and method based on spherical tree algorithm and federal learning
CN117592042B (en) * 2024-01-17 2024-04-05 杭州海康威视数字技术股份有限公司 Privacy disclosure detection method and device for federal recommendation system
CN117592042A (en) * 2024-01-17 2024-02-23 杭州海康威视数字技术股份有限公司 Privacy disclosure detection method and device for federal recommendation system

Also Published As

Publication number Publication date
CN111079022A (en) 2020-04-28
CN111079022B (en) 2023-10-03

Similar Documents

Publication Publication Date Title
WO2021121106A1 (en) Federated learning-based personalized recommendation method, apparatus and device, and medium
CN102043833B (en) Search method and device based on query word
JP5721818B2 (en) Use of model information group in search
WO2021143267A1 (en) Image detection-based fine-grained classification model processing method, and related devices
CN101984422B (en) Fault-tolerant text query method and equipment
CN101984420B (en) Method and equipment for searching pictures based on word segmentation processing
CN110134800A (en) A kind of document relationships visible processing method and device
WO2010022655A1 (en) A searching method and system
JP2013504118A (en) Information retrieval based on query semantic patterns
CN103123632B (en) Search center word defining method and device, searching method and search equipment
CN103136213B (en) Method and device for providing related words
WO2011137125A1 (en) Vertical search-based query method, system and apparatus
CN105531701A (en) Personalized trending image search suggestion
CN102236710A (en) Method and equipment for displaying news information in query result
WO2016078533A1 (en) Search method, apparatus, and device and non-volatile computer storage medium
WO2015081792A1 (en) Method, device, and system for correlative and personalized extended search
Babu et al. Improving Quality of Content Based Image Retrieval with Graph Based Ranking
US20160140173A1 (en) Systems and methods for representing search query rewrites
TW201220097A (en) capable of performing relevancy processing for at least one product corresponding to product identifiers referenced in relevant web pages
CN110968801A (en) Real estate product searching method, storage medium and electronic device
CN105117434A (en) Webpage classification method and webpage classification system
CN112825089A (en) Article recommendation method, article recommendation device, article recommendation equipment and storage medium
CN102609539A (en) Search method and search system
JP2011248762A (en) Classification device, content retrieval system, content classification method, content retrieval method, and program
CN102760127A (en) Method, device and equipment for determining resource type based on extended text information

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20903485

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20903485

Country of ref document: EP

Kind code of ref document: A1