CN110008399B - Recommendation model training method and device, and recommendation method and device - Google Patents

Recommendation model training method and device, and recommendation method and device Download PDF

Info

Publication number
CN110008399B
CN110008399B CN201910090621.6A CN201910090621A CN110008399B CN 110008399 B CN110008399 B CN 110008399B CN 201910090621 A CN201910090621 A CN 201910090621A CN 110008399 B CN110008399 B CN 110008399B
Authority
CN
China
Prior art keywords
recommended
user
sample
recommendation
application program
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910090621.6A
Other languages
Chinese (zh)
Other versions
CN110008399A (en
Inventor
谢仁强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Advantageous New Technologies Co Ltd
Original Assignee
Advanced New Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Advanced New Technologies Co Ltd filed Critical Advanced New Technologies Co Ltd
Priority to CN201910090621.6A priority Critical patent/CN110008399B/en
Publication of CN110008399A publication Critical patent/CN110008399A/en
Application granted granted Critical
Publication of CN110008399B publication Critical patent/CN110008399B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The method for training the recommendation model comprises the steps of obtaining user characteristics of at least two sample users and attribute characteristics of at least two sample application programs; generating a positive sample clicked and purchased by the sample user on the exposed sample application program and a negative sample clicked but not purchased or clicked by the sample user on the exposed sample application program based on the user characteristics and the attribute characteristics; training a recommendation model based on a sample set comprising at least one positive sample and one negative sample to obtain the recommendation model, wherein the recommendation model outputs exposure conversion rate obtained by each sample user based on click rate and purchase rate of each exposed sample application program.

Description

Training method and device of recommendation model, and recommendation method and device
Technical Field
The present application relates to the field of computer distributed processing technologies, and in particular, to a training method and apparatus for a recommendation model, a recommendation method and apparatus, a computing device, and a computer-readable storage medium.
Background
In some internet products, in order to facilitate a user to quickly find a desired product (e.g., APP), personalized recommendation needs to be performed, and a product which the user most likely clicks to purchase is recommended to a user to be recommended.
At present, some platforms simply recommend the goods according to the user and the tag of the APP by using a traditional machine learning model (such as an LR or GBDT model), the method of recommending goods by using the tag needs a lot of manpower to establish a tag system, inappropriate APPs (such as APP of yellow gambling poison) are easily recommended to the user, and the traditional machine learning model only estimates the click rate of the APP, does not fully utilize purchase data after clicking, has high click rate, and has not necessarily high conversion rate after clicking.
Disclosure of Invention
In view of this, embodiments of the present disclosure provide a recommendation model training method and apparatus, a recommendation method and apparatus, a computing device, and a computer-readable storage medium, so as to solve technical defects in the prior art.
In a first aspect, an embodiment of the present specification discloses a training method for a recommendation model, including:
acquiring user characteristics of at least two sample users and attribute characteristics of at least two sample application programs;
generating a positive sample clicked and purchased by a sample user on the exposed sample application program and a negative sample clicked but not purchased or clicked by the sample user on the exposed sample application program based on the user characteristics and the attribute characteristics;
training a recommendation model based on a sample set comprising at least one positive sample and one negative sample to obtain the recommendation model, wherein the recommendation model outputs exposure conversion rate obtained by each sample user based on click rate and purchase rate of each exposed sample application program.
Optionally, before training the recommendation model based on a sample set including at least one positive sample and one negative sample, the method further includes:
and screening the sample set into a training sample set comprising at least one positive sample and at least one negative sample and a test sample set comprising at least one positive sample and at least one negative sample based on a preset screening rule.
Optionally, training the recommendation model based on a set of samples comprising at least one positive sample and one negative sample comprises:
training a recommendation model based on the set of training samples comprising at least one positive sample and one negative sample.
Optionally, after training the recommendation model based on a sample set including at least one positive sample and one negative sample, the method further includes:
the recommendation model is tested based on a set of test samples including at least one positive sample and one negative sample.
Optionally, the recommendation model includes a deep fm multitask learning model, and the deep fm multitask learning model includes a click rate estimation part and a purchase rate estimation part.
Optionally, the recommending model outputting the exposure conversion rate obtained by each sample user based on the click rate and the purchase rate of the sample application program for each exposure comprises:
the click rate estimation part of the recommendation model outputs the click rate of each sample user to each exposed sample application program;
a purchase rate estimation part of the recommendation model outputs the purchase rate of each sample user to each exposed sample application program;
determining an exposure conversion rate for each sample user for each exposed sample application based on the click through rate and the purchase rate.
Optionally, the user features and the attribute features each include offline features and real-time features, wherein the offline features include historical features of the collected sample users and the sample applications, and the real-time features include features of the collected sample users and the sample applications at the time of events.
In a second aspect, an embodiment of the present specification further provides a recommendation method, including:
receiving a recommendation request of a user to be recommended for an exposed application program, wherein the user to be recommended carries a user identifier;
determining at least two applications to be recommended which are matched with the user to be recommended based on the user identification;
extracting the user characteristics of the user to be recommended and the attribute characteristics of the at least two application programs to be recommended;
inputting the user characteristics and the attribute characteristics into a pre-trained recommendation model to obtain the exposure conversion rate of each matched application program to be recommended by the user to be recommended;
recommending at least one application program to be recommended in the at least two application programs to be recommended to a user to be recommended as an exposed application program based on the exposure conversion rate.
Optionally, before receiving an application recommendation request of an exposure of a user to be recommended, the method further includes:
acquiring a plurality of application programs carrying identifiers;
and screening the plurality of application programs based on a first preset condition, and determining at least two application programs to be recommended.
Optionally, after determining at least two applications to be recommended, the method further includes:
and matching the user to be recommended with the at least two application programs to be recommended based on a preset matching rule, wherein the user to be recommended carries a user identifier.
Optionally, the recommendation model includes a deep fm multitask learning model, and the deep fm multitask learning model includes a click rate estimation part and a purchase rate estimation part.
Optionally, recommending at least one application to be recommended of the at least two applications to be recommended as an exposed application to a user to be recommended based on the exposure conversion includes:
sorting the at least two applications to be recommended based on the exposure conversion rate;
and selecting at least one to-be-recommended application program in the at least two to-be-recommended application programs based on a preset recommendation condition to be recommended to the to-be-recommended user as an exposed application program.
Optionally, after the sorting the at least two applications to be recommended based on the exposure conversion rate, the method further includes:
screening the at least two applications to be recommended based on the second preset condition;
selecting at least one application program to be recommended from the at least two application programs to be recommended as an exposed application program to be recommended to a user to be recommended based on a preset recommendation condition, wherein the recommendation comprises the following steps:
and selecting at least one application program to be recommended from the at least two application programs to be recommended after screening as an exposed application program to be recommended to the user to be recommended based on a preset recommendation condition.
Optionally, the user features and the attribute features both include offline features and real-time features, where the offline features include collected historical features of the user to be recommended and the application to be recommended, and the real-time features include collected features of the user to be recommended and the application to be recommended at the current time.
In a third aspect, an embodiment of the present specification further provides a training apparatus for a recommendation model, including:
a first obtaining module configured to obtain user characteristics of at least two sample users and attribute characteristics of at least two sample applications;
a generating module configured to generate a positive sample clicked and purchased by a sample user on the exposed sample application program and a negative sample clicked but not purchased or clicked by the sample user on the exposed sample application program based on the user characteristics and the attribute characteristics;
the recommendation model is obtained by training a recommendation model based on a sample set comprising at least one positive sample and one negative sample, and the recommendation model outputs exposure conversion rate obtained by each sample user based on click rate and purchase rate of a sample application program of each exposure.
Optionally, the apparatus further comprises:
a first screening module configured to screen the sample set into a training sample set including at least one positive sample and one negative sample and a test sample set including at least one positive sample and one negative sample based on a preset screening rule.
Optionally, the training module is further configured to:
training a recommendation model based on the set of training samples comprising at least one positive sample and one negative sample.
Optionally, the apparatus further comprises:
a testing module configured to test the recommendation model based on a set of test samples comprising at least one positive sample and one negative sample.
Optionally, the recommendation model includes a deep fm multitask learning model, and the deep fm multitask learning model includes a click rate estimation part and a purchase rate estimation part.
Optionally, the training module comprises:
the first output sub-module is configured to output the click rate of each sample user to each exposed sample application program by the click rate estimation part of the recommendation model;
a second output sub-module configured to output a purchase rate of each sample user for each exposed sample application program by a purchase rate estimation part of the recommendation model;
a determination submodule configured to determine an exposure conversion rate for each sample user for each exposed sample application based on the click through rate and the purchase rate.
Optionally, the user features and the attribute features each include offline features and real-time features, wherein the offline features include historical features of the collected sample users and the sample applications, and the real-time features include features of the collected sample users and the sample applications at the time of events.
In a fourth aspect, an embodiment of the present specification further provides a recommendation device, including:
the recommendation system comprises a receiving module, a recommendation module and a recommendation module, wherein the receiving module is configured to receive a recommendation request of a user to be recommended for an exposed application program, and the user to be recommended carries a user identifier;
the determining module is configured to determine at least two applications to be recommended which are matched with the user to be recommended based on the user identification;
the extraction module is configured to extract the user characteristics of the user to be recommended and the attribute characteristics of the at least two application programs to be recommended;
the obtaining module is configured to input the user characteristics and the attribute characteristics into a pre-trained recommendation model to obtain exposure conversion rate of each matched application program to be recommended by the user to be recommended;
the first recommending module is configured to recommend at least one application program to be recommended in the at least two applications programs to be recommended to a user to be recommended as an exposed application program based on the exposure conversion rate.
Optionally, the apparatus further comprises:
the second acquisition module is configured to acquire a plurality of application programs carrying the identifiers;
the second screening module is configured to screen the plurality of application programs based on a first preset condition, and determine at least two application programs to be recommended.
Optionally, the apparatus further comprises:
the matching module is configured to match a user to be recommended with the at least two application programs to be recommended based on a preset matching rule, wherein the user to be recommended carries a user identifier.
Optionally, the recommendation model includes a deep fm multitask learning model, and the deep fm multitask learning model includes a click rate estimation part and a purchase rate estimation part.
Optionally, the first recommending module includes:
a ranking submodule configured to rank the at least two applications to be recommended based on the exposure conversion;
and the second recommending submodule is configured to select at least one to-be-recommended application program in the at least two to-be-recommended application programs to be recommended to the to-be-recommended user as an exposed application program based on a preset recommending condition.
Optionally, the apparatus further comprises:
the third screening module is configured to screen the at least two applications to be recommended based on the second preset condition;
a second recommendation sub-module further configured to:
and selecting at least one application program to be recommended from the at least two application programs to be recommended after screening as an exposed application program to be recommended to the user to be recommended based on a preset recommendation condition.
Optionally, the user characteristic and the attribute characteristic both include an offline characteristic and a real-time characteristic, where the offline characteristic includes collected historical characteristics of the user to be recommended and the application to be recommended, and the real-time characteristic includes collected characteristics of the user to be recommended and the application to be recommended at a current time.
In a fifth aspect, embodiments of the present specification disclose a computing device comprising a memory, a processor, and computer instructions stored on the memory and executable on the processor, the processor implementing a training method for a recommendation model or steps of the recommendation method as described above when executing the instructions.
In a sixth aspect, the present specification discloses a computer readable storage medium storing computer instructions, which when executed by a processor implement the training method of the recommendation model or the steps of the recommendation method as described above.
The application provides a training method and device of a recommendation model, a recommendation method and device, a computing device and a computer readable storage medium, wherein the recommendation method comprises the steps of receiving a recommendation request of a user to be recommended for an exposed application program, wherein the user to be recommended carries a user identifier; determining at least two applications to be recommended which are matched with the user to be recommended based on the user identification; extracting the user characteristics of the user to be recommended and the attribute characteristics of the at least two application programs to be recommended; inputting the user characteristics and the attribute characteristics into a pre-trained recommendation model to obtain the exposure conversion rate of each matched application program to be recommended by the user to be recommended; recommending at least one application program to be recommended in the at least two applications programs to be recommended to a user to be recommended as an exposed application program based on the exposure conversion rate. The recommendation model based on deep learning is adopted, wherein the recommendation model is a model of a deep FM structure adopting multi-task learning, online real-time recommendation of the exposure application program is carried out based on a user to be recommended, the automatic feature cross combination capability of the recommendation model and the conversion feature after the exposure application program is clicked are effectively utilized, the feature sparsity problem is solved, and the recommendation effect of the exposure application program is greatly improved.
Drawings
FIG. 1 is a block diagram of a computing device provided in one or more embodiments of the present description;
FIG. 2 is a flow diagram of a method for training a recommendation model provided in one or more embodiments of the present description;
fig. 3 is a schematic network structure diagram of a deep fm model provided in one or more embodiments of the present disclosure;
FIG. 4 is a schematic network structure diagram of a DeepFM multitask learning model provided by one or more embodiments of the present description;
FIG. 5 is a flow diagram of a recommendation method provided by one or more embodiments of the present specification;
FIG. 6 is a schematic structural diagram of a training apparatus for a recommendation model according to one or more embodiments of the present disclosure;
fig. 7 is a schematic structural diagram of a recommendation device provided in one or more embodiments of the present disclosure.
Detailed Description
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present application. This application is capable of implementation in many different ways than those herein set forth and of similar import by those skilled in the art without departing from the spirit of this application and is therefore not limited to the specific implementations disclosed below.
The terminology used in the description of the one or more embodiments is for the purpose of describing the particular embodiments only and is not intended to be limiting of the description of the one or more embodiments. As used in one or more embodiments of the present specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used in one or more embodiments of the present specification refers to and encompasses any and all possible combinations of one or more of the associated listed items.
It should be understood that although the terms first, second, etc. may be used herein in one or more embodiments to describe various information, these information should not be limited by these terms. These terms are only used to distinguish one type of information from another. For example, a first can also be referred to as a second and, similarly, a second can also be referred to as a first without departing from the scope of one or more embodiments of the present description. The word "if," as used herein, may be interpreted as "at \8230; \8230when" or "when 8230; \823030when" or "in response to a determination," depending on the context.
First, the noun terms to which one or more embodiments of the present invention relate are explained.
FM: english is called as a whole: factorization Machines, all Chinese called: a factorization machine is a machine learning algorithm based on matrix decomposition and is proposed by Steffen Rendle, and can predict any real-value vector. The method has the main advantages that 1) the method can be used for highly sparse data scenes; 2) With linear computational complexity. The main objective in this application is to solve the problem of how to combine features in case of sparse data.
DNN: english is called as a whole: deep Neural Network, which is known as Chinese: the deep neural network is divided from the DNN according to the positions of different layers, and the neural network layers in the DNN can be divided into three types, namely an input layer, a hidden layer and an output layer.
DeepFM model: a deep learning model is a neural network framework integrating FM and DNN, combines the advantages of DNN and FM, and can simultaneously extract the combined features of low order and high order.
CTR: english is called as a whole: click-Through-Rate, chinese full name: click through rate.
CVR: english is called as a whole: conversion Rate, chinese full name: the rate of purchase.
MTL: english is called as a whole: multi-Task Learning, chinese full name: multitask learning, an inductive migration mechanism, is mainly aimed at improving generalization capability by using domain-specific information implicit in the training signals of multiple related tasks, and is achieved by training multiple tasks in parallel using shared representation.
TL: english is called as a whole: transfer Learning, chinese full name: the transfer learning is to transfer the knowledge in one domain (i.e. the source domain) to another domain (i.e. the target domain) so that the target domain can obtain better learning effect.
LR: english is called as a whole: logitics Regression, chinese full name: and (6) performing logistic regression.
GBDT: english is called as a whole: the Gradient Boosting Decision Tree is named in Chinese: the decision tree is boosted by a gradient.
In the present application, a training method and apparatus for a recommendation model, a recommendation method and apparatus, a computing device and a computer readable storage medium are provided, which are described in detail in the following embodiments one by one.
Fig. 1 is a block diagram illustrating a configuration of a computing device 100 according to an embodiment of the present specification. The components of the computing device 100 include, but are not limited to, memory 110 and processor 120. The processor 120 is coupled to the memory 110 via a bus 130 and a database 150 is used to store data.
Computing device 100 also includes access device 140, access device 140 enabling computing device 100 to communicate via one or more networks 160. Examples of such networks include the Public Switched Telephone Network (PSTN), a Local Area Network (LAN), a Wide Area Network (WAN), a Personal Area Network (PAN), or a combination of communication networks such as the internet. Access device 140 may include one or more of any type of network interface (e.g., a Network Interface Card (NIC)) whether wired or wireless, such as an IEEE802.11 Wireless Local Area Network (WLAN) wireless interface, a worldwide interoperability for microwave access (Wi-MAX) interface, an ethernet interface, a Universal Serial Bus (USB) interface, a cellular network interface, a bluetooth interface, a Near Field Communication (NFC) interface, and so forth.
In one embodiment of the present description, the above components of the computing device 100 and other components not shown in fig. 1 may also be connected to each other, for example, through a bus. It should be understood that the block diagram of the computing device architecture shown in FIG. 1 is for purposes of example only and is not limiting as to the scope of the description. Those skilled in the art may add or replace other components as desired.
Computing device 100 may be any type of stationary or mobile computing device, including a mobile computer or mobile computing device (e.g., tablet, personal digital assistant, laptop, notebook, netbook, etc.), a mobile phone (e.g., smartphone), a wearable computing device (e.g., smartwatch, smartglasses, etc.), or other type of mobile device, or a stationary computing device such as a desktop computer or PC. Computing device 100 may also be a mobile or stationary server.
Wherein the processor 120 may perform the steps of the method shown in fig. 2. Fig. 2 shows a schematic flow chart of a training method of a recommendation model according to an embodiment of the present specification, including step 202 to step 206.
Step 202: user characteristics of at least two sample users and attribute characteristics of at least two sample applications are obtained.
For example, the user characteristics of at least two sample users and the attribute characteristics of at least two sample applications within a preset time period may be obtained, where the preset time period may include 60 days, 120 days, and the like, and the preset time period is set according to actual requirements, which is not limited in this application.
In practical applications, the sample user and the sample application may carry a user identifier and an application identifier, where the identifier is unique identification information for distinguishing each sample user from each sample application, for example, a unique special character string or a special mark set for each sample user or each sample application and playing a role in identification.
The sample user is a sample user of a recommended sample application, including but not limited to an office-type, game-type, entertainment-type sample application.
The user features and the attribute features each include offline features and real-time features, wherein the offline features include historical features of the collected sample users and the sample applications, and the real-time features include features of the collected sample users and the sample applications at the time of events.
Wherein, the offline characteristics of the user characteristics include but are not limited to basic portrait-like characteristics of the user, such as the user's age, sex, constellation, occupation, education level, life stage, etc.; the wealth characteristics of the user, such as income, purchasing power, the probability of having a room, the probability of having a car and the like of the user; location characteristics of the user, such as the user's place of birth, place of employment, place of home, and premise, etc.; user behavior characteristics such as the exposure number, click number and click rate of the user to the application program and other characteristics such as interest preference characteristics of the user, search question or query characteristics, activeness characteristics, historical transaction characteristics, real-time red envelope characteristics and the like. Real-time features of the user features include, but are not limited to: the scene characteristics of the user, such as the user channel source, etc., that is, in practical applications, the sample application program that the sample user likes can be determined according to the skip channel source of the sample user.
The offline features of the attribute features include, but are not limited to, basic attribute features of the application, such as classification, price, rating, number of comments, ranking, language, etc., statistical class features of the application, such as exposure pv (pageview, exposure), exposure uv (exposure), click pv (click count), click uv (click count), pv click rate, uv click rate (click count/exposure count), etc., for approximately 1/3/7/15/30/90 days of the application. Real-time features of the attribute features include, but are not limited to: scene characteristics such as the current hour and week of the application, that is, in practical applications, the sample application that is preferred by the sample user can be determined according to the current time and week.
In practical applications, the sample application may be obtained from an ODPS (Open Data Processing Service), in which the original full amount of applications is stored.
Step 204: based on the user features and the attribute features, positive samples clicked and purchased by the sample user on the exposed sample application and negative samples clicked but not purchased or clicked by the sample user on the exposed sample application are generated.
Wherein the exposed sample application is a sample application that is exposed to a sample user.
In order to estimate the exposure conversion rate of each sample user to each exposure sample application program, 60-day sample users and sample application programs clicked or not clicked after exposure for the sample users can be obtained, then analysis is performed according to user characteristics of the sample users and attribute characteristics of the exposure sample application programs, positive and negative samples (label) clicked after exposure and positive and negative samples (label) after exposure are formed, then a sample set comprising at least one positive sample and one negative sample is generated after the user characteristics and the attribute characteristics are spliced, and each piece of positive and negative sample data in the sample set can be expressed as (features, label).
In practical applications, the post-exposure click positive-negative sample and the post-exposure conversion positive-negative sample (label) may be generated by combining the user identifier (user _ id) of the sample user and the application identifier (item _ id) of the sample application, and the post-exposure click positive-negative sample (y 1= 1) is generated if the sample application is exposed and the sample user clicks, and the post-exposure click negative-sample (y 1= 0) is generated if the sample application is exposed but the sample user does not click, and the post-exposure conversion positive sample (y 2= 1) is generated if the sample application is exposed and the sample user clicks and purchases, and the post-exposure sample user does not click or purchases after clicking is generated as the exposure conversion negative-sample (y 2= 0), which may be expressed as (user _ id, item _ id, y1, y 2).
And then, after the user characteristic and the attribute characteristic are spliced according to the user _ id and the itemID, generating a sample set comprising at least one positive sample and one negative sample, wherein each piece of positive and negative sample data in the sample set can be represented as (user _ id, itemID, features, y1, y 2).
Step 206: training a recommendation model based on a sample set comprising at least one positive sample and one negative sample to obtain the recommendation model, wherein the recommendation model outputs exposure conversion rate obtained by each sample user based on click rate and purchase rate of each exposed sample application program.
In one or more embodiments of the present specification, before training the recommendation model based on a sample set including at least one positive sample and one negative sample, the method further includes:
and screening the sample set into a training sample set comprising at least one positive sample and one negative sample and a test sample set comprising at least one positive sample and one negative sample based on a preset screening rule.
The preset screening rule includes, but is not limited to, selecting positive and negative sample data of a preset number in a preset time length from the generated sample set as a test sample set, and taking the rest of the positive and negative sample data as a training sample set. For example, the preset screening rule may include extracting 20 ten thousand positive and negative sample data in the positive and negative sample data of the last day from the generated 1.4 hundred million sample sets as a test sample set, and using the rest positive and negative sample data as a training sample set. In practical application, because the ratio of exposure to conversion of the application program is low, if the number of positive and negative samples in the test sample set is too small, the number of positive samples converted by exposure of the application program is small, and the evaluation may be inaccurate, so that it is more appropriate to select 150 ten thousand positive and negative samples as the test sample set.
The recommendation model is obtained by training a recommendation model based on a sample set comprising at least one positive sample and one negative sample, the recommendation model outputs exposure conversion rate obtained by each sample user based on click rate and purchase rate of a sample application program for each exposure, namely the recommendation model is obtained by training the recommendation model based on the training sample set comprising at least one positive sample and one negative sample, and the recommendation model outputs exposure conversion rate obtained by each sample user based on click rate and purchase rate of the sample application program for each exposure.
In practical applications, after training the recommendation model based on the sample set including at least one positive sample and one negative sample, the method further includes:
the recommendation model is tested based on a set of test samples including at least one positive sample and one negative sample.
The recommendation model is tested based on a test sample set comprising at least one positive sample and one negative sample, namely, the test sample set is input into the recommendation model, so that each sample user of the recommendation model obtains exposure conversion rate based on click rate and purchase rate of each exposed sample application program. Wherein the recommending model outputs the exposure conversion rate obtained by each sample user based on the click rate and the purchase rate of the sample application program for each exposure, and comprises:
the click rate estimation part of the recommendation model outputs the click rate of each sample user to each exposed sample application program;
a purchase rate estimation part of the recommendation model outputs the purchase rate of each sample user to each exposed sample application program;
determining an exposure conversion rate for each sample user for each exposed sample application based on the click through rate and the purchase rate.
The recommendation model includes, but is not limited to, a deep fm multitask learning model, and in one or more embodiments of the present specification, the recommendation model is described as the deep fm multitask learning model.
The deep FM multi-task learning model is to apply multi-task learning MTL in the deep FM model, wherein the deep FM model combines the advantages of DNN and FM and can simultaneously extract the combined features of low order and high order. Wherein, the FM part extracts low order combination features, including: linear combination of first-order features (weight and feature dot product), second-order cross features (implicit vector inner product). The Deep part extracts high-order combination characteristics. Meanwhile, FM and Deep share the input and embedding vectors. Specifically, the prediction result of the deep fm model is expressed by formula (1):
Figure BDA0001963151300000161
wherein, the output formula of FM is expressed as formula (2):
Figure BDA0001963151300000162
the output formula of DNN is expressed as formula (3):
yDN N=σ(W |H|+1 ·a H +b |H|+1 ) (3)
as shown in fig. 3, a network structure of the Deep FM model is provided, the Deep FM model is first divided into a Deep neural network part and an FM factorizer part, the Deep neural network part may adopt a fully-connected feedforward neural network DNN, the DNN and the FM divide input user features and attribute features into a plurality of feature groups, each feature group corresponds to one embedding (embedding) vector, wherein a feature splicing layer (concat) of the Deep neural network part splices all the embedding vectors, and then a fully-connected layer (Fc (relu)) of two layers is added to realize combination of high-order features; the FM factorization machine carries out weighted summation (addition) on input original feature input such as user features and attribute features, and extracts feature combinations through an embedding vector inner product of each dimension to realize the combination of low-order features; and finally, combining the outputs of the Deep neural network and the FM factorization machine to obtain a prediction result (sigmoid), namely a click rate prediction result.
The multi-task learning MTL enables the DeepFM model to better summarize the original task by sharing the characterization between related tasks. It is also an inductive migration mechanism, the main goal being to improve generalization capability with domain-specific information implicit in the training signals of multiple related tasks, multi-task learning accomplishes this goal by training multiple tasks in parallel using a shared representation.
In practical applications, the recommended application program for the user may go through a plurality of steps, such as exposure, clicking, and final purchase. The goal of using the deep fm multitask learning model was to increase the final uv exposure conversion (CTCVR = number of purchases/number of exposures). The calculation formula of the exposure conversion rate (CTCVR) is shown in formula (4), and the calculation formula is obtained by multiplying the CTR (click rate estimation) and the CVR (purchase rate estimation).
Figure BDA0001963151300000171
If a recommendation method for estimating the click rate by adopting a positive and negative sample training click rate (CTR) recommendation model of clicking after exposure (namely, sorting is carried out according to pCTR during recommendation), conversion data after clicking is not fully utilized, the click rate of some application programs is high, but the purchase rate after clicking is very low, so that the total CTCVR is not high. In addition, a CVR model is trained to predict the purchase rate of the user after clicking, and then pCTCVR is obtained from pCTR pCVR for sorting. However, the direct training of the CVR model only uses a sample set clicked by a user, the positive and negative sample data volume is usually one to two orders of magnitude less than that of the exposed sample set, and the sample data sparsity problem is faced.
Referring to fig. 4, in an embodiment of the present description, a deep fm multitask learning model is used. The DeepFM multi-task learning model is mainly divided into a CTR part and a CVR part, the DeepFM multi-task learning model divides input bottom-layer feature features into a plurality of feature groups, each feature group corresponds to an embedding (embedding) vector, and the CTR part and the CVR part share the bottom-layer feature features and the embedding vectors. From fig. 4, it can be seen that the CTR and the CVR are each a deep fm model, and all the embedding vectors are embedded by using a feature embedding layer (concat), and then a full connection layer (Fc (relu)) of two layers is added to realize combination of higher-order features and feature combination is extracted by embedding vector inner product of each dimension to realize combination of lower-order features, so as to obtain a click rate prediction result and a purchase rate prediction result, wherein the purchase rate is a purchase rate after exposure, and finally the click rate prediction result and the purchase rate prediction result are multiplied to obtain an exposure conversion rate prediction result (i.e., pCTCVR). The penalty function for the DeepFM multitask learning model is shown in equation (5), where yi denotes click or not, zi denotes buy or not, which optimizes both the exposure click rate and the exposure purchase rate, and pCVR is only an intermediate node of the DeepFM multitask learning model. And finally, when recommendation is performed, sorting in a reverse order according to the pCTCVR, and recommending the top N exposed application programs to the user.
Figure BDA0001963151300000181
In one or more embodiments of the present specification, a method for training a recommendation model is provided, where user characteristics of at least two sample users and attribute characteristics of at least two sample applications are obtained; generating a positive sample clicked and purchased by the sample user on the exposed sample application program and a negative sample clicked but not purchased or clicked by the sample user on the exposed sample application program based on the user characteristics and the attribute characteristics; training a recommendation model based on a sample set comprising at least one positive sample and one negative sample to obtain the recommendation model, wherein the recommendation model outputs exposure conversion rate obtained by each sample user based on click rate and purchase rate of each exposed sample application program. When the deep learning-based DeepFM multi-task learning model is trained, data such as exposed application programs purchased or not clicked by sample users after clicking, automatic feature cross combination capability of the DeepFM multi-task learning model and conversion features of the exposed application programs after clicking are fully utilized, the problem of feature sparsity is solved, and the recommendation effect of the exposed application programs is greatly improved.
The processor 120 may also perform the steps of the method shown in fig. 5. Fig. 5 shows a schematic flow chart of a recommendation method according to an embodiment of the present specification, including steps 502 to 510.
Step 502: receiving a recommendation request of a user to be recommended for an exposed application program, wherein the user to be recommended carries a user identifier.
The user to be recommended is a user of the application program to be recommended and exposed; the exposed application is the application which is exposed and recommended to the user to be recommended.
The user identifier is unique identification information for distinguishing each user to be recommended, for example, a unique special character string or a special mark which is set for each user to be recommended and has an identification function.
In one or more embodiments of the present specification, before receiving an application recommendation request for exposure of a user to be recommended, the method further includes:
acquiring a plurality of application programs carrying identifications;
and screening the plurality of application programs based on a first preset condition, and determining at least two application programs to be recommended.
The application program to be recommended is an application program waiting to be recommended to the user. The first preset condition includes, but is not limited to, selecting an application program within a preset interval, for example, the preset interval is 1 to 200, and the first preset condition may be selecting an application program with a price within an interval of 1 to 200. The first preset condition may further include rejecting yellow gambling poison related applications or poor quality applications, and the like, wherein the poor quality applications may be determined by the scores and the number of comments of the applications, for example, lower limit values of the scores and the number of comments are set, and applications with a score less than 3 and a score less than 200 are identified as the poor quality applications.
The first preset condition may be set according to an actual requirement, which is not limited in this application. After the application programs are screened by the first preset condition, the number of the application programs is reduced, and the recommendation quality of the application programs is guaranteed while the subsequent workload is reduced.
Taking the first preset condition as an example of selecting the application programs with the price in the range of 1 to 200 yuan, screening the plurality of application programs based on the first preset condition, and determining at least two application programs to be recommended, namely screening the application programs with the price in the range of 1 to 200 yuan as the application programs to be recommended.
In practical application, all application programs are placed in the ODPS, the problem of mass data calculation of a user can be solved more quickly, a plurality of application programs to be recommended can be selected quickly from the plurality of application programs, enterprise cost can be effectively reduced, and data safety is guaranteed.
Step 504: and determining at least two applications to be recommended matched with the user to be recommended based on the user identification.
The application program to be recommended may be regarded as an initial application program displayed to the user, and the exposure conversion rate of each application program to be recommended may be obtained when performing recommendation model prediction subsequently.
Specifically, the method for screening the plurality of application programs based on a first preset condition and after determining at least two application programs to be recommended further includes:
and matching the user to be recommended with the at least two application programs to be recommended based on a preset matching rule, wherein the user to be recommended carries a user identifier.
The preset matching rules include, but are not limited to, recommending top-ranked popular applications to be recommended in each category (denoted hot), recommending applications to be recommended according to preference of each user for the Taobao category (denoted U2C 2I), grouping users according to gender/age/city/purchasing power/interest tags, recommending applications to be recommended clicked by the same group of users (denoted U2G 2I), and/or recommending applications to be recommended similar to applications clicked by the users (denoted Item-CF).
Taking the preset matching rule as an example of recommending top-ranked application programs of each category, and matching the user to be recommended with the at least two application programs to be recommended based on the preset matching rule; namely, based on the user identification and the identification of the application program, the popular application program named next ten in the game category recommended by the user to be recommended is the application program to be recommended, so that the user to be recommended is matched with the recommended application program to be recommended.
After the preset matching rules are matched, each user to be recommended corresponds to dozens or hundreds of application programs to be recommended, and then the application programs to be recommended matched with each user to be recommended are recorded in a database, such as an HBase database, based on the user identification of each user to be recommended and the identification of the application programs to be recommended. After a recommendation request of an exposure application program of a user to be recommended is received, the application program to be recommended matched with the user identification of the user to be recommended can be inquired in the HBase database based on the user identification of the user to be recommended.
Step 506: and extracting the user characteristics of the user to be recommended and the attribute characteristics of the at least two application programs to be recommended.
The user characteristics and the attribute characteristics comprise offline characteristics and real-time characteristics, wherein the offline characteristics comprise collected historical characteristics of the user to be recommended and the application program to be recommended, and the real-time characteristics comprise collected characteristics of the user to be recommended and the application program to be recommended at the current moment.
The extracted user features and the extracted attribute features may refer to the above embodiments, which are not described herein again.
Step 508: and inputting the user characteristics and the attribute characteristics into a pre-trained recommendation model to obtain the exposure conversion rate of the user to be recommended to each matched application program to be recommended.
The recommendation model comprises a deep FM multi-task learning model, and the deep FM multi-task learning model comprises a click rate estimation part and a purchase rate estimation part.
In practical application, the extracted user characteristics and the offline characteristics of the attribute characteristics are synchronized into an HBase database, when the recommendation model predicts the exposure conversion rate of the user to be recommended to each matched application program to be recommended, the user characteristics of the user to be recommended and the offline characteristics of each matched application program to be recommended can be directly extracted from the HBase database in real time, then the exposure conversion rate prediction is performed by combining the real-time extracted real-time characteristics, when the exposure conversion rate of the user to be recommended to each matched application program to be recommended is predicted by adopting the recommendation model, the attribute characteristics are recorded to form a characteristic log and flow back to an ODPS, then the recommendation model is trained in an offline manner based on the characteristic log, and the trained recommendation model is updated to realize continuous optimization of the recommendation model.
Step 510: recommending at least one application program to be recommended in the at least two applications programs to be recommended to a user to be recommended as an exposed application program based on the exposure conversion rate.
In one or more embodiments of the present specification, recommending at least one exposed application of the at least two exposed applications to the user to be recommended based on the exposure conversion includes:
in one or more embodiments of the present specification, recommending, to a user to be recommended, at least one application to be recommended of the at least two applications to be recommended as an exposed application based on the exposure conversion rate includes:
sorting the at least two applications to be recommended based on the exposure conversion rate;
and selecting at least one to-be-recommended application program in the at least two to-be-recommended application programs based on a preset recommendation condition to be recommended to the to-be-recommended user as an exposed application program.
The sorting comprises but is not limited to descending sorting, and the preset recommendation condition comprises but is not limited to selecting the application to be recommended with the top 30 rank.
In practical application, the at least two application programs to be recommended are sorted based on the exposure conversion rate;
and selecting at least one to-be-recommended application program in the at least two to-be-recommended application programs based on a preset recommendation condition to be recommended to the to-be-recommended user as an exposed application program.
That is, the at least two applications to be recommended may be sorted in a descending order based on the exposure conversion rate, and then the application to be recommended that is ranked first 30 is recommended to the user to be recommended as the exposed application, and the application to be recommended that is ranked first 30 is the application that is actually exposed and recommended to the user.
In another implementation manner, after the sorting the at least two applications to be recommended based on the exposure conversion rate, the method further includes:
and screening the at least two applications to be recommended based on the second preset condition.
The second preset condition may include, but is not limited to, selecting and eliminating applications in a preset blacklist.
Under the condition that the at least two applications to be recommended are screened based on the second preset condition, selecting at least one application to be recommended from the at least two applications to be recommended as an exposed application to be recommended to a user to be recommended based on a preset recommendation condition includes:
and selecting at least one application program to be recommended from the at least two application programs to be recommended after screening as an exposed application program to be recommended to the user to be recommended based on a preset recommendation condition.
The at least two application programs to be recommended are sorted in a descending order based on the exposure conversion rate, then the at least two application programs to be recommended after the sorting in the descending order are matched with the application programs in a preset blacklist, if the matched application programs to be recommended exist, the application programs to be recommended are removed from the queue in the descending order, then the first 30 or the first 20 application programs to be recommended with the highest exposure conversion rate are selected as the exposure application programs to be recommended to the users to be recommended, so that the optimal application programs to be recommended are recommended to the users to be recommended as the exposure application programs, and the user experience is improved.
In actual use, when an exposed application program recommendation request of a user to be recommended is received, firstly, an application program to be recommended which is matched with the user to be recommended after being screened is obtained, then, the attribute characteristics of the application program to be recommended and the user characteristics of the matched user to be recommended are input into a pre-trained recommendation model, and the exposure conversion rate of the user to be recommended to each application program to be recommended is obtained.
And finally, sequencing the application programs to be recommended corresponding to the users to be recommended based on the exposure conversion rate, and selecting the application programs to be recommended with the ranking of 50 th or 60 th as the final exposed application program to be recommended based on the requirement and finally displaying the final exposed application program to the users to be recommended.
According to the recommendation method provided by one or more embodiments of the specification, firstly, obtained application programs are screened, namely, selected products are filtered, high-quality application programs to be recommended are screened out, then strategies such as hot/U2C2I/U2G2I/Item-CF are utilized to match a user with the application programs to be recommended, the problem that a tag system is established by using a large amount of manpower is avoided, the application programs to be recommended which are possibly clicked by the user can be better covered by various matching strategies, online real-time exposure conversion rate prediction is carried out by adopting a deep learning recommendation model according to the user to be recommended and the matched application programs to be recommended, the appropriate application programs to be recommended are recommended to the user to be recommended as final exposed application programs through exposure conversion rate, real-time characteristics are effectively utilized, and the recommendation effect is improved.
In practical application, the trained recommendation model in the description needs to be deployed to an online server to perform online real-time scoring on an application program. Generally, an arks platform is adopted, which provides high-performance online sequencing and real-time pre-estimation service, is highly available, and realizes multiple functions such as load balancing, remote disaster recovery and the like. When a user requests application program recommendation, a retrieval module is required to retrieve hundreds of candidate application programs, namely application programs to be recommended, which are previously matched for the user in a matching stage from an HBase according to user identification user _ id. And the offline and real-time characteristics of the user and the application programs to be recommended are scored for the recommendation model in real time, so that the exposure conversion rate of the user to each application program to be recommended is obtained. And finally, sorting in a descending order according to the exposure conversion rate of the application programs to be recommended, and recommending and displaying the 30 application programs to be recommended which are ranked at the top as final exposed application programs to the user. In addition, in order to avoid recommending some unsuitable exposed applications to the user, a blacklist filtering mechanism can be further arranged, and some exposed applications of badcase can be filtered out urgently.
Referring to fig. 6, one or more embodiments of the present specification provide a training apparatus for a recommendation model, including:
a first obtaining module 602 configured to obtain user characteristics of at least two sample users and attribute characteristics of at least two sample applications;
a generating module 604 configured to generate a positive sample clicked and purchased by a sample user on the exposed sample application and a negative sample clicked but not purchased or clicked by the sample user on the exposed sample application based on the user characteristics and the attribute characteristics;
a training module 606 configured to train a recommendation model based on a sample set comprising at least one positive sample and one negative sample, resulting in the recommendation model, the recommendation model outputting an exposure conversion rate for each sample user based on a click through rate and a purchase rate for a sample application for each exposure.
Optionally, the apparatus further comprises:
a first screening module configured to screen the set of samples into a training sample set including at least one positive sample and one negative sample and a test sample set including at least one positive sample and one negative sample based on a preset screening rule.
Optionally, the training module is further configured to:
training a recommendation model based on the set of training samples comprising at least one positive sample and one negative sample.
Optionally, the apparatus further comprises:
a testing module configured to test the recommendation model based on a set of test samples including at least one positive sample and one negative sample.
Optionally, the recommendation model includes a deep fm multitask learning model, and the deep fm multitask learning model includes a click rate estimation part and a purchase rate estimation part.
Optionally, the training module 606 comprises:
the first output sub-module is configured to output the click rate of each sample user to each exposed sample application program by the click rate estimation part of the recommendation model;
a second output sub-module configured to output a purchase rate of each sample user for each exposed sample application program by a purchase rate estimation part of the recommendation model;
a determination sub-module configured to determine an exposure conversion rate for each sample user for a sample application for each exposure based on the click-through rate and the purchase rate.
Optionally, the user characteristics and the attribute characteristics each include offline characteristics and real-time characteristics, wherein the offline characteristics include historical characteristics of the collected sample users and the sample applications, and the real-time characteristics include characteristics of the collected sample users and the sample applications at the time of events.
When the training device for the recommendation model provided by one or more embodiments of the specification trains the multitask learning model based on the deep learning model deep fm, data such as an exposed application program purchased or not clicked by a sample user after clicking, automatic feature cross combination capability of the multitask learning model of deep fm and conversion features of the exposed application program after clicking are fully utilized, the problem of feature sparsity is solved, and the recommendation effect of the exposed application program is greatly improved.
The above is a schematic scheme of a training apparatus for recommending a model according to this embodiment. It should be noted that the technical solution of the training apparatus for the recommended model and the technical solution of the training method for the recommended model belong to the same concept, and details that are not described in detail in the technical solution of the training apparatus for the recommended model can be referred to the description of the technical solution of the training method for the recommended model.
Referring to fig. 7, one or more embodiments of the present specification further provide a recommendation apparatus including:
a receiving module 702, configured to receive a recommendation request of a user to be recommended for an exposed application program, where the user to be recommended carries a user identifier;
a determining module 704, configured to determine at least two applications to be recommended that match the user to be recommended based on the user identification;
an extracting module 706 configured to extract the user characteristics of the user to be recommended and the attribute characteristics of the at least two applications to be recommended;
an obtaining module 708, configured to input the user characteristics and the attribute characteristics into a pre-trained recommendation model, and obtain an exposure conversion rate of each matched application program to be recommended by the user to be recommended;
a first recommending module 710 configured to recommend at least one application to be recommended of the at least two applications to be recommended as an exposed application to a user to be recommended based on the exposure conversion rate.
Optionally, the apparatus further comprises:
the second acquisition module is configured to acquire a plurality of application programs carrying the identifiers;
the second screening module is configured to screen the plurality of application programs based on a first preset condition, and determine at least two application programs to be recommended.
Optionally, the apparatus further comprises:
the matching module is configured to match a user to be recommended with the at least two application programs to be recommended based on a preset matching rule, wherein the user to be recommended carries a user identifier.
Optionally, the recommendation model includes a deep fm multitask learning model, and the deep fm multitask learning model includes a click rate estimation part and a purchase rate estimation part.
Optionally, the first recommending module 710 includes:
a ranking submodule configured to rank the at least two applications to be recommended based on the exposure conversion;
and the second recommending submodule is configured to select at least one to-be-recommended application program in the at least two to-be-recommended application programs to be recommended to the to-be-recommended user as an exposed application program based on a preset recommending condition.
Optionally, the apparatus further comprises:
the third screening module is configured to screen the at least two applications to be recommended based on the second preset condition;
a second recommendation sub-module further configured to:
and selecting at least one application program to be recommended from the at least two application programs to be recommended after screening as an exposed application program to be recommended to the user to be recommended based on a preset recommendation condition.
Optionally, the user features and the attribute features both include offline features and real-time features, where the offline features include collected historical features of the user to be recommended and the application to be recommended, and the real-time features include collected features of the user to be recommended and the application to be recommended at the current time.
According to the recommendation device provided by one or more embodiments of the specification, firstly, obtained application programs are screened, namely, selected products are filtered, high-quality application programs to be recommended are screened out, then strategies such as hot/U2C2I/U2G2I/Item-CF are utilized to match a user with the application programs to be recommended, the problem that a tag system is established by using a large amount of manpower is avoided, the application programs to be recommended which are possibly clicked by the user can be better covered by various matching strategies, online real-time exposure conversion rate prediction is carried out by adopting a deep learning recommendation model according to the user to be recommended and the matched application programs to be recommended, the appropriate application programs to be recommended are recommended to the user to be recommended as final exposed application programs through exposure conversion rate, real-time characteristics are effectively utilized, and the recommendation effect is improved.
The above is a schematic solution of a recommendation device of the present embodiment. It should be noted that the technical solution of the recommendation apparatus and the technical solution of the recommendation method described above belong to the same concept, and for details that are not described in detail in the technical solution of the recommendation apparatus, reference may be made to the description of the technical solution of the recommendation method described above.
The matching rules, user characteristics and the deep fm multitask learning model provided in the specification were compared online using the AB test platform of the darwinian laboratory. The following comparisons are obtained as results, and the main comparison indexes are as follows: UV click rate (number of clicks/number of exposures) and UV exposure conversion rate (number of purchasers/number of exposures).
1. Matching rules experiment
Compared with the common matching by using Hot and U2C2I matching rules, the UV click rate is improved by 9.93% (15.11% - > 16.61%).
Compared with common matching of Hot and U2C2I, the UV click rate is improved by 10.86% (28.18% - > 31.25%) when the Hot, U2C2I, U2G2I and Item-CF matching rules are matched together.
Experiments fully show that the UV click rate can be obviously improved by adopting a plurality of matching rules for matching.
2. Characteristic experiment
With the increased interest-biased feature, the exposure conversion was increased by 1.93% (15.78% - > 16.08%) over the no interest-biased feature. The click purchase behavior of the user is explained to be closely related to the interest preference of the user.
3. Model experiment
Firstly, the CTR model and the fractional multiplication mode of two independent CTR models and the CVR model are compared in an experiment, both the CTR model and the CVR model adopt DeepFM, and the model structure and the input characteristics are completely the same. The data of the experimental results are shown in table 1, and it can be seen from table 1 that the UV click rate is decreased and the purchase rate after UV click is increased by multiplying the scores of two separate CTR and CVR models, but the final UV exposure conversion rate is worse than that of the CTR model alone.
TABLE 1
Figure BDA0001963151300000301
And performing an online comparison experiment on the CTR model and the DeepFM-based multitask learning model, wherein input characteristics, the DeepFM model structure and the sample data size of the two models are completely the same. The experimental results are shown in table 2 below, and based on the deepFM-based CTR model, it can be seen that the DeepFM-based multitask learning model is 3.73% worse in UV click rate than the CTR model, the purchase rate after UV click is increased by 5.01%, and the total UV exposure conversion rate is increased by 1.09%. Experiments fully show that the DeepFM-based multitask learning model can bring about the improvement of the conversion rate of UV exposure.
TABLE 2
Figure BDA0001963151300000302
An embodiment of the present application further provides a computer-readable storage medium, which stores computer instructions, when executed by a processor, for implementing the method for training a recommendation model or the steps of the recommendation method as described above.
The above is an illustrative scheme of a computer-readable storage medium of the embodiment. It should be noted that the technical solution of the storage medium belongs to the same concept as the above-mentioned training method of the recommendation model or the above-mentioned technical solution of the recommendation method, and details that are not described in detail in the technical solution of the storage medium can be referred to the above-mentioned description of the training method of the recommendation model or the above-mentioned technical solution of the recommendation method.
The foregoing description has been directed to specific embodiments of this disclosure. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims can be performed in a different order than in the embodiments and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing may also be possible or may be advantageous.
The computer instructions comprise computer program code which may be in the form of source code, object code, an executable file or some intermediate form, or the like. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U.S. disk, removable hard disk, magnetic diskette, optical disk, computer Memory, read-Only Memory (ROM), random Access Memory (RAM), electrical carrier wave signal, telecommunications signal, and software distribution medium, etc. It should be noted that the computer readable medium may contain content that is subject to appropriate increase or decrease as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media does not include electrical carrier signals and telecommunications signals as is required by legislation and patent practice.
It should be noted that for simplicity and convenience of description, the above-described method embodiments are described as a series of combinations of acts, but those skilled in the art will appreciate that the present application is not limited by the order of acts, as some steps may, in accordance with the present application, occur in other orders and/or concurrently. Further, those skilled in the art should also appreciate that the embodiments described in the specification are preferred embodiments and that the acts and modules referred to are not necessarily required in this application.
In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
The preferred embodiments of the present application disclosed above are intended only to aid in the explanation of the application. Alternative embodiments are not exhaustive and do not limit the invention to the precise embodiments described. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the application and the practical application, to thereby enable others skilled in the art to best understand and utilize the application. The application is limited only by the claims and their full scope and equivalents.

Claims (26)

1. A training method of a recommendation model is characterized by comprising the following steps:
acquiring user characteristics of at least two sample users and attribute characteristics of at least two sample application programs;
generating a positive sample clicked and purchased by the sample user on the exposed sample application program and a negative sample clicked but not purchased or clicked by the sample user on the exposed sample application program based on the user characteristics and the attribute characteristics;
the method comprises the steps of training a recommendation model based on a sample set comprising at least one positive sample and at least one negative sample to obtain the recommendation model, outputting exposure conversion rate obtained by each sample user based on click rate and purchase rate of each exposed sample application program by the recommendation model, wherein the recommendation model is a DeepFM multi-task learning model, the DeepFM multi-task learning model comprises a click rate estimation part and a conversion rate estimation part, the click rate estimation part and the conversion rate estimation part share bottom layer features and embedded vectors, the click rate estimation part and the purchase rate estimation part adopt feature splicing layers to splice the embedded vectors, two full-connection layers are adopted to combine spliced high-order features, low-order feature combination is carried out on the embedded vector inner product of each dimension, and click rate prediction results and purchase rate prediction results are obtained according to the combination of the high-order features and the combination of the low-order features.
2. The method of claim 1, wherein prior to training the recommendation model based on a set of samples comprising at least one positive sample and one negative sample, further comprising:
and screening the sample set into a training sample set comprising at least one positive sample and one negative sample and a test sample set comprising at least one positive sample and one negative sample based on a preset screening rule.
3. The method of claim 2, wherein training the recommendation model based on a set of samples comprising at least one positive sample and one negative sample comprises:
training a recommendation model based on the set of training samples comprising at least one positive sample and one negative sample.
4. The method of claim 3, wherein after training the recommendation model based on a set of samples comprising at least one positive sample and one negative sample, further comprising:
the recommendation model is tested based on a set of test samples including at least one positive sample and one negative sample.
5. The method of claim 1, wherein the recommending model outputting the exposure conversion rate obtained by each sample user based on the click through rate and purchase rate of the sample application for each exposure comprises:
the click rate estimation part of the recommendation model outputs the click rate of each sample user to each exposed sample application program;
a conversion rate estimation part of the recommendation model outputs the purchase rate of each sample user to each exposed sample application program;
determining an exposure conversion rate for each sample user for each exposed sample application based on the click through rate and the purchase rate.
6. The method of claim 1, wherein the user characteristics and the attribute characteristics each comprise offline characteristics and real-time characteristics, wherein the offline characteristics comprise historical characteristics of the sample user and the sample application collected, and the real-time characteristics comprise characteristics of the sample user and the sample application collected at the time of the event.
7. A recommendation method, comprising:
receiving a recommendation request of a user to be recommended for an exposed application program, wherein the user to be recommended carries a user identifier;
determining at least two applications to be recommended which are matched with the user to be recommended based on the user identification;
extracting the user characteristics of the user to be recommended and the attribute characteristics of the at least two application programs to be recommended;
inputting the user characteristics and the attribute characteristics into a pre-trained recommendation model to obtain exposure conversion rate of each matched application program to be recommended by the user to be recommended, wherein the recommendation model is a deep FM multi-task learning model, the deep FM multi-task learning model comprises a click rate estimation part and a conversion rate estimation part, the click rate estimation part and the conversion rate estimation part share bottom layer characteristics and embedded vectors, the click rate estimation part and the purchase rate estimation part adopt a characteristic splicing layer to splice the embedded vectors, two layers of full connection layers are adopted to combine spliced high-order characteristics, and a click rate prediction result and a purchase rate prediction result are obtained according to the combination of the high-order characteristics and the combination of the low-order characteristics by combining the low-order characteristics in each dimension of the embedded vector;
recommending at least one application program to be recommended in the at least two applications programs to be recommended to a user to be recommended as an exposed application program based on the exposure conversion rate.
8. The method of claim 7, wherein before receiving the application recommendation request for exposure of the user to be recommended, further comprising:
acquiring a plurality of application programs carrying identifiers;
and screening the application programs carrying the identifiers based on a first preset condition, and determining at least two application programs to be recommended.
9. The method of claim 8, wherein after determining at least two applications to be recommended, further comprising:
and matching the user to be recommended with the at least two application programs to be recommended based on a preset matching rule, wherein the user to be recommended carries a user identifier.
10. The method of claim 7, wherein recommending at least one application to be recommended of the at least two applications to be recommended as an exposed application to a user to be recommended based on the exposure conversion comprises:
sorting the at least two applications to be recommended based on the exposure conversion rate;
and selecting at least one application program to be recommended from the at least two application programs to be recommended as an exposed application program to be recommended to the user to be recommended based on a preset recommendation condition.
11. The method of claim 10, further comprising, after sorting the at least two applications to be recommended based on the exposure conversion,:
screening the at least two application programs to be recommended based on a second preset condition;
selecting at least one application program to be recommended from the at least two application programs to be recommended as an exposed application program to be recommended to a user to be recommended based on a preset recommendation condition, wherein the recommendation comprises the following steps:
and selecting at least one application program to be recommended from the at least two application programs to be recommended after screening as an exposed application program to be recommended to the user to be recommended based on a preset recommendation condition.
12. The method according to claim 7, wherein the user characteristics and the attribute characteristics each include offline characteristics and real-time characteristics, wherein the offline characteristics include collected historical characteristics of the user to be recommended and the application to be recommended, and the real-time characteristics include collected characteristics of the user to be recommended and the application to be recommended at the current moment.
13. An apparatus for training a recommendation model, comprising:
the system comprises a first acquisition module, a second acquisition module and a third acquisition module, wherein the first acquisition module is configured to acquire user characteristics of at least two sample users and attribute characteristics of at least two sample application programs;
a generating module configured to generate a positive sample clicked and purchased by a sample user on the exposed sample application program and a negative sample clicked but not purchased or clicked by the sample user on the exposed sample application program based on the user characteristics and the attribute characteristics;
the recommendation model is a DeepFM multi-task learning model, the DeepFM multi-task learning model comprises a click rate estimation part and a conversion rate estimation part, the click rate estimation part and the conversion rate estimation part share bottom layer features and embedded vectors, the click rate estimation part and the purchase rate estimation part adopt feature splicing layers to splice the embedded vectors, two layers of full connection layers are adopted to combine spliced high-order features, and a click rate prediction result and a purchase rate prediction result are obtained according to the combination of the high-order features and the combination of the low-order features by combining the embedded vector inner products of each dimension.
14. The apparatus of claim 13, further comprising:
a first screening module configured to screen the set of samples into a training sample set including at least one positive sample and one negative sample and a test sample set including at least one positive sample and one negative sample based on a preset screening rule.
15. The apparatus of claim 14, wherein the training module is further configured to:
training a recommendation model based on the set of training samples comprising at least one positive sample and one negative sample.
16. The apparatus of claim 15, further comprising:
a testing module configured to test the recommendation model based on a set of test samples comprising at least one positive sample and one negative sample.
17. The apparatus of claim 13, wherein the training module comprises:
a first output sub-module configured to output a click rate of each sample user to each exposed sample application program by a click rate estimation part of the recommendation model;
a second output sub-module configured to output a purchase rate of each sample user for each exposed sample application program by a purchase rate estimation part of the recommendation model;
a determination sub-module configured to determine an exposure conversion rate for each sample user for a sample application for each exposure based on the click-through rate and the purchase rate.
18. The apparatus of claim 13, wherein the user characteristics and the attribute characteristics each comprise offline characteristics and real-time characteristics, wherein the offline characteristics comprise historical characteristics of the sample user and the sample application collected, and the real-time characteristics comprise characteristics of the sample user and the sample application collected at the time of the event.
19. A recommendation device, comprising:
the recommendation system comprises a receiving module, a recommendation module and a recommendation module, wherein the receiving module is configured to receive a recommendation request of a user to be recommended for an exposed application program, and the user to be recommended carries a user identifier;
the determining module is configured to determine at least two applications to be recommended which are matched with the users to be recommended based on the user identification;
the extraction module is configured to extract the user characteristics of the user to be recommended and the attribute characteristics of the at least two application programs to be recommended;
an obtaining module, configured to input the user characteristics and the attribute characteristics into a pre-trained recommendation model, and obtain exposure conversion rate of each matched application program to be recommended by the user to be recommended, wherein the recommendation model is a deep fm multi-task learning model, the deep fm multi-task learning model includes a click rate estimation part and a conversion rate estimation part, the click rate estimation part and the conversion rate estimation part share bottom layer characteristics and an embedded vector, the click rate estimation part and the purchase rate estimation part are spliced by using a characteristic splicing layer, the spliced high-order characteristics are combined by using two full-connection layers, and a click rate prediction result and a purchase rate prediction result are obtained according to the combination of the high-order characteristics and the combination of the low-order characteristics by combining the low-order characteristics for each dimension of the embedded vector inner products;
the first recommending module is configured to recommend at least one application program to be recommended in the at least two applications programs to be recommended to a user to be recommended as an exposed application program based on the exposure conversion rate.
20. The apparatus of claim 19, further comprising:
the second acquisition module is configured to acquire a plurality of application programs carrying the identifiers;
the second screening module is configured to screen the plurality of application programs carrying the identifiers based on a first preset condition, and determine at least two application programs to be recommended.
21. The apparatus of claim 19, further comprising:
the matching module is configured to match a user to be recommended with the at least two application programs to be recommended based on a preset matching rule, wherein the user to be recommended carries a user identifier.
22. The apparatus of claim 19, wherein the first recommending module comprises:
a ranking submodule configured to rank the at least two applications to be recommended based on the exposure conversion;
and the second recommending submodule is configured to select at least one application program to be recommended from the at least two application programs to be recommended as an exposed application program to be recommended to the user to be recommended based on a preset recommending condition.
23. The apparatus of claim 22, further comprising:
the third screening module is configured to screen the at least two applications to be recommended based on a second preset condition;
a second recommendation sub-module further configured to:
and selecting at least one application program to be recommended from the at least two application programs to be recommended after screening as an exposed application program to be recommended to the user to be recommended based on a preset recommendation condition.
24. The apparatus according to claim 19, wherein the user characteristic and the attribute characteristic each include an offline characteristic and a real-time characteristic, wherein the offline characteristic includes collected historical characteristics of the user to be recommended and the application to be recommended, and the real-time characteristic includes collected characteristics of the user to be recommended and the application to be recommended at a current time.
25. A computing device comprising a memory, a processor and computer instructions stored on the memory and executable on the processor, wherein the processor implements the steps of the method of any one of claims 1-12 when executing the instructions.
26. A computer-readable storage medium storing computer instructions, which when executed by a processor, perform the steps of the method of any one of claims 1 to 12.
CN201910090621.6A 2019-01-30 2019-01-30 Recommendation model training method and device, and recommendation method and device Active CN110008399B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910090621.6A CN110008399B (en) 2019-01-30 2019-01-30 Recommendation model training method and device, and recommendation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910090621.6A CN110008399B (en) 2019-01-30 2019-01-30 Recommendation model training method and device, and recommendation method and device

Publications (2)

Publication Number Publication Date
CN110008399A CN110008399A (en) 2019-07-12
CN110008399B true CN110008399B (en) 2023-04-07

Family

ID=67165588

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910090621.6A Active CN110008399B (en) 2019-01-30 2019-01-30 Recommendation model training method and device, and recommendation method and device

Country Status (1)

Country Link
CN (1) CN110008399B (en)

Families Citing this family (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110363590A (en) * 2019-07-16 2019-10-22 深圳乐信软件技术有限公司 A kind of advertisement recommended method, device, terminal and storage medium
CN110569427B (en) * 2019-08-07 2022-03-01 智者四海(北京)技术有限公司 Multi-target sequencing model training and user behavior prediction method and device
CN110569447B (en) * 2019-09-12 2022-03-15 腾讯音乐娱乐科技(深圳)有限公司 Network resource recommendation method and device and storage medium
CN110837596B (en) * 2019-09-16 2023-02-03 中国平安人寿保险股份有限公司 Intelligent recommendation method and device, computer equipment and storage medium
CN110796513A (en) * 2019-09-25 2020-02-14 北京三快在线科技有限公司 Multitask learning method and device, electronic equipment and storage medium
CN110880124A (en) * 2019-09-29 2020-03-13 清华大学 Conversion rate evaluation method and device
CN110910201B (en) * 2019-10-18 2023-08-29 中国平安人寿保险股份有限公司 Information recommendation control method and device, computer equipment and storage medium
CN112711643B (en) * 2019-10-25 2023-10-10 北京达佳互联信息技术有限公司 Training sample set acquisition method and device, electronic equipment and storage medium
CN110807693A (en) * 2019-11-04 2020-02-18 上海喜马拉雅科技有限公司 Album recommendation method, device, equipment and storage medium
CN110995529B (en) * 2019-11-14 2021-09-21 广州方硅信息技术有限公司 Configuration data testing method and device of recommendation system and electronic equipment
CN112818291B (en) * 2019-11-18 2023-09-01 百度在线网络技术(北京)有限公司 Conversion rate prediction method, conversion rate prediction device, conversion rate prediction equipment and conversion rate prediction medium
CN111027435B (en) * 2019-12-02 2022-12-02 清华大学 Identification system, device and method based on gradient lifting decision tree
CN110889747B (en) * 2019-12-02 2023-05-09 腾讯科技(深圳)有限公司 Commodity recommendation method, device, system, computer equipment and storage medium
CN111131227B (en) * 2019-12-20 2023-04-18 深圳前海微众银行股份有限公司 Data processing method and device
WO2021120226A1 (en) * 2019-12-20 2021-06-24 深圳前海微众银行股份有限公司 Conversion estimation method and apparatus
CN111080413A (en) * 2019-12-20 2020-04-28 深圳市华宇讯科技有限公司 E-commerce platform commodity recommendation method and device, server and storage medium
CN111179031B (en) * 2019-12-23 2023-09-26 第四范式(北京)技术有限公司 Training method, device and system for commodity recommendation model
CN111340522B (en) * 2019-12-30 2024-03-08 支付宝实验室(新加坡)有限公司 Resource recommendation method, device, server and storage medium
CN111259222B (en) * 2020-01-22 2023-08-22 北京百度网讯科技有限公司 Article recommendation method, system, electronic equipment and storage medium
CN113269577B (en) * 2020-02-17 2023-10-13 北京达佳互联信息技术有限公司 Data acquisition method, device, server and storage medium
CN113495985B (en) * 2020-03-18 2024-03-19 北京达佳互联信息技术有限公司 Article recommendation method and device, short video recommendation method and device and server
CN111401963B (en) * 2020-03-20 2022-06-07 支付宝(杭州)信息技术有限公司 Method and device for training user behavior prediction model
CN111242752B (en) * 2020-04-24 2020-08-14 支付宝(杭州)信息技术有限公司 Method and system for determining recommended object based on multi-task prediction
CN113688305A (en) * 2020-05-19 2021-11-23 腾讯科技(深圳)有限公司 Information processing method and device and computer readable storage medium
CN111859133B (en) * 2020-07-21 2023-11-14 有半岛(北京)信息科技有限公司 Recommendation method and release method and device of online prediction model
CN112163676B (en) * 2020-10-13 2024-04-05 北京百度网讯科技有限公司 Method, device, equipment and storage medium for training multitasking service prediction model
CN112381607B (en) * 2020-11-12 2023-11-24 杭州时趣信息技术有限公司 Network commodity ordering method, device, equipment and medium
CN112508613B (en) * 2020-12-09 2024-03-19 天津狮拓信息技术有限公司 Commodity recommendation method and device, electronic equipment and readable storage medium
CN112612768B (en) * 2020-12-11 2022-09-16 上海哔哩哔哩科技有限公司 Model training method and device
CN113190725B (en) * 2021-03-31 2023-12-12 北京达佳互联信息技术有限公司 Object recommendation and model training method and device, equipment, medium and product
CN113222656B (en) * 2021-04-29 2023-11-24 西安点告网络科技有限公司 Programmed advertisement putting method, system, device, equipment and storage medium
CN113240408A (en) * 2021-06-18 2021-08-10 中国银行股份有限公司 Mobile banking APP recommendation method and device
CN113962316A (en) * 2021-10-27 2022-01-21 维沃移动通信有限公司 Model training method and device and electronic equipment
CN116150470A (en) * 2021-11-19 2023-05-23 腾讯科技(深圳)有限公司 Content recommendation method, device, apparatus, storage medium and program product
CN115017400B (en) * 2021-11-30 2023-05-26 荣耀终端有限公司 Application APP recommendation method and electronic equipment
CN115618035B (en) * 2022-10-14 2023-07-18 北京京航计算通讯研究所 Picture recommendation method based on picture exposure conversion rate prediction model
CN116248903A (en) * 2022-12-30 2023-06-09 天翼云科技有限公司 Live broadcast recommendation method and device, electronic equipment and storage medium
CN116525121B (en) * 2023-07-05 2023-09-26 昆明同心医联科技有限公司 Method for establishing primary spring coil recommendation model of embolic aneurysm and application of primary spring coil recommendation model

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106485562A (en) * 2015-09-01 2017-03-08 苏宁云商集团股份有限公司 A kind of commodity information recommendation method based on user's history behavior and system
CN107613022A (en) * 2017-10-20 2018-01-19 广州优视网络科技有限公司 Content delivery method, device and computer equipment
CN108416625A (en) * 2018-02-28 2018-08-17 阿里巴巴集团控股有限公司 The recommendation method and apparatus of marketing product

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180374138A1 (en) * 2017-06-23 2018-12-27 Vufind Inc. Leveraging delayed and partial reward in deep reinforcement learning artificial intelligence systems to provide purchase recommendations

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106485562A (en) * 2015-09-01 2017-03-08 苏宁云商集团股份有限公司 A kind of commodity information recommendation method based on user's history behavior and system
CN107613022A (en) * 2017-10-20 2018-01-19 广州优视网络科技有限公司 Content delivery method, device and computer equipment
CN108416625A (en) * 2018-02-28 2018-08-17 阿里巴巴集团控股有限公司 The recommendation method and apparatus of marketing product

Also Published As

Publication number Publication date
CN110008399A (en) 2019-07-12

Similar Documents

Publication Publication Date Title
CN110008399B (en) Recommendation model training method and device, and recommendation method and device
CN110046952B (en) Recommendation model training method and device, and recommendation method and device
CN108229590B (en) Method and device for acquiring multi-label user portrait
CN109547814B (en) Video recommendation method and device, server and storage medium
CN110033342A (en) A kind of training method and device, a kind of recommended method and device of recommended models
CN109189904A (en) Individuation search method and system
CN110276067B (en) Text intention determining method and device
CN111061946A (en) Scenario content recommendation method and device, electronic equipment and storage medium
CN110363213A (en) The cognitive analysis of image of clothing and classification
CN112669096B (en) Object recommendation model training method and device
CN112348629A (en) Commodity information pushing method and device
US20220138770A1 (en) Method and apparatus for analyzing sales conversation based on voice recognition
CN110008397A (en) A kind of recommended models training method and device
US20190311415A1 (en) Adaptive Multi-Perceptual Similarity Detection and Resolution
CN113076423A (en) Data processing method and device and data query method and device
KR20210062522A (en) Control method, device and program of user participation keyword selection system
CN110555749B (en) Credit behavior prediction method and device based on neural network
CN110197196B (en) Question processing method and device, electronic equipment and storage medium
US20230308360A1 (en) Methods and systems for dynamic re-clustering of nodes in computer networks using machine learning models
CN110377803B (en) Information processing method and device
CN116955591A (en) Recommendation language generation method, related device and medium for content recommendation
CN116663909A (en) Provider risk identification data processing method and device
CN114048294B (en) Similar population extension model training method, similar population extension method and device
CN111915216B (en) Open source software project developer recommendation method based on secondary attention mechanism
CN112785391B (en) Recommendation processing method and device, intelligent equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20201015

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant after: Advanced innovation technology Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Applicant before: Alibaba Group Holding Ltd.

Effective date of registration: 20201015

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant after: Innovative advanced technology Co.,Ltd.

Address before: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant before: Advanced innovation technology Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant