WO2024051707A1 - Recommendation model training method and apparatus, and resource recommendation method and apparatus - Google Patents

Recommendation model training method and apparatus, and resource recommendation method and apparatus Download PDF

Info

Publication number
WO2024051707A1
WO2024051707A1 PCT/CN2023/117102 CN2023117102W WO2024051707A1 WO 2024051707 A1 WO2024051707 A1 WO 2024051707A1 CN 2023117102 W CN2023117102 W CN 2023117102W WO 2024051707 A1 WO2024051707 A1 WO 2024051707A1
Authority
WO
WIPO (PCT)
Prior art keywords
conversion rate
user
resource
target resource
type
Prior art date
Application number
PCT/CN2023/117102
Other languages
French (fr)
Chinese (zh)
Inventor
王喆
梁嘉旺
何旭轩
谢水
洪福兴
田璐鑫
鹿宁
张拓宇
何海乾
Original Assignee
脸萌有限公司
北京有竹居网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 脸萌有限公司, 北京有竹居网络技术有限公司 filed Critical 脸萌有限公司
Publication of WO2024051707A1 publication Critical patent/WO2024051707A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Definitions

  • Embodiments of the present disclosure relate to the field of data processing, and more specifically, to methods of training recommendation models, methods, devices, electronic devices, and computer-readable storage media for recommending resources.
  • Embodiments of the present disclosure provide a recommendation model training solution.
  • a method of training a recommendation model may include obtaining a first set of training data including a real conversion rate of a first type of user to the target resource, the first type of user providing the real conversion rate.
  • the method may further include obtaining a second set of training data including a predicted conversion rate of a second type of user to the target resource, the second type of user not providing a true conversion rate.
  • the method may further include training the recommendation model using the first set of training data and the second set of training data.
  • a method of training a recommendation model may include determining an initial conversion rate for the target resource by multiple users, with a single user not providing a true conversion rate.
  • the method may further include determining a correction factor based on the total conversion rate and the initial conversion rate of the plurality of users to the target resource.
  • the method may further include determining a predicted conversion rate of the user to the target resource based on the correction factor and the initial conversion rate. Additionally, the method may further include training a recommendation model based at least on the predicted conversion rate.
  • a method of recommending resources may include obtaining user characteristics of the user and resource characteristics of the resource.
  • the method may further include using the conversion rate model trained according to the methods of the first aspect and the second aspect to determine the user's conversion rate to the resource based on the user characteristics and the resource characteristics.
  • the method may further include recommending resources to the user based on the conversion rate.
  • a device for training a recommendation model may include: a first training data acquisition module configured to acquire a first data including a real conversion rate of a first type of user to a target resource. A set of training data, the first type of user provides a true conversion rate; the second training data acquisition module is configured to obtain a second set of training data including the predicted conversion rate of the second type of user to the target resource, the second type of user does not provide a true conversion rate conversion rate; and a first model training module configured to train the recommendation model using the first set of training data and the second set of training data.
  • a device for training a recommendation model may include: an initial conversion rate determination module configured to determine the initial conversion rate of multiple users to the target resource, and a single user does not provide a real conversion. rate; the correction factor determination module is configured to determine the correction factor based on the total conversion rate and the initial conversion rate of multiple users to the target resource; the predicted conversion rate determination module is configured to determine the user's conversion rate based on the correction factor and the initial conversion rate. a predicted conversion rate of the target resource; and a second model training module configured to train the recommendation model based on at least the predicted conversion rate.
  • a device for recommending resources may include: a feature acquisition module configured to acquire user features of the user and resource features of the resource; a conversion rate determination module configured to utilize The conversion rate model trained by the method of the first aspect or the second aspect determines the user's response to the resource based on the user characteristics and resource characteristics. the conversion rate of the source; and the recommendation module is configured to recommend resources to users based on the conversion rate.
  • an electronic device including: a processor; and a memory coupled to the processor, the memory having instructions stored therein, the instructions when executed by the processor, cause the electronic device to perform according to the first Any step of the method of the first, second or third aspect.
  • a computer-readable storage medium having a computer program stored thereon, which when executed by a processor implements any steps of the method according to the first, second or third aspect.
  • FIG. 1 illustrates a schematic diagram of an example environment in which various embodiments of the present disclosure can be implemented
  • FIG. 2 illustrates a schematic diagram of a detailed example environment for training and applying models in accordance with embodiments of the present disclosure
  • FIG. 3 illustrates a flowchart of a process for training a recommendation model according to one embodiment of the present disclosure
  • FIG. 4 illustrates a flowchart of a process for training a recommendation model according to another embodiment of the present disclosure
  • Figure 5 shows an overall flow chart of a scheme for training a recommendation model according to an embodiment of the present disclosure
  • Figure 6 illustrates an apparatus for training a recommendation model according to one embodiment of the present disclosure. Schematic diagram of the installation
  • Figure 7 shows a schematic diagram of an apparatus for training a recommendation model according to another embodiment of the present disclosure.
  • Figure 8 shows a schematic block diagram of an example device that may be used to implement embodiments of the present disclosure.
  • a prompt message is sent to the user to clearly remind the user that the operation requested will require the acquisition and use of the user's personal information. Therefore, users can autonomously choose whether to provide personal information to software or hardware such as electronic devices, applications, servers or storage media that perform the operations of the technical solution of the present disclosure based on the prompt information.
  • the method of sending prompt information to the user may be, for example, a pop-up window, and the prompt information may be presented in the form of text in the pop-up window.
  • the pop-up window can also contain a selection control for the user to choose "agree” or "disagree” to provide personal information to the electronic device.
  • model can learn the association between the corresponding input and the output from the training data, so that after the training is completed, the given input is processed based on the parameter set obtained by the training. Generate the corresponding output.
  • a “model” may also sometimes be called a “neural network”, “learning model”, “learning network” or “network”. These terms are used interchangeably herein.
  • feature refers to a vector representation of a resource or user.
  • the nature of this feature vector makes objects corresponding to vectors with similar distances have similar meanings. For example, if two resources, cars and digital products, are both technological items, then the feature vectors of the car and the feature vectors of the digital products are relatively close in space. For another example, if user A and user B select entertainment information as tags of interest at the same time, then the characteristics of user A and user B are relatively close in space.
  • the concept of “features” can be used to encode objects with vectors and retain the characteristics of their meaning, which is very suitable for deep learning.
  • mendation refers to the action of presenting or exposing various resources or content to users in various appropriate forms.
  • a recommendation model is needed to achieve accurate recommendation of resources.
  • whether to recommend a user pair is usually determined based on the user's conversion rate of the resource.
  • some initial conversion rates can be used to train the recommendation model.
  • the training effect is poor and an accurate recommendation model cannot be obtained.
  • a recommendation model training scheme is proposed. This scheme divides the training data into two parts. For the first type of user who provides a real conversion rate, a first set of training data including the real conversion rate is obtained for recommendation model training. For the second type of users who do not provide a true conversion rate, a second set of training data including predicted conversion rates is obtained for recommendation model training. From this, data related to different types of users Used as different training data for recommendation model training, it can improve the generalization of the model. In addition, training the recommendation model based on the predicted conversion rate that is closest to the true conversion rate can improve the accuracy of the recommendation model in predicting the conversion rate, thereby solving the above problems and/or other potential problems.
  • Figure 1 illustrates a block diagram of an example system 100 for training a recommendation model in accordance with an embodiment of the present disclosure. It should be understood that the system 100 shown in FIG. 1 is only an example in which embodiments of the present disclosure can be implemented, and is not intended to limit the scope of the present disclosure. Embodiments of the present disclosure are equally applicable to other systems or architectures.
  • system 100 may include computing device 120 .
  • Computing device 120 may be configured to receive input 110, which may include data associated with users and resources, such as user characteristics and resource characteristics.
  • the computing device 120 generates a user conversion rate 130 for the resource based on the input 110 .
  • the computing device 120 may generate the conversion rate 130 through the recommendation model 140 disposed therein.
  • the user can be a user of various types of applications, and the application can be an application including a recommendation system, including but not limited to shopping applications, short video applications, music applications, dating applications, news applications, forum applications, cloud disk storage applications, Search applications, etc. This disclosure is not limited here.
  • Resources can be products, live broadcast rooms, short videos, pictures, music, character information, etc. in the above applications including recommendation systems. Users receive recommended videos, pictures, texts, voices or combinations thereof that are associated with resources in the above-mentioned applications. For example, after a user enters a news application, he or she receives recommended news cover images, news headline text information, or video information in the display interface.
  • “resources”, “content”, “objects”, etc. all refer to entities or virtual items that may need to be presented or exposed to users, and this disclosure is not limited here.
  • the user will go through the following process in the above application: first, the user sees the resource, and then the user may select (for example, click) the resource of interest. Then the user may perform further operations on the resource, such as purchasing, collecting, adding to shopping cart, and downloading. Uploading, forwarding, etc. This behavior is called conversion. Predicting the probability that the user will convert after the resource is shown to the user is called conversion rate estimation.
  • the real conversion rate of some users for the resource will be provided to the application (hereinafter, we call it the first type of user), while the real conversion rate of some users will be provided to the application party.
  • the real conversion rate of this resource will not be provided to the application side (hereinafter, we call it the second type of user).
  • the recommendation model 140 may be designed to perform recommendation tasks.
  • recommended models include, but are not limited to, various types of deep neural networks (DNN), convolutional neural networks (CNN), support vector machines (SVM), decision trees, random forest models, etc.
  • DNN deep neural networks
  • CNN convolutional neural networks
  • SVM support vector machines
  • decision trees random forest models
  • a recommendation model may also be referred to as a "neural network,” “learning model,” “learning network,” “model,” and “network” interchangeably.
  • computing device 120 may include, but is not limited to, a personal computer, a server computer, a handheld or laptop device, a mobile device (such as a mobile phone, personal digital assistant (PDA), media player, etc.), consumer electronics, small form factor device, etc. Computers, mainframe computers, cloud computing resources, etc.
  • system 100 may also include additional devices and/or units not shown.
  • the computing device 120 of the system 100 may further include a storage unit (not shown) for storing pre-input hyperparameters and the like.
  • example environment 200 may include a computing device 220 , an input 210 into the computing device 220 , and a conversion rate 230 output from the computing device 220 .
  • the example environment 200 may generally include a model training system 260 and a model application system 270 .
  • model training system 260 and/or model application system 270 may be implemented in computing device 120 as shown in FIG. 1 or computing device 220 as shown in FIG. 2 . Should be taken for granted It is understood that the structure and functionality of example environment 200 are described for illustrative purposes only and are not intended to limit the scope of the subject matter described herein. The subject matter described herein may be implemented in different structures and/or functions.
  • model training system 260 may utilize training data 250 to train model 240.
  • training data 250 may be a triplet of (user characteristics; resource characteristics; user-to-resource conversion rate).
  • training data 250 may include a first set of training data 252 associated with a first type of user, in which the user's true conversion rate for the resource is provided.
  • training data 250 may include a second set of training data 254 associated with a second type of user, where the user's true conversion rate for the resource is not provided. At this time, the true conversion rate needs to be predicted for training the model 240, and the prediction process will be described in detail below.
  • training data 250 may include both first set of training data 252 and second set of training data 242 .
  • the model application system 270 may receive the trained model 240 .
  • the model 240 loaded into the computing device 220 of the model application system 270 can determine the conversion rate 230 based on the input 210 .
  • model 240 may be constructed as a learning network.
  • the learning network may include multiple networks, where each network may be a multi-layer neural network, which may be composed of a large number of neurons. Through the training process, the corresponding parameters of each neuron in the network can be determined. The parameters of the neurons in these networks are collectively referred to as the parameters of the model 240 .
  • the training process of the model 240 may be performed in an iterative manner until at least some of the parameters of the model 240 converge or until a predetermined number of iterations is reached, thereby obtaining final model parameters.
  • Figure 3 illustrates a flow diagram of a process 300 for training a recommendation model in accordance with an embodiment of the present disclosure.
  • process 300 may be implemented in computing device 120 in FIG. 1 and computing device 220 in FIG. 2 .
  • a process 300 of training a recommendation model according to an embodiment of the present disclosure is now described with reference to FIG. 3 .
  • the specific examples mentioned in the following description are illustrative and are not intended to limit the scope of the present disclosure.
  • the computing device 120 may obtain a first set of training data including a true conversion rate of a first type of user to the target resource, the first type of user providing the true conversion rate. For example, according to the type of operating system, the user's settings, and the resource party's settings, the computing device 120 may obtain the true conversion rate of the first type of user to the target resource. Computing device 120 may then train the model based on this true conversion rate.
  • the computing device 120 may determine the user characteristics of the first type of user and the resource characteristics of the target resource, and then obtain the real conversion rate of the first type of user to the target resource.
  • the training data for user i can be expressed as (x i , y i , z i ).
  • x i represents user characteristics and resource characteristics
  • yi i uses different values to indicate whether the user has selected (clicked) the target resource displayed to the user.
  • yi i can be a value between 0 and 1, for example, 0 means no selection.
  • 1 represents selection;
  • z i represents the conversion rate of whether the user has made the choice, that is, whether the target behavior has been implemented.
  • the target behavior is the behavior that is considered to be converted by the user in the corresponding scenario.
  • z i can be a value between 0 and 1. For example, 0 means conversion has occurred and 1 means no conversion has occurred. Please note that the above training data is only exemplary, and different forms of training data may also exist, and the disclosure is not limited here.
  • computing device 120 may determine user characteristics and resource characteristics respectively. For example, computing device 120 may characterize the user based on the user's historical selection of resources. Computing device 120 may determine resource characteristics based on one or more of the resource category, the resource publisher, and user characteristics of users who have historically selected the resource. The above method of determining characteristics is only exemplary, and the computing device 120 may determine user characteristics and resource characteristics based on other suitable inherent characteristics of users and resources to accurately represent the complex non-linear relationship between users and resources.
  • the user's interaction information with the resource can be used to determine Determine the user characteristics of the user and the resource characteristics of the resource.
  • a node graph can be constructed based on the relationship between users' clicks, sharing, publishing and other operations on resources, where each node represents a user and a resource. Then determine the user characteristics and resource characteristics by walking in the node graph. It can be understood that by accurately representing the characteristics of users and resources, the feature capacity and generalization of the recommendation model to be trained can be improved.
  • the computing device 120 may obtain a second set of training data including predicted conversion rates for the target resource by a second type of user who does not provide a true conversion rate. It is understandable that for some types of operating systems, dual authorization from users and resource related parties is sometimes required to obtain the true conversion rate of a single user. At this time, the computing device 120 needs to predict the conversion rate based on existing data.
  • computing device 120 may first determine user characteristics of the second type of user and resource characteristics of the target resource. Regarding the method of determining user characteristics and resource characteristics, please refer to the above description and will not be repeated here. Computing device 120 may then determine a predicted conversion rate based on user behavior after the second type of user selects the target resource. For example, although the computing device 120 cannot directly obtain the user's true conversion rate, it can predict the conversion rate based on the user's interaction behavior with the target resource.
  • the computing device 120 may determine the initial conversion rate as the predicted conversion rate based on the action relationship between the second type user and the target resource. It can be understood that the more operations the user performs on the resource, or the longer it takes for the user to enter other interfaces and then jump back to the application after selecting the resource, it means that the user has a greater probability of implementing conversion behavior. For example, the computing device 120 may determine one or more of whether the user likes the target resource, whether the user forwards the target resource, the time when the user switches back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource. item to determine the initial conversion rate as the predicted conversion rate. The initial conversion rate can be a value between 0 and 1. Conversion rates can be accurately predicted based on the relationship between user actions and resources within the app.
  • computing device 120 may provide the above The action relationship between the first type of conversion rate user and the target resource and its true conversion rate are used to train the machine learning model to obtain a trained initial conversion rate model. Computing device 120 may then utilize the trained initial conversion rate model to predict the initial conversion rate for the second type of user. It can be understood that due to the difference between the training data used (i.e., first type users, target resources, and the real conversion rate between them) and the data used for prediction (i.e., second type users, target resources, and the initial conversion rate between them) Conversion rate) are all targeted at the same target resource, which allows the initial conversion rate to be accurately predicted, that is, closer to the true conversion rate.
  • the training data used i.e., first type users, target resources, and the real conversion rate between them
  • the data used for prediction i.e., second type users, target resources, and the initial conversion rate between them
  • Conversion rate are all targeted at the same target resource, which allows the initial conversion rate to be accurately predicted, that is, closer to the true conversion rate.
  • the initial conversion rate can also be corrected based on some other data to obtain a predicted conversion rate that is closer to the real conversion rate.
  • the computing device 120 may determine the correction factor based on the total conversion rate of the resource by the first type of users and the second type of users, the true conversion rate, and the initial conversion rate.
  • the total conversion rate T is the ratio between the number of converted users among the first type users and the second type users and the total number of the first type users and the second type users
  • the real conversion rate R is the ratio among the first type users The ratio between the number of converted users and the total number of first-type users and second-type users.
  • the initial conversion rate I may be the average of the predicted initial conversion rates of all second-type users.
  • TN is the total number of conversions
  • FN is the number of converted users among the first type of users
  • IN is the initial number of conversions corresponding to the initial conversion rate.
  • Computing device 120 then applies the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as the predicted conversion rate. For example, computing device 120 may multiply the initial conversion rate of each second type user by a correction factor as the predicted conversion rate. It can be understood that correcting the predicted initial conversion rate based on the obtained total conversion rate can make the predicted conversion rate more accurate, thereby improving the prediction accuracy of the subsequently trained recommendation model.
  • the computing device 120 may use the first set of training data and the second set of training data to train the recommendation model.
  • training data includes user characteristics, resource characteristics, and conversion rates.
  • the computing device 120 may, for example, input user characteristics and resource characteristics into the initial model to obtain a predicted conversion rate.
  • the error between the predicted conversion rate and the conversion rate as the ground-truth label is then determined, and the computing device 120 then propagates the error in the opposite direction (ie, from the output layer to the input layer of the model to be trained).
  • the error between the prediction and the actual value of the model to be trained will become smaller and smaller until the model converges and the training process is completed. From this, the computing device 120 obtains the recommended model.
  • the present disclosure can accurately predict the user's conversion rate through user behavior and the total conversion rate when the conversion rate of a single user for resources cannot be obtained.
  • the present disclosure uses the accurate conversion rate obtained above to train the recommendation model, which can improve the prediction accuracy and generalization of the recommendation model.
  • the trained recommendation model can be used to accurately recommend resources to users, improving user experience and reducing resource related party costs.
  • FIG. 4 illustrates a flowchart of a process for training a recommendation model according to another embodiment of the present disclosure.
  • computing device 120 determines multiple users' initial conversion rates to the target resource and that a single user does not provide a true conversion rate.
  • the process of determining the initial conversion rate is similar to the step described in 304 and will not be described again here.
  • the total conversion rate T is the ratio between the number of converted users among the second type users and the total number of second type users, and the initial conversion rate I can be the average of the initial conversion rates of all second type users predicted above. value.
  • computing device 120 determines the user's predicted conversion rate for the target resource based on the correction factor and the initial conversion rate.
  • Computing device 120 may apply the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as a predicted conversion rate. For example, computing device 120 may multiply the initial conversion rate of each second type user by a correction factor as the predicted conversion rate. It can be understood that correcting the predicted initial conversion rate based on the obtained total conversion rate can make the predicted conversion rate more accurate, thereby improving the prediction accuracy of the subsequently trained recommendation model.
  • computing device 120 trains a recommendation model based at least on the predicted conversion rate.
  • the process of training the model is similar to the steps described in 306 and will not be described again here.
  • the present disclosure can accurately predict the user's conversion rate through user behavior and the total conversion rate when only the second type of user exists.
  • the present disclosure uses the accurate conversion rate obtained above to train the recommendation model, which can improve the prediction accuracy and generalization of the recommendation model.
  • the trained recommendation model can be used to accurately recommend resources to users, improving user experience and reducing resource related party costs.
  • Figure 5 shows an overall flowchart of a scheme for training a recommendation model according to an embodiment of the present disclosure.
  • the training data 510 is divided into training data for the first type of users and training data for the second type of users.
  • the training data for the first type of users includes the true conversion rate of 540.
  • the conversion rate of the training data of the second type of user needs to be determined according to the above-mentioned processes 330 and 400, in which the incident conversion rate 550, the correction factor 570 and the predicted conversion rate 560 are determined respectively.
  • the training data of the first type of user and the training data of the second type of user can be respectively input into the value model 520 for training, and the output of the model 520 is the conversion rate 530.
  • the conversion rate 530 For specific steps, please refer to the above description and will not be repeated here.
  • the computing device 120 obtains the user characteristics of the user and the resource characteristics of the resource. For example, the computing device 120 may determine user characteristics and resource characteristics. For the determination process of user characteristics and resource characteristics, refer to the above description and will not be described again here. In some embodiments, the computing device 120 may also call pre-stored user characteristics and resource characteristics from the database according to the user identification and resource identification.
  • Computing device 120 may then determine the user's conversion rate to the resource based on the user characteristics and the resource characteristics according to the conversion rate model trained in processes 300 and 400 . For example, the computing device 120 may use user characteristics and resource characteristics as inputs to the recommendation model to derive a predicted conversion rate. The computing device 120 then recommends resources to the user based on the conversion rate. For example, the computing device 120 may rank a user's conversion rates for multiple resources and recommend resources that are in front of the predetermined sort order to the user.
  • FIG. 6 shows a schematic diagram of an apparatus 600 for training a recommendation model according to an embodiment of the present disclosure.
  • the device 600 may at least include: a first training data acquisition module 602 configured to acquire a first set of training data including the real conversion rate of a first type of user to the target resource, and the first type of user provides real conversion. rate; the second training data acquisition module 604 is configured to obtain a second set of training data including the predicted conversion rate of a second type of user to the target resource, the second type of user does not provide a true conversion rate; and the first model training module 606 , is configured to train the recommendation model using the first set of training data and the second set of training data.
  • the second training data acquisition module 604 may include: a first feature determination module configured to determine user features of the second type of user and resource features of the target resource; and a first prediction module configured to The predicted conversion rate is determined based on user behavior after the second type of user selects the target resource.
  • the first prediction module may include: a second prediction module configured to determine the initial conversion rate as the predicted conversion rate based on the action relationship between the second type user and the target resource.
  • the action relationship between the second type user and the target resource includes Including at least one of the following: whether the user likes the target resource, whether the user forwards the target resource, the time when the user switches back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource.
  • the apparatus 600 may further include: a correction factor module configured to determine the correction factor based on the total conversion rate, the real conversion rate and the initial conversion rate of the resource by the first type of users and the second type of users; and and a correction application module configured to apply the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as the predicted conversion rate.
  • a correction factor module configured to determine the correction factor based on the total conversion rate, the real conversion rate and the initial conversion rate of the resource by the first type of users and the second type of users
  • a correction application module configured to apply the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as the predicted conversion rate.
  • the first training data acquisition module 602 may include: a second characteristic determination module configured to determine user characteristics of the first type of user and resource characteristics of the target resource; and a conversion rate acquisition module configured to Get the true conversion rate.
  • the first feature determination module and the second feature determination module may include: a user feature determination module configured to determine user features based on the user's historical selection of resources.
  • the first feature determination module and the second feature determination module may include: a resource feature determination module configured to be based on at least one of resource categories, resource publishers, and user features of users who have historically selected the resource. Items determine resource characteristics.
  • FIG. 7 shows a schematic diagram of an apparatus 700 for training a recommendation model according to another embodiment of the present disclosure.
  • the device 700 may at least include: an initial conversion rate determination module 702, configured to determine the initial conversion rate of multiple users to the target resource, and a single user does not provide a true conversion rate; a correction factor determination module 704, configured to Determine a correction factor based on the total conversion rate and the initial conversion rate of multiple users to the target resource; the predicted conversion rate determination module 706 is configured to determine the predicted conversion rate of the user to the target resource based on the correction factor and the initial conversion rate; and
  • the second model training module 708 is configured to train the recommendation model based on at least the predicted conversion rate.
  • the present disclosure also provides a device for recommending resources, which may include: a feature acquisition module configured to acquire user features of the user and resource features of the resource; a conversion rate determination module configured to utilize According to the conversion rate model trained by the process 300 and 400 methods, based on user characteristics and resource characteristics, determine the user's response to the resource conversion rate; and a recommendation module configured to recommend resources to users based on the conversion rate.
  • a feature acquisition module configured to acquire user features of the user and resource features of the resource
  • a conversion rate determination module configured to utilize According to the conversion rate model trained by the process 300 and 400 methods, based on user characteristics and resource characteristics, determine the user's response to the resource conversion rate
  • a recommendation module configured to recommend resources to users based on the conversion rate.
  • FIG. 8 shows a schematic block diagram of an example device 800 that may be used to implement embodiments of the present disclosure.
  • computing device 120 shown in FIG. 1 and computing device 220 shown in FIG. 2 may be implemented by device 800.
  • the device 800 includes a central processing unit (CPU) 801 that can operate on a computer in accordance with computer program instructions stored in a read-only memory (ROM) 802 or loaded from a storage unit 808 into a random access memory (RAM) 803 Program instructions to perform various appropriate actions and processes.
  • ROM 802 read-only memory
  • RAM 803 Program instructions to perform various appropriate actions and processes.
  • various programs and data required for the operation of the device 900 can also be stored.
  • CPU 801, ROM 802 and RAM 803 are connected to each other via bus 804.
  • An input/output (I/O) interface 805 is also connected to bus 804.
  • the I/O interface 805 includes: an input unit 806, such as a keyboard, a mouse, etc.; an output unit 807, such as various types of displays, speakers, etc.; a storage unit 808, such as a magnetic disk, optical disk, etc. ; and communication unit 809, such as a network card, modem, wireless communication transceiver, etc.
  • the communication unit 809 allows the device 800 to exchange information/data with other devices through computer networks such as the Internet and/or various telecommunications networks. It should be understood that the present disclosure can use the output unit 807 to display real-time dynamic change information of user satisfaction, key factor identification information of satisfied group users or individual users, optimization strategy information, and strategy implementation effect evaluation information, etc.
  • the processing unit 801 may be implemented by one or more processing circuits.
  • the processing unit 801 may be configured to perform the various processes and processes described above, such as processes 300, 400, and 500.
  • processes 300, 400, and 500 may be implemented as a computer software program tangibly embodied in a machine-readable medium, such as storage unit 808.
  • part or all of the computer program may be loaded and/or installed onto device 800 via ROM 802 and/or communication unit 809.
  • the computer program is loaded into RAM 803 and executed by CPU 801, one or more steps in processes 300, 400, and 500 described above may be performed.
  • the present disclosure may be a system, method, and/or computer program product.
  • a computer program product may include a computer-readable storage medium having thereon computer-readable program instructions for performing various aspects of the present disclosure.
  • Computer-readable storage media may be tangible devices that can retain and store instructions for use by an instruction execution device.
  • the computer-readable storage medium may be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the above. More specific examples (non-exhaustive list) of computer-readable storage media include: portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM) or Flash memory), Static Random Access Memory (SRAM), Compact Disk Read Only Memory (CD-ROM), Digital Versatile Disk (DVD), Memory Stick, Floppy Disk, Mechanical Coding Device, such as a printer with instructions stored on it.
  • RAM random access memory
  • ROM read-only memory
  • EPROM erasable programmable read-only memory
  • Flash memory Static Random Access Memory
  • CD-ROM Compact Disk Read Only Memory
  • DVD Digital Versatile Disk
  • Memory Stick
  • Computer-readable storage media are not to be construed as transient signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (e.g., light pulses through fiber optic cables), or through electrical wires. transmitted electrical signals.
  • Computer-readable program instructions described herein may be downloaded from a computer-readable storage medium to various computing/processing devices, or to an external computer or external storage device over a network, such as the Internet, a local area network, a wide area network, and/or a wireless network.
  • the network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers.
  • a network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network and forwards the computer-readable program instructions for storage on a computer-readable storage medium in the respective computing/processing device .
  • Computer program instructions for performing operations of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-related instructions, microcode, firmware instructions, state setting data, or instructions in one or more programming languages.
  • the computer-readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer. be executed on a computer or server.
  • the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider through the Internet). connect).
  • LAN local area network
  • WAN wide area network
  • an external computer such as an Internet service provider through the Internet. connect
  • an electronic circuit such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA)
  • the electronic circuit can Computer readable program instructions are executed to implement various aspects of the disclosure.
  • These computer-readable program instructions may be provided to a processing unit of a general-purpose computer, a special-purpose computer, or other programmable data processing apparatus, thereby producing a machine such that the instructions, when executed by a processing unit of the computer or other programmable data processing apparatus, , resulting in an apparatus that implements the functions/actions specified in one or more blocks in the flowchart and/or block diagram.
  • These computer-readable program instructions can also be stored in a computer-readable storage medium. These instructions cause the computer, programmable data processing device and/or other equipment to work in a specific manner. Therefore, the computer-readable medium storing the instructions includes An article of manufacture that includes instructions that implement aspects of the functions/acts specified in one or more blocks of the flowcharts and/or block diagrams.
  • Computer-readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other equipment, causing a series of operating steps to be performed on the computer, other programmable data processing apparatus, or other equipment to produce a computer-implemented process , thereby causing instructions executed on a computer, other programmable data processing apparatus, or other equipment to implement the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.
  • each box in the flowchart or block diagram can represent a module, program segment Or a part of an instruction.
  • the module, program segment or part of the instruction contains one or more executable instructions for realizing the specified logical function.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two consecutive blocks may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved.
  • each block of the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration can be implemented by special purpose hardware-based systems that perform the specified functions or acts. , or can be implemented using a combination of specialized hardware and computer instructions.
  • Example 1 A method of training a recommendation model, including: obtaining a first set of training data including a first type of user's true conversion rate to a target resource, the first type of user providing the true conversion rate; obtaining a second set of training data including a second type of user's true conversion rate to a target resource. a second set of training data for the predicted conversion rate of a type of user to the target resource, the second type of user not providing the true conversion rate; and using the first set of training data and the second set of training data to Train the recommendation model.
  • Example 2 The method of Example 1, wherein obtaining the second set of training data includes: determining user characteristics of the second type of user and resource characteristics of the target resource; and selecting based on the second type of user User behavior following the target resource determines the predicted conversion rate.
  • Example 3 The method according to Example 1-2, wherein determining the predicted conversion rate includes: determining an initial conversion rate as the predicted conversion rate based on an action relationship between the second type user and the target resource.
  • Example 4 The method according to Example 1-3, wherein the action relationship between the second type user and the target resource includes at least one of the following: whether the user likes the target resource, whether the user forwards the target resource , the time for the user to switch back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource.
  • Example 5 The method according to examples 1-4, further comprising based on the total conversion rate of the resource by the first type of users and the second type of users, the true conversion rate and the initial conversion rate, determining a correction factor; and applying said correction factor to said The initial conversion rate is used to obtain the corrected initial conversion rate as the predicted conversion rate.
  • Example 6 The method according to examples 1-5, wherein obtaining the first set of training data includes: determining user characteristics of the first type of user and resource characteristics of the target resource; and obtaining the true conversion rate.
  • Example 7 The method of examples 1-6, wherein determining the user characteristics includes determining the user characteristics based on the user's historical selection of resources.
  • Example 8 The method of Examples 1-7, wherein determining the resource characteristics of the target resource includes determining the resource based on at least one of a resource category, a resource publisher, and user characteristics of a user who has historically selected the resource. feature.
  • Example 9 A method of training a recommendation model, including: determining the initial conversion rate of multiple users to the target resource, and a single user does not provide a real conversion rate; based on the total conversion rate of the multiple users to the target resource and the an initial conversion rate, determining a correction factor; based on the correction factor and the initial conversion rate, determining a user's predicted conversion rate for the target resource; and training the recommendation model based at least on the predicted conversion rate.
  • Example 10 A method of recommending resources, including: obtaining user characteristics of the user and resource characteristics of the resource; using a conversion rate model trained according to the method described in any one of Examples 1 to 9, based on the user characteristics and the resource characteristics The resource characteristics are used to determine the user's conversion rate of the resource; and based on the conversion rate, the resource is recommended to the user.
  • Example 11 A device for training a recommendation model, including: a first training data acquisition module configured to acquire a first set of training data including the true conversion rate of a first type of user to a target resource, the first type of user providing The real conversion rate; a second training data acquisition module configured to obtain a second set of training data including the predicted conversion rate of a second type of user to the target resource, the second type of user not providing the real conversion rate; and a first model training module configured to train the recommendation model using the first set of training data and the second set of training data.
  • Example 12 The apparatus according to Example 11, the obtaining the second training data acquisition module includes: a first feature determination module configured to determine user features of the second type user and resources of the target resource Features; and a first prediction module configured to determine based on user behavior after the second type user selects the target resource. Determine the predicted conversion rate.
  • Example 13 The apparatus according to Example 11 or 12, the first prediction module includes: a second prediction module configured to determine an initial conversion based on an action relationship between the second type user and the target resource. rate as the predicted conversion rate.
  • Example 14 The device according to examples 11-13, wherein the action relationship between the second type user and the target resource includes at least one of the following: whether the user likes the target resource, whether the user forwards the target resource , the time for the user to switch back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource.
  • Example 15 The apparatus according to examples 11-14, the apparatus further comprising: a correction factor module configured to be based on the total conversion rate of the resource by the first type of users and the second type of users, the The true conversion rate and the initial conversion rate are used to determine a correction factor; and a correction application module is configured to apply the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as a predicted conversion rate.
  • a correction factor module configured to be based on the total conversion rate of the resource by the first type of users and the second type of users, the The true conversion rate and the initial conversion rate are used to determine a correction factor
  • a correction application module is configured to apply the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as a predicted conversion rate.
  • the first training data acquisition module includes: a second feature determination module configured to determine user features of the first type of user and resource features of the target resource ; and a conversion rate acquisition module configured to obtain the true conversion rate.
  • Example 17 The apparatus according to examples 11-16, the first feature determination module and the second feature determination module may include: a user feature determination module configured to determine the user feature based on the user's historical selection of resources. .
  • Example 18 The apparatus according to Examples 11-17, the first feature determination module and the second feature determination module may include: a resource feature determination module configured to select resources based on resource categories, resource publishers, and historical selections of resources. At least one of the user characteristics of the user determines the resource characteristics.
  • Example 19 A device for training a recommendation model, including: an initial conversion rate determination configured to determine the initial conversion rate of multiple users to a target resource, and a single user does not provide a true conversion rate; a correction factor determination module configured to determine based on the Determine a correction factor based on the total conversion rate of the multiple users to the target resource and the initial conversion rate; the predicted conversion rate determination module is configured to determine the user based on the correction factor and the initial conversion rate. a user's predicted conversion rate for the target resource; and a second model training module configured to train the recommendation model based at least on the predicted conversion rate.
  • Example 20 A device for recommending resources, including: a feature acquisition module configured to acquire user characteristics of the user and resource characteristics of the resource; a conversion rate determination module configured to utilize the method according to any one of Examples 1 to 10 The conversion rate model trained by the method determines the user's conversion rate of resources based on the user characteristics and the resource characteristics; and a recommendation module is configured to recommend resources to the user based on the conversion rate.
  • Example 21 An electronic device, comprising: a processor; and a memory coupled to the processor, the memory having instructions stored therein that, when executed by the processor, cause the electronic device to perform actions, The actions include: obtaining a first set of training data including a real conversion rate of a first type user to the target resource, the first type user providing the real conversion rate; obtaining a first set of training data including a second type user to the target resource. Predicting a second set of training data for a conversion rate, the second type of user not providing the true conversion rate; and using the first set of training data and the second set of training data to train the recommendation model.
  • Example 22 An electronic device, comprising: a processor; and a memory coupled to the processor, the memory having instructions stored therein that, when executed by the processor, cause the electronic device to perform actions, The actions include: determining an initial conversion rate of multiple users to the target resource, and a single user does not provide a true conversion rate; determining a correction factor based on the total conversion rate of the multiple users to the target resource and the initial conversion rate; Determine a user's predicted conversion rate for the target resource based on the correction factor and the initial conversion rate; and train the recommendation model based on at least the predicted conversion rate.
  • Example 23 An electronic device, comprising: a processor; and a memory coupled to the processor, the memory having instructions stored therein that, when executed by the processor, cause the electronic device to perform actions,
  • the actions include: obtaining user characteristics of the user and resource characteristics of the resource; using a conversion rate model trained according to the method described in any one of Examples 1 to 9, based on the user characteristics and the resource characteristics, determining the The user's conversion rate of resources; and based on the conversion rate, recommendations to the user resource.
  • Example 24 A computer-readable storage medium having a computer program stored thereon, which when executed by a processor implements the method described in any one of Examples 1-10.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Embodiments of the present disclosure provide a recommendation model training method and apparatus, and a resource recommendation method and apparatus. The recommendation model training method may comprise obtaining a first set of training data comprising a real conversion rate of a first-type user for a target resource, wherein the first-type user provides the real conversion rate. The method may further comprise obtaining a second set of training data comprising a predicted conversion rate of a second-type user for the target resource, wherein the second-type user does not provide a real conversion rate. In addition, the method may further comprise training a recommendation model using the first set of training data and the second set of training data. The recommendation model obtained according to the training mode of the present disclosure can accurately recommend resources to users, thereby improving user experience.

Description

训练推荐模型的方法、推荐资源的方法及其装置Method for training recommendation model, method for recommending resources and device thereof
本申请要求于2022年09月08日提交中国国家知识产权局、申请号为202211098044.3、发明名称为“训练推荐模型的方法、推荐资源的方法及其装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application requests the priority of the Chinese patent application submitted to the State Intellectual Property Office of China on September 8, 2022, with the application number 202211098044.3 and the invention title "Method for training recommendation model, method for recommending resources and device", all of which The contents are incorporated into this application by reference.
技术领域Technical field
本公开的实施例涉及数据处理领域,并且更具体地,涉及训练推荐模型的方法、推荐资源的方法、装置、电子设备和计算机可读存储介质。Embodiments of the present disclosure relate to the field of data processing, and more specifically, to methods of training recommendation models, methods, devices, electronic devices, and computer-readable storage media for recommending resources.
背景技术Background technique
随着互联网的快速发展,人们接受到的信息也在爆炸式增长,推荐系统需要在信息超载的情况下,为用户推荐其感兴趣的资源、从而提高用户体验并且提高资源的分发效率。推荐系统在面临海量资源的推荐分发时,可以应用各种不同类型的模型以实现在毫秒级时间内从千万量级资源库中为用户推荐出感兴趣的资源。因此,需要一种推荐模型来实现资源的准确推荐。With the rapid development of the Internet, the information people receive is also growing explosively. Recommendation systems need to recommend resources that users are interested in under the condition of information overload, thereby improving user experience and improving resource distribution efficiency. When the recommendation system faces the recommendation distribution of massive resources, it can apply various types of models to recommend interesting resources to users from tens of millions of resource libraries in milliseconds. Therefore, a recommendation model is needed to achieve accurate recommendation of resources.
发明内容Contents of the invention
本公开的实施例提供了推荐模型训练方案。Embodiments of the present disclosure provide a recommendation model training solution.
在本公开的第一方面中,提供了一种训练推荐模型的方法。该方法可以包括获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,第一类型用户提供真实转化率。该方法还可以包括获取包括第二类型用户对目标资源的预测转化率的第二组训练数据,第二类型用户未提供真实转化率。此外,该方法可以进一步包括使用第一组训练数据和第二组训练数据来训练推荐模型。 In a first aspect of the present disclosure, a method of training a recommendation model is provided. The method may include obtaining a first set of training data including a real conversion rate of a first type of user to the target resource, the first type of user providing the real conversion rate. The method may further include obtaining a second set of training data including a predicted conversion rate of a second type of user to the target resource, the second type of user not providing a true conversion rate. Additionally, the method may further include training the recommendation model using the first set of training data and the second set of training data.
在本公开的第二方面中,提供了一种训练推荐模型的方法。该方法可以包括确定多个用户对目标资源初始转化率,单个用户未提供真实转化率。该方法还可以包括基于多个用户对目标资源的总转化率和初始转化率,确定校正因子。该方法还可以包括基于校正因子和初始转化率,确定用户对目标资源的预测转化率。此外,该方法可以进一步包括至少基于预测转化率来训练推荐模型。In a second aspect of the present disclosure, a method of training a recommendation model is provided. The method may include determining an initial conversion rate for the target resource by multiple users, with a single user not providing a true conversion rate. The method may further include determining a correction factor based on the total conversion rate and the initial conversion rate of the plurality of users to the target resource. The method may further include determining a predicted conversion rate of the user to the target resource based on the correction factor and the initial conversion rate. Additionally, the method may further include training a recommendation model based at least on the predicted conversion rate.
在本公开的第三方面中,提供了一种推荐资源的方法。该方法可以包括获取用户的用户特征和资源的资源特征。该方法还可以包括利用根据第一方面和第二方面的方法训练的转化率模型,基于用户特征和资源特征,确定用户对资源的转化率。此外,该方法可以进一步包括基于转化率,向用户推荐资源。In a third aspect of the present disclosure, a method of recommending resources is provided. The method may include obtaining user characteristics of the user and resource characteristics of the resource. The method may further include using the conversion rate model trained according to the methods of the first aspect and the second aspect to determine the user's conversion rate to the resource based on the user characteristics and the resource characteristics. In addition, the method may further include recommending resources to the user based on the conversion rate.
在本公开的第四方面中,提供了一种训练推荐模型的装置,该装置可以包括:第一训练数据获取模块,被配置为获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,第一类型用户提供真实转化率;第二训练数据获取模块,被配置为获取包括第二类型用户对目标资源的预测转化率的第二组训练数据,第二类型用户未提供真实转化率;以及第一模型训练模块,被配置为使用第一组训练数据和第二组训练数据来训练推荐模型。In a fourth aspect of the present disclosure, a device for training a recommendation model is provided. The device may include: a first training data acquisition module configured to acquire a first data including a real conversion rate of a first type of user to a target resource. A set of training data, the first type of user provides a true conversion rate; the second training data acquisition module is configured to obtain a second set of training data including the predicted conversion rate of the second type of user to the target resource, the second type of user does not provide a true conversion rate conversion rate; and a first model training module configured to train the recommendation model using the first set of training data and the second set of training data.
在本公开的第五方面中,提供了一种训练推荐模型的装置,该装置可以包括:初始转化率确定模块,被配置为确定多个用户对目标资源初始转化率,单个用户未提供真实转化率;校正因子确定模块,被配置为基于多个用户对目标资源的总转化率和初始转化率,确定校正因子;预测转率确定模块,被配置为基于校正因子和初始转化率,确定用户对目标资源的预测转化率;以及第二模型训练模块,被配置为至少基于预测转化率来训练推荐模型。In a fifth aspect of the present disclosure, a device for training a recommendation model is provided. The device may include: an initial conversion rate determination module configured to determine the initial conversion rate of multiple users to the target resource, and a single user does not provide a real conversion. rate; the correction factor determination module is configured to determine the correction factor based on the total conversion rate and the initial conversion rate of multiple users to the target resource; the predicted conversion rate determination module is configured to determine the user's conversion rate based on the correction factor and the initial conversion rate. a predicted conversion rate of the target resource; and a second model training module configured to train the recommendation model based on at least the predicted conversion rate.
在本公开的第六方面中,提供了一种推荐资源的装置,该装置可以包括:特征获取模块,被配置为获取用户的用户特征和资源的资源特征;转化率确定模块,被配置为利用第一方面或第二方面的方法训练的转化率模型,基于用户特征和资源特征,确定用户对资 源的转化率;以及推荐模块,被配置为基于转化率,向用户推荐资源。In a sixth aspect of the present disclosure, a device for recommending resources is provided. The device may include: a feature acquisition module configured to acquire user features of the user and resource features of the resource; a conversion rate determination module configured to utilize The conversion rate model trained by the method of the first aspect or the second aspect determines the user's response to the resource based on the user characteristics and resource characteristics. the conversion rate of the source; and the recommendation module is configured to recommend resources to users based on the conversion rate.
在本公开的第七方面中,提供了一种电子设备,包括:处理器;以及与处理器耦合的存储器,存储器具有存储于其中的指令,指令在被处理器执行时使电子设备执行根据第一方面、第二方面或第三方面的方法的任意步骤。In a seventh aspect of the present disclosure, an electronic device is provided, including: a processor; and a memory coupled to the processor, the memory having instructions stored therein, the instructions when executed by the processor, cause the electronic device to perform according to the first Any step of the method of the first, second or third aspect.
在本公开的第八方面中,提供了计算机可读存储介质,其上存储有计算机程序,程序被处理器执行时实现根据第一方面、第二方面或第三方面的方法的任意步骤。In an eighth aspect of the present disclosure, there is provided a computer-readable storage medium having a computer program stored thereon, which when executed by a processor implements any steps of the method according to the first, second or third aspect.
提供该内容部分是为了简化的形式来介绍对概念的选择,它们在下文的具体实施方式中将被进一步描述。该内容部分无意标识本公开的关键特征或主要特征,也无意限制本公开的范围。This Content is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This summary is not intended to identify key features or key features of the disclosure or to limit the scope of the disclosure.
附图说明Description of the drawings
通过结合附图对本公开示例性实施例进行更详细的描述,本公开的上述以及其它目的、特征和优势将变得更加明显,其中,在本公开示例性实施例中,相同或相似的参考标号通常代表相同或相似的部件。在附图中:The above and other objects, features and advantages of the present disclosure will become more apparent by describing the exemplary embodiments of the present disclosure in more detail with reference to the accompanying drawings, in which the same or similar reference numerals are used in the exemplary embodiments of the present disclosure. Usually represents the same or similar parts. In the attached picture:
图1示出了本公开的多个实施例能够在其中实现的示例环境的示意图;1 illustrates a schematic diagram of an example environment in which various embodiments of the present disclosure can be implemented;
图2示出了根据本公开的实施例的用于训练和应用模型的详细示例环境的示意图;2 illustrates a schematic diagram of a detailed example environment for training and applying models in accordance with embodiments of the present disclosure;
图3示出了根据本公开的一个实施例的用于训练推荐模型的过程的流程图;3 illustrates a flowchart of a process for training a recommendation model according to one embodiment of the present disclosure;
图4示出了根据本公开的另一实施例的用于训练推荐模型的过程的流程图;4 illustrates a flowchart of a process for training a recommendation model according to another embodiment of the present disclosure;
图5示出了根据本公开的实施例的用于训练推荐模型的方案的总体流程图;Figure 5 shows an overall flow chart of a scheme for training a recommendation model according to an embodiment of the present disclosure;
图6示出了根据本公开的一个实施例的用于训练推荐模型的装 置的示意图;Figure 6 illustrates an apparatus for training a recommendation model according to one embodiment of the present disclosure. Schematic diagram of the installation;
图7示出了根据本公开的另一实施例的用于训练推荐模型的装置的示意图;以及Figure 7 shows a schematic diagram of an apparatus for training a recommendation model according to another embodiment of the present disclosure; and
图8示出了可以用来实施本公开的实施例的示例设备的示意性框图。Figure 8 shows a schematic block diagram of an example device that may be used to implement embodiments of the present disclosure.
具体实施方式Detailed ways
可以理解的是,在使用本公开各实施例公开的技术方案之前,均应当依据相关法律法规通过恰当的方式对本公开所涉及个人信息的类型、使用范围、使用场景等告知用户并获得用户的授权。It can be understood that before using the technical solutions disclosed in the embodiments of this disclosure, users should be informed of the type, scope of use, usage scenarios, etc. of the personal information involved in this disclosure in an appropriate manner in accordance with relevant laws and regulations and obtain the user's authorization. .
例如,在响应于接收到用户的主动请求时,向用户发送提示信息,以明确地提示用户,其请求执行的操作将需要获取和使用到用户的个人信息。从而,使得用户可以根据提示信息来自主地选择是否向执行本公开技术方案的操作的电子设备、应用程序、服务器或存储介质等软件或硬件提供个人信息。For example, in response to receiving an active request from a user, a prompt message is sent to the user to clearly remind the user that the operation requested will require the acquisition and use of the user's personal information. Therefore, users can autonomously choose whether to provide personal information to software or hardware such as electronic devices, applications, servers or storage media that perform the operations of the technical solution of the present disclosure based on the prompt information.
作为一种可选的但非限定性的实现方式,响应于接收到用户的主动请求,向用户发送提示信息的方式例如可以是弹窗的方式,弹窗中可以以文字的方式呈现提示信息。此外,弹窗中还可以承载供用户选择“同意”或者“不同意”向电子设备提供个人信息的选择控件。As an optional but non-limiting implementation method, in response to receiving the user's active request, the method of sending prompt information to the user may be, for example, a pop-up window, and the prompt information may be presented in the form of text in the pop-up window. In addition, the pop-up window can also contain a selection control for the user to choose "agree" or "disagree" to provide personal information to the electronic device.
可以理解的是,上述通知和获取用户授权过程仅是示意性的,不对本公开的实现方式构成限定,其它满足相关法律法规的方式也可应用于本公开的实现方式中。It can be understood that the above process of notifying and obtaining user authorization is only illustrative and does not limit the implementation of the present disclosure. Other methods that satisfy relevant laws and regulations can also be applied to the implementation of the present disclosure.
可以理解的是,本技术方案所涉及的数据(包括但不限于数据本身、数据的获取或使用)应当遵循相应法律法规及相关规定的要求。It can be understood that the data involved in this technical solution (including but not limited to the data itself, the acquisition or use of the data) should comply with the requirements of corresponding laws, regulations and related regulations.
下面将参考附图中示出的若干示例实施例来描述本公开的原理。在本公开的实施例的描述中,术语“包括”及其类似用语应当理解为开放性包含,即“包括但不限于”。术语“基于”应当理解为 “至少部分地基于”。术语“一个实施例”或“该实施例”应当理解为“至少一个实施例”。术语“第一”、“第二”等可以指代不同的或相同的对象。下文还可能包括其他明确的和隐含的定义。The principles of the present disclosure will be described below with reference to several example embodiments illustrated in the accompanying drawings. In the description of embodiments of the present disclosure, the term "including" and similar expressions shall be understood as an open inclusion, that is, "including but not limited to." The term "based on" shall be understood to mean "Based at least in part on". The terms "one embodiment" or "the embodiment" should be understood to mean "at least one embodiment". The terms "first", "second", etc. may refer to different or the same object. Other explicit and implicit definitions may be included below.
在本公开的实施例的描述中,术语“模型”可以从训练数据中学习到相应的输入与输出之间的关联,从而在训练完成后基于训练得到的参数集对给定的输入进行处理以生成对应的输出。“模型”有时也可以被称为“神经网络”、“学习模型”、“学习网络”或“网络”。这些术语在本文中可互换地使用。In the description of the embodiments of the present disclosure, the term "model" can learn the association between the corresponding input and the output from the training data, so that after the training is completed, the given input is processed based on the parameter set obtained by the training. Generate the corresponding output. A "model" may also sometimes be called a "neural network", "learning model", "learning network" or "network". These terms are used interchangeably herein.
术语“特征”是指通过向量表示资源或用户。该特征向量的性质使距离相近的向量对应的物体有相近的含义。例如,两个资源汽车和数码产品都属于科技类物品,则汽车的特征向量和数码产品的特征向量在空间上距离比较接近。又比如用户A和用户B同时选择娱乐类信息作为感兴趣的标签,则用户A和用户B的特征在空间上距离比较接近。利用“特征”概念能够用向量对物体进行编码还能保留其含义的特点,这非常适合深度学习。术语“推荐”是指将各种资源或者内容通过各种合适的形式呈现给用户或者曝光给用户的动作。The term "feature" refers to a vector representation of a resource or user. The nature of this feature vector makes objects corresponding to vectors with similar distances have similar meanings. For example, if two resources, cars and digital products, are both technological items, then the feature vectors of the car and the feature vectors of the digital products are relatively close in space. For another example, if user A and user B select entertainment information as tags of interest at the same time, then the characteristics of user A and user B are relatively close in space. The concept of "features" can be used to encode objects with vectors and retain the characteristics of their meaning, which is very suitable for deep learning. The term "recommendation" refers to the action of presenting or exposing various resources or content to users in various appropriate forms.
如上文所描述,需要一种推荐模型来实现资源的准确推荐。在传统上,通常根据用户对资源的转化率来确定是否向用户对进行推荐。然而根据不同操作系统的要求和用户的设置,往往无法获得真实转化率作为真值标签来对推荐模型进行训练。在常规操作中,可以利用一些初始转化率来对推荐模型进行训练。然而由于初始转化率与真实转化率之间具有较大的差异,训练效果不佳,从而无法得到准确的推荐模型。这一方面导致用户体验感差,另一方面导致推荐方投入的推荐成本过高。As described above, a recommendation model is needed to achieve accurate recommendation of resources. Traditionally, whether to recommend a user pair is usually determined based on the user's conversion rate of the resource. However, depending on the requirements of different operating systems and user settings, it is often impossible to obtain the true conversion rate as a true value label to train the recommendation model. In normal operation, some initial conversion rates can be used to train the recommendation model. However, due to the large difference between the initial conversion rate and the real conversion rate, the training effect is poor and an accurate recommendation model cannot be obtained. On the one hand, this results in a poor user experience, and on the other hand, the recommendation cost invested by the recommender is too high.
根据本公开的实施例,提出了一种推荐模型训练方案。该方案在将训练数据分为两部分。对于提供真实转化率的第一类型用户,获取包括真实转化率的第一组训练数据用于推荐模型训练。对于未提供真实转化率的第二类型用户,获取包括预测转化率的第二组训练数据用于推荐模型训练。由此,将与不同类型的用户相关的数据 作为不同的训练数据用于推荐模型训练,可以提高模型的泛化性。此外,根据最接近真实转化率的预测转化率来训练推荐模型,可以提高推荐模型预测转化率的准确性,从而能够解决上述问题和/或其他潜在问题。According to embodiments of the present disclosure, a recommendation model training scheme is proposed. This scheme divides the training data into two parts. For the first type of user who provides a real conversion rate, a first set of training data including the real conversion rate is obtained for recommendation model training. For the second type of users who do not provide a true conversion rate, a second set of training data including predicted conversion rates is obtained for recommendation model training. From this, data related to different types of users Used as different training data for recommendation model training, it can improve the generalization of the model. In addition, training the recommendation model based on the predicted conversion rate that is closest to the true conversion rate can improve the accuracy of the recommendation model in predicting the conversion rate, thereby solving the above problems and/or other potential problems.
以下将结合示例场景来详细描述本公开的各实施例。应当理解,这仅仅是出于说明的目的,不旨在以任何方式限制本公开的范围。Various embodiments of the present disclosure will be described in detail below in conjunction with example scenarios. It should be understood that this is for illustrative purposes only and is not intended to limit the scope of the present disclosure in any way.
图1示出了根据本公开的实施例的用于训练推荐模型的示例系统100的框图。应当理解,图1所示的系统100仅仅是本公开的实施例可实现于其中的一种示例,不旨在限制本公开的范围。本公开的实施例同样适用于其他系统或架构。Figure 1 illustrates a block diagram of an example system 100 for training a recommendation model in accordance with an embodiment of the present disclosure. It should be understood that the system 100 shown in FIG. 1 is only an example in which embodiments of the present disclosure can be implemented, and is not intended to limit the scope of the present disclosure. Embodiments of the present disclosure are equally applicable to other systems or architectures.
如图1所示,系统100可以包括计算设备120。计算设备120可以被配置为接收输入110,输入可以包括与用户和资源相关联的数据,例如用户特征和资源特征。计算设备120基于输入110生成用户对资源的转化率130。具体地,计算设备120可以通过布置在其中的推荐模型140来生成转化率130。As shown in FIG. 1 , system 100 may include computing device 120 . Computing device 120 may be configured to receive input 110, which may include data associated with users and resources, such as user characteristics and resource characteristics. The computing device 120 generates a user conversion rate 130 for the resource based on the input 110 . Specifically, the computing device 120 may generate the conversion rate 130 through the recommendation model 140 disposed therein.
用户可以是各种类型的应用的用户,该应用可以是包括推荐系统的应用,包括但不限于购物应用、短视频应用、音乐应用、婚恋交友应用、新闻应用、论坛应用、云盘存储应用、搜索应用等。本公开在此不做限制。The user can be a user of various types of applications, and the application can be an application including a recommendation system, including but not limited to shopping applications, short video applications, music applications, dating applications, news applications, forum applications, cloud disk storage applications, Search applications, etc. This disclosure is not limited here.
资源可以是包括推荐系统的上述应用中的商品、直播间、短视频、图片、音乐、人物信息等。用户在上述应用中接收被推荐的、与资源相关联的视频、图片、文字、语音或其组合。例如,用户进入新闻应用后,在显示界面中收到推荐的新闻的封面图片、新闻头条文字信息或视频信息。在本文中,“资源”、“内容”、“对象”等均指可能需要呈现或者曝光给用户的实体或者虚拟物品,本公开在此不做限制。Resources can be products, live broadcast rooms, short videos, pictures, music, character information, etc. in the above applications including recommendation systems. Users receive recommended videos, pictures, texts, voices or combinations thereof that are associated with resources in the above-mentioned applications. For example, after a user enters a news application, he or she receives recommended news cover images, news headline text information, or video information in the display interface. In this article, "resources", "content", "objects", etc. all refer to entities or virtual items that may need to be presented or exposed to users, and this disclosure is not limited here.
用户在上述应用中会经历以下流程:首先是用户看到资源,之后用户有可能会选择(例如,点击)感兴趣的资源。接着用户有可能会对该资源进行进一步操作,例如购买、收藏、加入购物车、下 载、转发等,这一行为称为转化。预测资源被展示给用户后用户会产生转化行为的概率称为转化率预估。The user will go through the following process in the above application: first, the user sees the resource, and then the user may select (for example, click) the resource of interest. Then the user may perform further operations on the resource, such as purchasing, collecting, adding to shopping cart, and downloading. Uploading, forwarding, etc. This behavior is called conversion. Predicting the probability that the user will convert after the resource is shown to the user is called conversion rate estimation.
可以理解,在用户选择该资源后,会离开应用的界面而进入资源相关的页面,之后用户的动作不可见。而根据用户设置、操作系统的类型以及资源相关方的设置,一些用户对该资源的真实转化率会被提供给应用方(在下文中,我们称其为第一类型用户),而另一些用户对该资源的真实转化率不会被提供给应用方(在下文中,我们称其为第二类型用户)。It is understandable that after the user selects the resource, he will leave the application interface and enter the resource-related page, and the user's actions thereafter will not be visible. Depending on the user settings, the type of operating system, and the settings of the resource parties, the real conversion rate of some users for the resource will be provided to the application (hereinafter, we call it the first type of user), while the real conversion rate of some users will be provided to the application party. The real conversion rate of this resource will not be provided to the application side (hereinafter, we call it the second type of user).
在本公开中,推荐模型140可以被设计用于执行推荐任务。推荐模型的示例包括但不限于各类深度神经网络(DNN)、卷积神经网络(CNN)、支持向量机(SVM)、决策树、随机森林模型等等。在本公开的实现中,推荐模型也可以被称为“神经网络”、“学习模型”、“学习网络”、“模型”和“网络”可替换地使用。In the present disclosure, the recommendation model 140 may be designed to perform recommendation tasks. Examples of recommended models include, but are not limited to, various types of deep neural networks (DNN), convolutional neural networks (CNN), support vector machines (SVM), decision trees, random forest models, etc. In implementations of the present disclosure, a recommendation model may also be referred to as a "neural network," "learning model," "learning network," "model," and "network" interchangeably.
在一些实施例中,计算设备120可以包括但不限于个人计算机、服务器计算机、手持或膝上型设备、移动设备(诸如移动电话、个人数字助理PDA、媒体播放器等)、消费电子产品、小型计算机、大型计算机、云计算资源等。In some embodiments, computing device 120 may include, but is not limited to, a personal computer, a server computer, a handheld or laptop device, a mobile device (such as a mobile phone, personal digital assistant (PDA), media player, etc.), consumer electronics, small form factor device, etc. Computers, mainframe computers, cloud computing resources, etc.
应当理解,系统100中所包括的这些装置和/或装置中的单元仅是示例性的,而不旨在限制本公开的范围。应当理解的是,系统100还可以包括未示出的附加装置和/或单元。例如,在一些实施例中,系统100的计算设备120中还可以进一步包括用于存储预先输入的超参数等的存储单元(未示出)。It should be understood that the devices and/or elements within the devices included in the system 100 are exemplary only and are not intended to limit the scope of the present disclosure. It should be understood that system 100 may also include additional devices and/or units not shown. For example, in some embodiments, the computing device 120 of the system 100 may further include a storage unit (not shown) for storing pre-input hyperparameters and the like.
下文将参考图2对计算设备120中的模型的训练和使用进行描述。图2示出了根据本公开的实施例的详细示例环境200的示意图。与图1类似地,示例环境200可以包含计算设备220、输入计算设备220的输入210和从计算设备220输出的转化率230。区别在于,示例环境200总体上可以包括模型训练系统260和模型应用系统270。作为示例,模型训练系统260和/或模型应用系统270可以在如图1所示的计算设备120或如图2所示的计算设备220中实现。应当理 解,仅出于示例性的目的描述示例环境200的结构和功能并不旨在限制本文所描述主题的范围。本文所描述主题可以在不同的结构和/或功能中实施。Training and use of the model in computing device 120 is described below with reference to FIG. 2 . Figure 2 shows a schematic diagram of a detailed example environment 200 in accordance with embodiments of the present disclosure. Similar to FIG. 1 , example environment 200 may include a computing device 220 , an input 210 into the computing device 220 , and a conversion rate 230 output from the computing device 220 . The difference is that the example environment 200 may generally include a model training system 260 and a model application system 270 . As examples, model training system 260 and/or model application system 270 may be implemented in computing device 120 as shown in FIG. 1 or computing device 220 as shown in FIG. 2 . Should be taken for granted It is understood that the structure and functionality of example environment 200 are described for illustrative purposes only and are not intended to limit the scope of the subject matter described herein. The subject matter described herein may be implemented in different structures and/or functions.
如前所述,对模型的输入进行处理以确定用户对资源的转化率130的过程可以分为两个阶段:模型训练阶段和模型应用阶段。作为示例,如图2所示,在模型训练阶段中,模型训练系统260可以利用训练数据250来训练模型240。应理解,训练数据250可以是(用户特征;资源特征;用户对资源转化率)的三元组。在一些实施例中,训练数据250可以包括与第一类型用户相关联的第一组训练数据252,其中用户对资源的真实转化率被提供。在一些其他实施例中,训练数据250可以包括与第二类型用户相关联的第二组训练数据254,其中用户对资源的真实转化率不被提供。此时,需要对该真实转化率进行预测以用于训练模型240,将在下文详细描述预测过程。备选地,在一些实施例中,训练数据250可以包括第一组训练数据252和第二组训练数据242两者。As mentioned before, the process of processing the input of the model to determine the user's conversion rate 130 of the resource can be divided into two stages: the model training stage and the model application stage. As an example, as shown in Figure 2, in the model training phase, model training system 260 may utilize training data 250 to train model 240. It should be understood that the training data 250 may be a triplet of (user characteristics; resource characteristics; user-to-resource conversion rate). In some embodiments, training data 250 may include a first set of training data 252 associated with a first type of user, in which the user's true conversion rate for the resource is provided. In some other embodiments, training data 250 may include a second set of training data 254 associated with a second type of user, where the user's true conversion rate for the resource is not provided. At this time, the true conversion rate needs to be predicted for training the model 240, and the prediction process will be described in detail below. Alternatively, in some embodiments, training data 250 may include both first set of training data 252 and second set of training data 242 .
在模型应用阶段中,模型应用系统270可以接收经训练的模型240。由此,载入到模型应用系统270的计算设备220中的模型240可以基于输入210来确定转化率230。In the model application phase, the model application system 270 may receive the trained model 240 . Thus, the model 240 loaded into the computing device 220 of the model application system 270 can determine the conversion rate 230 based on the input 210 .
在其他实施例中,模型240可以被构建为学习网络。在一些实施例中,该学习网络可以包括多个网络,其中每个网络可以是一个多层神经网络,其可以由大量的神经元组成。通过训练过程,每个网络中的神经元的相应参数能够被确定。这些网络中的神经元的参数被统称为模型240的参数。In other embodiments, model 240 may be constructed as a learning network. In some embodiments, the learning network may include multiple networks, where each network may be a multi-layer neural network, which may be composed of a large number of neurons. Through the training process, the corresponding parameters of each neuron in the network can be determined. The parameters of the neurons in these networks are collectively referred to as the parameters of the model 240 .
模型240的训练过程可以以迭代方式来被执行,直至模型240的参数中的至少部分参数收敛或者直至达到预定迭代次数,由此获得最终的模型参数。The training process of the model 240 may be performed in an iterative manner until at least some of the parameters of the model 240 converge or until a predetermined number of iterations is reached, thereby obtaining final model parameters.
上文描述的技术方案仅用于示例,而非限制本公开。应理解,还可以按照其他方式和连接关系来布置各个网络。为了更清楚地解释上述方案的原理,下文将参考图3来更详细描述推荐模型训练的 过程。The technical solutions described above are only used as examples and do not limit the present disclosure. It should be understood that each network can also be arranged in other ways and connection relationships. In order to explain the principle of the above solution more clearly, the following will describe the recommended model training in more detail with reference to Figure 3. process.
图3示出了根据本公开的实施例的用于训练推荐模型的过程300的流程图。在某些实施例中,过程300可以在图1中的计算设备120和图2中的计算设备220中实现。现参照图3描述根据本公开实施例的训练推荐模型的过程300。为了便于理解,在下文描述中提及的具体实例均是示例性的,并不用于限定本公开的保护范围。Figure 3 illustrates a flow diagram of a process 300 for training a recommendation model in accordance with an embodiment of the present disclosure. In some embodiments, process 300 may be implemented in computing device 120 in FIG. 1 and computing device 220 in FIG. 2 . A process 300 of training a recommendation model according to an embodiment of the present disclosure is now described with reference to FIG. 3 . For ease of understanding, the specific examples mentioned in the following description are illustrative and are not intended to limit the scope of the present disclosure.
在302,计算设备120可以获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,第一类型用户提供真实转化率。例如,根据操作系统的类别、用户的设置以及资源方的设置,计算设备120可以获取第一类型用户对目标资源的真实转化率。计算设备120然后可以根据该真实转化率训练模型。At 302, the computing device 120 may obtain a first set of training data including a true conversion rate of a first type of user to the target resource, the first type of user providing the true conversion rate. For example, according to the type of operating system, the user's settings, and the resource party's settings, the computing device 120 may obtain the true conversion rate of the first type of user to the target resource. Computing device 120 may then train the model based on this true conversion rate.
在一些实施例中,计算设备120可以确定第一类型用户的用户特征和目标资源的资源特征,然后获取第一类型用户对目标资源真实转化率。例如针对用户i的训练数据可以表示为(xi,yi,zi)。其中xi表示用户特征和资源特征;yi以不同的值表示用户是否选择(点击)了展示给该用户的目标资源,yi可以为0至1之间的一个数值,例如0表示没有选择,1表示选择;zi则表示用户是否在选择之后的转化率,即是否实施了目标行为,该目标行为是在对应场景下被认为该用户得到转化的行为。zi可以为0至1之间的一个数值,例如0表示发生转化,1表示未发生转化。请注意,上述训练数据仅仅使示例性的,还可以存在不同形式的训练数据,本公开在此不做限制。In some embodiments, the computing device 120 may determine the user characteristics of the first type of user and the resource characteristics of the target resource, and then obtain the real conversion rate of the first type of user to the target resource. For example, the training data for user i can be expressed as (x i , y i , z i ). Among them, x i represents user characteristics and resource characteristics; yi i uses different values to indicate whether the user has selected (clicked) the target resource displayed to the user. yi i can be a value between 0 and 1, for example, 0 means no selection. , 1 represents selection; z i represents the conversion rate of whether the user has made the choice, that is, whether the target behavior has been implemented. The target behavior is the behavior that is considered to be converted by the user in the corresponding scenario. z i can be a value between 0 and 1. For example, 0 means conversion has occurred and 1 means no conversion has occurred. Please note that the above training data is only exemplary, and different forms of training data may also exist, and the disclosure is not limited here.
对于用户特征和资源特征的确定,在一些实施例中,计算设备120可以分别确定用户特征和资源特征。例如,计算设备120可以基于用户对资源的历史选择来用户特征。计算设备120可以基于资源类别、资源发布者、历史上选择资源的用户的用户特征中的一项或者多项来确定资源特征。上述确定特征的方法仅仅是示例性的,计算设备120可以根据用户和资源的其他合适的固有特征来确定用户特征和资源特征,以准确地表示用户与资源间复杂的非线性关系。For the determination of user characteristics and resource characteristics, in some embodiments, computing device 120 may determine user characteristics and resource characteristics respectively. For example, computing device 120 may characterize the user based on the user's historical selection of resources. Computing device 120 may determine resource characteristics based on one or more of the resource category, the resource publisher, and user characteristics of users who have historically selected the resource. The above method of determining characteristics is only exemplary, and the computing device 120 may determine user characteristics and resource characteristics based on other suitable inherent characteristics of users and resources to accurately represent the complex non-linear relationship between users and resources.
备选地,在一些实施例中,可以通过用户与资源的交互信息来 确定用户的用户特征以及资源的资源特征。例如可以通过用户对资源的点击、分享、发布等操作的关系来构建节点图,其中每个节点表示用户和资源。再通过在该节点图中游走来确定用户特征以及资源特征。可以理解,通过准确地表示用户与资源的特征,可以提升待训练的推荐模型的特征容量和泛化性。Alternatively, in some embodiments, the user's interaction information with the resource can be used to determine Determine the user characteristics of the user and the resource characteristics of the resource. For example, a node graph can be constructed based on the relationship between users' clicks, sharing, publishing and other operations on resources, where each node represents a user and a resource. Then determine the user characteristics and resource characteristics by walking in the node graph. It can be understood that by accurately representing the characteristics of users and resources, the feature capacity and generalization of the recommendation model to be trained can be improved.
上面描述了对已知真实预测率的用户的训练数据的获取和确定,下面描述对位未知真实预测率的用户的训练数据的获取和确定。在304,计算设备120可以获取包括第二类型用户对目标资源的预测转化率的第二组训练数据,第二类型用户未提供真实转化率。可以理解,对于一些类型的操作系统,有时需要用户和资源关联方的双重授权,才可以得到单个用户的真实转化率。此时,计算设备120需要根据已有数据来对转化率进行预测。The above describes the acquisition and determination of training data for users whose true prediction rates are known, and the following describes the acquisition and determination of training data for users whose true prediction rates are unknown. At 304, the computing device 120 may obtain a second set of training data including predicted conversion rates for the target resource by a second type of user who does not provide a true conversion rate. It is understandable that for some types of operating systems, dual authorization from users and resource related parties is sometimes required to obtain the true conversion rate of a single user. At this time, the computing device 120 needs to predict the conversion rate based on existing data.
在一个示例中,计算设备120可以首先确定第二类型用户的用户特征和目标资源的资源特征。关于用户特征和资源特征的确定方法参见上文的描述,在此不再赘述。计算设备120然后可以基于第二类型用户选择目标资源之后的用户行为确定预测转化率。例如,虽然计算设备120无法直接获取用户的真实转化率,其可以根据用户与目标资源的交互行为对转化率进行预测。In one example, computing device 120 may first determine user characteristics of the second type of user and resource characteristics of the target resource. Regarding the method of determining user characteristics and resource characteristics, please refer to the above description and will not be repeated here. Computing device 120 may then determine a predicted conversion rate based on user behavior after the second type of user selects the target resource. For example, although the computing device 120 cannot directly obtain the user's true conversion rate, it can predict the conversion rate based on the user's interaction behavior with the target resource.
在一些实施例中,计算设备120可以基于第二类型用户与目标资源之间的动作关系,确定初始转化率作为预测转化率。可以理解,用户对该资源的操作越多,或者用户应用中选择该资源后进入其他界面又跳回应用的时间越长,则表明用户存在更大概率实施了转化行为。例如,计算设备120可以根据用户是否对目标资源点赞、用户是否将目标资源转发、用户对目标资源点击后切换回资源展示界面的时间、以及用户是否将目标资源进行下载中的一项或者多项来确定初始转化率作为预测转化率。初始转化率可以是0到1之间的一个数值。根据用户在应用内与资源之间的动作关系,可以准确地预测转化率。In some embodiments, the computing device 120 may determine the initial conversion rate as the predicted conversion rate based on the action relationship between the second type user and the target resource. It can be understood that the more operations the user performs on the resource, or the longer it takes for the user to enter other interfaces and then jump back to the application after selecting the resource, it means that the user has a greater probability of implementing conversion behavior. For example, the computing device 120 may determine one or more of whether the user likes the target resource, whether the user forwards the target resource, the time when the user switches back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource. item to determine the initial conversion rate as the predicted conversion rate. The initial conversion rate can be a value between 0 and 1. Conversion rates can be accurately predicted based on the relationship between user actions and resources within the app.
备选地,在一些实施例中,计算设备120可以将上述提供真实 转化率的第一类型用户与目标资源之间的动作关系以及其真实转化率来训练机器学习模型以得到经训练的初始转化率模型。然后计算设备120可以利用该经训练的初始转化率模型来预测第二类型用户的初始转化率。可以理解,由于所使用的训练数据(即第一类型用户、目标资源、以及其之间的真实转化率)与用于预测的数据(即第二类型用户、目标资源、以及其之间的初始转化率)都是针对同一目标资源,这可以使得准确地预测初始转化率,即更加接近真实转化率。Alternatively, in some embodiments, computing device 120 may provide the above The action relationship between the first type of conversion rate user and the target resource and its true conversion rate are used to train the machine learning model to obtain a trained initial conversion rate model. Computing device 120 may then utilize the trained initial conversion rate model to predict the initial conversion rate for the second type of user. It can be understood that due to the difference between the training data used (i.e., first type users, target resources, and the real conversion rate between them) and the data used for prediction (i.e., second type users, target resources, and the initial conversion rate between them) Conversion rate) are all targeted at the same target resource, which allows the initial conversion rate to be accurately predicted, that is, closer to the true conversion rate.
可以理解,有时仅仅基于用户的交互行为往往还是不够准确,还可以根据一些其他数据对初始转化率进行校正,以得到更加接近于真实转化率的预测转化率。It is understandable that sometimes it is not accurate enough based only on user interaction behavior. The initial conversion rate can also be corrected based on some other data to obtain a predicted conversion rate that is closer to the real conversion rate.
附加地或者备选地,在一些实施例中,虽然出于多种因素,未提供单个用户对目标资源的转化率,但一个计划中的所有(或者一部分)用户对目标资源的总转化率可以被获得。在这种情况下,计算设备120可以基于第一类型用户和第二类型用户对资源的总转化率、真实转化率和初始转化率,确定校正因子。例如,可以根据以下等式(1)来确定校正因子C:
C=(T-R)/I   等式(1)
Additionally or alternatively, in some embodiments, although the conversion rate of an individual user to the target resource is not provided due to various factors, the total conversion rate of all (or a portion) of the users in a plan to the target resource may be given. In this case, the computing device 120 may determine the correction factor based on the total conversion rate of the resource by the first type of users and the second type of users, the true conversion rate, and the initial conversion rate. For example, the correction factor C can be determined according to the following equation (1):
C=(TR)/I Equation (1)
其中总转化率T为第一类型用户和第二类型用户中的发生转化的用户数目与第一类型用户和第二类型用户的总数目之间的比率,真实转化率R为第一类型用户中发生转化的用户数目与第一类型用户和第二类型用户的总数目之间的比率,初始转化率I可以为上述预测的所有第二类型用户的初始转化率的平均值。对等式(1)中的分子和分母同时乘以第一类型用户和第二类型用户的总数目,上述等式(1)还可以如下等式(2)的形式呈现:
C=(TN-FN)/IN   等式(2)
The total conversion rate T is the ratio between the number of converted users among the first type users and the second type users and the total number of the first type users and the second type users, and the real conversion rate R is the ratio among the first type users The ratio between the number of converted users and the total number of first-type users and second-type users. The initial conversion rate I may be the average of the predicted initial conversion rates of all second-type users. By multiplying the numerator and denominator in equation (1) by the total number of first type users and second type users at the same time, the above equation (1) can also be presented in the form of the following equation (2):
C=(TN-FN)/IN Equation (2)
其中TN为总转化数,FN为第一类型用户中转化的用户数,IN为与初始转化率对应的初始转化数。 Among them, TN is the total number of conversions, FN is the number of converted users among the first type of users, and IN is the initial number of conversions corresponding to the initial conversion rate.
计算设备120然后将校正因子应用于初始转化率,以得到经校正的初始转化率作为预测转化率。例如,计算设备120可以将每个第二类型用户的初始转化率与校正因子相乘以作为预测转化率。可以理解,根据所获取的总转化率来对所预测的初始转化率进行校正,可以使得预测的转化率更加准确,从而使得随后训练的推荐模型的预测准确度提升。Computing device 120 then applies the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as the predicted conversion rate. For example, computing device 120 may multiply the initial conversion rate of each second type user by a correction factor as the predicted conversion rate. It can be understood that correcting the predicted initial conversion rate based on the obtained total conversion rate can make the predicted conversion rate more accurate, thereby improving the prediction accuracy of the subsequently trained recommendation model.
在确定以及获取了不同类型用户的训练数据后,在306,计算设备120可以使用第一组训练数据和第二组训练数据来训练推荐模型。如上文所述的,训练数据包括用户特征、资源特征和转化率。计算设备120例如可以将用户特征和资源特征输入初始模型,得到预测的转化率。然后确定预测的转化率与作为真值标签的转化率之间的误差,接着计算设备120将该误差沿着从相反的方向(即从待训练模型的输出层到输入层的方向)传播。在反向传播过程中,可以依赖梯度下降算法,调整待训练模型中各个层的参数的值。根据多轮训练,待训练模型的预测与实际值之间的误差会越来越小,直到模型收敛,训练过程完成。由此,计算设备120得到推荐模型。After determining and obtaining training data for different types of users, at 306, the computing device 120 may use the first set of training data and the second set of training data to train the recommendation model. As mentioned above, training data includes user characteristics, resource characteristics, and conversion rates. The computing device 120 may, for example, input user characteristics and resource characteristics into the initial model to obtain a predicted conversion rate. The error between the predicted conversion rate and the conversion rate as the ground-truth label is then determined, and the computing device 120 then propagates the error in the opposite direction (ie, from the output layer to the input layer of the model to be trained). During the backpropagation process, you can rely on the gradient descent algorithm to adjust the values of parameters of each layer in the model to be trained. According to multiple rounds of training, the error between the prediction and the actual value of the model to be trained will become smaller and smaller until the model converges and the training process is completed. From this, the computing device 120 obtains the recommended model.
通过上述多个实施例,本公开可以在无法获得单个用户针对资源的转化率的情况下,通过用户行为以及总转化率来准确地预测用户的转化率。此外,本公开利用上述获得的准确转化率来训练推荐模型,可以提升推荐模型的预测准确度和泛化性。进而,可以使用经训练的推荐模型精准地向用户推荐资源,提高用户体验并且降低资源关联方成本。Through the above-mentioned embodiments, the present disclosure can accurately predict the user's conversion rate through user behavior and the total conversion rate when the conversion rate of a single user for resources cannot be obtained. In addition, the present disclosure uses the accurate conversion rate obtained above to train the recommendation model, which can improve the prediction accuracy and generalization of the recommendation model. Furthermore, the trained recommendation model can be used to accurately recommend resources to users, improving user experience and reducing resource related party costs.
上述介绍了同时存在两种类型的用户的方案,下面描述仅存在第二类型用户的方案。图4示出了根据本公开的另一实施例的用于训练推荐模型的过程的流程图。The above describes the solution in which two types of users exist at the same time. The following describes the solution in which only the second type of users exists. 4 illustrates a flowchart of a process for training a recommendation model according to another embodiment of the present disclosure.
在402,计算设备120确定多个用户对目标资源初始转化率,单个用户未提供真实转化率。确定初始转化率的过程与在304中描述的步骤类似,在此不再赘述。At 402, computing device 120 determines multiple users' initial conversion rates to the target resource and that a single user does not provide a true conversion rate. The process of determining the initial conversion rate is similar to the step described in 304 and will not be described again here.
在404,计算设备120基于多个用户对目标资源的总转化率和初 始转化率,确定校正因子。与在304中描述的步骤不同的是,不存在第一类型用户。计算设备120可以根据如下等式(3)确定校正因子C:
C=T/I   等式(3)
At 404, the computing device 120 based on the total conversion rate of the plurality of users to the target resource and the initial Initial conversion rate, determine the correction factor. Unlike the step described in 304, there is no first type of user. Computing device 120 may determine the correction factor C according to equation (3) below:
C=T/I Equation (3)
其中总转化率T为第二类型用户中的发生转化的用户数目与第二类型用户的总数目之间的比率,初始转化率I可以为上述预测的所有第二类型用户的初始转化率的平均值。The total conversion rate T is the ratio between the number of converted users among the second type users and the total number of second type users, and the initial conversion rate I can be the average of the initial conversion rates of all second type users predicted above. value.
在406,计算设备120基于校正因子和初始转化率,确定用户对目标资源的预测转化率。计算设备120可以将校正因子应用于初始转化率,以得到经校正的初始转化率作为预测转化率。例如,计算设备120可以将每个第二类型用户的初始转化率与校正因子相乘以作为预测转化率。可以理解,根据所获取的总转化率来对所预测的初始转化率进行校正,可以使得预测的转化率更加准确,从而使得随后训练的推荐模型的预测准确度提升。At 406, computing device 120 determines the user's predicted conversion rate for the target resource based on the correction factor and the initial conversion rate. Computing device 120 may apply the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as a predicted conversion rate. For example, computing device 120 may multiply the initial conversion rate of each second type user by a correction factor as the predicted conversion rate. It can be understood that correcting the predicted initial conversion rate based on the obtained total conversion rate can make the predicted conversion rate more accurate, thereby improving the prediction accuracy of the subsequently trained recommendation model.
在408,计算设备120至少基于预测转化率来训练推荐模型。训练模型的过程与在306中描述的步骤类似,在此不再赘述。At 408, computing device 120 trains a recommendation model based at least on the predicted conversion rate. The process of training the model is similar to the steps described in 306 and will not be described again here.
通过上述过程,本公开可以在仅存在第二类型用户的情况下,通过用户行为以及总转化率来准确地预测用户的转化率。此外,本公开利用上述获得的准确转化率来训练推荐模型,可以提升推荐模型的预测准确度和泛化性。进而,可以使用经训练的推荐模型精准地向用户推荐资源,提高用户体验并且降低资源关联方成本。Through the above process, the present disclosure can accurately predict the user's conversion rate through user behavior and the total conversion rate when only the second type of user exists. In addition, the present disclosure uses the accurate conversion rate obtained above to train the recommendation model, which can improve the prediction accuracy and generalization of the recommendation model. Furthermore, the trained recommendation model can be used to accurately recommend resources to users, improving user experience and reducing resource related party costs.
图5示出了根据本公开的实施例的用于训练推荐模型的方案的总体流程图。首先将训练数据510划分为与第一类型用户的训练数据和第二类型用户的训练数据。第一类型用户的训练数据包括真实转化率540。第二类型用户的训练数据的转化率需要根据上述330和400的过程来确定,其中分别确定出事转化率550、校正因子570和预测转化率560。然后可以将第一类型用户的训练数据和第二类型用户的训练数据分别输入值模型520进行训练,模型520的输出位转化率530。具体步骤参见上文描述,在此不再赘述。 Figure 5 shows an overall flowchart of a scheme for training a recommendation model according to an embodiment of the present disclosure. First, the training data 510 is divided into training data for the first type of users and training data for the second type of users. The training data for the first type of users includes the true conversion rate of 540. The conversion rate of the training data of the second type of user needs to be determined according to the above-mentioned processes 330 and 400, in which the incident conversion rate 550, the correction factor 570 and the predicted conversion rate 560 are determined respectively. Then, the training data of the first type of user and the training data of the second type of user can be respectively input into the value model 520 for training, and the output of the model 520 is the conversion rate 530. For specific steps, please refer to the above description and will not be repeated here.
上文详细描述了推荐模型的训练过程,下面描述推荐模型应用过程。首先计算设备120获取用户的用户特征和资源的资源特征。例如,计算设备120可以确定用户特征和资源特征,户特征和资源特征的确定过程参见上文描述,在此不再赘述。在一些实施例中,计算设备120还可以根据用户标识和资源标识从数据库中调用预先存储的用户特征和资源特征。The training process of the recommendation model is described in detail above, and the application process of the recommendation model is described below. First, the computing device 120 obtains the user characteristics of the user and the resource characteristics of the resource. For example, the computing device 120 may determine user characteristics and resource characteristics. For the determination process of user characteristics and resource characteristics, refer to the above description and will not be described again here. In some embodiments, the computing device 120 may also call pre-stored user characteristics and resource characteristics from the database according to the user identification and resource identification.
计算设备120然后可以根据过程300和400训练的转化率模型,基于用户特征和资源特征,确定用户对资源的转化率。例如,计算设备120可以将用户特征和资源特征作为推荐模型的输入,从而得出预测转化率。之后计算设备120基于转化率,向用户推荐资源。例如,计算设备120可以将一个用户对多个资源的转化率进行排序,并将处于预定排序顺序之前的资源推荐给用户。Computing device 120 may then determine the user's conversion rate to the resource based on the user characteristics and the resource characteristics according to the conversion rate model trained in processes 300 and 400 . For example, the computing device 120 may use user characteristics and resource characteristics as inputs to the recommendation model to derive a predicted conversion rate. The computing device 120 then recommends resources to the user based on the conversion rate. For example, the computing device 120 may rank a user's conversion rates for multiple resources and recommend resources that are in front of the predetermined sort order to the user.
本公开还提供了一种模型训练装置。具体地,图6示出了根据本公开的实施例的用于训练推荐模型的装置600的示意图。如图6所示,装置600至少可以包括:第一训练数据获取模块602,被配置为获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,第一类型用户提供真实转化率;第二训练数据获取模块604,被配置为获取包括第二类型用户对目标资源的预测转化率的第二组训练数据,第二类型用户未提供真实转化率;以及第一模型训练模块606,被配置为使用第一组训练数据和第二组训练数据来训练推荐模型。The present disclosure also provides a model training device. Specifically, FIG. 6 shows a schematic diagram of an apparatus 600 for training a recommendation model according to an embodiment of the present disclosure. As shown in Figure 6, the device 600 may at least include: a first training data acquisition module 602 configured to acquire a first set of training data including the real conversion rate of a first type of user to the target resource, and the first type of user provides real conversion. rate; the second training data acquisition module 604 is configured to obtain a second set of training data including the predicted conversion rate of a second type of user to the target resource, the second type of user does not provide a true conversion rate; and the first model training module 606 , is configured to train the recommendation model using the first set of training data and the second set of training data.
在某些实施例中,第二训练数据获取模块604可以包括:第一特征确定模块,被配置为确定第二类型用户的用户特征和目标资源的资源特征;以及第一预测模块,被配置为基于第二类型用户选择目标资源之后的用户行为确定预测转化率。In some embodiments, the second training data acquisition module 604 may include: a first feature determination module configured to determine user features of the second type of user and resource features of the target resource; and a first prediction module configured to The predicted conversion rate is determined based on user behavior after the second type of user selects the target resource.
在某些实施例中,第一预测模块可以包括:第二预测模块,被配置为基于第二类型用户与目标资源之间的动作关系,确定初始转化率作为预测转化率。In some embodiments, the first prediction module may include: a second prediction module configured to determine the initial conversion rate as the predicted conversion rate based on the action relationship between the second type user and the target resource.
在某些实施例中,第二类型用户与目标资源之间的动作关系包 括以下至少一项:用户是否对目标资源点赞、用户是否将目标资源转发、用户对目标资源点击后切换回资源展示界面的时间、以及用户是否将目标资源进行下载。In some embodiments, the action relationship between the second type user and the target resource includes Including at least one of the following: whether the user likes the target resource, whether the user forwards the target resource, the time when the user switches back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource.
在某些实施例中,装置600还可以包括:校正因子模块,被配置为基于第一类型用户和第二类型用户对资源的总转化率、真实转化率和初始转化率,确定校正因子;以及校正应用模块,被配置为将校正因子应用于初始转化率,以得到经校正的初始转化率作为预测转化率。In some embodiments, the apparatus 600 may further include: a correction factor module configured to determine the correction factor based on the total conversion rate, the real conversion rate and the initial conversion rate of the resource by the first type of users and the second type of users; and and a correction application module configured to apply the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as the predicted conversion rate.
在某些实施例中,第一训练数据获取模块602可以包括:第二特征确定模块,被配置为确定第一类型用户的用户特征和目标资源的资源特征;以及转化率获取模块,被配置为获取真实转化率。In some embodiments, the first training data acquisition module 602 may include: a second characteristic determination module configured to determine user characteristics of the first type of user and resource characteristics of the target resource; and a conversion rate acquisition module configured to Get the true conversion rate.
在某些实施例中,第一特征确定模块和第二特征确定模块可以包括:用户特征确定模块,被配置为基于用户对资源的历史选择,确定用户特征。In some embodiments, the first feature determination module and the second feature determination module may include: a user feature determination module configured to determine user features based on the user's historical selection of resources.
在某些实施例中,第一特征确定模块和第二特征确定模块可以包括:资源特征确定模块,被配置为基于资源类别、资源发布者、历史上选择资源的用户的用户特征中的至少一项确定资源特征。In some embodiments, the first feature determination module and the second feature determination module may include: a resource feature determination module configured to be based on at least one of resource categories, resource publishers, and user features of users who have historically selected the resource. Items determine resource characteristics.
图7示出了根据本公开的另一实施例的用于训练推荐模型的装置700的示意图。如图7所示,装置700至少可以包括:初始转化率确定模块702,被配置为确定多个用户对目标资源初始转化率,单个用户未提供真实转化率;校正因子确定模块704,被配置为基于多个用户对目标资源的总转化率和初始转化率,确定校正因子;预测转率确定模块706,被配置为基于校正因子和初始转化率,确定用户对目标资源的预测转化率;以及第二模型训练模块708,被配置为至少基于预测转化率来训练推荐模型。FIG. 7 shows a schematic diagram of an apparatus 700 for training a recommendation model according to another embodiment of the present disclosure. As shown in Figure 7, the device 700 may at least include: an initial conversion rate determination module 702, configured to determine the initial conversion rate of multiple users to the target resource, and a single user does not provide a true conversion rate; a correction factor determination module 704, configured to Determine a correction factor based on the total conversion rate and the initial conversion rate of multiple users to the target resource; the predicted conversion rate determination module 706 is configured to determine the predicted conversion rate of the user to the target resource based on the correction factor and the initial conversion rate; and The second model training module 708 is configured to train the recommendation model based on at least the predicted conversion rate.
此外,虽然未示出,但本公开还提供了一种推荐资源的装置,可以包括:特征获取模块,被配置为获取用户的用户特征和资源的资源特征;转化率确定模块,被配置为利用根据过程300和400方法训练的转化率模型,基于用户特征和资源特征,确定用户对资源 的转化率;以及推荐模块,被配置为基于转化率,向用户推荐资源。In addition, although not shown, the present disclosure also provides a device for recommending resources, which may include: a feature acquisition module configured to acquire user features of the user and resource features of the resource; a conversion rate determination module configured to utilize According to the conversion rate model trained by the process 300 and 400 methods, based on user characteristics and resource characteristics, determine the user's response to the resource conversion rate; and a recommendation module configured to recommend resources to users based on the conversion rate.
图8示出了可以用来实施本公开的实施例的示例设备800的示意性框图。例如,如图1所示的计算设备120以及图2所示的计算设备220可以由设备800来实施。如图所示,设备800包括中央处理单元(CPU)801,其可以根据存储在只读存储器(ROM)802中的计算机程序指令或者从存储单元808加载到随机访问存储器(RAM)803中的计算机程序指令,来执行各种适当的动作和处理。在RAM 803中,还可存储设备900操作所需的各种程序和数据。CPU 801、ROM802以及RAM 803通过总线804彼此相连。输入/输出(I/O)接口805也连接至总线804。Figure 8 shows a schematic block diagram of an example device 800 that may be used to implement embodiments of the present disclosure. For example, computing device 120 shown in FIG. 1 and computing device 220 shown in FIG. 2 may be implemented by device 800. As shown, the device 800 includes a central processing unit (CPU) 801 that can operate on a computer in accordance with computer program instructions stored in a read-only memory (ROM) 802 or loaded from a storage unit 808 into a random access memory (RAM) 803 Program instructions to perform various appropriate actions and processes. In the RAM 803, various programs and data required for the operation of the device 900 can also be stored. CPU 801, ROM 802 and RAM 803 are connected to each other via bus 804. An input/output (I/O) interface 805 is also connected to bus 804.
设备800中的多个部件连接至I/O接口805,包括:输入单元806,例如键盘、鼠标等;输出单元807,例如各种类型的显示器、扬声器等;存储单元808,例如磁盘、光盘等;以及通信单元809,例如网卡、调制解调器、无线通信收发机等。通信单元809允许设备800通过诸如因特网的计算机网络和/或各种电信网络与其他设备交换信息/数据。应理解,本公开可以利用输出单元807显示用户满意度的实时动态变化信息、满意度的群体用户或个体用户的关键因素识别信息、优化策略信息、以及策略实施效果评估信息等。Multiple components in the device 800 are connected to the I/O interface 805, including: an input unit 806, such as a keyboard, a mouse, etc.; an output unit 807, such as various types of displays, speakers, etc.; a storage unit 808, such as a magnetic disk, optical disk, etc. ; and communication unit 809, such as a network card, modem, wireless communication transceiver, etc. The communication unit 809 allows the device 800 to exchange information/data with other devices through computer networks such as the Internet and/or various telecommunications networks. It should be understood that the present disclosure can use the output unit 807 to display real-time dynamic change information of user satisfaction, key factor identification information of satisfied group users or individual users, optimization strategy information, and strategy implementation effect evaluation information, etc.
处理单元801可通过一个或多个处理电路来实现。处理单元801可被配置为执行上文所描述的各个过程和处理,例如过程300、400和500。例如,在一些实施例中,过程300、400和500可以被实现为计算机软件程序,其被有形地包含于机器可读介质,例如存储单元808。在一些实施例中,计算机程序的部分或者全部可以经由ROM 802和/或通信单元809而被载入和/或安装到设备800上。当计算机程序被加载到RAM 803并由CPU801执行时,可以执行上文描述的过程300、400和500中的一个或多个步骤。The processing unit 801 may be implemented by one or more processing circuits. The processing unit 801 may be configured to perform the various processes and processes described above, such as processes 300, 400, and 500. For example, in some embodiments, processes 300, 400, and 500 may be implemented as a computer software program tangibly embodied in a machine-readable medium, such as storage unit 808. In some embodiments, part or all of the computer program may be loaded and/or installed onto device 800 via ROM 802 and/or communication unit 809. When the computer program is loaded into RAM 803 and executed by CPU 801, one or more steps in processes 300, 400, and 500 described above may be performed.
本公开可以是系统、方法和/或计算机程序产品。计算机程序产品可以包括计算机可读存储介质,其上载有用于执行本公开的各个方面的计算机可读程序指令。 The present disclosure may be a system, method, and/or computer program product. A computer program product may include a computer-readable storage medium having thereon computer-readable program instructions for performing various aspects of the present disclosure.
计算机可读存储介质可以是可以保持和存储由指令执行设备使用的指令的有形设备。计算机可读存储介质例如可以是――但不限于――电存储设备、磁存储设备、光存储设备、电磁存储设备、半导体存储设备或者上述的任意合适的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、静态随机存取存储器(SRAM)、便携式压缩盘只读存储器(CD-ROM)、数字多功能盘(DVD)、记忆棒、软盘、机械编码设备、例如其上存储有指令的打孔卡或凹槽内凸起结构、以及上述的任意合适的组合。这里所使用的计算机可读存储介质不被解释为瞬时信号本身,诸如无线电波或者其他自由传播的电磁波、通过波导或其他传输媒介传播的电磁波(例如,通过光纤电缆的光脉冲)、或者通过电线传输的电信号。Computer-readable storage media may be tangible devices that can retain and store instructions for use by an instruction execution device. The computer-readable storage medium may be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the above. More specific examples (non-exhaustive list) of computer-readable storage media include: portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM) or Flash memory), Static Random Access Memory (SRAM), Compact Disk Read Only Memory (CD-ROM), Digital Versatile Disk (DVD), Memory Stick, Floppy Disk, Mechanical Coding Device, such as a printer with instructions stored on it. Protruding structures in hole cards or grooves, and any suitable combination of the above. As used herein, computer-readable storage media are not to be construed as transient signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (e.g., light pulses through fiber optic cables), or through electrical wires. transmitted electrical signals.
这里所描述的计算机可读程序指令可以从计算机可读存储介质下载到各个计算/处理设备,或者通过网络、例如因特网、局域网、广域网和/或无线网下载到外部计算机或外部存储设备。网络可以包括铜传输电缆、光纤传输、无线传输、路由器、防火墙、交换机、网关计算机和/或边缘服务器。每个计算/处理设备中的网络适配卡或者网络接口从网络接收计算机可读程序指令,并转发该计算机可读程序指令,以供存储在各个计算/处理设备中的计算机可读存储介质中。Computer-readable program instructions described herein may be downloaded from a computer-readable storage medium to various computing/processing devices, or to an external computer or external storage device over a network, such as the Internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers. A network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network and forwards the computer-readable program instructions for storage on a computer-readable storage medium in the respective computing/processing device .
用于执行本公开操作的计算机程序指令可以是汇编指令、指令集架构(ISA)指令、机器指令、机器相关指令、微代码、固件指令、状态设置数据、或者以一种或多种编程语言的任意组合编写的源代码或目标代码,所述编程语言包括面向对象的编程语言—诸如Smalltalk、C++等,以及常规的过程式编程语言—诸如“C”语言或类似的编程语言。计算机可读程序指令可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计 算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络—包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。在一些实施例中,通过利用计算机可读程序指令的状态信息来个性化定制电子电路,例如可编程逻辑电路、现场可编程门阵列(FPGA)或可编程逻辑阵列(PLA),该电子电路可以执行计算机可读程序指令,从而实现本公开的各个方面。Computer program instructions for performing operations of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-related instructions, microcode, firmware instructions, state setting data, or instructions in one or more programming languages. Source code or object code written in any combination of object-oriented programming languages - such as Smalltalk, C++, etc., and conventional procedural programming languages - such as the "C" language or similar programming languages. The computer-readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer. be executed on a computer or server. In situations involving remote computers, the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (such as an Internet service provider through the Internet). connect). In some embodiments, by utilizing state information of computer-readable program instructions to personalize an electronic circuit, such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), the electronic circuit can Computer readable program instructions are executed to implement various aspects of the disclosure.
这里参照根据本公开实施例的方法、装置(系统)和计算机程序产品的流程图和/或框图描述了本公开的各个方面。应当理解,流程图和/或框图的每个方框以及流程图和/或框图中各方框的组合,都可以由计算机可读程序指令实现。Aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer-readable program instructions.
这些计算机可读程序指令可以提供给通用计算机、专用计算机或其它可编程数据处理装置的处理单元,从而生产出一种机器,使得这些指令在通过计算机或其它可编程数据处理装置的处理单元执行时,产生了实现流程图和/或框图中的一个或多个方框中规定的功能/动作的装置。也可以把这些计算机可读程序指令存储在计算机可读存储介质中,这些指令使得计算机、可编程数据处理装置和/或其他设备以特定方式工作,从而,存储有指令的计算机可读介质则包括一个制造品,其包括实现流程图和/或框图中的一个或多个方框中规定的功能/动作的各个方面的指令。These computer-readable program instructions may be provided to a processing unit of a general-purpose computer, a special-purpose computer, or other programmable data processing apparatus, thereby producing a machine such that the instructions, when executed by a processing unit of the computer or other programmable data processing apparatus, , resulting in an apparatus that implements the functions/actions specified in one or more blocks in the flowchart and/or block diagram. These computer-readable program instructions can also be stored in a computer-readable storage medium. These instructions cause the computer, programmable data processing device and/or other equipment to work in a specific manner. Therefore, the computer-readable medium storing the instructions includes An article of manufacture that includes instructions that implement aspects of the functions/acts specified in one or more blocks of the flowcharts and/or block diagrams.
也可以把计算机可读程序指令加载到计算机、其它可编程数据处理装置、或其它设备上,使得在计算机、其它可编程数据处理装置或其它设备上执行一系列操作步骤,以产生计算机实现的过程,从而使得在计算机、其它可编程数据处理装置、或其它设备上执行的指令实现流程图和/或框图中的一个或多个方框中规定的功能/动作。Computer-readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other equipment, causing a series of operating steps to be performed on the computer, other programmable data processing apparatus, or other equipment to produce a computer-implemented process , thereby causing instructions executed on a computer, other programmable data processing apparatus, or other equipment to implement the functions/actions specified in one or more blocks in the flowcharts and/or block diagrams.
附图中的流程图和框图显示了根据本公开的多个实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段 或指令的一部分,所述模块、程序段或指令的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods, and computer program products according to various embodiments of the present disclosure. In this regard, each box in the flowchart or block diagram can represent a module, program segment Or a part of an instruction. The module, program segment or part of the instruction contains one or more executable instructions for realizing the specified logical function. In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two consecutive blocks may actually execute substantially in parallel, or they may sometimes execute in the reverse order, depending on the functionality involved. It will also be noted that each block of the block diagram and/or flowchart illustration, and combinations of blocks in the block diagram and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts. , or can be implemented using a combination of specialized hardware and computer instructions.
根据本公开的一个或多个实施例。示例1.一种训练推荐模型的方法,包括:获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,所述第一类型用户提供所述真实转化率;获取包括第二类型用户对所述目标资源的预测转化率的第二组训练数据,所述第二类型用户未提供所述真实转化率;以及使用所述第一组训练数据和所述第二组训练数据来训练所述推荐模型。In accordance with one or more embodiments of the present disclosure. Example 1. A method of training a recommendation model, including: obtaining a first set of training data including a first type of user's true conversion rate to a target resource, the first type of user providing the true conversion rate; obtaining a second set of training data including a second type of user's true conversion rate to a target resource. a second set of training data for the predicted conversion rate of a type of user to the target resource, the second type of user not providing the true conversion rate; and using the first set of training data and the second set of training data to Train the recommendation model.
示例2.根据示例1所述的方法,其中获取所述第二组训练数据包括:确定所述第二类型用户的用户特征和所述目标资源的资源特征;以及基于所述第二类型用户选择所述目标资源之后的用户行为确定所述预测转化率。Example 2. The method of Example 1, wherein obtaining the second set of training data includes: determining user characteristics of the second type of user and resource characteristics of the target resource; and selecting based on the second type of user User behavior following the target resource determines the predicted conversion rate.
示例3.根据示例1-2所述的方法,其中确定所述预测转化率包括:基于所述第二类型用户与所述目标资源之间的动作关系,确定初始转化率作为预测转化率。Example 3. The method according to Example 1-2, wherein determining the predicted conversion rate includes: determining an initial conversion rate as the predicted conversion rate based on an action relationship between the second type user and the target resource.
示例4.根据示例1-3所述的方法,其中所述第二类型用户与所述目标资源之间的动作关系包括以下至少一项:用户是否对目标资源点赞、用户是否将目标资源转发、用户对目标资源点击后切换回资源展示界面的时间、以及用户是否将目标资源进行下载。Example 4. The method according to Example 1-3, wherein the action relationship between the second type user and the target resource includes at least one of the following: whether the user likes the target resource, whether the user forwards the target resource , the time for the user to switch back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource.
示例5.根据示例1-4所述的方法,还包括基于所述第一类型用户和所述第二类型用户对所述资源的总转化率、所述真实转化率和所述初始转化率,确定校正因子;以及将所述校正因子应用于所述 初始转化率,以得到经校正的初始转化率作为预测转化率。Example 5. The method according to examples 1-4, further comprising based on the total conversion rate of the resource by the first type of users and the second type of users, the true conversion rate and the initial conversion rate, determining a correction factor; and applying said correction factor to said The initial conversion rate is used to obtain the corrected initial conversion rate as the predicted conversion rate.
示例6.根据示例1-5所述的方法,其中获取第一组训练数据包括:确定所述第一类型用户的用户特征和所述目标资源的资源特征;以及获取所述真实转化率。Example 6. The method according to examples 1-5, wherein obtaining the first set of training data includes: determining user characteristics of the first type of user and resource characteristics of the target resource; and obtaining the true conversion rate.
示例7.根据示例1-6所述的方法,其中确定用户特征包括:基于用户对资源的历史选择,确定所述用户特征。Example 7. The method of examples 1-6, wherein determining the user characteristics includes determining the user characteristics based on the user's historical selection of resources.
示例8.根据示例1-7所述的方法,其中确定所述目标资源的资源特征包括:基于资源类别、资源发布者、历史上选择资源的用户的用户特征中的至少一项确定所述资源特征。Example 8. The method of Examples 1-7, wherein determining the resource characteristics of the target resource includes determining the resource based on at least one of a resource category, a resource publisher, and user characteristics of a user who has historically selected the resource. feature.
示例9.一种训练推荐模型的方法,包括:确定多个用户对目标资源初始转化率,单个用户未提供真实转化率;基于所述多个用户对所述目标资源的总转化率和所述初始转化率,确定校正因子;基于所述校正因子和所述初始转化率,确定用户对所述目标资源的预测转化率;以及至少基于所述预测转化率来训练所述推荐模型。Example 9. A method of training a recommendation model, including: determining the initial conversion rate of multiple users to the target resource, and a single user does not provide a real conversion rate; based on the total conversion rate of the multiple users to the target resource and the an initial conversion rate, determining a correction factor; based on the correction factor and the initial conversion rate, determining a user's predicted conversion rate for the target resource; and training the recommendation model based at least on the predicted conversion rate.
示例10.一种推荐资源的方法,包括:获取用户的用户特征和资源的资源特征;利用根据示例1至9中任一项所述的方法训练的转化率模型,基于所述用户特征和所述资源特征,确定所述用户对资源的转化率;以及基于所述转化率,向所述用户推荐资源。Example 10. A method of recommending resources, including: obtaining user characteristics of the user and resource characteristics of the resource; using a conversion rate model trained according to the method described in any one of Examples 1 to 9, based on the user characteristics and the resource characteristics The resource characteristics are used to determine the user's conversion rate of the resource; and based on the conversion rate, the resource is recommended to the user.
示例11.一种训练推荐模型的装置,包括:第一训练数据获取模块,被配置为获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,所述第一类型用户提供所述真实转化率;第二训练数据获取模块,被配置为获取包括第二类型用户对所述目标资源的预测转化率的第二组训练数据,所述第二类型用户未提供所述真实转化率;以及第一模型训练模块,被配置为使用所述第一组训练数据和所述第二组训练数据来训练所述推荐模型。Example 11. A device for training a recommendation model, including: a first training data acquisition module configured to acquire a first set of training data including the true conversion rate of a first type of user to a target resource, the first type of user providing The real conversion rate; a second training data acquisition module configured to obtain a second set of training data including the predicted conversion rate of a second type of user to the target resource, the second type of user not providing the real conversion rate; and a first model training module configured to train the recommendation model using the first set of training data and the second set of training data.
示例12.根据示例11所述的装置,所述获取所述第二训练数据获取模块包括:第一特征确定模块,被配置为确定所述第二类型用户的用户特征和所述目标资源的资源特征;以及第一预测模块,被配置为基于所述第二类型用户选择所述目标资源之后的用户行为确 定所述预测转化率。Example 12. The apparatus according to Example 11, the obtaining the second training data acquisition module includes: a first feature determination module configured to determine user features of the second type user and resources of the target resource Features; and a first prediction module configured to determine based on user behavior after the second type user selects the target resource. Determine the predicted conversion rate.
示例13.根据示例11或12所述的装置,所述第一预测模块包括:第二预测模块,被配置为基于所述第二类型用户与所述目标资源之间的动作关系,确定初始转化率作为预测转化率。Example 13. The apparatus according to Example 11 or 12, the first prediction module includes: a second prediction module configured to determine an initial conversion based on an action relationship between the second type user and the target resource. rate as the predicted conversion rate.
示例14.根据示例11-13所述的装置,其中所述第二类型用户与所述目标资源之间的动作关系包括以下至少一项:用户是否对目标资源点赞、用户是否将目标资源转发、用户对目标资源点击后切换回资源展示界面的时间、以及用户是否将目标资源进行下载。Example 14. The device according to examples 11-13, wherein the action relationship between the second type user and the target resource includes at least one of the following: whether the user likes the target resource, whether the user forwards the target resource , the time for the user to switch back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource.
示例15.根据示例11-14所述的装置,所述装置还包括:校正因子模块,被配置为基于所述第一类型用户和所述第二类型用户对所述资源的总转化率、所述真实转化率和所述初始转化率,确定校正因子;以及校正应用模块,被配置为将所述校正因子应用于所述初始转化率,以得到经校正的初始转化率作为预测转化率。Example 15. The apparatus according to examples 11-14, the apparatus further comprising: a correction factor module configured to be based on the total conversion rate of the resource by the first type of users and the second type of users, the The true conversion rate and the initial conversion rate are used to determine a correction factor; and a correction application module is configured to apply the correction factor to the initial conversion rate to obtain a corrected initial conversion rate as a predicted conversion rate.
示例16.根据示例11-15所述的装置,所述第一训练数据获取模块包括:第二特征确定模块,被配置为确定所述第一类型用户的用户特征和所述目标资源的资源特征;以及转化率获取模块,被配置为获取所述真实转化率。Example 16. The apparatus according to examples 11-15, the first training data acquisition module includes: a second feature determination module configured to determine user features of the first type of user and resource features of the target resource ; and a conversion rate acquisition module configured to obtain the true conversion rate.
示例17.根据示例11-16所述的装置,所述第一特征确定模块和第二特征确定模块可以包括:用户特征确定模块,被配置为基于用户对资源的历史选择,确定所述用户特征。Example 17. The apparatus according to examples 11-16, the first feature determination module and the second feature determination module may include: a user feature determination module configured to determine the user feature based on the user's historical selection of resources. .
示例18.根据示例11-17所述的装置,所述第一特征确定模块和第二特征确定模块可以包括:资源特征确定模块,被配置为基于资源类别、资源发布者、历史上选择资源的用户的用户特征中的至少一项确定所述资源特征。Example 18. The apparatus according to Examples 11-17, the first feature determination module and the second feature determination module may include: a resource feature determination module configured to select resources based on resource categories, resource publishers, and historical selections of resources. At least one of the user characteristics of the user determines the resource characteristics.
示例19.一种训练推荐模型的装置,包括:初始转化率确定,被配置为确定多个用户对目标资源初始转化率,单个用户未提供真实转化率;校正因子确定模块,被配置为基于所述多个用户对所述目标资源的总转化率和所述初始转化率,确定校正因子;预测转率确定模块,被配置为基于所述校正因子和所述初始转化率,确定用 户对所述目标资源的预测转化率;以及第二模型训练模块,被配置为至少基于所述预测转化率来训练所述推荐模型。Example 19. A device for training a recommendation model, including: an initial conversion rate determination configured to determine the initial conversion rate of multiple users to a target resource, and a single user does not provide a true conversion rate; a correction factor determination module configured to determine based on the Determine a correction factor based on the total conversion rate of the multiple users to the target resource and the initial conversion rate; the predicted conversion rate determination module is configured to determine the user based on the correction factor and the initial conversion rate. a user's predicted conversion rate for the target resource; and a second model training module configured to train the recommendation model based at least on the predicted conversion rate.
示例20.一种推荐资源的装置,包括:特征获取模块,被配置为获取用户的用户特征和资源的资源特征;转化率确定模块,被配置为利用根据示例1至10中任一项所述的方法训练的转化率模型,基于所述用户特征和所述资源特征,确定所述用户对资源的转化率;以及推荐模块,被配置为基于所述转化率,向所述用户推荐资源。Example 20. A device for recommending resources, including: a feature acquisition module configured to acquire user characteristics of the user and resource characteristics of the resource; a conversion rate determination module configured to utilize the method according to any one of Examples 1 to 10 The conversion rate model trained by the method determines the user's conversion rate of resources based on the user characteristics and the resource characteristics; and a recommendation module is configured to recommend resources to the user based on the conversion rate.
示例21.一种电子设备,包括:处理器;以及与所述处理器耦合的存储器,所述存储器具有存储于其中的指令,所述指令在被处理器执行时使所述电子设备执行动作,所述动作包括:获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,所述第一类型用户提供所述真实转化率;获取包括第二类型用户对所述目标资源的预测转化率的第二组训练数据,所述第二类型用户未提供所述真实转化率;以及使用所述第一组训练数据和所述第二组训练数据来训练所述推荐模型。Example 21. An electronic device, comprising: a processor; and a memory coupled to the processor, the memory having instructions stored therein that, when executed by the processor, cause the electronic device to perform actions, The actions include: obtaining a first set of training data including a real conversion rate of a first type user to the target resource, the first type user providing the real conversion rate; obtaining a first set of training data including a second type user to the target resource. Predicting a second set of training data for a conversion rate, the second type of user not providing the true conversion rate; and using the first set of training data and the second set of training data to train the recommendation model.
示例22.一种电子设备,包括:处理器;以及与所述处理器耦合的存储器,所述存储器具有存储于其中的指令,所述指令在被处理器执行时使所述电子设备执行动作,所述动作包括:确定多个用户对目标资源初始转化率,单个用户未提供真实转化率;基于所述多个用户对所述目标资源的总转化率和所述初始转化率,确定校正因子;基于所述校正因子和所述初始转化率,确定用户对所述目标资源的预测转化率;以及至少基于所述预测转化率来训练所述推荐模型。Example 22. An electronic device, comprising: a processor; and a memory coupled to the processor, the memory having instructions stored therein that, when executed by the processor, cause the electronic device to perform actions, The actions include: determining an initial conversion rate of multiple users to the target resource, and a single user does not provide a true conversion rate; determining a correction factor based on the total conversion rate of the multiple users to the target resource and the initial conversion rate; Determine a user's predicted conversion rate for the target resource based on the correction factor and the initial conversion rate; and train the recommendation model based on at least the predicted conversion rate.
示例23.一种电子设备,包括:处理器;以及与所述处理器耦合的存储器,所述存储器具有存储于其中的指令,所述指令在被处理器执行时使所述电子设备执行动作,所述动作包括:获取用户的用户特征和资源的资源特征;利用根据示例1至9中任一项所述的方法训练的转化率模型,基于所述用户特征和所述资源特征,确定所述用户对资源的转化率;以及基于所述转化率,向所述用户推荐 资源。Example 23. An electronic device, comprising: a processor; and a memory coupled to the processor, the memory having instructions stored therein that, when executed by the processor, cause the electronic device to perform actions, The actions include: obtaining user characteristics of the user and resource characteristics of the resource; using a conversion rate model trained according to the method described in any one of Examples 1 to 9, based on the user characteristics and the resource characteristics, determining the The user's conversion rate of resources; and based on the conversion rate, recommendations to the user resource.
示例24.一种计算机可读存储介质,其上存储有计算机程序,所述程序被处理器执行时实现如示例1-10中任一项所述的方法。Example 24. A computer-readable storage medium having a computer program stored thereon, which when executed by a processor implements the method described in any one of Examples 1-10.
以上已经描述了本公开的各实施例,上述说明是示例性的,并非穷尽性的,并且也不限于所披露的各实施例。在不偏离所说明的各实施例的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。本文中所用术语的选择,旨在最好地解释各实施例的原理、实际应用或对市场中技术的技术改进,或者使本技术领域的其它普通技术人员能理解本文披露的各实施例。 The embodiments of the present disclosure have been described above. The above description is illustrative, not exhaustive, and is not limited to the disclosed embodiments. Many modifications and variations will be apparent to those skilled in the art without departing from the scope and spirit of the described embodiments. The terminology used herein is chosen to best explain the principles of the embodiments, practical applications, or technical improvements to the technology in the market, or to enable other persons of ordinary skill in the art to understand the embodiments disclosed herein.

Claims (15)

  1. 一种训练推荐模型的方法,包括:A method of training a recommendation model, including:
    获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,所述第一类型用户提供所述真实转化率;Obtaining a first set of training data including a true conversion rate of a first type of user to the target resource, the first type of user providing the true conversion rate;
    获取包括第二类型用户对所述目标资源的预测转化率的第二组训练数据,所述第二类型用户未提供所述真实转化率;以及Obtaining a second set of training data including a predicted conversion rate of a second type of user to the target resource, the second type of user not providing the true conversion rate; and
    使用所述第一组训练数据和所述第二组训练数据来训练所述推荐模型。The recommendation model is trained using the first set of training data and the second set of training data.
  2. 根据权利要求1所述的方法,其中获取所述第二组训练数据包括:The method of claim 1, wherein obtaining the second set of training data includes:
    确定所述第二类型用户的用户特征和所述目标资源的资源特征;以及Determining user characteristics of the second type user and resource characteristics of the target resource; and
    基于所述第二类型用户选择所述目标资源之后的用户行为确定所述预测转化率。The predicted conversion rate is determined based on user behavior after the second type user selects the target resource.
  3. 根据权利要求2所述的方法,其中确定所述预测转化率包括:The method of claim 2, wherein determining the predicted conversion rate includes:
    基于所述第二类型用户与所述目标资源之间的动作关系,确定初始转化率作为所述预测转化率。Based on the action relationship between the second type user and the target resource, an initial conversion rate is determined as the predicted conversion rate.
  4. 根据权利要求3所述的方法,其中所述第二类型用户与所述目标资源之间的动作关系包括以下至少一项:The method according to claim 3, wherein the action relationship between the second type user and the target resource includes at least one of the following:
    用户是否对目标资源点赞、用户是否将目标资源转发、用户对目标资源点击后切换回资源展示界面的时间、以及用户是否将目标资源进行下载。Whether the user likes the target resource, whether the user forwards the target resource, the time when the user switches back to the resource display interface after clicking on the target resource, and whether the user downloads the target resource.
  5. 根据权利要求3所述的方法,还包括The method of claim 3, further comprising
    基于所述第一类型用户和所述第二类型用户对所述资源的总转化率、所述真实转化率和所述初始转化率,确定校正因子;以及determining a correction factor based on the total conversion rate of the resource by the first type of users and the second type of users, the true conversion rate and the initial conversion rate; and
    将所述校正因子应用于所述初始转化率,以得到经校正的初始转化率作为预测转化率。The correction factor is applied to the initial conversion rate to obtain a corrected initial conversion rate as the predicted conversion rate.
  6. 根据权利要求1所述的方法,其中获取第一组训练数据包括: The method of claim 1, wherein obtaining the first set of training data includes:
    确定所述第一类型用户的用户特征和所述目标资源的资源特征;以及Determining user characteristics of the first type user and resource characteristics of the target resource; and
    获取所述真实转化率。Get said true conversion rate.
  7. 根据权利要求2或6所述的方法,其中确定所述用户特征包括:The method of claim 2 or 6, wherein determining the user characteristics includes:
    基于用户对资源的历史选择,确定所述用户特征。The user characteristics are determined based on the user's historical selection of resources.
  8. 根据权利要求2或6所述的方法,其中确定所述目标资源的资源特征包括:The method according to claim 2 or 6, wherein determining the resource characteristics of the target resource includes:
    基于资源类别、资源发布者、历史上选择资源的用户的用户特征中的至少一项确定所述资源特征。The resource characteristics are determined based on at least one of a resource category, a resource publisher, and user characteristics of users who have historically selected the resource.
  9. 一种训练推荐模型的方法,包括:A method of training a recommendation model, including:
    确定多个用户对目标资源初始转化率,所述多个用户中的用户未提供真实转化率;Determine the initial conversion rate of multiple users to the target resource, and users among the multiple users do not provide a true conversion rate;
    基于所述多个用户对所述目标资源的总转化率和所述初始转化率,确定校正因子;Determine a correction factor based on the total conversion rate of the multiple users to the target resource and the initial conversion rate;
    基于所述校正因子和所述初始转化率,确定用户对所述目标资源的预测转化率;以及determining a user's predicted conversion rate for the target resource based on the correction factor and the initial conversion rate; and
    至少基于所述预测转化率来训练所述推荐模型。The recommendation model is trained based on at least the predicted conversion rate.
  10. 一种推荐资源的方法,包括:A method of recommending resources including:
    获取用户的用户特征和资源的资源特征;Obtain the user characteristics of the user and the resource characteristics of the resource;
    利用根据权利要求1至9中任一项所述的方法训练的转化率模型,基于所述用户特征和所述资源特征,确定所述用户对资源的转化率;以及Using a conversion rate model trained according to the method of any one of claims 1 to 9, based on the user characteristics and the resource characteristics, determining the user's conversion rate of resources; and
    基于所述转化率,向所述用户推荐资源。Based on the conversion rate, resources are recommended to the user.
  11. 一种训练推荐模型的装置,包括:A device for training recommendation models, including:
    第一训练数据获取模块,被配置为获取包括第一类型用户对目标资源的真实转化率的第一组训练数据,所述第一类型用户提供所述真实转化率;A first training data acquisition module configured to acquire a first set of training data including a real conversion rate of a first type of user to the target resource, the first type of user providing the real conversion rate;
    第二训练数据获取模块,被配置为获取包括第二类型用户对所述目标资源的预测转化率的第二组训练数据,所述第二类型用户未提 供所述真实转化率;以及The second training data acquisition module is configured to acquire a second set of training data including the predicted conversion rate of a second type of user to the target resource. The second type of user has not mentioned to provide stated true conversion rates; and
    第一模型训练模块,被配置为使用所述第一组训练数据和所述第二组训练数据来训练所述推荐模型。A first model training module configured to train the recommendation model using the first set of training data and the second set of training data.
  12. 一种训练推荐模型的装置,包括:A device for training recommendation models, including:
    初始转化率确定,被配置为确定多个用户对目标资源初始转化率,单个用户未提供真实转化率;Initial conversion rate determination is configured to determine the initial conversion rate of multiple users to the target resource, and a single user does not provide a true conversion rate;
    校正因子确定模块,被配置为基于所述多个用户对所述目标资源的总转化率和所述初始转化率,确定校正因子;a correction factor determination module configured to determine a correction factor based on the total conversion rate of the multiple users to the target resource and the initial conversion rate;
    预测转率确定模块,被配置为基于所述校正因子和所述初始转化率,确定用户对所述目标资源的预测转化率;以及a predicted conversion rate determination module configured to determine a user's predicted conversion rate for the target resource based on the correction factor and the initial conversion rate; and
    第二模型训练模块,被配置为至少基于所述预测转化率来训练所述推荐模型。A second model training module is configured to train the recommendation model based on at least the predicted conversion rate.
  13. 一种推荐资源的装置,包括:A device for recommending resources, including:
    特征获取模块,被配置为获取用户的用户特征和资源的资源特征;a feature acquisition module configured to acquire user features of the user and resource features of the resource;
    转化率确定模块,被配置为利用根据权利要求1至10中任一项所述的方法训练的转化率模型,基于所述用户特征和所述资源特征,确定所述用户对资源的转化率;以及A conversion rate determination module configured to determine the user's conversion rate of resources based on the user characteristics and the resource characteristics using a conversion rate model trained according to the method of any one of claims 1 to 10; as well as
    推荐模块,被配置为基于所述转化率,向所述用户推荐资源。A recommendation module configured to recommend resources to the user based on the conversion rate.
  14. 一种电子设备,包括:An electronic device including:
    处理器;以及processor; and
    与所述处理器耦合的存储器,所述存储器具有存储于其中的指令,所述指令在被处理器执行时使所述电子设备执行如权利要求1-10中任一项所述的方法。A memory coupled to the processor, the memory having instructions stored therein that when executed by the processor cause the electronic device to perform the method of any one of claims 1-10.
  15. 一种计算机可读存储介质,其上存储有计算机程序,所述程序被处理器执行时实现如权利要求1-10中任一项所述的方法。 A computer-readable storage medium on which a computer program is stored. When the program is executed by a processor, the method according to any one of claims 1-10 is implemented.
PCT/CN2023/117102 2022-09-08 2023-09-05 Recommendation model training method and apparatus, and resource recommendation method and apparatus WO2024051707A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202211098044.3 2022-09-08
CN202211098044.3A CN115391661A (en) 2022-09-08 2022-09-08 Method for training recommendation model, method for recommending resources and device thereof

Publications (1)

Publication Number Publication Date
WO2024051707A1 true WO2024051707A1 (en) 2024-03-14

Family

ID=84126261

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/117102 WO2024051707A1 (en) 2022-09-08 2023-09-05 Recommendation model training method and apparatus, and resource recommendation method and apparatus

Country Status (2)

Country Link
CN (1) CN115391661A (en)
WO (1) WO2024051707A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115391661A (en) * 2022-09-08 2022-11-25 脸萌有限公司 Method for training recommendation model, method for recommending resources and device thereof

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3136265A1 (en) * 2015-08-28 2017-03-01 Yandex Europe AG Method and apparatus for generating a recommended content list
CN113688304A (en) * 2020-05-19 2021-11-23 华为技术有限公司 Training method for search recommendation model, and method and device for sequencing search results
CN114461871A (en) * 2021-12-21 2022-05-10 北京达佳互联信息技术有限公司 Recommendation model training method, object recommendation device and storage medium
CN115391661A (en) * 2022-09-08 2022-11-25 脸萌有限公司 Method for training recommendation model, method for recommending resources and device thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3136265A1 (en) * 2015-08-28 2017-03-01 Yandex Europe AG Method and apparatus for generating a recommended content list
CN113688304A (en) * 2020-05-19 2021-11-23 华为技术有限公司 Training method for search recommendation model, and method and device for sequencing search results
CN114461871A (en) * 2021-12-21 2022-05-10 北京达佳互联信息技术有限公司 Recommendation model training method, object recommendation device and storage medium
CN115391661A (en) * 2022-09-08 2022-11-25 脸萌有限公司 Method for training recommendation model, method for recommending resources and device thereof

Also Published As

Publication number Publication date
CN115391661A (en) 2022-11-25

Similar Documents

Publication Publication Date Title
US11093818B2 (en) Customer profile learning based on semi-supervised recurrent neural network using partially labeled sequence data
CN110688528B (en) Method, apparatus, electronic device, and medium for generating classification information of video
US11151324B2 (en) Generating completed responses via primal networks trained with dual networks
CN112035743B (en) Data recommendation method and device, computer equipment and storage medium
CN112100489B (en) Object recommendation method, device and computer storage medium
CN114117216A (en) Recommendation probability prediction method and device, computer storage medium and electronic equipment
CN114036398B (en) Content recommendation and ranking model training method, device, equipment and storage medium
WO2024051707A1 (en) Recommendation model training method and apparatus, and resource recommendation method and apparatus
CN112036954A (en) Item recommendation method and device, computer-readable storage medium and electronic device
CN112650841A (en) Information processing method and device and electronic equipment
CN111340220A (en) Method and apparatus for training a predictive model
US11281867B2 (en) Performing multi-objective tasks via primal networks trained with dual networks
WO2022001887A1 (en) Method and apparatus for training item coding model
CN116910373B (en) House source recommendation method and device, electronic equipment and storage medium
CN114119123A (en) Information pushing method and device
WO2023185125A1 (en) Product resource data processing method and apparatus, electronic device and storage medium
CN117056595A (en) Interactive project recommendation method and device and computer readable storage medium
US20230215174A1 (en) Dynamic network quantization for efficient video inference
AU2019200721A1 (en) Online training and update of factorization machines using alternating least squares optimization
CN112100507B (en) Object recommendation method, computing device and computer-readable storage medium
CN114611015A (en) Interactive information processing method and device and cloud server
CN114926234A (en) Article information pushing method and device, electronic equipment and computer readable medium
CN114139059A (en) Resource recommendation model training method, resource recommendation method and device
CN112905885A (en) Method, apparatus, device, medium, and program product for recommending resources to a user
CN114926192A (en) Information processing method and device and computer readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23862401

Country of ref document: EP

Kind code of ref document: A1