WO2019076173A1 - Content pushing method and apparatus, and computer device - Google Patents

Content pushing method and apparatus, and computer device Download PDF

Info

Publication number
WO2019076173A1
WO2019076173A1 PCT/CN2018/105810 CN2018105810W WO2019076173A1 WO 2019076173 A1 WO2019076173 A1 WO 2019076173A1 CN 2018105810 W CN2018105810 W CN 2018105810W WO 2019076173 A1 WO2019076173 A1 WO 2019076173A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
application
sample
click rate
model
Prior art date
Application number
PCT/CN2018/105810
Other languages
French (fr)
Chinese (zh)
Inventor
潘岸腾
Original Assignee
广州优视网络科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州优视网络科技有限公司 filed Critical 广州优视网络科技有限公司
Publication of WO2019076173A1 publication Critical patent/WO2019076173A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications

Definitions

  • the present invention relates to the field of Internet technologies, and in particular, to a content pushing method, apparatus, and computer device.
  • LR Logistic Regression
  • logistic regression model is one of the discrete selection method models, and the logistic regression model is the earliest discrete selection model, and it is also the most widely used model.
  • the logistic regression model is also a classification model in machine learning. Due to its simplicity and efficiency, it is widely used in practice.
  • the most frequently used model for app (APP) click rate estimation is the LR model.
  • the idea is to cross the user feature and the application feature as the input feature of the final model, and train the LR model.
  • a big drawback of this approach is that it is a model for all users.
  • the optimal LR model of the A user group is X.
  • the optimal LR model of the B user group is Y.
  • X and Y are likely to be conflicts. of. That is, the X model is a positive feature on a feature and the Y model is a negative feature.
  • the model Z is not the optimal model. Therefore, the traditional LR model is not accurate for the user's click rate estimation of the application, which makes it impossible to accurately push the relevant content to the user according to the traditional LR model.
  • the object of the present invention is to provide a content pushing method, device and computer device, which can determine an estimated click rate according to a click rate prediction model of the group after determining the group to which the user belongs, and improve the accuracy of the estimated click rate, and accurately Users push relevant content.
  • the present invention provides the following technical solutions:
  • a content pushing method includes the steps of: acquiring a feature of a user of an exposure application and a click behavior on the application; generating a sample according to the feature of the user and a click behavior of the application; inputting the sample into the group In the model, determining a group to which the sample belongs in the group model; inputting the sample into a click rate estimation model corresponding to the group, and obtaining an estimated click rate of the user to the application; Push content to the user based on the estimated clickthrough rate of the app.
  • the pushing content to the user based on an estimated click rate of the application comprises: pushing the application to the user based on an estimated click rate of the application.
  • the pushing the application to the user according to an estimated click rate of the application comprises: confirming that the estimated click rate of the application is greater than a threshold, and pushing the application to the user .
  • the pushing the application to the user according to an estimated click rate of the application including calculating the estimated click rate of the user to all applications in the application mall, according to the estimated click The rate is sorted from high to low, and the user is pushed to the top of the preset number of applications.
  • the method before acquiring the feature of the user of the exposure application and the click behavior on the application, the method further includes: acquiring a feature of the historical user of the exposure application and a click behavior on the application; according to the history The user's characteristics and the click behavior of the application generate samples; the sample is used to train a cluster model containing several groups and a population-specific click rate prediction model.
  • the training the clustering model comprising a plurality of populations using the sample comprises: training the clustering model using the sample according to a decision tree algorithm.
  • the training the clustering model comprising a plurality of groups and the population corresponding to the click rate prediction model using the sample comprises: training the clustering model comprising several groups using the sample, and grouping the samples into groups To classify, use the sample of the group corresponding classification to train the group's click rate prediction model.
  • the using the population corresponding to the classified sample to train the population's click rate prediction model comprises: using the sample of the group corresponding classification, using the logistic algorithm to train the click rate prediction model.
  • the click rate prediction model is a click rate estimation model obtained according to a formula of logistic regression; the formula of the logistic regression is:
  • the using the population corresponding to the classified samples, using the logistic algorithm, training the click rate prediction model including: training the group by using a logistic algorithm according to a gradient descent method, training The click rate prediction model in the corresponding group.
  • the characteristics of the user include a natural attribute, a social attribute, or a preference attribute of the user.
  • the generating a sample according to the feature of the user and the click behavior of the application includes: generating a sample by using the feature of the user as an input variable and using the click behavior as a target variable.
  • a content pushing device comprising: a first obtaining module, configured to acquire a feature of a user exposing an application and a click behavior on the application; and a generating module, configured to: according to the feature of the user and the click behavior of the application, Generating a sample; a determining module, configured to input the sample into a grouping model, determining a group to which the sample belongs in the grouping model; and a second obtaining module, configured to input the sample to the group corresponding to the group
  • the click rate estimation model the estimated click rate of the user to the application is obtained; and the pushing module is configured to push content to the user according to the estimated click rate of the application.
  • a computer device comprising: one or more processors; a memory; one or more applications, wherein the one or more applications are stored in the memory and configured to be by the one or more Executing by the processor, the one or more applications are configured to perform the content push method described in any of the above embodiments.
  • a content pushing method which generates a sample according to a feature of a user and a click behavior of the application, and inputs the sample into the grouping model to confirm that the sample belongs to a group in the grouping model. Then, according to the click rate estimation model in the group, the estimated click rate of the user is obtained, and then the corresponding content is pushed to the user according to the estimated click rate of the application.
  • the content pushing method determines the group to which the user belongs, the estimated click rate is obtained according to the click rate estimation model of the group, the accuracy of the estimated click rate is improved, and the relevant content is accurately pushed to the user.
  • relevant content may include applications, similar applications, application introductions, and advertisements and information related to the applications.
  • the content push method performs group model training according to different user groups, and trains a click rate prediction model for different groups in the group model, so that different user groups correspond to different click rate estimation models, according to corresponding clicks.
  • the rate prediction model obtains a more accurate estimated click rate for the user to accurately push relevant content to the user.
  • FIG. 1 is a flowchart of a method for pushing a content according to an embodiment of the present invention
  • FIG. 2 is a flowchart of a method for pushing a content according to another embodiment of the present invention.
  • FIG. 3 is a flow chart of a method for establishing a clustered logistic regression click rate prediction model method according to an embodiment of the present invention
  • FIG. 4 is a flowchart of a usage scenario of a model of a content pushing method according to an embodiment of the present invention
  • FIG. 5 is a flowchart of a method for a content push method according to still another embodiment of the present invention.
  • FIG. 6 is a flowchart of a method for a content push method according to still another embodiment of the present invention.
  • FIG. 7 is a schematic structural diagram of a content pushing apparatus according to an embodiment of the present invention.
  • FIG. 8 is a schematic structural diagram of a computer device according to an embodiment of the present invention.
  • the content in the content pushing method of the present invention includes: an application, a similar application, an application introduction, an advertisement related to the application, information, and the like.
  • FIG. 1 is a flowchart of a method for a content push method according to an embodiment of the present invention. As shown in FIG. 1, a content pushing method of the present invention includes the steps of: S10-S50.
  • the server When the server detects the user exposing the application, it acquires the characteristics of the user who exposed the application and obtains the user's click behavior on the application.
  • the user's click behavior on the application includes the user clicking on the application and the user not clicking on the application.
  • the application can be various applications in the app store, including game apps, social apps, reading apps, system apps, and shopping apps.
  • the characteristics of the user include the user's natural attributes, the user's social attributes, or the user's preferred attributes.
  • the user's natural attributes such as the age of the user, the gender of the user, and so on.
  • the social attributes of the user such as the user's cultural level, the user's occupation, and the user's geographic area.
  • User preferences such as “military fans”, “technology enthusiasts”, “football fans” and “game fans”.
  • the server obtains features of the user exposing the application and builds a mathematical model that characterizes the user based on the characteristics of the user. The portrayal of the user is in fact to find out the characteristics of the user.
  • the methods for obtaining user characteristics include the following methods:
  • Method 1 Characterize the user's mathematical model through the user's natural attributes.
  • the user's mathematical model is characterized according to the age of the user and the gender of the user. Among them, discretization processing is performed for continuous quantities. For example, the age is discretized into “children”, “youth”, “youth”, “middle age” and “old age”.
  • Method 2 Characterize the user's mathematical model through the user's social attributes. For example, the user's mathematical model is characterized according to the user's cultural level, the user's occupation, and the user's geographic location.
  • Method 3 Characterize the user's mathematical model through the user's preference attributes.
  • the user's mathematical model is characterized according to the user's preference for the article, such as the user's favorite military article, the user is portrayed as "military fans.” Users like technology articles and portray users as “tech enthusiasts.” Users like to watch football news and portray users as “football fans.”
  • a user portrait is formed by summarizing the users depicted by different methods and different dimensions.
  • the user image is used as a mathematical model for depicting the user. For example, the characteristics of user a are “youth”, “college students”, “residents living in Guangzhou” and “technical enthusiasts”. These four characteristics are the mathematical characterization model of user a.
  • the sample is generated according to the characteristics of the user and the click behavior of the application. Specifically, the user's feature is taken as an input variable, and the user's click behavior of the application is used as a target variable to generate a sample.
  • the server After the server generates a sample based on the characteristics of the user and the user's click behavior on the application, the server inputs the sample into the cluster model. Also determine the population to which the sample belongs in the clustering model. Specifically, there are multiple groups in the clustering model, and the sample values are obtained in the plurality of groups of the group model, and the result values of the samples in the plurality of groups are obtained, and the sample belongs to the group belonging to the group model according to the result value.
  • the clustering model can generate samples by collecting features of several users and corresponding user click behaviors to the application. According to the sample, the clustering model is trained to obtain an accurate clustering model for multiple user groups, and then an application click rate prediction model is established for different user groups.
  • the application click rate estimation model established by different user groups can be the same click rate estimation model or different click rate estimation models.
  • the clustering model can also be a predictive clustering model. That is, a clustering model that has been formed for different user groups.
  • the server After the server obtains the sample belonging to the group to which the cluster model belongs, the sample is input into the click rate estimation model corresponding to the group, thereby obtaining the estimated click rate of the user to the application.
  • the click rate prediction model corresponding to each group is the click rate estimation model obtained by the standard formula of logistic regression.
  • the click rate prediction model corresponding to each group may also be a click rate estimation model obtained by other formulas.
  • the click rate prediction model corresponding to each group in the cluster model may not be identical.
  • S50 Push content to the user according to an estimated click rate of the application.
  • the server may obtain the estimated click rate of the user for each application in the application mall. According to the estimated click rate of the user for each application, the corresponding content can be pushed to the user.
  • the content here can include the application itself and the content related to the application.
  • similar applications and application introductions, etc. may also include corresponding advertisements and information content obtained according to the application.
  • the server may obtain the estimated click rate of the user for each application in the application mall.
  • the application can be pushed to the user based on the estimated click rate of the user for each application.
  • the above content pushing method generates a sample according to the characteristics of the user and the click behavior of the application, and inputs the sample into the group model to confirm that the sample belongs to the group in the group model.
  • the estimated click rate of the user to the application is obtained according to the click rate estimation model in the group.
  • the content is pushed to the user based on the estimated click rate of the application to the user.
  • the content pushing method obtains the estimated click rate according to the click rate estimation model of the group, improves the accuracy of the estimated click rate, and can accurately push the corresponding user according to the estimated click rate of the application.
  • Content For example, the application can be pushed to the user accurately based on the estimated click rate of the application.
  • FIG. 2 is a flowchart of a method for a content push method according to another embodiment of the present invention. As shown in FIG. 2, before step S10, the method further includes: steps S101-S105.
  • the server obtains the characteristics of all users exposing the history of the application, as well as the click behavior of each user to the application.
  • the user's click behavior on the application includes the user clicking on the application and the user not clicking on the application.
  • the application can be various applications in the app store, including game apps, social apps, reading apps, system apps, and shopping apps.
  • the characteristics of the user include the user's natural attributes, the user's social attributes, or the user's preferred attributes.
  • the user's natural attributes such as the age of the user, the gender of the user, and so on.
  • the social attributes of the user such as the user's cultural level, the user's occupation, and the user's geographic area.
  • User preferences such as “military fans”, “technology enthusiasts”, “football fans” and “game fans”.
  • the server obtains features of the user exposing the application and builds a mathematical model that characterizes the user based on the characteristics of the user. The portrayal of the user is in fact to find out the characteristics of the user.
  • the methods for obtaining user characteristics include the following methods:
  • Method 1 Characterize the user's mathematical model through the user's natural attributes.
  • the user's mathematical model is characterized according to the age of the user and the gender of the user. Among them, discretization processing is performed for continuous quantities. For example, the age is discretized into “children”, “youth”, “youth”, “middle age” and “old age”.
  • Method 2 Characterize the user's mathematical model through the user's social attributes. For example, the user's mathematical model is characterized according to the user's cultural level, the user's occupation, and the user's geographic location.
  • Method 3 Characterize the user's mathematical model through the user's preference attributes.
  • the user's mathematical model is characterized according to the user's preference for the article, such as the user's favorite military article, the user is portrayed as "military fans.” Users like technology articles and portray users as “tech enthusiasts.” Users like to watch football news and portray users as “football fans.”
  • users drawn according to different methods and different dimensions are aggregated to form a user portrait.
  • the user image is used as a mathematical model for depicting the user. For example, the characteristics of user a are “youth”, “college students”, “residents living in Guangzhou” and “technical enthusiasts”. These four characteristics are the mathematical characterization model of user a.
  • the server After the server obtains the characteristics of all the users of the history and the click behavior of each user to the application, the sample is generated according to the characteristics of each user and the click behavior of the application. Specifically, the user's feature is taken as an input variable, and the user's click behavior of the application is used as a target variable to generate a sample.
  • the sample corresponds to a sample generated based on the characteristics of all users of the history and the click behavior of the application. Based on this sample, a clustering model containing several populations can be trained. At the same time, according to the sample, the corresponding click rate estimation model of each group in the cluster model can also be trained. Specifically, the sample training model including several groups is used to classify the samples by group, and the group corresponding to the group is used to train the group's click rate estimation model.
  • the sample rate prediction model of the group is trained by using the sample corresponding to the group, including: using the sample corresponding to the group, and using the logistic method to train the click rate estimation model according to the gradient descent method.
  • the click rate prediction model is a click rate estimation model obtained from the standard formula of logistic regression.
  • the standard formula of logistic regression is: Where X represents the input variable and ⁇ represents the weight vector of the different features.
  • the above-mentioned statistic algorithm is used to obtain the click rate prediction model of the group, which is only one of the methods for obtaining the click rate estimation model.
  • other algorithms can be used to obtain the click rate of the group. Estimate the model.
  • the sample training clustering model is used according to a decision tree algorithm.
  • the sample training clustering model may also be used according to the Naive Bayes algorithm, the SVM (Support Vector Machines) algorithm, or the neural network algorithm. That is to say, the algorithm for using the sample training cluster model is not limited, and the present scheme merely provides several algorithms as explanations.
  • samples are generated by collecting the characteristics of the historical user and the corresponding user's click behavior on the application.
  • the grouping model including several groups is trained, and the corresponding click rate estimation model of each group in the group model is trained to generate a click rate estimation model for different user groups, and then clicks according to different user groups.
  • the rate estimation model obtains the estimated click rate of the corresponding user to the application, and according to the estimated click rate, the corresponding content can be accurately pushed to the user. For example, based on the estimated click rate, the corresponding application can be accurately pushed to the user.
  • the input variables of the training samples are generated for all exposure events of the most recent day as described above.
  • each sample is assigned to a group during each iteration.
  • group number 3 sample x is a negative sample
  • the value calculated by the LR model of "group 1" is 0.3 in a certain iteration
  • the value calculated by the LR model of "group 2” is 0.2
  • group The value calculated by the LR model of 3" is 0.1
  • the error of the LR model of "group 3” is the smallest
  • the category of the sample x is "group 3”.
  • a clustering model is established using a decision tree algorithm.
  • Card(X) is a decision tree algorithm.
  • the training of this method uses the industry-wide model Card classification algorithm, which will not be described here.
  • the method for establishing a clustered logistic regression click rate prediction model includes: steps S301-S311.
  • samples of all users in history are randomly divided into n groups.
  • the number of iterations of the group LR is i.
  • the number of iterations i 0.
  • the maximum number of iterations is m and the iteration error threshold is c.
  • a corresponding LR model is trained based on samples of each group. Among them, the positive sample is 1 and the negative sample is 0.
  • each sample is substituted into each LR model, and the click probability is predicted, and the sample is divided into the group closest to the target value of the sample (the negative sample is closest to 0, and the positive sample is closest to 1).
  • each sample in the sample is substituted into the LR model of each group, and each sample is obtained corresponding to each group according to the group.
  • the predicted click rate obtained by the LR model is compared according to the predicted click rate obtained by each group, and the sample is divided into the group closest to the target value of the sample (where the negative sample is closest to 0 and the positive sample is closest to 1).
  • Msm(i) sum(
  • a judgment condition is established for the iterative process.
  • the judgment condition is
  • ⁇ c or i> m.
  • the clustering model is trained to predict the population to which the unknown sample belongs. Specifically, each group is regarded as a class, assuming that there are a total of n categories, in which the group has obtained which group each sample belongs to, as the target variable, each sample feature is used as a classification input variable and substituted into the classification.
  • the clustering model is trained.
  • the classification algorithm selects a decision tree algorithm. In other embodiments, a naive Bayesian algorithm, an SVM algorithm, or a neural network algorithm may also be selected.
  • FIG. 4 provides a usage scenario of a model of a content push method.
  • the content here is a sample.
  • Sample x can be an application. As shown in FIG. 4, the following steps are included: S401-407.
  • the input feature data of the unknown sample x is acquired first.
  • the feature data of the unknown sample x is substituted into the cluster model, thereby obtaining the unknown sample x belonging to the group in the cluster model (K-group).
  • the feature data of the unknown sample x is then substituted into the click rate prediction model (LR model) of the group to obtain the estimated click rate of the unknown sample x.
  • step S401 to step S405 the estimated click rate of the user on the sample x can be obtained.
  • the sample x is pushed to the user based on the estimated click rate of the user on the sample x.
  • step S50 includes step A: pushing the application to the user according to an estimated click rate of the application. Specifically, as shown in FIG. 5, step A includes: S501, confirming that the estimated click rate of the application is greater than a threshold, and pushing the application to the user.
  • the server may obtain the estimated click rate of the user for each application in the application store according to step S10 to step S40.
  • the server pushes the application to the user. Therefore, the user can be pushed to the application of interest to the user in the application mall.
  • step A includes the steps of:
  • S503 Calculate the estimated click rate of the user for all applications in the application store, sort the application from high to low according to the estimated click rate, and push the preset number of applications to the user.
  • the server respectively obtains the estimated click rate of the user to the plurality of applications in the application store according to step S10 to step S40. Therefore, the server can calculate the estimated click rate of the user for all applications in the app store. And, the application is sorted from high to low according to the estimated click rate of each application, thereby obtaining the top-preset number of applications, and pushing the preset number of applications to the user. For example, in an app store, you need to push 8 apps to your users.
  • the server obtains an estimated click rate for all applications of the application store in accordance with the scheme of the present invention. Sort all applications from high to low based on the estimated clickthrough rate of each app. Further, get the top 8 apps and push the 8 apps to the user in the app store.
  • the application pushing method provided by the embodiment may filter the application of the mobile application store according to the estimation of the click rate of the application, so as to push the application that the user is interested in to the user.
  • the invention also provides a content pushing device, as shown in FIG.
  • the content pushing device includes a first obtaining module 701, a generating module 703, a determining module 705, a second obtaining module 707, and a pushing module 709.
  • the first obtaining module 701 is configured to acquire a feature of a user of the exposure application and a click behavior to the application.
  • the server detects the user exposing the application, it acquires the characteristics of the user who exposed the application and obtains the user's click behavior on the application.
  • the user's click behavior on the application includes the user clicking on the application and the user not clicking on the application.
  • the application can be various applications in the app store, including game apps, social apps, reading apps, system apps, and shopping apps.
  • the characteristics of the user include the user's natural attributes, the user's social attributes, or the user's preferred attributes.
  • the user's natural attributes such as the age of the user, the gender of the user, and so on.
  • the social attributes of the user such as the user's cultural level, the user's occupation, and the user's geographic area.
  • User preferences such as “military fans”, “technology enthusiasts”, “football fans” and “game fans”.
  • the server obtains features of the user exposing the application and builds a mathematical model that characterizes the user based on the characteristics of the user. The portrayal of the user is in fact to find out the characteristics of the user.
  • the methods for obtaining user characteristics include the following methods:
  • Method 1 Characterize the user's mathematical model through the user's natural attributes.
  • the user's mathematical model is characterized according to the age of the user and the gender of the user. Among them, discretization processing is performed for continuous quantities. For example, the age is discretized into “children”, “youth”, “youth”, “middle age” and “old age”.
  • Method 2 Characterize the user's mathematical model through the user's social attributes. For example, the user's mathematical model is characterized according to the user's cultural level, the user's occupation, and the user's geographic location.
  • Method 3 Characterize the user's mathematical model through the user's preference attributes.
  • the user's mathematical model is characterized according to the user's preference for the article, such as the user's favorite military article, the user is portrayed as "military fans.” Users like technology articles and portray users as “tech enthusiasts.” Users like to watch football news and portray users as “football fans.”
  • users drawn according to different methods and different dimensions are aggregated to form a user portrait.
  • the user image is used as a mathematical model for depicting the user. For example, the characteristics of user a are “youth”, “college students”, “residents living in Guangzhou” and “technical enthusiasts”. These four characteristics are the mathematical characterization model of user a.
  • the generating module 703 is configured to generate a sample according to the characteristics of the user and the click behavior of the application. After the server obtains the characteristics of the user and the user's click behavior on the application, the sample is generated according to the characteristics of the user and the click behavior of the application. Specifically, the user's feature is taken as an input variable, and the user's click behavior of the application is used as a target variable to generate a sample.
  • a determination module 705 is configured to input the sample into a cluster model to determine a population to which the sample belongs in the cluster model. After the server generates a sample based on the characteristics of the user and the user's click behavior on the application, the server inputs the sample into the cluster model. Also determine the population to which the sample belongs in the clustering model. Specifically, there are multiple groups in the clustering model, and the sample values are obtained in the plurality of groups of the group model, and the result values of the samples in the plurality of groups are obtained, and the sample belongs to the group belonging to the group model according to the result value.
  • the clustering model can generate samples by collecting features of several users and corresponding user click behaviors to the application. According to the sample, the clustering model is trained to obtain an accurate clustering model for multiple user groups, and then an application click rate prediction model is established for different user groups.
  • the application click rate estimation model established by different user groups can be the same click rate estimation model or different click rate estimation models.
  • the clustering model can also be a predictive clustering model. That is, a clustering model that has been formed for different user groups.
  • the second obtaining module 707 is configured to input the sample into a click rate estimation model corresponding to the group, and obtain an estimated click rate of the user for the application. After the server obtains the sample belonging to the group to which the cluster model belongs, the sample is input into the click rate estimation model corresponding to the group, thereby obtaining the estimated click rate of the user to the application.
  • the click rate prediction model corresponding to each group is the click rate estimation model obtained by the standard formula of logistic regression.
  • the click rate prediction model corresponding to each group may also be a click rate estimation model obtained by other formulas.
  • the click rate prediction model corresponding to each group in the cluster model may not be identical.
  • the push module 709 is configured to push content to the user based on the estimated click rate of the application.
  • the server may obtain the estimated click rate of the user for each application in the application mall. According to the estimated click rate of the user for each application, the corresponding content can be pushed to the user.
  • the content here can include the application itself and the content related to the application.
  • similar applications and application introductions, etc. may also include corresponding advertisements and information content obtained according to the application.
  • the server may obtain the estimated click rate of the user for each application in the application mall.
  • the application can be pushed to the user based on the estimated click rate of the user for each application.
  • the invention also provides a computer device.
  • the computer device includes one or more processors, memory, and one or more applications. Wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more applications configured to perform any of the embodiments described above Content push method.
  • FIG. 8 is a schematic structural diagram of a computer device according to an embodiment of the present invention.
  • the device includes a processor 803, a memory 805, an input unit 807, and a display unit 809.
  • the memory 805 can be used to store the application 801 and the various functional modules, and the processor 803 runs the application 801 stored in the memory 805 to perform various functional applications and data processing of the device.
  • the memory can be internal or external, or both internal and external.
  • the internal memory may include a read only memory (ROM), a programmable ROM (PROM), an electrically programmable ROM (EPROM), an electrically erasable programmable ROM (EEPROM), a flash memory, or a random access memory.
  • ROM read only memory
  • PROM programmable ROM
  • EPROM electrically programmable ROM
  • EEPROM electrically erasable programmable ROM
  • flash memory or a random access memory.
  • the external storage may include a hard disk, a floppy disk, a ZIP disk, a USB disk, a magnetic tape, and the like.
  • the memories disclosed herein include, but are not limited to, these types of memories.
  • the memory disclosed herein is by way of example only, and not limitation.
  • the input unit 807 is for receiving an input of a signal and receiving a keyword input by the user.
  • the input unit 807 can include a touch panel as well as other input devices.
  • the touch panel can collect touch operations on or near the user (such as the user using any suitable object or accessory such as a finger or a stylus on the touch panel or near the touch panel), and according to a preset
  • the program drives the corresponding connection device; other input devices may include, but are not limited to, one or more of a physical keyboard, function keys (such as play control buttons, switch buttons, etc.), trackballs, mice, joysticks, and the like.
  • the display unit 809 can be used to display information input by the user or information provided to the user as well as various menus of the computer device.
  • the display unit 809 can take the form of a liquid crystal display, an organic light emitting diode, or the like.
  • the processor 803 is a control center of the computer device that connects various parts of the entire computer using various interfaces and lines, executes or executes software programs and/or modules stored in the memory 803, and calls data stored in the memory to execute Various functions and processing data.
  • the computer device includes one or more processors 803, and one or more memories 805, one or more applications 801. Wherein the one or more applications 801 are stored in a memory 805 and configured to be executed by the one or more processors 803 configured to perform the above embodiments Content push method.
  • each functional unit in each embodiment of the present invention may be integrated into one processing module, or each unit may exist physically separately, or two or more units may be integrated into one module.
  • the above integrated modules can be implemented in the form of hardware or in the form of software functional modules.
  • the integrated modules, if implemented in the form of software functional modules and sold or used as stand-alone products, may also be stored in a computer readable storage medium.

Abstract

The present invention provides a content pushing method and apparatus, and a computer device. The content pushing method comprises: obtaining characteristics of a user exposing an application and a click behavior of the application; generating a sample according to the characteristics of the user and the click behavior of the application; inputting the sample into a grouping model to determine a group to which the sample belongs in the grouping model; inputting the sample into a click rate prediction model corresponding to the group to obtain the predicted click rate of the application by a user; and pushing content to the user according to the predicted click rate of the application. According to the content pushing method, after a group to which a user belongs is determined, a predicted click rate is obtained according to the click rate prediction model of the group, the accuracy of the predicted click rate is improved, and content is accurately pushed to the user.

Description

内容推送方法、装置及计算机设备Content pushing method, device and computer device 技术领域Technical field
本发明涉及互联网技术领域,具体而言,本发明涉及一种内容推送方法、装置及计算机设备。The present invention relates to the field of Internet technologies, and in particular, to a content pushing method, apparatus, and computer device.
背景技术Background technique
LR(Logistic Regression,逻辑回归)模型是离散选择法模型之一,同时逻辑回归模型是最早的离散选择模型,也是目前应用最广的模型。逻辑回归模型也是机器学习中的一种分类模型,由于算法的简单和高效,在实际中应用非常广泛。LR (Logistic Regression) model is one of the discrete selection method models, and the logistic regression model is the earliest discrete selection model, and it is also the most widely used model. The logistic regression model is also a classification model in machine learning. Due to its simplicity and efficiency, it is widely used in practice.
目前在应用商店中,应用程序(APP)的点击率预估用得最多的模型是LR模型,思路大致为把用户特征与应用特征交叉,作为最后模型的输入特征,训练得到LR模型。这样的做法有一个很大的弊端就是对所有用户都是一个模型,在很多情况A用户群的最优LR模型是X,B用户群的最优LR模型是Y,X与Y很可能是冲突的。即在某个特征上X模型是正向特征,Y模型是负向特征。而如果只有一个LR模型的情况是为了同时迎合A用户群和B用户群生成一个折衷的模型Z,但模型Z并不是最优模型。因此,采用传统的LR模型对用户对应用程序的点击率预估值并不准确,导致根据传统的LR模型并不能准确地给用户推送相关的内容。At present, in the application store, the most frequently used model for app (APP) click rate estimation is the LR model. The idea is to cross the user feature and the application feature as the input feature of the final model, and train the LR model. A big drawback of this approach is that it is a model for all users. In many cases, the optimal LR model of the A user group is X. The optimal LR model of the B user group is Y. X and Y are likely to be conflicts. of. That is, the X model is a positive feature on a feature and the Y model is a negative feature. However, if there is only one LR model in order to simultaneously satisfy the A user group and the B user group to generate a compromise model Z, the model Z is not the optimal model. Therefore, the traditional LR model is not accurate for the user's click rate estimation of the application, which makes it impossible to accurately push the relevant content to the user according to the traditional LR model.
发明内容Summary of the invention
本发明的目的是提供一种内容推送方法、装置及计算机设备,实现确定用户所属群体后根据该群体的点击率预估模型获取预估点击率,提高预估点击率的准确率,准确地给用户推送相关内容。The object of the present invention is to provide a content pushing method, device and computer device, which can determine an estimated click rate according to a click rate prediction model of the group after determining the group to which the user belongs, and improve the accuracy of the estimated click rate, and accurately Users push relevant content.
为了实现上述目的,本发明提供以下技术方案:In order to achieve the above object, the present invention provides the following technical solutions:
一种内容推送方法,包括以下步骤:获取曝光应用程序的用户的特征和对应用程序的点击行为;根据所述用户的特征及对应用程序的点击行为,生成样本;将所述样本输入到分群模型中,确定所述样本在所述分群模型中所属的群体;将所述样本输入到所述群体对应的点击率预估模型中,获得所述用户对所述应用程序的预估点击率;根据应用程序的预估点击率,向所述用户推送内容。A content pushing method includes the steps of: acquiring a feature of a user of an exposure application and a click behavior on the application; generating a sample according to the feature of the user and a click behavior of the application; inputting the sample into the group In the model, determining a group to which the sample belongs in the group model; inputting the sample into a click rate estimation model corresponding to the group, and obtaining an estimated click rate of the user to the application; Push content to the user based on the estimated clickthrough rate of the app.
在其中一个实施例中,所述根据应用程序的预估点击率,向所述用户推送内容,包括:根据应用程序的预估点击率,向所述用户推送所述应用程序。In one embodiment, the pushing content to the user based on an estimated click rate of the application comprises: pushing the application to the user based on an estimated click rate of the application.
在其中一个实施例中,所述根据应用程序的预估点击率,向所述用户推送所述应用程序,包括:确认应用程序的预估点击率大于阈值,向所述用户推送所述应用程序。In one embodiment, the pushing the application to the user according to an estimated click rate of the application comprises: confirming that the estimated click rate of the application is greater than a threshold, and pushing the application to the user .
在其中一个实施例中,所述根据应用程序的预估点击率,向所述用户推送所述应用程序,包括计算所述用户对应用商场中全部应用程序的预估点击率,按预估点击率从高到低对应用程序进行排序,向所述用户推送排在最前面的预置数量的应用程序。In one embodiment, the pushing the application to the user according to an estimated click rate of the application, including calculating the estimated click rate of the user to all applications in the application mall, according to the estimated click The rate is sorted from high to low, and the user is pushed to the top of the preset number of applications.
在其中一个实施例中,所述获取曝光应用程序的用户的特征和对应用程序的点击行为之前,还包括:获取曝光应用程序的历史用户的特征和对应用程序的点击行为;根据所述历史用户的特征及对应用程序的点击行为,生成样本;使用所述样本训练包含若干群体的分群模型和群体对应的点击率预估模型。In one embodiment, before acquiring the feature of the user of the exposure application and the click behavior on the application, the method further includes: acquiring a feature of the historical user of the exposure application and a click behavior on the application; according to the history The user's characteristics and the click behavior of the application generate samples; the sample is used to train a cluster model containing several groups and a population-specific click rate prediction model.
在其中一个实施例中,所述使用所述样本训练包含若干群体的分群模型,包括:根据决策树算法使用所述样本训练分群模型。In one of the embodiments, the training the clustering model comprising a plurality of populations using the sample comprises: training the clustering model using the sample according to a decision tree algorithm.
在其中一个实施例中,所述使用所述样本训练包含若干群体的分群模型和群体对应的点击率预估模型,包括:使用所述样本训练包含若干群体的分群模型,将所述样本按群体进行分类,使用群体对应分类的样本训练该群体的点击率预估模型。In one embodiment, the training the clustering model comprising a plurality of groups and the population corresponding to the click rate prediction model using the sample comprises: training the clustering model comprising several groups using the sample, and grouping the samples into groups To classify, use the sample of the group corresponding classification to train the group's click rate prediction model.
在其中一个实施例中,所述使用群体对应分类的样本训练该群体的点击率预估模型,包括:使用群体对应分类的样本,利用逻辑斯蒂算法,训 练所述点击率预估模型。In one embodiment, the using the population corresponding to the classified sample to train the population's click rate prediction model comprises: using the sample of the group corresponding classification, using the logistic algorithm to train the click rate prediction model.
在其中一个实施例中,所述点击率预估模型为根据逻辑斯蒂回归的公式获得的点击率预估模型;所述逻辑斯蒂回归的公式为:In one embodiment, the click rate prediction model is a click rate estimation model obtained according to a formula of logistic regression; the formula of the logistic regression is:
Figure PCTCN2018105810-appb-000001
Figure PCTCN2018105810-appb-000001
其中X表示输入变量,β表示不同特征的权重向量。Where X represents the input variable and β represents the weight vector of the different features.
在其中一个实施例中,所述使用群体对应分类的样本,利用逻辑斯蒂算法,训练所述点击率预估模型,包括:根据梯度下降法利用逻辑斯蒂算法对所述群体进行训练,训练对应群体中的点击率预估模型。In one embodiment, the using the population corresponding to the classified samples, using the logistic algorithm, training the click rate prediction model, including: training the group by using a logistic algorithm according to a gradient descent method, training The click rate prediction model in the corresponding group.
在其中一个实施例中,所述用户的特性包括用户的自然属性、社会属性或偏好属性。In one of the embodiments, the characteristics of the user include a natural attribute, a social attribute, or a preference attribute of the user.
在其中一个实施例中,所述根据所述用户的特征及对应用程序的点击行为,生成样本,包括:以所述用户的特征作为输入变量,以所述点击行为作为目标变量,生成样本。In one embodiment, the generating a sample according to the feature of the user and the click behavior of the application includes: generating a sample by using the feature of the user as an input variable and using the click behavior as a target variable.
一种内容推送装置,包括:第一获取模块,用于获取曝光应用程序的用户的特征和对应用程序的点击行为;生成模块,用于根据所述用户的特征及对应用程序的点击行为,生成样本;确定模块,用于将所述样本输入到分群模型中,确定所述样本在所述分群模型中所属的群体;第二获取模块,用于将所述样本输入到所述群体对应的点击率预估模型中,获得所述用户对所述应用程序的预估点击率;推送模块,用于根据应用程序的预估点击率,向所述用户推送内容。A content pushing device, comprising: a first obtaining module, configured to acquire a feature of a user exposing an application and a click behavior on the application; and a generating module, configured to: according to the feature of the user and the click behavior of the application, Generating a sample; a determining module, configured to input the sample into a grouping model, determining a group to which the sample belongs in the grouping model; and a second obtaining module, configured to input the sample to the group corresponding to the group In the click rate estimation model, the estimated click rate of the user to the application is obtained; and the pushing module is configured to push content to the user according to the estimated click rate of the application.
一种计算机设备,其包括:一个或多个处理器;存储器;一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个应用程序配置用于执行上述任一实施例所述的内容推送方法。A computer device comprising: one or more processors; a memory; one or more applications, wherein the one or more applications are stored in the memory and configured to be by the one or more Executing by the processor, the one or more applications are configured to perform the content push method described in any of the above embodiments.
本发明的一个实施例,提供的一种内容推送方法,根据用户的特征和对应用程序的点击行为生成样本,并将该样本输入到分群模型中确认样本属于分群模型中的群体。再根据该群体中的点击率预估模型获得用户对应用程序的预估点击率,然后根据应用程序的预估点击率给用户推送对应的 内容。该内容推送方法确定用户所属群体后,根据该群体的点击率预估模型获取预估点击率,提高预估点击率的准确率,准确地给用户推送相关内容。例如,相关内容可包括应用程序、相似应用程序、应用程序介绍以及与应用程序相关的广告及资讯等。此外,该内容推送方法根据不同的用户群进行分群模型训练,并对分群模型中不同的群体训练点击率预估模型,以使得不同的用户群对应不同的点击率预估模型,根据对应的点击率预估模型获得用户对应用程序更加准确的预估点击率,以准确地给用户推送相关内容。According to an embodiment of the present invention, a content pushing method is provided, which generates a sample according to a feature of a user and a click behavior of the application, and inputs the sample into the grouping model to confirm that the sample belongs to a group in the grouping model. Then, according to the click rate estimation model in the group, the estimated click rate of the user is obtained, and then the corresponding content is pushed to the user according to the estimated click rate of the application. After the content pushing method determines the group to which the user belongs, the estimated click rate is obtained according to the click rate estimation model of the group, the accuracy of the estimated click rate is improved, and the relevant content is accurately pushed to the user. For example, relevant content may include applications, similar applications, application introductions, and advertisements and information related to the applications. In addition, the content push method performs group model training according to different user groups, and trains a click rate prediction model for different groups in the group model, so that different user groups correspond to different click rate estimation models, according to corresponding clicks. The rate prediction model obtains a more accurate estimated click rate for the user to accurately push relevant content to the user.
附图说明DRAWINGS
本发明上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present invention will become apparent and readily understood from
图1为本发明一实施例中的一种内容推送方法的方法流程图;1 is a flowchart of a method for pushing a content according to an embodiment of the present invention;
图2为本发明另一实施例中的一种内容推送方法的方法流程图;2 is a flowchart of a method for pushing a content according to another embodiment of the present invention;
图3为本发明一个实施例中的建立分群逻辑斯蒂回归点击率预估模型方法的方法流程图;3 is a flow chart of a method for establishing a clustered logistic regression click rate prediction model method according to an embodiment of the present invention;
图4为本发明一实施例中的一种内容推送方法的模型的使用场景的流程图;4 is a flowchart of a usage scenario of a model of a content pushing method according to an embodiment of the present invention;
图5为本发明又一实施例中的一种内容推送方法的方法流程图;FIG. 5 is a flowchart of a method for a content push method according to still another embodiment of the present invention; FIG.
图6为本发明再一实施例中的一种内容推送方法的方法流程图;6 is a flowchart of a method for a content push method according to still another embodiment of the present invention;
图7为本发明一实施例中的一种内容推送装置的结构示意图;FIG. 7 is a schematic structural diagram of a content pushing apparatus according to an embodiment of the present invention; FIG.
图8为本发明一实施例中的计算机设备结构示意图。FIG. 8 is a schematic structural diagram of a computer device according to an embodiment of the present invention.
具体实施方式Detailed ways
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本发明,而不能解释为对本发明的限制。The embodiments of the present invention are described in detail below, and the examples of the embodiments are illustrated in the drawings, wherein the same or similar reference numerals are used to refer to the same or similar elements or elements having the same or similar functions. The embodiments described below with reference to the drawings are intended to be illustrative of the invention and are not to be construed as limiting.
本领域技术人员应当理解,本发明所述的内容推送方法中的内容包括:应用程序、相似应用程序、应用程序介绍、与应用程序相关的广告以及资讯等。Those skilled in the art should understand that the content in the content pushing method of the present invention includes: an application, a similar application, an application introduction, an advertisement related to the application, information, and the like.
图1为本发明一实施例中的一种内容推送方法的方法流程图。如图1所示,本发明的一种内容推送方法,包括步骤:S10-S50。FIG. 1 is a flowchart of a method for a content push method according to an embodiment of the present invention. As shown in FIG. 1, a content pushing method of the present invention includes the steps of: S10-S50.
S10,获取曝光应用程序的用户的特征和对应用程序的点击行为。S10. Obtain the characteristics of the user of the exposure application and the click behavior of the application.
服务器检测到用户曝光应用程序时,获取曝光该应用程序的用户的特征,并获取用户对应用程序的点击行为。此处,用户对应用程序的点击行为包括用户点击了该应用程序,以及用户没有点击该应用程序。应用程序可以是应用商店里面的各类应用程序,包括游戏类应用程序、社交类应用程序、阅读类应用程序、系统类应用程序及购物类应用程序等。When the server detects the user exposing the application, it acquires the characteristics of the user who exposed the application and obtains the user's click behavior on the application. Here, the user's click behavior on the application includes the user clicking on the application and the user not clicking on the application. The application can be various applications in the app store, including game apps, social apps, reading apps, system apps, and shopping apps.
用户的特征包括用户的自然属性、用户的社会属性或用户的偏好属性。用户的自然属性如用户的年龄、用户的性别等。用户的社会属性如用户的文化水平、用户的职业及用户的地域等。用户的偏好属性如“军事迷”、“科技爱好者”、“足球迷”及“游戏迷”等。The characteristics of the user include the user's natural attributes, the user's social attributes, or the user's preferred attributes. The user's natural attributes such as the age of the user, the gender of the user, and so on. The social attributes of the user, such as the user's cultural level, the user's occupation, and the user's geographic area. User preferences such as "military fans", "technology enthusiasts", "football fans" and "game fans".
在一实施方式中,服务器获取曝光应用程序的用户的特征,并根据用户的特征建立刻画用户的数学模型。对用户的刻画,事实上就是把用户的特性找出来。获取用户特性的方法包括以下几种方法:In one embodiment, the server obtains features of the user exposing the application and builds a mathematical model that characterizes the user based on the characteristics of the user. The portrayal of the user is in fact to find out the characteristics of the user. The methods for obtaining user characteristics include the following methods:
方法1:通过用户的自然属性对用户的数学模型进行刻画。例如根据用户的年龄及用户的性别对用户的数学模型进行刻画。其中,对于连续的量要做离散化处理。如年龄离散化为“儿童”、“少年”、“青年”、“中年”及“老年”等。Method 1: Characterize the user's mathematical model through the user's natural attributes. For example, the user's mathematical model is characterized according to the age of the user and the gender of the user. Among them, discretization processing is performed for continuous quantities. For example, the age is discretized into “children”, “youth”, “youth”, “middle age” and “old age”.
方法2:通过用户的社会属性对用户的数学模型进行刻画。例如根据用户的文化水平、用户的职业及用户的地域对用户的数学模型进行刻画。Method 2: Characterize the user's mathematical model through the user's social attributes. For example, the user's mathematical model is characterized according to the user's cultural level, the user's occupation, and the user's geographic location.
方法3:通过用户的偏好属性对用户的数学模型进行刻画。例如根据用户对文章的偏好对用户的数学模型进行刻画,如用户喜欢军事的文章,将用户刻画“军事迷”。用户喜欢科技类的文章,将用户刻画为“科技爱好者”。用户喜欢看足球新闻,将用户刻画为“足球迷”。Method 3: Characterize the user's mathematical model through the user's preference attributes. For example, the user's mathematical model is characterized according to the user's preference for the article, such as the user's favorite military article, the user is portrayed as "military fans." Users like technology articles and portray users as "tech enthusiasts." Users like to watch football news and portray users as "football fans."
在一具体实施方式中,根据不同方法及不同维度刻画出的用户汇总起 来,就形成用户画像。把用户画像作为刻画用户的数学模型。例如:用户a的特征有“青年”、“大学生”、“常住广州”及“科技爱好者”,这4个特征就是用户a的数学刻画模型。In a specific embodiment, a user portrait is formed by summarizing the users depicted by different methods and different dimensions. The user image is used as a mathematical model for depicting the user. For example, the characteristics of user a are “youth”, “college students”, “residents living in Guangzhou” and “technical enthusiasts”. These four characteristics are the mathematical characterization model of user a.
S20,根据所述用户的特征及对应用程序的点击行为,生成样本。S20. Generate a sample according to the characteristics of the user and the click behavior of the application.
服务器获取到用户的特征和用户对应用程序的点击行为之后,根据用户的特征和对应用程序的点击行为,生成样本。具体地,以用户的特征作为输入变量,以用户对应用程序的点击行为作为目标变量,生成样本。After the server obtains the characteristics of the user and the user's click behavior on the application, the sample is generated according to the characteristics of the user and the click behavior of the application. Specifically, the user's feature is taken as an input variable, and the user's click behavior of the application is used as a target variable to generate a sample.
对于一个用户,曝光一个应用程序为一条样本。以用户的特征作为输入变量,以用户点击行为作为目标变量,用户点击了目标变量为1,用户没有点击目标变量为0。例如:某一用户具有特征(标签)为“青年”、“军事迷”,给用户曝光的应用程序是“微信”。用户没有点击“微信”应用程序,那么记录样本的形式为:x=(1,1,…),y=0。x表示其中的一个样本。x前面两个1分别表示用户具有“青年”、“军事迷”标签。x中不具备的标签用0表示。y=0表示负样本。For a user, expose an application as a sample. Taking the user's characteristics as the input variable and the user's click behavior as the target variable, the user clicks the target variable to be 1, and the user does not click the target variable to be 0. For example, a user has characteristics (labels) as “youth” and “military fans”, and the application exposed to the user is “WeChat”. If the user does not click on the "WeChat" application, the record sample is in the form: x = (1, 1, ...), y = 0. x represents one of the samples. The two fronts of x indicate that the user has the label "Youth" and "Military Fan". Labels that are not available in x are indicated by 0. y=0 indicates a negative sample.
S30,将所述样本输入到分群模型中,确定所述样本在所述分群模型中所属的群体。S30. Input the sample into a grouping model, and determine a group to which the sample belongs in the group model.
服务器根据用户的特征和用户对应用程序的点击行为生成样本后,将样本输入到分群模型中。同时确定样本在分群模型中所属的群体。具体地,分群模型中有多个群体,通过将样本输入到分群模型的多个群体中,获取样本在多个群体中的结果值,根据结果值可获得样本属于分群模型中所属的群体。After the server generates a sample based on the characteristics of the user and the user's click behavior on the application, the server inputs the sample into the cluster model. Also determine the population to which the sample belongs in the clustering model. Specifically, there are multiple groups in the clustering model, and the sample values are obtained in the plurality of groups of the group model, and the result values of the samples in the plurality of groups are obtained, and the sample belongs to the group belonging to the group model according to the result value.
在一实施方式中,分群模型可以通过收集若干用户的特征,以及对应用户对应用程序的点击行为,以生成样本。根据该样本对分群模型进行训练,从而获得准确的针对多个用户群的分群模型,进而再对不同用户群建立应用程序点击率预估模型。不同用户群建立的应用程序点击率预估模型可以是相同的点击率预估模型,也可以是不同的点击率预估模型。In an embodiment, the clustering model can generate samples by collecting features of several users and corresponding user click behaviors to the application. According to the sample, the clustering model is trained to obtain an accurate clustering model for multiple user groups, and then an application click rate prediction model is established for different user groups. The application click rate estimation model established by different user groups can be the same click rate estimation model or different click rate estimation models.
在其他的实施方式中,分群模型也可以是预知的分群模型。即针对不同的用户群已形成的分群模型。In other embodiments, the clustering model can also be a predictive clustering model. That is, a clustering model that has been formed for different user groups.
S40,将所述样本输入到所述群体对应的点击率预估模型中,获得所述 用户对所述应用程序的预估点击率。S40. Enter the sample into a click rate estimation model corresponding to the group, and obtain an estimated click rate of the user to the application.
服务器获取到样本属于分群模型中所属的群体后,将该样本输入到该群体对应的点击率预估模型中,从而获得用户对应用程序的预估点击率。分群模型中,每个群体对应的点击率预估模型均为逻辑斯蒂回归的标准公式获得的点击率预估模型。在其他的实施方式中,每个群体对应的点击率预估模型也可以是其他公式获得的点击率预估模型。此外,分群模型中的每个群体对应的点击率预估模型也可以不完全相同。After the server obtains the sample belonging to the group to which the cluster model belongs, the sample is input into the click rate estimation model corresponding to the group, thereby obtaining the estimated click rate of the user to the application. In the clustering model, the click rate prediction model corresponding to each group is the click rate estimation model obtained by the standard formula of logistic regression. In other embodiments, the click rate prediction model corresponding to each group may also be a click rate estimation model obtained by other formulas. In addition, the click rate prediction model corresponding to each group in the cluster model may not be identical.
S50,根据应用程序的预估点击率,向所述用户推送内容。S50: Push content to the user according to an estimated click rate of the application.
在本实施例中,根据步骤S10至步骤S40,服务器可获取到所述用户对应用商场中每个应用程序的预估点击率。根据所述用户对每个应用程序的预估点击率,可向用户推送相应的内容。In this embodiment, according to step S10 to step S40, the server may obtain the estimated click rate of the user for each application in the application mall. According to the estimated click rate of the user for each application, the corresponding content can be pushed to the user.
其中,此处的内容可包括应用程序本身以及与应用程序相关的内容。比如,相似的应用程序以及应用程序介绍等,还可包括与根据应用程序获得的相应的广告以及资讯内容等。Among them, the content here can include the application itself and the content related to the application. For example, similar applications and application introductions, etc., may also include corresponding advertisements and information content obtained according to the application.
在一实施方式中,根据步骤S10至步骤S40,服务器可获取到所述用户对应用商场中每个应用程序的预估点击率。根据所述用户对每个应用程序的预估点击率,可向用户推送该应用程序。In an embodiment, according to step S10 to step S40, the server may obtain the estimated click rate of the user for each application in the application mall. The application can be pushed to the user based on the estimated click rate of the user for each application.
上述内容推送方法,根据用户的特征和对应用程序的点击行为生成样本,并将该样本输入到分群模型中确认样本属于分群模型中的群体。再根据该群体中的点击率预估模型获得用户对应用程序的预估点击率。最后根据用户对应用程序的预估点击率,向该用户推送内容。该内容推送方法在确定用户所属群体后,根据该群体的点击率预估模型获取预估点击率,提高预估点击率的准确率,能够准确地根据应用程序的预估点击率给用户推送相应的内容。例如,能够准确地根据应用程序的预估点击率给用户推送该应用程序。The above content pushing method generates a sample according to the characteristics of the user and the click behavior of the application, and inputs the sample into the group model to confirm that the sample belongs to the group in the group model. The estimated click rate of the user to the application is obtained according to the click rate estimation model in the group. Finally, the content is pushed to the user based on the estimated click rate of the application to the user. After determining the group to which the user belongs, the content pushing method obtains the estimated click rate according to the click rate estimation model of the group, improves the accuracy of the estimated click rate, and can accurately push the corresponding user according to the estimated click rate of the application. Content. For example, the application can be pushed to the user accurately based on the estimated click rate of the application.
图2为本发明另一实施例中的一种内容推送方法的方法流程图。如图2所示,步骤S10之前,还包括:步骤S101-S105。FIG. 2 is a flowchart of a method for a content push method according to another embodiment of the present invention. As shown in FIG. 2, before step S10, the method further includes: steps S101-S105.
S101,获取曝光应用程序的历史用户的特征和对应用程序的点击行为。S101. Acquire a feature of a historical user of the exposure application and a click behavior to the application.
服务器获取曝光该应用程序的历史所有用户的特征,以及每个用户对 应用程序的点击行为。此处,用户对应用程序的点击行为包括用户点击了该应用程序,以及用户没有点击该应用程序。应用程序可以是应用商店里面的各类应用程序,包括游戏类应用程序、社交类应用程序、阅读类应用程序、系统类应用程序及购物类应用程序等。The server obtains the characteristics of all users exposing the history of the application, as well as the click behavior of each user to the application. Here, the user's click behavior on the application includes the user clicking on the application and the user not clicking on the application. The application can be various applications in the app store, including game apps, social apps, reading apps, system apps, and shopping apps.
用户的特征包括用户的自然属性、用户的社会属性或用户的偏好属性。用户的自然属性如用户的年龄、用户的性别等。用户的社会属性如用户的文化水平、用户的职业及用户的地域等。用户的偏好属性如“军事迷”、“科技爱好者”、“足球迷”及“游戏迷”等。The characteristics of the user include the user's natural attributes, the user's social attributes, or the user's preferred attributes. The user's natural attributes such as the age of the user, the gender of the user, and so on. The social attributes of the user, such as the user's cultural level, the user's occupation, and the user's geographic area. User preferences such as "military fans", "technology enthusiasts", "football fans" and "game fans".
在一实施方式中,服务器获取曝光应用程序的用户的特征,并根据用户的特征建立刻画用户的数学模型。对用户的刻画,事实上就是把用户的特性找出来。获取用户特性的方法包括以下几种方法:In one embodiment, the server obtains features of the user exposing the application and builds a mathematical model that characterizes the user based on the characteristics of the user. The portrayal of the user is in fact to find out the characteristics of the user. The methods for obtaining user characteristics include the following methods:
方法1:通过用户的自然属性对用户的数学模型进行刻画。例如根据用户的年龄及用户的性别对用户的数学模型进行刻画。其中,对于连续的量要做离散化处理。如年龄离散化为“儿童”、“少年”、“青年”、“中年”及“老年”等。Method 1: Characterize the user's mathematical model through the user's natural attributes. For example, the user's mathematical model is characterized according to the age of the user and the gender of the user. Among them, discretization processing is performed for continuous quantities. For example, the age is discretized into “children”, “youth”, “youth”, “middle age” and “old age”.
方法2:通过用户的社会属性对用户的数学模型进行刻画。例如根据用户的文化水平、用户的职业及用户的地域对用户的数学模型进行刻画。Method 2: Characterize the user's mathematical model through the user's social attributes. For example, the user's mathematical model is characterized according to the user's cultural level, the user's occupation, and the user's geographic location.
方法3:通过用户的偏好属性对用户的数学模型进行刻画。例如根据用户对文章的偏好对用户的数学模型进行刻画,如用户喜欢军事的文章,将用户刻画“军事迷”。用户喜欢科技类的文章,将用户刻画为“科技爱好者”。用户喜欢看足球新闻,将用户刻画为“足球迷”。Method 3: Characterize the user's mathematical model through the user's preference attributes. For example, the user's mathematical model is characterized according to the user's preference for the article, such as the user's favorite military article, the user is portrayed as "military fans." Users like technology articles and portray users as "tech enthusiasts." Users like to watch football news and portray users as "football fans."
在一具体实施方式中,根据不同方法及不同维度刻画出的用户汇总起来,就形成用户画像。把用户画像作为刻画用户的数学模型。例如:用户a的特征有“青年”、“大学生”、“常住广州”及“科技爱好者”,这4个特征就是用户a的数学刻画模型。In a specific embodiment, users drawn according to different methods and different dimensions are aggregated to form a user portrait. The user image is used as a mathematical model for depicting the user. For example, the characteristics of user a are “youth”, “college students”, “residents living in Guangzhou” and “technical enthusiasts”. These four characteristics are the mathematical characterization model of user a.
S103,根据所述历史用户的特征及对应用程序的点击行为,生成样本。S103. Generate a sample according to the characteristics of the historical user and the click behavior of the application.
服务器获取到历史所有用户的特征和每个用户对应用程序的点击行为之后,根据每个用户的特征和对应用程序的点击行为,生成样本。具体地,以用户的特征作为输入变量,以用户对应用程序的点击行为作为目标变量, 生成样本。After the server obtains the characteristics of all the users of the history and the click behavior of each user to the application, the sample is generated according to the characteristics of each user and the click behavior of the application. Specifically, the user's feature is taken as an input variable, and the user's click behavior of the application is used as a target variable to generate a sample.
对于一个用户,曝光一个应用程序为一条样本。以用户的特征作为输入变量,以用户点击行为作为目标变量,用户点击了目标变量为1,用户没有点击目标变量为0。例如:某一用户具有特征(标签)为“青年”、“军事迷”,给用户曝光的应用程序是“微信”。用户没有点击“微信”应用程序,那么记录样本的形式为:x=(1,1,…),y=0。x表示其中的一个样本。x前面两个1分别表示用户具有“青年”、“军事迷”标签。x中不具备的标签用0表示。y=0表示负样本。For a user, expose an application as a sample. Taking the user's characteristics as the input variable and the user's click behavior as the target variable, the user clicks the target variable to be 1, and the user does not click the target variable to be 0. For example, a user has characteristics (labels) as “youth” and “military fans”, and the application exposed to the user is “WeChat”. If the user does not click on the "WeChat" application, the record sample is in the form: x = (1, 1, ...), y = 0. x represents one of the samples. The two fronts of x indicate that the user has the label "Youth" and "Military Fan". Labels that are not available in x are indicated by 0. y=0 indicates a negative sample.
S105,使用所述样本训练包含若干群体的分群模型和群体对应的点击率预估模型。S105. Train the cluster model including several groups and the click rate prediction model corresponding to the group using the sample.
在本实施方式中,样本对应为根据历史所有用户的特征和对应用程序的点击行为生成的样本。根据该样本,可训练包含若干群体的分群模型。同时,根据该样本也可训练分群模型中每个群体对应的点击率预估模型。具体地,使用样本训练包含若干群体的分群模型,将样本按群体进行分类,使用群体对应分类的样本训练该群体的点击率预估模型。In the present embodiment, the sample corresponds to a sample generated based on the characteristics of all users of the history and the click behavior of the application. Based on this sample, a clustering model containing several populations can be trained. At the same time, according to the sample, the corresponding click rate estimation model of each group in the cluster model can also be trained. Specifically, the sample training model including several groups is used to classify the samples by group, and the group corresponding to the group is used to train the group's click rate estimation model.
其中,使用群体对应分类的样本训练该群体的点击率预估模型,包括:使用群体对应分类的样本,根据梯度下降法,利用逻辑斯蒂算法,训练点击率预估模型。点击率预估模型为根据逻辑斯蒂回归的标准公式获得的点击率预估模型,逻辑斯蒂回归的标准公式为:
Figure PCTCN2018105810-appb-000002
其中X表示输入变量,β表示不同特征的权重向量。
The sample rate prediction model of the group is trained by using the sample corresponding to the group, including: using the sample corresponding to the group, and using the logistic method to train the click rate estimation model according to the gradient descent method. The click rate prediction model is a click rate estimation model obtained from the standard formula of logistic regression. The standard formula of logistic regression is:
Figure PCTCN2018105810-appb-000002
Where X represents the input variable and β represents the weight vector of the different features.
上述采用逻辑斯蒂算法获取该群体的点击率预估模型仅仅为其中一种获取点击率预估模型的方法,除采用上述逻辑斯蒂算法外,还可以使用其他算法获得该群体的点击率预估模型。The above-mentioned statistic algorithm is used to obtain the click rate prediction model of the group, which is only one of the methods for obtaining the click rate estimation model. In addition to the above-mentioned logic algorithm, other algorithms can be used to obtain the click rate of the group. Estimate the model.
在本实施方式中,根据决策树算法使用样本训练分群模型。在其他实施方式中,也可根据朴素贝叶斯算法、SVM(Support Vector Machines,支持向量机)算法或者神经网络算法使用样本训练分群模型。也即是,使用样本训练分群模型的算法不作限定,本方案仅仅是提供其中的几种算法作为解释说明。In the present embodiment, the sample training clustering model is used according to a decision tree algorithm. In other embodiments, the sample training clustering model may also be used according to the Naive Bayes algorithm, the SVM (Support Vector Machines) algorithm, or the neural network algorithm. That is to say, the algorithm for using the sample training cluster model is not limited, and the present scheme merely provides several algorithms as explanations.
综上所述,在本实施方式中,通过收集历史用户的特征和对应用户对 应用程序的点击行为,生成样本。根据该样本对包含若干群体的分群模型进行训练,并且对分群模型中各群体对应的点击率预估模型进行训练,从而生成针对不同用户群的点击率预估模型,进而根据不同用户群的点击率预估模型获取对应用户对应用程序的预估点击率,根据该预估点击率能够准确地给用户推送相应的内容。例如,根据该预估点击率能够准确地给用户推送相应的应用程序。In summary, in the present embodiment, samples are generated by collecting the characteristics of the historical user and the corresponding user's click behavior on the application. According to the sample, the grouping model including several groups is trained, and the corresponding click rate estimation model of each group in the group model is trained to generate a click rate estimation model for different user groups, and then clicks according to different user groups. The rate estimation model obtains the estimated click rate of the corresponding user to the application, and according to the estimated click rate, the corresponding content can be accurately pushed to the user. For example, based on the estimated click rate, the corresponding application can be accurately pushed to the user.
以下提供一种建立分群模型的具体实施方式:The following provides a specific implementation of establishing a clustering model:
把最近一天的所有曝光事件按照上述方法生成训练样本的输入变量,对于每一个样本,在每一次迭代过程中每个样本都会归属到一个群。例如:分群数3,样本x是一个负样本,在某次迭代过程中利用“群1”的LR模型计算的值是0.3,利用“群2”的LR模型计算的值是0.2,利用“群3”的LR模型计算的值是0.1,那么“群3”的LR模型误差最小,样本x的类别是“群3”。利用决策树算法,建立分群模型。The input variables of the training samples are generated for all exposure events of the most recent day as described above. For each sample, each sample is assigned to a group during each iteration. For example: group number 3, sample x is a negative sample, the value calculated by the LR model of "group 1" is 0.3 in a certain iteration, and the value calculated by the LR model of "group 2" is 0.2, using "group The value calculated by the LR model of 3" is 0.1, then the error of the LR model of "group 3" is the smallest, and the category of the sample x is "group 3". A clustering model is established using a decision tree algorithm.
C(X)=Card(X)。C(X) = Card(X).
其中Card(X)为决策树算法,该方法的训练用业界通用的模型Card分类算法,这里不再赘述。Card(X) is a decision tree algorithm. The training of this method uses the industry-wide model Card classification algorithm, which will not be described here.
在一实施方式中,如图3所示,建立分群逻辑斯蒂回归点击率预估模型的方法包括:步骤S301-S311。In an embodiment, as shown in FIG. 3, the method for establishing a clustered logistic regression click rate prediction model includes: steps S301-S311.
S301,初始化:把样本随机划分n群,分群LR(Logistic Regression,逻辑回归)迭代次数i=0,最大迭代次数为m,迭代误差阈值为c。S301, initialization: randomly divide the samples into n groups, group LR (Logistic Regression, logistic regression), the number of iterations i=0, the maximum number of iterations is m, and the iteration error threshold is c.
在本实施方式中,将历史所有用户的样本随机划分为n个群。分群LR的迭代次数为i。初始化时,迭代次数i=0。此处,假设最大的迭代次数为m,迭代误差阈值为c。In the present embodiment, samples of all users in history are randomly divided into n groups. The number of iterations of the group LR is i. At initialization, the number of iterations i=0. Here, assume that the maximum number of iterations is m and the iteration error threshold is c.
S303,对每个群的样本训练一个LR模型(正样本为1,负样本为0)。S303. Train an LR model for each group of samples (the positive sample is 1 and the negative sample is 0).
在本实施方式中,根据每个群的样本训练一个对应的LR模型。其中,正样本为1,负样本为0。In the present embodiment, a corresponding LR model is trained based on samples of each group. Among them, the positive sample is 1 and the negative sample is 0.
S305,把每一条样本分别代入每一个LR模型,预测其点击概率,把该样本分到最接近该样本目标值的群(负样本时最接近0,正样本是最接近1)。S305, each sample is substituted into each LR model, and the click probability is predicted, and the sample is divided into the group closest to the target value of the sample (the negative sample is closest to 0, and the positive sample is closest to 1).
根据样本随机分群,并根据每个群的样本训练该群对应的LR模型之后, 把样本中每一条样本分别代入每个群的LR模型中,分别获得每一条样本对应每一个群中根据该群LR模型获得的预测点击率。将每条样本根据每个群获得的预测点击率进行比对,把该样本分到最接近该样本目标值的群中(其中,负样本时最接近0,正样本是最接近1)。According to the random grouping of the samples, and training the corresponding LR model of the group according to the samples of each group, each sample in the sample is substituted into the LR model of each group, and each sample is obtained corresponding to each group according to the group. The predicted click rate obtained by the LR model. Each sample is compared according to the predicted click rate obtained by each group, and the sample is divided into the group closest to the target value of the sample (where the negative sample is closest to 0 and the positive sample is closest to 1).
S307,计算所有样本预测的误差。误差为Msm(i)。Msm(i)=sum(|样本在该群LR模型的预测值—样本真实值|)。S307, calculating the error of all sample predictions. The error is Msm(i). Msm(i)=sum(|The predicted value of the sample in the LR model of the group - the true value of the sample|).
在本实施方式中,计算每个群中所有样本预测点击率与样本真是点击率的误差值。假设该误差值为Msm(i),则Msm(i)=sum(|样本在该群LR模型的预测值—样本真实值|)。In the present embodiment, the error value of the predicted click rate of all samples in each group and the true click rate of the sample is calculated. Assuming that the error value is Msm(i), then Msm(i)=sum(|the predicted value of the sample in the LR model of the group - the true value of the sample|).
S309,建立判断条件,即|Msm(i)—Msm(i-1)|<c Or i>=m。S309, a judgment condition is established, that is, |Msm(i) - Msm(i-1)|<c Or i>=m.
在本实施方式中,对于迭代过程建立判断条件。其中,判断条件为|Msm(i)—Msm(i-1)|<c或者i>=m。当条件不满足判断条件时,返回执行步骤S303。当条件满足判断条件时,执行以下步骤S311。In the present embodiment, a judgment condition is established for the iterative process. Wherein, the judgment condition is |Msm(i) - Msm(i-1)|<c or i>=m. When the condition does not satisfy the judgment condition, the process returns to step S303. When the condition satisfies the judgment condition, the following step S311 is performed.
S311,训练分群模型,用于预测未知样本属于哪个分群:把每个群看作一类,总共有n个类别,上述步骤中已经得到每个样本属于哪个群,以此作为目标变量,把每个样本特征作为分类的输入变量,代入分类算法,训练分群模型。S311, training a clustering model for predicting which group of unknown samples belongs to: treating each group as a class, there are a total of n categories, and in the above steps, which group each sample belongs to, as the target variable, each The sample features are used as input variables of the classification, substituted into the classification algorithm, and trained the cluster model.
在本实施方式中,当条件满足步骤309中的判断条件时,也即是将所有的样本分到归属的群体中。进一步地,训练分群模型,用于预测未知样本归属的群体。具体地,将每个群看作一类,假设总共有n个类别,上述步骤中已经得到每个样本属于哪个群,以此作为目标变量,把每个样本特征作为分类的输入变量,代入分类算法中,以训练分群模型。在本实施方式中,分类算法选择决策树算法。在其他实施方式中,也可以选择朴素贝叶斯算法、SVM算法或者神经网络算法。In the present embodiment, when the condition satisfies the judgment condition in step 309, that is, all the samples are classified into the belonging group. Further, a clustering model is trained to predict the population to which the unknown sample belongs. Specifically, each group is regarded as a class, assuming that there are a total of n categories, in which the group has obtained which group each sample belongs to, as the target variable, each sample feature is used as a classification input variable and substituted into the classification. In the algorithm, the clustering model is trained. In the present embodiment, the classification algorithm selects a decision tree algorithm. In other embodiments, a naive Bayesian algorithm, an SVM algorithm, or a neural network algorithm may also be selected.
图4提供了一种内容推送方法的模型的使用场景。此处的内容为样本。样本x可以是应用程序。如图4所示,包括以下步骤:S401-407。FIG. 4 provides a usage scenario of a model of a content push method. The content here is a sample. Sample x can be an application. As shown in FIG. 4, the following steps are included: S401-407.
S401,未知样本x的输入特征数据整理。S401, the input feature data of the unknown sample x is sorted.
S403,把未知样本x的特征数据特征代入分群模型,得到未知样本x属于第K群。S403, substituting the feature data feature of the unknown sample x into the grouping model, and obtaining the unknown sample x belonging to the Kth group.
S405,把未知样本x的特征数据代入第K群的LR模型得到未知样本x的预估点击率。S405. Substituting the feature data of the unknown sample x into the LR model of the Kth group to obtain an estimated click rate of the unknown sample x.
也即是,先获取未知样本x的输入特征数据。将未知样本x的特征数据代入到分群模型中,从而获得该未知样本x属于分群模型中群体(第K群)。再将未知样本x的特征数据代入到所属的群体的点击率预估模型(LR模型)中,以得到未知样本x的预估点击率。That is, the input feature data of the unknown sample x is acquired first. The feature data of the unknown sample x is substituted into the cluster model, thereby obtaining the unknown sample x belonging to the group in the cluster model (K-group). The feature data of the unknown sample x is then substituted into the click rate prediction model (LR model) of the group to obtain the estimated click rate of the unknown sample x.
S407,根据样本x的预估点击率,向用户推送样本x。S407, pushing the sample x to the user according to the estimated click rate of the sample x.
根据步骤S401至步骤S405,可获得用户对样本x的预估点击率。根据用户对样本x的预估点击率,向用户推送样本x。According to step S401 to step S405, the estimated click rate of the user on the sample x can be obtained. The sample x is pushed to the user based on the estimated click rate of the user on the sample x.
在一实施方式中,步骤S50包括步骤A:根据应用程序的预估点击率,向所述用户推送所述应用程序。具体地,如图5所示,步骤A包括:S501,确认应用程序的预估点击率大于阈值,向所述用户推送所述应用程序。In an embodiment, step S50 includes step A: pushing the application to the user according to an estimated click rate of the application. Specifically, as shown in FIG. 5, step A includes: S501, confirming that the estimated click rate of the application is greater than a threshold, and pushing the application to the user.
服务器可根据步骤S10至步骤S40获取到用户对应用商店里每个应用程序的预估点击率。当用户对应用商场里每个应用程序的预估点击率大于阈值(根据实际需求提前设置)时,服务器向用户推送该应用程序。因此,可给用户推送应用商场中用户感兴趣的应用程序。The server may obtain the estimated click rate of the user for each application in the application store according to step S10 to step S40. When the user's estimated click rate for each application in the application store is greater than a threshold (set in advance according to actual demand), the server pushes the application to the user. Therefore, the user can be pushed to the application of interest to the user in the application mall.
在一实施例中,如图6所示,步骤A包括步骤:In an embodiment, as shown in FIG. 6, step A includes the steps of:
S503,计算用户对应用商店中全部应用程序的预估点击率,按预估点击率从高到低对应用程序进行排序,向用户推送排在最前面的预置数量的应用程序。S503: Calculate the estimated click rate of the user for all applications in the application store, sort the application from high to low according to the estimated click rate, and push the preset number of applications to the user.
服务器根据步骤S10至步骤S40分别获取到用户对应用商店中多个应用程序的预估点击率。因此,服务器可计算用户对应用商店中,所有应用程序的预估点击率。并且,根据每个应用程序的预估点击率对应用程序进行从高到低排序,从而获取到排在最前面的预置数量的应用程序,向用户推送该预置数量的应用程序。例如,在应用商店中,需要向用户推送8个应用程序。服务器根据本发明的方案获取到应用商店所有的应用程序的预估点击率。根据每个应用程序的预估点击率对所有的应用程序进行从高到低排序。进一步,获取排在最前面的8个应用程序,并在应用商店中向用户推送这8个应用程序。The server respectively obtains the estimated click rate of the user to the plurality of applications in the application store according to step S10 to step S40. Therefore, the server can calculate the estimated click rate of the user for all applications in the app store. And, the application is sorted from high to low according to the estimated click rate of each application, thereby obtaining the top-preset number of applications, and pushing the preset number of applications to the user. For example, in an app store, you need to push 8 apps to your users. The server obtains an estimated click rate for all applications of the application store in accordance with the scheme of the present invention. Sort all applications from high to low based on the estimated clickthrough rate of each app. Further, get the top 8 apps and push the 8 apps to the user in the app store.
本实施方式提供的应用程序推送方法,可根据对应用程序点击率的预估,对手机应用商店的应用程序进行筛选,以向用户推送用户感兴趣的应用程序。The application pushing method provided by the embodiment may filter the application of the mobile application store according to the estimation of the click rate of the application, so as to push the application that the user is interested in to the user.
本发明还提供一种内容推送装置,如图7所示。该内容推送装置包括第一获取模块701、生成模块703、确定模块705、第二获取模块707以及推送模块709。The invention also provides a content pushing device, as shown in FIG. The content pushing device includes a first obtaining module 701, a generating module 703, a determining module 705, a second obtaining module 707, and a pushing module 709.
第一获取模块701用于获取曝光应用程序的用户的特征和对应用程序的点击行为。服务器检测到用户曝光应用程序时,获取曝光该应用程序的用户的特征,并获取用户对应用程序的点击行为。此处,用户对应用程序的点击行为包括用户点击了该应用程序,以及用户没有点击该应用程序。应用程序可以是应用商店里面的各类应用程序,包括游戏类应用程序、社交类应用程序、阅读类应用程序、系统类应用程序及购物类应用程序等。The first obtaining module 701 is configured to acquire a feature of a user of the exposure application and a click behavior to the application. When the server detects the user exposing the application, it acquires the characteristics of the user who exposed the application and obtains the user's click behavior on the application. Here, the user's click behavior on the application includes the user clicking on the application and the user not clicking on the application. The application can be various applications in the app store, including game apps, social apps, reading apps, system apps, and shopping apps.
用户的特征包括用户的自然属性、用户的社会属性或用户的偏好属性。用户的自然属性如用户的年龄、用户的性别等。用户的社会属性如用户的文化水平、用户的职业及用户的地域等。用户的偏好属性如“军事迷”、“科技爱好者”、“足球迷”及“游戏迷”等。The characteristics of the user include the user's natural attributes, the user's social attributes, or the user's preferred attributes. The user's natural attributes such as the age of the user, the gender of the user, and so on. The social attributes of the user, such as the user's cultural level, the user's occupation, and the user's geographic area. User preferences such as "military fans", "technology enthusiasts", "football fans" and "game fans".
在一实施方式中,服务器获取曝光应用程序的用户的特征,并根据用户的特征建立刻画用户的数学模型。对用户的刻画,事实上就是把用户的特性找出来。获取用户特性的方法包括以下几种方法:In one embodiment, the server obtains features of the user exposing the application and builds a mathematical model that characterizes the user based on the characteristics of the user. The portrayal of the user is in fact to find out the characteristics of the user. The methods for obtaining user characteristics include the following methods:
方法1:通过用户的自然属性对用户的数学模型进行刻画。例如根据用户的年龄及用户的性别对用户的数学模型进行刻画。其中,对于连续的量要做离散化处理。如年龄离散化为“儿童”、“少年”、“青年”、“中年”及“老年”等。Method 1: Characterize the user's mathematical model through the user's natural attributes. For example, the user's mathematical model is characterized according to the age of the user and the gender of the user. Among them, discretization processing is performed for continuous quantities. For example, the age is discretized into “children”, “youth”, “youth”, “middle age” and “old age”.
方法2:通过用户的社会属性对用户的数学模型进行刻画。例如根据用户的文化水平、用户的职业及用户的地域对用户的数学模型进行刻画。Method 2: Characterize the user's mathematical model through the user's social attributes. For example, the user's mathematical model is characterized according to the user's cultural level, the user's occupation, and the user's geographic location.
方法3:通过用户的偏好属性对用户的数学模型进行刻画。例如根据用户对文章的偏好对用户的数学模型进行刻画,如用户喜欢军事的文章,将用户刻画“军事迷”。用户喜欢科技类的文章,将用户刻画为“科技爱好者”。用户喜欢看足球新闻,将用户刻画为“足球迷”。Method 3: Characterize the user's mathematical model through the user's preference attributes. For example, the user's mathematical model is characterized according to the user's preference for the article, such as the user's favorite military article, the user is portrayed as "military fans." Users like technology articles and portray users as "tech enthusiasts." Users like to watch football news and portray users as "football fans."
在一具体实施方式中,根据不同方法及不同维度刻画出的用户汇总起来,就形成用户画像。把用户画像作为刻画用户的数学模型。例如:用户a的特征有“青年”、“大学生”、“常住广州”及“科技爱好者”,这4个特征就是用户a的数学刻画模型。In a specific embodiment, users drawn according to different methods and different dimensions are aggregated to form a user portrait. The user image is used as a mathematical model for depicting the user. For example, the characteristics of user a are “youth”, “college students”, “residents living in Guangzhou” and “technical enthusiasts”. These four characteristics are the mathematical characterization model of user a.
生成模块703用于根据所述用户的特征及对应用程序的点击行为,生成样本。服务器获取到用户的特征和用户对应用程序的点击行为之后,根据用户的特征和对应用程序的点击行为,生成样本。具体地,以用户的特征作为输入变量,以用户对应用程序的点击行为作为目标变量,生成样本。The generating module 703 is configured to generate a sample according to the characteristics of the user and the click behavior of the application. After the server obtains the characteristics of the user and the user's click behavior on the application, the sample is generated according to the characteristics of the user and the click behavior of the application. Specifically, the user's feature is taken as an input variable, and the user's click behavior of the application is used as a target variable to generate a sample.
对于一个用户,曝光一个应用程序为一条样本。以用户的特征作为输入变量,以用户点击行为作为目标变量,用户点击了目标变量为1,用户没有点击目标变量为0。例如:某一用户具有特征(标签)为“青年”、“军事迷”,给用户曝光的应用程序是“微信”。用户没有点击“微信”应用程序,那么记录样本的形式为:x=(1,1,…),y=0。x表示其中的一个样本。x前面两个1分别表示用户具有“青年”、“军事迷”标签。x中不具备的标签用0表示。y=0表示负样本。For a user, expose an application as a sample. Taking the user's characteristics as the input variable and the user's click behavior as the target variable, the user clicks the target variable to be 1, and the user does not click the target variable to be 0. For example, a user has characteristics (labels) as “youth” and “military fans”, and the application exposed to the user is “WeChat”. If the user does not click on the "WeChat" application, the record sample is in the form: x = (1, 1, ...), y = 0. x represents one of the samples. The two fronts of x indicate that the user has the label "Youth" and "Military Fan". Labels that are not available in x are indicated by 0. y=0 indicates a negative sample.
确定模块705用于将所述样本输入到分群模型中,确定所述样本在所述分群模型中所属的群体。服务器根据用户的特征和用户对应用程序的点击行为生成样本后,将样本输入到分群模型中。同时确定样本在分群模型中所属的群体。具体地,分群模型中有多个群体,通过将样本输入到分群模型的多个群体中,获取样本在多个群体中的结果值,根据结果值可获得样本属于分群模型中所属的群体。A determination module 705 is configured to input the sample into a cluster model to determine a population to which the sample belongs in the cluster model. After the server generates a sample based on the characteristics of the user and the user's click behavior on the application, the server inputs the sample into the cluster model. Also determine the population to which the sample belongs in the clustering model. Specifically, there are multiple groups in the clustering model, and the sample values are obtained in the plurality of groups of the group model, and the result values of the samples in the plurality of groups are obtained, and the sample belongs to the group belonging to the group model according to the result value.
在一实施方式中,分群模型可以通过收集若干用户的特征,以及对应用户对应用程序的点击行为,以生成样本。根据该样本对分群模型进行训练,从而获得准确的针对多个用户群的分群模型,进而再对不同用户群建立应用程序点击率预估模型。不同用户群建立的应用程序点击率预估模型可以是相同的点击率预估模型,也可以是不同的点击率预估模型。In an embodiment, the clustering model can generate samples by collecting features of several users and corresponding user click behaviors to the application. According to the sample, the clustering model is trained to obtain an accurate clustering model for multiple user groups, and then an application click rate prediction model is established for different user groups. The application click rate estimation model established by different user groups can be the same click rate estimation model or different click rate estimation models.
在其他的实施方式中,分群模型也可以是预知的分群模型。即针对不同的用户群已形成的分群模型。In other embodiments, the clustering model can also be a predictive clustering model. That is, a clustering model that has been formed for different user groups.
第二获取模块707用于将所述样本输入到所述群体对应的点击率预估 模型中,获得所述用户对所述应用程序的预估点击率。服务器获取到样本属于分群模型中所属的群体后,将该样本输入到该群体对应的点击率预估模型中,从而获得用户对应用程序的预估点击率。分群模型中,每个群体对应的点击率预估模型均为逻辑斯蒂回归的标准公式获得的点击率预估模型。在其他的实施方式中,每个群体对应的点击率预估模型也可以是其他公式获得的点击率预估模型。此外,分群模型中的每个群体对应的点击率预估模型也可以不完全相同。The second obtaining module 707 is configured to input the sample into a click rate estimation model corresponding to the group, and obtain an estimated click rate of the user for the application. After the server obtains the sample belonging to the group to which the cluster model belongs, the sample is input into the click rate estimation model corresponding to the group, thereby obtaining the estimated click rate of the user to the application. In the clustering model, the click rate prediction model corresponding to each group is the click rate estimation model obtained by the standard formula of logistic regression. In other embodiments, the click rate prediction model corresponding to each group may also be a click rate estimation model obtained by other formulas. In addition, the click rate prediction model corresponding to each group in the cluster model may not be identical.
推送模块709用于根据应用程序的预估点击率,向所述用户推送内容。在本实施例中,根据步骤S10至步骤S40,服务器可获取到所述用户对应用商场中每个应用程序的预估点击率。根据所述用户对每个应用程序的预估点击率,可向用户推送相应的内容。The push module 709 is configured to push content to the user based on the estimated click rate of the application. In this embodiment, according to step S10 to step S40, the server may obtain the estimated click rate of the user for each application in the application mall. According to the estimated click rate of the user for each application, the corresponding content can be pushed to the user.
其中,此处的内容可包括应用程序本身以及与应用程序相关的内容。比如,相似的应用程序以及应用程序介绍等,还可包括与根据应用程序获得的相应的广告以及资讯内容等。Among them, the content here can include the application itself and the content related to the application. For example, similar applications and application introductions, etc., may also include corresponding advertisements and information content obtained according to the application.
在一实施方式中,根据步骤S10至步骤S40,服务器可获取到所述用户对应用商场中每个应用程序的预估点击率。根据所述用户对每个应用程序的预估点击率,可向用户推送该应用程序。In an embodiment, according to step S10 to step S40, the server may obtain the estimated click rate of the user for each application in the application mall. The application can be pushed to the user based on the estimated click rate of the user for each application.
本发明还提供一种计算机设备。该计算机设备包括一个或多个处理器、存储器以及一个或多个应用程序。其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个应用程序配置用于执行上述任一实施例所述的内容推送方法。The invention also provides a computer device. The computer device includes one or more processors, memory, and one or more applications. Wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more applications configured to perform any of the embodiments described above Content push method.
图8为本发明一实施例中的计算机设备的结构示意图。例如服务器、个人计算机以及网络设备。如图8所示,设备包括处理器803、存储器805、输入单元807以及显示单元809等器件。本领域技术人员可以理解,图8示出的设备结构器件并不构成对所有设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件。存储器805可用于存储应用程序801以及各功能模块,处理器803运行存储在存储器805的应用程序801,从而执行设备的各种功能应用以及数据处理。存储器可以是内存储器或外存储器,或者包括内存储器和外存储器两者。内存储器可以包括只读存储器 (ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦写可编程ROM(EEPROM)、快闪存储器、或者随机存储器。外存储器可以包括硬盘、软盘、ZIP盘、U盘、磁带等。本发明所公开的存储器包括但不限于这些类型的存储器。本发明所公开的存储器只作为例子而非作为限定。FIG. 8 is a schematic structural diagram of a computer device according to an embodiment of the present invention. For example, servers, personal computers, and network devices. As shown in FIG. 8, the device includes a processor 803, a memory 805, an input unit 807, and a display unit 809. It will be understood by those skilled in the art that the device structure device illustrated in FIG. 8 does not constitute a limitation on all devices, and may include more or less components than those illustrated, or may combine certain components. The memory 805 can be used to store the application 801 and the various functional modules, and the processor 803 runs the application 801 stored in the memory 805 to perform various functional applications and data processing of the device. The memory can be internal or external, or both internal and external. The internal memory may include a read only memory (ROM), a programmable ROM (PROM), an electrically programmable ROM (EPROM), an electrically erasable programmable ROM (EEPROM), a flash memory, or a random access memory. The external storage may include a hard disk, a floppy disk, a ZIP disk, a USB disk, a magnetic tape, and the like. The memories disclosed herein include, but are not limited to, these types of memories. The memory disclosed herein is by way of example only, and not limitation.
输入单元807用于接收信号的输入,以及接收用户输入的关键字。输入单元807可包括触控面板以及其它输入设备。触控面板可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触控面板上或在触控面板附近的操作),并根据预先设定的程序驱动相应的连接装置;其它输入设备可以包括但不限于物理键盘、功能键(比如播放控制按键、开关按键等)、轨迹球、鼠标、操作杆等中的一种或多种。显示单元809可用于显示用户输入的信息或提供给用户的信息以及计算机设备的各种菜单。显示单元809可采用液晶显示器、有机发光二极管等形式。处理器803是计算机设备的控制中心,利用各种接口和线路连接整个电脑的各个部分,通过运行或执行存储在存储器803内的软件程序和/或模块,以及调用存储在存储器内的数据,执行各种功能和处理数据。The input unit 807 is for receiving an input of a signal and receiving a keyword input by the user. The input unit 807 can include a touch panel as well as other input devices. The touch panel can collect touch operations on or near the user (such as the user using any suitable object or accessory such as a finger or a stylus on the touch panel or near the touch panel), and according to a preset The program drives the corresponding connection device; other input devices may include, but are not limited to, one or more of a physical keyboard, function keys (such as play control buttons, switch buttons, etc.), trackballs, mice, joysticks, and the like. The display unit 809 can be used to display information input by the user or information provided to the user as well as various menus of the computer device. The display unit 809 can take the form of a liquid crystal display, an organic light emitting diode, or the like. The processor 803 is a control center of the computer device that connects various parts of the entire computer using various interfaces and lines, executes or executes software programs and/or modules stored in the memory 803, and calls data stored in the memory to execute Various functions and processing data.
在一实施方式中,计算机设备包括一个或多个处理器803,以及一个或多个存储器805,一个或多个应用程序801。其中所述一个或多个应用程序801被存储在存储器805中并被配置为由所述一个或多个处理器803执行,所述一个或多个应用程序801配置用于执行以上实施例所述的内容推送方法。In an embodiment, the computer device includes one or more processors 803, and one or more memories 805, one or more applications 801. Wherein the one or more applications 801 are stored in a memory 805 and configured to be executed by the one or more processors 803 configured to perform the above embodiments Content push method.
此外,在本发明各个实施例中的各功能单元可以集成在一个处理模块中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。所述集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing module, or each unit may exist physically separately, or two or more units may be integrated into one module. The above integrated modules can be implemented in the form of hardware or in the form of software functional modules. The integrated modules, if implemented in the form of software functional modules and sold or used as stand-alone products, may also be stored in a computer readable storage medium.
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,该程序可以存储于一计算机可读存储介质中,存储介质可以包括存储器、磁盘或光盘等。A person skilled in the art can understand that all or part of the steps of implementing the above embodiments may be completed by hardware, or may be executed by a program to execute related hardware. The program may be stored in a computer readable storage medium, and the storage medium may include Memory, disk or disc, etc.
以上所述仅是本发明的部分实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above is only a part of the embodiments of the present invention, and it should be noted that those skilled in the art can also make several improvements and retouchings without departing from the principles of the present invention. It should be considered as the scope of protection of the present invention.

Claims (14)

  1. 一种内容推送方法,其特征在于,包括以下步骤:A content pushing method, comprising the steps of:
    获取曝光应用程序的用户的特征和对应用程序的点击行为;Get the characteristics of the user who exposed the application and the click behavior on the application;
    根据所述用户的特征及对应用程序的点击行为,生成样本;Generating a sample according to the characteristics of the user and the click behavior of the application;
    将所述样本输入到分群模型中,确定所述样本在所述分群模型中所属的群体;Entering the sample into a grouping model to determine a population to which the sample belongs in the grouping model;
    将所述样本输入到所述群体对应的点击率预估模型中,获得所述用户对所述应用程序的预估点击率;And inputting the sample into a click rate estimation model corresponding to the group, and obtaining an estimated click rate of the user to the application;
    根据应用程序的预估点击率,向所述用户推送内容。Push content to the user based on the estimated clickthrough rate of the app.
  2. 根据权利要求1所述的内容推送方法,其特征在于,所述根据应用程序的预估点击率,向所述用户推送内容,包括:The content push method according to claim 1, wherein the pushing the content to the user according to the estimated click rate of the application comprises:
    根据应用程序的预估点击率,向所述用户推送所述应用程序。The application is pushed to the user based on the estimated click rate of the application.
  3. 根据权利要求1或2所述的内容推送方法,其特征在于,所述根据应用程序的预估点击率,向所述用户推送所述应用程序,包括:The content push method according to claim 1 or 2, wherein the pushing the application to the user according to an estimated click rate of the application comprises:
    确认应用程序的预估点击率大于阈值,向所述用户推送所述应用程序。Confirm that the estimated click rate of the application is greater than the threshold and push the application to the user.
  4. 根据权利要求1-3任意一项所述的内容推送方法,其特征在于,所述根据应用程序的预估点击率,向所述用户推送所述应用程序,包括:The content pushing method according to any one of claims 1 to 3, wherein the pushing the application to the user according to an estimated click rate of the application comprises:
    计算所述用户对应用商场中全部应用程序的预估点击率,按预估点击率从高到低对应用程序进行排序,向所述用户推送排在最前面的预置数量的应用程序。Calculating the estimated click rate of the user to all applications in the application mall, sorting the applications from high to low according to the estimated click rate, and pushing the preset number of applications to the user.
  5. 根据权利要求1-4任意一项所述的内容推送方法,其特征在于,所述获取曝光应用程序的用户的特征和对应用程序的点击行为之前,还包括:The content pushing method according to any one of claims 1 to 4, wherein before the acquiring the feature of the user of the exposure application and the click behavior on the application, the method further comprises:
    获取曝光应用程序的历史用户的特征和对应用程序的点击行为;Get the characteristics of the historical user of the exposure application and the click behavior on the application;
    根据所述历史用户的特征及对应用程序的点击行为,生成样本;Generating a sample according to the characteristics of the historical user and the click behavior of the application;
    使用所述样本训练包含若干群体的分群模型和群体对应的点击率预估模型。The sample is used to train a clustering model comprising a plurality of populations and a population corresponding to a click rate prediction model.
  6. 根据权利要求1-5任意一项所述的内容推送方法,其特征在于,所述使用所述样本训练包含若干群体的分群模型,包括:The content pushing method according to any one of claims 1 to 5, wherein the training the clustering model comprising a plurality of groups using the sample comprises:
    根据决策树算法使用所述样本训练分群模型。The sample training clustering model is used according to a decision tree algorithm.
  7. 根据权利要求1-6任意一项所述的内容推送方法,其特征在于,所述使用所述样本训练包含若干群体的分群模型和群体对应的点击率预估模型,包括:The content pushing method according to any one of claims 1 to 6, wherein the training the clustering model including a plurality of groups and the click rate prediction model corresponding to the group using the sample comprises:
    使用所述样本训练包含若干群体的分群模型,将所述样本按群体进行分类,使用群体对应分类的样本训练该群体的点击率预估模型。The sample is used to train a clustering model comprising several populations, the samples are grouped by group, and the population corresponding category is trained to train the population's click rate prediction model.
  8. 根据权利要求1-7任意一项所述的内容推送方法,其特征在于,所述使用群体对应分类的样本训练该群体的点击率预估模型,包括:The content pushing method according to any one of claims 1 to 7, wherein the using the sample corresponding to the group to train the click rate prediction model of the group comprises:
    使用群体对应分类的样本,利用逻辑斯蒂算法,训练所述点击率预估模型。The click rate prediction model is trained using a logistic algorithm using a sample corresponding to the population.
  9. 根据权利要求1-8任意一项所述的内容推送方法,其特征在于,所述点击率预估模型为根据逻辑斯蒂回归的公式获得的点击率预估模型;所述逻辑斯蒂回归的公式为:The content pushing method according to any one of claims 1-8, wherein the click rate prediction model is a click rate estimation model obtained according to a formula of logistic regression; The formula is:
    Figure PCTCN2018105810-appb-100001
    Figure PCTCN2018105810-appb-100001
    其中X表示输入变量,β表示不同特征的权重向量。Where X represents the input variable and β represents the weight vector of the different features.
  10. 根据权利要求1-9任意一项所述的内容推送方法,其特征在于,所述使用群体对应分类的样本,利用逻辑斯蒂算法,训练所述点击率预估模型,包括:The content pushing method according to any one of claims 1-9, wherein the using the group corresponding to the classification, using the logical algorithm to train the click rate prediction model, comprises:
    根据梯度下降法利用逻辑斯蒂算法对所述群体进行训练,训练对应群体中的点击率预估模型。The group is trained according to the gradient descent method using the logistic algorithm, and the click rate prediction model in the corresponding group is trained.
  11. 根据权利要求1-10任意一项所述的内容推送方法,其特征在于,所述用户的特性包括用户的自然属性、社会属性或偏好属性。The content pushing method according to any one of claims 1 to 10, characterized in that the characteristics of the user include a natural attribute, a social attribute or a preference attribute of the user.
  12. 根据权利要求1-11任意一项所述的内容推送方法,其特征在于,所述根据所述用户的特征及对应用程序的点击行为,生成样本,包括:The content push method according to any one of claims 1 to 11, wherein the generating a sample according to the characteristics of the user and the click behavior of the application comprises:
    以所述用户的特征作为输入变量,以所述点击行为作为目标变量,生成样本。A sample is generated by taking the characteristics of the user as an input variable and using the click behavior as a target variable.
  13. 一种内容推送装置,其特征在于,包括:A content pushing device, comprising:
    第一获取模块,用于获取曝光应用程序的用户的特征和对应用程序的点击行为;a first obtaining module, configured to acquire a feature of a user who exposes the application and a click behavior to the application;
    生成模块,用于根据所述用户的特征及对应用程序的点击行为,生成样本;a generating module, configured to generate a sample according to the characteristics of the user and the click behavior of the application;
    确定模块,用于将所述样本输入到分群模型中,确定所述样本在所述分群模型中所属的群体;a determining module, configured to input the sample into a grouping model, and determine a group to which the sample belongs in the grouping model;
    第二获取模块,用于将所述样本输入到所述群体对应的点击率预估模型中,获得所述用户对所述应用程序的预估点击率;a second obtaining module, configured to input the sample into a click rate estimation model corresponding to the group, and obtain an estimated click rate of the user to the application;
    推送模块,用于根据应用程序的预估点击率,向所述用户推送内容。A push module for pushing content to the user based on an estimated click rate of the application.
  14. 一种计算机设备,其特征在于,包括:A computer device, comprising:
    一个或多个处理器;One or more processors;
    存储器;Memory
    一个或多个应用程序,其中所述一个或多个应用程序被存储在所述存储器中并被配置为由所述一个或多个处理器执行,所述一个或多个应用程序配置用于执行根据权利要求1至12任一项所述的内容推送方法。One or more applications, wherein the one or more applications are stored in the memory and configured to be executed by the one or more processors, the one or more applications configured to execute A content pushing method according to any one of claims 1 to 12.
PCT/CN2018/105810 2017-10-20 2018-09-14 Content pushing method and apparatus, and computer device WO2019076173A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710985244.3A CN107613022B (en) 2017-10-20 2017-10-20 Content pushing method and device and computer equipment
CN201710985244.3 2017-10-20

Publications (1)

Publication Number Publication Date
WO2019076173A1 true WO2019076173A1 (en) 2019-04-25

Family

ID=61078736

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/105810 WO2019076173A1 (en) 2017-10-20 2018-09-14 Content pushing method and apparatus, and computer device

Country Status (2)

Country Link
CN (1) CN107613022B (en)
WO (1) WO2019076173A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111160961A (en) * 2019-12-20 2020-05-15 深圳前海微众银行股份有限公司 Information recommendation method and device
CN113743643A (en) * 2021-02-05 2021-12-03 北京京东振世信息技术有限公司 Method, device, equipment and medium for determining commodity data prediction accuracy
CN116821512A (en) * 2023-08-25 2023-09-29 深圳唯爱智云科技有限公司 Recommendation model training method and device, recommendation method and device

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107613022B (en) * 2017-10-20 2020-10-16 阿里巴巴(中国)有限公司 Content pushing method and device and computer equipment
CN107766580A (en) * 2017-11-20 2018-03-06 北京奇虎科技有限公司 The method for pushing and device of message
CN108427708B (en) * 2018-01-25 2021-06-25 腾讯科技(深圳)有限公司 Data processing method, data processing apparatus, storage medium, and electronic apparatus
CN108510352A (en) * 2018-02-09 2018-09-07 广州优视网络科技有限公司 Application issue recommends method, apparatus and computer equipment
CN110210884B (en) * 2018-05-29 2023-05-05 腾讯科技(深圳)有限公司 Method, device, computer equipment and storage medium for determining user characteristic data
CN109190791A (en) * 2018-07-25 2019-01-11 广州优视网络科技有限公司 Using the appraisal procedure of recommended models, device and electronic equipment
CN109495552A (en) * 2018-10-31 2019-03-19 北京字节跳动网络技术有限公司 Method and apparatus for updating clicking rate prediction model
CN109408723A (en) * 2018-11-06 2019-03-01 北京奇艺世纪科技有限公司 A kind of method for pushing and device
CN109359247B (en) * 2018-12-07 2021-07-06 广州市百果园信息技术有限公司 Content pushing method, storage medium and computer equipment
CN110008399B (en) * 2019-01-30 2023-04-07 创新先进技术有限公司 Recommendation model training method and device, and recommendation method and device
CN110033342A (en) * 2019-01-30 2019-07-19 阿里巴巴集团控股有限公司 A kind of training method and device, a kind of recommended method and device of recommended models
CN110046952B (en) * 2019-01-30 2021-12-10 创新先进技术有限公司 Recommendation model training method and device, and recommendation method and device
CN110264276A (en) * 2019-06-25 2019-09-20 广州视源电子科技股份有限公司 Determine method, apparatus, equipment and the storage medium of push user group
CN115004667B (en) * 2020-04-24 2024-04-02 深圳市欢太科技有限公司 Information pushing method, device, electronic equipment and computer readable medium
CN112669078A (en) * 2020-12-30 2021-04-16 上海众源网络有限公司 Behavior prediction model training method, device, equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110238496A1 (en) * 2010-02-23 2011-09-29 Vishal Gurbuxani Systems and Methods for Generating Data from Mobile Applications and Dynamically Delivering Advertising Based on Generated Data
CN104572962A (en) * 2014-12-31 2015-04-29 浙江大学 APP (Application) recommendation method and system
CN105678587A (en) * 2016-01-12 2016-06-15 腾讯科技(深圳)有限公司 Recommendation feature determination method and information recommendation method and device
CN105701191A (en) * 2016-01-08 2016-06-22 腾讯科技(深圳)有限公司 Push information click rate estimation method and device
US20170004524A1 (en) * 2015-06-30 2017-01-05 Yahoo! Inc. Systems and Methods For Mobile Campaign Optimization Without Knowing User Identity
CN107613022A (en) * 2017-10-20 2018-01-19 广州优视网络科技有限公司 Content delivery method, device and computer equipment

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102346899A (en) * 2011-10-08 2012-02-08 亿赞普(北京)科技有限公司 Method and device for predicting advertisement click rate based on user behaviors
CN102521767A (en) * 2011-12-13 2012-06-27 亿赞普(北京)科技有限公司 Method and system for publishing network advertising information
CN103345512A (en) * 2013-07-06 2013-10-09 北京品友互动信息技术有限公司 Online advertising click-through rate forecasting method and device based on user attribute
CN105809462A (en) * 2014-12-30 2016-07-27 Tcl集团股份有限公司 Estimation method and device for estimating advertisement click rate
CN106156878B (en) * 2015-04-21 2020-09-15 深圳市腾讯计算机系统有限公司 Advertisement click rate correction method and device
CN106570718B (en) * 2015-10-13 2020-06-19 深圳市腾讯计算机系统有限公司 Information delivery method and delivery system
CN105897847A (en) * 2015-12-15 2016-08-24 乐视网信息技术(北京)股份有限公司 Information push method and device
CN105631711A (en) * 2015-12-30 2016-06-01 合一网络技术(北京)有限公司 Advertisement putting method and apparatus
CN106096629B (en) * 2016-06-01 2019-07-16 南京邮电大学 A kind of ad click rate prediction technique based on similarity relation between user

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110238496A1 (en) * 2010-02-23 2011-09-29 Vishal Gurbuxani Systems and Methods for Generating Data from Mobile Applications and Dynamically Delivering Advertising Based on Generated Data
CN104572962A (en) * 2014-12-31 2015-04-29 浙江大学 APP (Application) recommendation method and system
US20170004524A1 (en) * 2015-06-30 2017-01-05 Yahoo! Inc. Systems and Methods For Mobile Campaign Optimization Without Knowing User Identity
CN105701191A (en) * 2016-01-08 2016-06-22 腾讯科技(深圳)有限公司 Push information click rate estimation method and device
CN105678587A (en) * 2016-01-12 2016-06-15 腾讯科技(深圳)有限公司 Recommendation feature determination method and information recommendation method and device
CN107613022A (en) * 2017-10-20 2018-01-19 广州优视网络科技有限公司 Content delivery method, device and computer equipment

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111160961A (en) * 2019-12-20 2020-05-15 深圳前海微众银行股份有限公司 Information recommendation method and device
CN111160961B (en) * 2019-12-20 2023-10-17 深圳前海微众银行股份有限公司 Information recommendation method and device
CN113743643A (en) * 2021-02-05 2021-12-03 北京京东振世信息技术有限公司 Method, device, equipment and medium for determining commodity data prediction accuracy
CN113743643B (en) * 2021-02-05 2023-11-03 北京京东振世信息技术有限公司 Method, device, equipment and medium for determining commodity data prediction accuracy
CN116821512A (en) * 2023-08-25 2023-09-29 深圳唯爱智云科技有限公司 Recommendation model training method and device, recommendation method and device
CN116821512B (en) * 2023-08-25 2024-02-20 深圳唯爱智云科技有限公司 Recommendation model training method and device, recommendation method and device

Also Published As

Publication number Publication date
CN107613022B (en) 2020-10-16
CN107613022A (en) 2018-01-19

Similar Documents

Publication Publication Date Title
WO2019076173A1 (en) Content pushing method and apparatus, and computer device
US10216558B1 (en) Predicting drive failures
US10348550B2 (en) Method and system for processing network media information
US20180349384A1 (en) Differentially private database queries involving rank statistics
JP6484730B2 (en) Collaborative filtering method, apparatus, server, and storage medium for fusing time factors
US10535106B2 (en) Selecting user posts related to trending topics on online social networks
US8756178B1 (en) Automatic event categorization for event ticket network systems
Agarwal et al. Statistical methods for recommender systems
US20110225192A1 (en) Auto-detection of historical search context
US20140314311A1 (en) System and method for classification with effective use of manual data input
US10452627B2 (en) Column weight calculation for data deduplication
CN109165975B (en) Label recommending method, device, computer equipment and storage medium
JP6911603B2 (en) How to generate predictive models for the categories of facilities visited by users, programs, server equipment, and processing equipment
US20140317078A1 (en) Method and system for retrieving information
WO2019061664A1 (en) Electronic device, user&#39;s internet surfing data-based product recommendation method, and storage medium
US9286379B2 (en) Document quality measurement
US10511681B2 (en) Establishing and utilizing behavioral data thresholds for deep learning and other models to identify users across digital space
WO2017075980A1 (en) Information pushing method and apparatus
WO2016107354A1 (en) Method and apparatus for providing user personalised resource message pushing
CN110472016B (en) Article recommendation method and device, electronic equipment and storage medium
US20160232548A1 (en) Adaptive pricing analytics
CN113076416A (en) Information heat evaluation method and device and electronic equipment
Liu et al. Question popularity analysis and prediction in community question answering services
US9652527B2 (en) Multi-term query subsumption for document classification
CN112070559A (en) State acquisition method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18867349

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18867349

Country of ref document: EP

Kind code of ref document: A1