CN109886729A

CN109886729A - Method, apparatus, electronic equipment and the readable storage medium storing program for executing of probability are clicked in prediction

Info

Publication number: CN109886729A
Application number: CN201910024668.2A
Authority: CN
Inventors: 谢雨; 吴鸿杰
Original assignee: Beijing Sankuai Online Technology Co Ltd
Current assignee: Beijing Sankuai Online Technology Co Ltd
Priority date: 2019-01-10
Filing date: 2019-01-10
Publication date: 2019-06-14

Abstract

The embodiment of the present application provides method, apparatus, electronic equipment and the readable storage medium storing program for executing of a kind of prediction click probability, to optimize click probability forecasting method in the related technology.This method comprises: obtaining the parameter information that object has been displayed, the parameter information that object has been displayed includes at least this and the type of object has been displayed, and the user's operation information that object has been displayed includes at least this and has been displayed whether object is clicked by user；According to the parameter information of object to be shown and Probabilistic Prediction Model is clicked, predicts the click probability of the object to be shown；Wherein, the desired value of historical user's operation information under the parameter information of object has been displayed using each history as reward function for the click Probabilistic Prediction Model, it is inputted using the parameter information that object has been displayed in each history as state, the click probability of the parameter information of object is had been displayed as action output with each history.

Description

Method, apparatus, electronic equipment and the readable storage medium storing program for executing of probability are clicked in prediction

Technical field

The invention relates to technical field of data processing more particularly to it is a kind of prediction click probability method, apparatus, Electronic equipment and readable storage medium storing program for executing.

Background technique

With the popularity of the internet, more and more content suppliers (Content Provider, CP) and electric business platform Occur in people's lives, brings convenience for people's lives.Such as: user can be clear by the website of content supplier Look at various information, user can also choose the commodity of oneself needs online by electric business platform.Due to content supplier and electric business Platform can attract huge flow, so most businessman's selections launch advertisement by content supplier and electric business platform, to increase The audient of more advertisements optimizes the effect of publicity of advertisement.

In general, the display terminal that content supplier and electric business platform are used in a manner of information flow to user pushes away It delivers letters breath, and is inserted into the advertisement of businessman in information flow, to pass through advertisement net income increase.Businessman is imitated to optimize the publicity of advertisement Fruit needs to consider that audient to the click probability of advertisement, gives advertising display to the click higher audient of probability.

In the related technology, in order to predict the click probability of advertisement, firstly, the historical behavior data to user count, The advertisement whether clicked to user in information flow counts.Then, it is established and is clicked according to the historical behavior data of user Probabilistic Prediction Model.Finally, being predicted using Probabilistic Prediction Model is clicked the click probability of advertisement.

As it can be seen that in the related technology, the premise accurately predicted the click probability of advertisement is: being based on history abundant Behavioral data has constructed the higher click Probabilistic Prediction Model of prediction accuracy.If the negligible amounts of historical behavior data (for example, user saw that the number of the information flow comprising advertisement is seldom or the seldom click information stream of user in advertisement), then The higher click Probabilistic Prediction Model of prediction accuracy can not be constructed, thus also can not accurately to the click probability of advertisement into Row prediction.Also, click Probabilistic Prediction Model in the related technology is constructed according to the historical behavior data of most users, With universality, using the click Probabilistic Prediction Model, obtained prediction result is: different user is general to the click of same advertisement Rate is identical.That is, click Probabilistic Prediction Model in the related technology have ignored user individualized feature (such as: difference use Family is usually interested in different advertisements).As it can be seen that predicting that the technical solution of the click probability of advertisement needs to be changed in the related technology Into.

Summary of the invention

The embodiment of the present application provides method, apparatus, electronic equipment and the readable storage medium storing program for executing of a kind of prediction click probability, with The click probability forecasting method of optimization in the related technology.

The embodiment of the present application first aspect provides a kind of method that probability is clicked in prediction, which comprises

The parameter information that object has been displayed is obtained, the parameter information that object has been displayed includes at least this and object has been displayed Type, the user's operation information that object has been displayed include at least this have been displayed whether object is clicked by user；

According to the parameter information of object to be shown and Probabilistic Prediction Model is clicked, predicts that the click of the object to be shown is general Rate；

Wherein, the operation of the historical user under the parameter information of object has been displayed with each history for the click Probabilistic Prediction Model The desired value of information is reward function, is inputted using the parameter information that object has been displayed in each history as state, is respectively gone through with described The click probability of the parameter information of object has been displayed as action output in history.

Optionally, the method also includes:

Determine that the cumulative award desired value of object has been displayed in each history respectively, wherein each history has been displayed pair The cumulative award desired value of elephant is that the user's operation of object has been displayed according to multiple history including object currently has been displayed What information determined；

It is greater than the cumulative award phase that object has been displayed in any history in the cumulative award desired value of the current display object In the case where prestige value, using the parameter information of the current display object and user's operation information as training sample, to the click Probabilistic Prediction Model is trained update.

Optionally it is determined that the cumulative award desired value of the current display object, comprising:

The user's operation information that multiple history including the current display object are had been displayed to object, inputs respectively Preset reward function, to obtain the reward value that object has been displayed in multiple history including the current display object；

The reward value that object has been displayed according to multiple history including the current display object, determines described current Show the cumulative award desired value of object.

Optionally, the user's operation information that object has been displayed further include: the object that has been displayed is in default viewing area The time persistently exposed in domain；

The user's operation information that multiple history including the current display object are had been displayed to object, inputs respectively Preset reward function, to obtain the reward value that object has been displayed in multiple history including the current display object, comprising:

Each history in object has been displayed for multiple history including the current display object to have been displayed pair As determining that the reward value of object has been displayed in the history according to following formula:

R=R₁+aR₂

Wherein, R indicates that the reward value of object, R has been displayed in the history₁Indicate that object has been displayed whether by user's point in the history It hits, a indicates default weight, R₂Indicate that the history has been displayed object samples and shows that object persistently exposes in the default display area The time of light.

Optionally, the method also includes:

The click probability predicted the click probability of multiple objects to be shown, and prediction is obtained is greater than default The object to be shown of probability is sent to display terminal.

Optionally, the object to be shown that the click probability that prediction obtains is greater than predetermined probabilities is sent to display terminal, wrapped It includes:

The multiple object to be shown is sent to the display terminal according to probability descending is clicked.

The embodiment of the present application second aspect provides a kind of device of prediction click probability, and described device includes:

Module is obtained, for obtaining the parameter information that object has been displayed, the parameter information that object has been displayed at least is wrapped Include the type that object has been displayed, the user's operation information that object has been displayed include at least this have been displayed object whether by with It clicks at family；

Prediction module, for described to aobvious according to the parameter information of object to be shown and click Probabilistic Prediction Model, prediction Show the click probability of object；

Optionally, described device further include:

Determining module, for determining that the cumulative award desired value of object has been displayed in each history respectively, wherein described each The cumulative award desired value that object has been displayed in history is to be had been displayed according to multiple history including object currently has been displayed pair What the user's operation information of elephant determined；

Object has been displayed for being greater than any history in the cumulative award desired value of the current display object in training module Cumulative award desired value in the case where, with it is described it is current display object parameter information and user's operation information be training sample This, is trained update to the click Probabilistic Prediction Model.

Optionally, the determining module includes:

Submodule is obtained, the user for object to be had been displayed in multiple history including the current display object grasps Make information, input preset reward function respectively, with obtain include it is described it is current show object including multiple history have been displayed pair The reward value of elephant；

First determines submodule, for object to have been displayed according to multiple history including the current display object Reward value determines the cumulative award desired value of the current display object.

The acquisition submodule includes:

Second determines submodule, for having been displayed in object for multiple history including the current display object Each history object has been displayed, according to following formula, determine that the reward value of object has been displayed in the history:

R=R₁+aR₂

Optionally, described device further include:

The point that sending module is predicted for the click probability to multiple objects to be shown, and prediction is obtained The object to be shown that probability is hit greater than predetermined probabilities is sent to display terminal.

Optionally, the sending module includes:

Sending submodule, for the multiple object to be shown to be sent to the display eventually according to probability descending is clicked End.

The embodiment of the present application third aspect provides a kind of computer readable storage medium, is stored thereon with computer program, The step in the method as described in the application first aspect is realized when the program is executed by processor.

The embodiment of the present application fourth aspect provides a kind of electronic equipment, including memory, processor and is stored in memory Computer program that is upper and can running on a processor, the processor realize method described in the application first aspect when executing The step of.

Using click probability forecasting method provided by the embodiments of the present application, under the parameter information that object has been displayed with each history Historical user's operation information desired value be reward function, using each history have been displayed object parameter information be state input, The click probability of the parameter information of object has been displayed as action output with each history, obtains clicking Probabilistic Prediction Model, thus To click Probabilistic Prediction Model consider that the parameter information that object has been displayed and user's operation information reflect the interest of user Point, so clicking the point of interest that Probabilistic Prediction Model is more bonded user, clicking Probabilistic Prediction Model becomes for user individual The model of customization, therefore using Probabilistic Prediction Model prediction user is clicked to the click probability of object to be shown, it can be improved general The accuracy of rate prediction.

Detailed description of the invention

Technical solution in ord to more clearly illustrate embodiments of the present application, below by institute in the description to the embodiment of the present application Attached drawing to be used is needed to be briefly described, it should be apparent that, the accompanying drawings in the following description is only some implementations of the application Example, for those of ordinary skill in the art, without any creative labor, can also be according to these attached drawings Obtain other attached drawings.

Fig. 1 is the flow chart for the method that probability is clicked in the prediction that one embodiment of the application proposes；

Fig. 2 is the flow chart of the method for the transmission object to be shown that one embodiment of the application provides；

Fig. 3 is the flow chart for the method that probability is clicked in the prediction that another embodiment of the application provides；

Fig. 4 is the flow chart for the method that probability is clicked in the prediction that another embodiment of the application provides；

Fig. 5 is the schematic diagram for the device that probability is clicked in the prediction that one embodiment of the application proposes；

Fig. 6 is the schematic diagram of the device for the transmission object to be shown that one embodiment of the application provides；

Fig. 7 is the schematic diagram for the electronic equipment that one embodiment of the application provides.

Specific embodiment

Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application carries out clear, complete Site preparation description, it is clear that described embodiment is some embodiments of the present application, instead of all the embodiments.Based on this Shen Please in embodiment, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, shall fall in the protection scope of this application.

It is the flow chart that the method for probability is clicked in the prediction that one embodiment of the application proposes with reference to Fig. 1, Fig. 1.Such as Fig. 1 institute Show, method includes the following steps:

Step S11: obtaining the parameter information that object has been displayed, and the parameter information that object has been displayed has included at least this Show that the type of object, the user's operation information that object has been displayed include at least this and object have been displayed whether by user's point It hits.

In this example it is shown that object may include the various objects being shown on the display terminal that user uses, example Such as: advertisement, news, link.According to whether showing on the display terminal that user uses, display object, which can be divided into, have been shown Show object and object to be shown.It is the object shown on the display terminal that user uses that object, which has been displayed, to be shown right As if the object not shown on the display terminal that user uses also.Object to be shown is shown on the display terminal that user uses It becomes later and object has been displayed.No matter show that object or object to be shown has been displayed in object, display object can divide For different type.For showing that object is advertisement, multiple types can be divided into: advertisement relevant to cuisines, related with tourism Advertisement, advertisement relevant to hotel.

The parameter information that object has been displayed includes at least this and the type of object has been displayed.In one embodiment, it has shown The parameter information for showing object includes but is not limited to: this have been displayed object temperature, this clicking rate of object has been displayed.It has been displayed pair The user's operation information of elephant includes at least this and has been displayed whether object is clicked by user.In one embodiment, it has been displayed pair The user's operation information of elephant can also include: that the time that object persistently exposes in default display area has been displayed in this.Illustratively, In the case where display screen display of the object in the display terminal that user uses has been displayed, object is had been displayed in display screen in this Middle position continues residence time.In another embodiment, the user's operation information that object has been displayed can also include: In the case where display screen display of the object in the display terminal that user uses has been displayed, whether the wifi switch of display terminal It is opened by user.In another embodiment, the user's operation information that object has been displayed can also include: that object has been displayed in this When it is exposed in default display area.Illustratively, object is having been displayed on the display screen for the display terminal that user uses In the case where display, the middle position when object is shown in display screen is had been displayed in this.

Step S12: according to the parameter information of object to be shown and Probabilistic Prediction Model is clicked, predicts the object to be shown Click probability；

In the present embodiment, it is adopted for predicting individually to show that the click Probabilistic Prediction Model of the click probability of object can be The model (in the case where executing step S12 for the first time) obtained with the relevant technologies, is also possible to updated model (non-first In the case where executing step S12).

Wherein, updated model obtains in the following way:

In the present embodiment, it is contemplated that the parameter information and user's operation information that object has been displayed actually characterize close The interested display object of phase user or the uninterested display object of recent user.That is, for being shown in the recent period Display object on the display terminal that user uses, if user, to the display subject interests, user would generally click this Show object；If user loses interest in the display object, user will not usually click the display object.

It in another embodiment, further include that object has been displayed default in this in the user's operation information that object has been displayed In the case where the time persistently exposed in display area, for the display pair being shown on the display terminal that user uses in the recent period As if user, to the display subject interests, user would generally click the display object, alternatively, even if user does not click on The display object, the time which persistently exposes in default display area are usually longer；If user is to the display Object is lost interest in, then user will not usually click the display object or user does not click on the display object, and the display pair As the time persistently exposed in default display area is shorter.

In order to which the accuracy of probabilistic forecasting is gradually increased, the present embodiment is proposed according to the parameter information and use that object has been displayed Family operation information is updated, so that updated click Probabilistic Prediction Model is more bonded user to Probabilistic Prediction Model is clicked Point of interest, in this way, be directed to different users, updated clicks Probabilistic Prediction Model difference, updated click probability is pre- Surveying model is the model for the customization of each user individual.Parameter information and the user behaviour of object have been displayed by making full use of Make information, be updated to Probabilistic Prediction Model is clicked, so that updated Probabilistic Prediction Model of clicking is with the interest of user Point change and adjust in due course, using updated clicks Probabilistic Prediction Model progress probabilistic forecasting, improve probabilistic forecasting Accuracy.

In one embodiment, step S12 the following steps are included:

According to the parameter information of object to be shown and updated click Probabilistic Prediction Model, the object to be shown is predicted Click probability.

In the present embodiment, it during obtaining updated click Probabilistic Prediction Model, is utilized and object has been displayed Parameter information, thus, in updated click Probabilistic Prediction Model, the parameter information that object has been displayed is updated click One model parameter of Probabilistic Prediction Model has corresponding weighted value, will in the click probabilistic forecasting to object to be shown The parameter information of the object to be shown is input in updated click Probabilistic Prediction Model, is finally predicted user and is waited showing to this Show the click probability of object.

In the present embodiment, it is contemplated that parameter information and user's operation information that object has been displayed reflect the interest of user Point is updated so making full use of the parameter information and user's operation information that object has been displayed to Probabilistic Prediction Model is clicked, So that the updated point of interest clicked Probabilistic Prediction Model and be more bonded user, updated click Probabilistic Prediction Model become needle To the model of user individual customization, therefore using updated click Probabilistic Prediction Model prediction user to object to be shown Probability is clicked, the accuracy of probabilistic forecasting can be improved.

In conjunction with above embodiments, in another embodiment of the application, the quantity of object to be shown be it is multiple, to multiple to aobvious Show that the click probability of each object to be shown in object is predicted, then the click probability of multiple objects to be shown is arranged Sequence, and the display terminal used based on ranking results to user sends one or more of multiple objects to be shown.Thus, ginseng Fig. 2 is examined, Fig. 2 is the flow chart of the method for the transmission object to be shown that one embodiment of the application provides.As shown in Fig. 2, this method It is further comprising the steps of in addition to including step S11- step S12:

Step S13: predicting the click probability of multiple objects to be shown, and the click probability that prediction is obtained Object to be shown greater than predetermined probabilities is sent to display terminal.

In one embodiment, the object to be shown that the click probability that prediction obtains is greater than predetermined probabilities is sent to aobvious Show terminal, comprising:

In the present embodiment, it in the case where the quantity of object to be shown is multiple, for each object to be shown, executes Step S11- step S12, obtains the click probability of each object to be shown.Then multiple respective clicks of objects to be shown are general Rate is screened out from it and clicks the object to be shown that probability is greater than predetermined probabilities respectively compared with predetermined probabilities, namely filter out with The compactness of the point of interest of user is higher than the object to be shown of default compactness, is then sent to the object to be shown filtered out The display terminal that user uses.Wherein, predetermined probabilities are pre-set, and predetermined probabilities are arranged bigger, and being sent to user makes The compactness of the point of interest of the object to be shown and user of display terminal is higher；Predetermined probabilities are arranged smaller, are sent to The compactness of the point of interest of the object to be shown and user for the display terminal that user uses is lower.

Since the compactness for being sent to the object to be shown for the display terminal that user uses and the point of interest of user is higher, So the probability that the object to be shown is clicked by user is higher, thus object to be shown is that content supplier and electric business platform are brought The probability of income is higher, improves the probability by the traffic transformation of content supplier and electric business platform for income.Also, due to hair The compactness for giving the object to be shown for the display terminal that user uses and the point of interest of user is higher, so optimizing user's body It tests.

In conjunction with above each embodiment, in another embodiment of the application, it includes current display object that object, which has been displayed,.? In this case, another embodiment of the application provides a kind of method that probability is clicked in prediction.It is that the application is another with reference to Fig. 3, Fig. 3 The flow chart of the method for probability is clicked in the prediction that embodiment provides.As shown in figure 3, the method comprising the steps of S11, step S12 and Following steps:

Step S12 ': using the parameter information of the current display object and user's operation information as training sample, to described It clicks Probabilistic Prediction Model to be trained, to obtain updated click Probabilistic Prediction Model.

In the present embodiment, become the model customized for each user individual to make to click Probabilistic Prediction Model, The thought of intensified learning is applied to in the renewal process for clicking Probabilistic Prediction Model.Specifically, due to currently showing object Parameter information and user's operation information, reflect the current point of interest of user in real time, thus will current display object parameter Information and user's operation information are input to and click in Probabilistic Prediction Model, to the click Probabilistic Prediction Model as training sample It is trained, the click Probabilistic Prediction Model after training is updated click Probabilistic Prediction Model.It is understood that such as Described previously, which can be the model obtained using the relevant technologies and (is executing step for the first time In the case where rapid S11- step S12 '), it is also possible to the last updated mould for executing and obtaining after step S11- step S12 ' Type (in the case where non-first execution step S11- step S12 ').

Click Probabilistic Prediction Model after training is updated click Probabilistic Prediction Model, above-mentioned training process So that the updated individualized feature clicked Probabilistic Prediction Model and cover user, utilizes the updated click probabilistic forecasting Model carries out probabilistic forecasting, improves the accuracy of probabilistic forecasting.

In conjunction with above each embodiment, in another embodiment of the application, it includes current display object that object, which has been displayed, and removes Outside, object has been displayed in further include display order adjacent with the current display display order of object upper one.In the case, originally Apply for that another embodiment provides a kind of method that probability is clicked in prediction.It is that another embodiment of the application provides with reference to Fig. 4, Fig. 4 Prediction click probability method flow chart.As shown in figure 4, the method comprising the steps of S11, step S12 and following steps:

Step S121: determine that the cumulative award desired value of object has been displayed in each history respectively, wherein each history The cumulative award desired value that object has been displayed is that object has been displayed according to multiple history including object currently has been displayed What user's operation information determined；

Step S122: it is greater than any history in the cumulative award desired value of the current display object and the tired of object has been displayed It is right using the parameter information of the current display object and user's operation information as training sample in the case that bonuses distributed according to strict calculations encourages desired value The click Probabilistic Prediction Model is trained update.

In one embodiment, step S121 the following steps are included:

The accumulative of object has been displayed in the cumulative award desired value for determining the current display object respectively and described upper one Reward desired value, wherein a cumulative award desired value that object has been displayed be according to including this object has been displayed including it is more What the user's operation information that object has been displayed in a history determined；

In one embodiment, step S122 the following steps are included:

It is greater than described upper one cumulative award that object has been displayed in the cumulative award desired value of the current display object In the case where desired value, using the parameter information of the current display object and user's operation information as training sample, to the point It hits Probabilistic Prediction Model to be trained, to obtain updated click Probabilistic Prediction Model.

In the present embodiment, a cumulative award desired value for having been displayed object, corresponding to including that this has been displayed object and exists The expectation that object is the probability that content supplier and electric business platform bring income has been displayed in interior multiple history, also reflect including The expectation that the compactness of object and the point of interest of user has been displayed in multiple history including object has been displayed in this.

In view of can all consume processor resource to each update for clicking Probabilistic Prediction Model, increase the processing of processor Burden, in order to improve the utilization efficiency of processor resource, and reduces the processing load of processor, in the present embodiment, every time Execute step S11 after, first determine whether this to click Probabilistic Prediction Model update whether it is necessary to if it is necessary, then This update is executed, conversely, can then continue to continue to use last updated click Probabilistic Prediction Model.

Specifically, every time after execution step S11, it is first determined the cumulative award desired value of current display object, and really The reward desired value that object has been displayed in upper one adjacent with the current display display order of object of display order is determined, then by two A reward desired value compares, if currently the cumulative award desired value of display object is larger, this is to click probabilistic forecasting The update of model is necessary, thus executes step S122 (or step S12 ')；If the currently cumulative award phase of display object Prestige value is smaller, then can continue to continue to use last updated click Probabilistic Prediction Model.

It is understood that if currently object has been displayed in cumulative award desired value greatly thereon one of display object Desired value is rewarded, then illustrates that rewarding desired value is also not up to maximum, thus it is necessary to pre- to last updated click probability It surveys model to continue to update, to increase the subsequent display object for being sent to the display terminal that user uses as content supplier The probability of income is brought with electric business platform, and improves the subsequent display object for being sent to the display terminal that user uses and user Point of interest compactness.

In one embodiment, the cumulative award desired value of the current display object is determined, comprising:

In one embodiment, the user's operation information that object has been displayed further include: the object that has been displayed exists The time persistently exposed in default display area；Object is had been displayed into multiple history including the current display object User's operation information inputs preset reward function respectively, to have obtained multiple history including the current display object Show the reward value of object, comprising:

R=R₁+aR₂

Object has been displayed for each, determines that the process of the cumulative award desired value of object has been displayed in this are as follows: firstly, determining The reward value of object has been displayed in this, and it is each to determine that object has been displayed in multiple history of the display order before object has been displayed in this From reward value；Then, these reward values are calculated desired.Wherein it is determined that a reward value that object has been displayed has And it is not limited to following two embodiment:

This: being had been displayed the user's operation information of object by the first embodiment, preset reward function is inputted, to be somebody's turn to do The reward value of object has been displayed, wherein this have been displayed object user's operation information include at least this have been displayed object whether by User clicks.

Second of embodiment: it includes that the user's operation of object has been displayed in this that the user's operation information that object has been displayed, which is removed, Information includes at least this and has been displayed except whether object clicked by user, further includes: object has been displayed in default display area in this The time inside persistently exposed.In the case, according to following formula, determine that the reward value of object has been displayed in this:

R=R₁+aR₂

Wherein, R indicates that the reward value of object, R has been displayed in this₁Indicate that this has been displayed whether object is clicked by user, a is indicated Default weight, R₂Indicate that this has been displayed object samples and shows the time that object persistently exposes in the default display area.

In the present embodiment, it is contemplated that object has been displayed for each, whether user clicks this and object has been displayed, can not Accurately embody the interest level that user this has been displayed object, it is possible to which the case where occurring is: user does not click on this and shown Show object, but this to have been displayed the time that object persistently exposes in default display area longer, then user actually to this Display object be also it is more interested, do not click on this only and object have been displayed.It is possible that occur another situation is that: user It clicks this and has been displayed object, but this has been displayed that the time that object persistently exposes in default display area is shorter, then having can It can be user misoperation, user actually has been displayed object to this and loses interest in.Therefore, this can whether have been clicked in conjunction with user Display object and this time that object persistently exposes in default display area has been displayed, had been displayed pair to capture user to this The interest level of elephant.Specifically, whether click that object has been displayed in this and object has been displayed in default display in this according to user The time persistently exposed in region determines that user this has been displayed the journey interested of object in length range at a reasonable time Degree.

In one embodiment, object can has been displayed with this in the interest level that object has been displayed to one in user Reward value characterizes.The reward value that object has been displayed is obtained according to following formula:

R=R₁+aR₂

In the specific implementation process, a is pre-set, characterizes this and object samples display object has been displayed described default The time persistently exposed in display area, the quantization table of significance level during determining that the reward value of object has been displayed in this Show.

In the present embodiment, the time that object persistently exposes in default display area will have been displayed, as having calculated this Show a parameter of the reward value of object, namely a Consideration as the point of interest for capturing user, it follows that The reward value that object has been displayed more accurately reflects the point of interest of user, is obtained more based on the reward value that object has been displayed Click Probabilistic Prediction Model after new carries out probabilistic forecasting using the updated click Probabilistic Prediction Model, improves probability The accuracy of prediction.

Based on the same inventive concept, one embodiment of the application provides a kind of device of prediction click probability.With reference to Fig. 5, Fig. 5 It is the schematic diagram that the device of probability is clicked in the prediction that one embodiment of the application provides.As shown in figure 5, the device includes:

Module 501 is obtained, for obtaining the parameter information that object has been displayed, the parameter information that object has been displayed is at least Have been displayed the type of object including this, the user's operation information that object has been displayed include at least this have been displayed object whether by User clicks；

Prediction module 502, for according to the parameter information of object to be shown and clicking Probabilistic Prediction Model, prediction it is described to Show the click probability of object；

Optionally, described device further include:

Optionally, the determining module includes:

The acquisition submodule includes:

R=R₁+aR₂

In conjunction with above each embodiment, one embodiment of the application provides a kind of device of prediction click probability.With reference to Fig. 6, Fig. 6 is the schematic diagram of the device for the transmission object to be shown that one embodiment of the application provides.As shown in fig. 6, the device is except including It obtains except module 501 and prediction module 502, further includes:

Sending module 503 predicts for the click probability to multiple objects to be shown, and prediction is obtained The object to be shown that probability is clicked greater than predetermined probabilities is sent to display terminal.

Optionally, the sending module includes:

Based on the same inventive concept, another embodiment of the application provides a kind of computer readable storage medium, stores thereon There is computer program, the step in the method as described in any of the above-described embodiment of the application is realized when which is executed by processor Suddenly.

Based on the same inventive concept, one embodiment of the application provides a kind of electronic equipment.It is the application one with reference to Fig. 7, Fig. 7 The schematic diagram for the electronic equipment that embodiment proposes.As shown in fig. 7, electronic equipment 100 includes: memory 110 and processor 120, It is connected between memory 110 and processor 120 by bus communication, is stored with computer program in memory 110, the computer Program can be run on processor 120, and then realize the step in method described in any of the above-described embodiment of the application.

For device embodiment, since it is basically similar to the method embodiment, related so being described relatively simple Place illustrates referring to the part of embodiment of the method.

All the embodiments in this specification are described in a progressive manner, the highlights of each of the examples are with The difference of other embodiments, the same or similar parts between the embodiments can be referred to each other.

It should be understood by those skilled in the art that, the embodiments of the present application may be provided as method, apparatus or calculating Machine program product.Therefore, the embodiment of the present application can be used complete hardware embodiment, complete software embodiment or combine software and The form of the embodiment of hardware aspect.Moreover, the embodiment of the present application can be used one or more wherein include computer can With in the computer-usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) of program code The form of the computer program product of implementation.

The embodiment of the present application is referring to according to the method for the embodiment of the present application, terminal device (system) and computer program The flowchart and/or the block diagram of product describes.It should be understood that flowchart and/or the block diagram can be realized by computer program instructions In each flow and/or block and flowchart and/or the block diagram in process and/or box combination.It can provide these Computer program instructions are set to general purpose computer, special purpose computer, Embedded Processor or other programmable data processing terminals Standby processor is to generate a machine, so that being held by the processor of computer or other programmable data processing terminal devices Capable instruction generates for realizing in one or more flows of the flowchart and/or one or more blocks of the block diagram The device of specified function.

These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing terminal devices In computer-readable memory operate in a specific manner, so that instruction stored in the computer readable memory generates packet The manufacture of command device is included, which realizes in one side of one or more flows of the flowchart and/or block diagram The function of being specified in frame or multiple boxes.

These computer program instructions can also be loaded into computer or other programmable data processing terminal devices, so that Series of operation steps are executed on computer or other programmable terminal equipments to generate computer implemented processing, thus The instruction executed on computer or other programmable terminal equipments is provided for realizing in one or more flows of the flowchart And/or in one or more blocks of the block diagram specify function the step of.

Although preferred embodiments of the embodiments of the present application have been described, once a person skilled in the art knows bases This creative concept, then additional changes and modifications can be made to these embodiments.So the following claims are intended to be interpreted as Including preferred embodiment and all change and modification within the scope of the embodiments of the present application.

Finally, it is to be noted that, herein, relational terms such as first and second and the like be used merely to by One entity or operation are distinguished with another entity or operation, without necessarily requiring or implying these entities or operation Between there are any actual relationship or orders.Moreover, the terms "include", "comprise" or its any other variant meaning Covering non-exclusive inclusion, so that process, method, article or terminal device including a series of elements not only wrap Those elements are included, but also including other elements that are not explicitly listed, or further includes for this process, method, article Or the element that terminal device is intrinsic.In the absence of more restrictions, being wanted by what sentence "including a ..." limited Element, it is not excluded that there is also other identical elements in process, method, article or the terminal device for including the element.

Method, apparatus, storage medium and the electronic equipment for clicking probability to a kind of prediction provided herein above, into It has gone and has been discussed in detail, specific examples are used herein to illustrate the principle and implementation manner of the present application, the above implementation The explanation of example is merely used to help understand the present processes and its core concept；Meanwhile for the general technology people of this field Member, according to the thought of the application, there will be changes in the specific implementation manner and application range, in conclusion this explanation Book content should not be construed as the limitation to the application.

Claims

1. a kind of method that probability is clicked in prediction, which is characterized in that the described method includes:

The parameter information that object has been displayed is obtained, the parameter information that object has been displayed includes at least this and the class of object has been displayed Type, the user's operation information that object has been displayed include at least this and have been displayed whether object is clicked by user；

According to the parameter information of object to be shown and Probabilistic Prediction Model is clicked, predicts the click probability of the object to be shown；

Wherein, historical user's operation information under the parameter information of object has been displayed with each history for the click Probabilistic Prediction Model Desired value be reward function, using each history have been displayed object parameter information be state input, with each history The click probability for showing the parameter information of object is action output.

2. the method according to claim 1, wherein the method also includes:

Determine that the cumulative award desired value of object has been displayed in each history respectively, wherein object has been displayed in each history Cumulative award desired value is that the user's operation information of object has been displayed according to multiple history including object currently has been displayed Determining；

It is greater than the cumulative award desired value that object has been displayed in any history in the cumulative award desired value of the current display object In the case where, using the parameter information of the current display object and user's operation information as training sample, to the click probability Prediction model is trained update.

3. according to the method described in claim 2, it is characterized in that, determining the cumulative award expectation of the current display object Value, comprising:

Multiple history including the current display object are had been displayed to the user's operation information of object, input is default respectively Reward function, to obtain the reward value that object has been displayed in multiple history including the current display object；

The reward value that object has been displayed according to multiple history including the current display object, determines the current display The cumulative award desired value of object.

4. according to the method described in claim 3, it is characterized in that, the user's operation information that object has been displayed further include: The time that object has been displayed and is persistently exposed in default display area；

Multiple history including the current display object are had been displayed to the user's operation information of object, input is default respectively Reward function, to obtain the reward value that object has been displayed in multiple history including the current display object, comprising:

Each history in object has been displayed for multiple history including the current display object, object has been displayed, presses According to following formula, determine that the reward value of object has been displayed in the history:

R=R₁+aR₂

Wherein, R indicates that the reward value of object, R has been displayed in the history₁Indicate that the history has been displayed whether object is clicked by user, a Indicate default weight, R₂Indicate that the history has been displayed object samples and shows what object persistently exposed in the default display area Time.

5. method according to claim 1 to 4, which is characterized in that the method also includes:

The click probability predicted the click probability of multiple objects to be shown, and prediction is obtained is greater than predetermined probabilities Object to be shown be sent to display terminal.

6. according to the method described in claim 5, it is characterized in that, by prediction obtain click probability be greater than predetermined probabilities to Display object is sent to display terminal, comprising:

7. the device that probability is clicked in a kind of prediction, which is characterized in that described device includes:

Module is obtained, for obtaining the parameter information that object has been displayed, the parameter information that object has been displayed includes at least should The type of object has been displayed, the user's operation information that object has been displayed includes at least this and object has been displayed whether by user's point It hits；

Prediction module, for the parameter information and click Probabilistic Prediction Model according to object to be shown, it is described to be shown right to predict The click probability of elephant；

8. device according to claim 7, which is characterized in that described device further include:

Determining module, for determining that the cumulative award desired value of object has been displayed in each history respectively, wherein each history The cumulative award desired value that object has been displayed is that object has been displayed according to multiple history including object currently has been displayed What user's operation information determined；

The tired of object has been displayed for being greater than any history in the cumulative award desired value of the current display object in training module It is right using the parameter information of the current display object and user's operation information as training sample in the case that bonuses distributed according to strict calculations encourages desired value The click Probabilistic Prediction Model is trained update.

9. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is held by processor The step in the method as described in claim 1-6 is any is realized when row.

10. a kind of electronic equipment including memory, processor and stores the calculating that can be run on a memory and on a processor Machine program, which is characterized in that the step of method as described in claim 1-6 is any is realized when the processor executes.