A kind of ad click rate prediction technique based on similarity relation between user
Technical field
The present invention relates to a kind of ad click rate prediction techniques based on similarity relation between user, belong to web advertisement dispensing
Technical field.
Background technique
Advertisement itself be to society transmitting information publicity measures and many companies important revenue source it
One.With the continuous development of Internet advertising, under the driving of enormous profit, how to improve advertisement launch bring profit also at
For research hotspot.By predicting ad click rate, it can effectively judge a user to an ad click row
For a possibility that, so that the advertisement that is oriented to it is launched, effectively improve the gray profit for being launched advertisement.At present
Advertisement launches and is generally divided into two kinds: the advertisement point based on content is launched and directional technology.
Advertisement based on content, which is launched, carries out content matching strategy, that is, launches the search term content searched for when advertisement with user
Or browsing webpage content centered on, by ad content in search term perhaps web page contents matched and launch it is matched extensively
It accuses, this putting mode matches ad content, and different user, which is directed to, there is no consideration carries out accurate personalized recommendation,
For different user, the advertisement that may be seen when searching for same search word or browsing the same page is the same, but this
A little advertisements not necessarily their interested contents, this putting mode effect are poor.
Directional technology is a kind of popular technology in terms of launching advertisement, it using historical data to user characteristics into
Then row description launches accurate advertisement to user according to user characteristics, so the experience of user can be promoted well, so
More at present is all to carry out advertisement dispensing using directional technology.
But since in practical applications, number of ads is huge, many users might not have ad click record, or
The advertisement that many users of person click is seldom, and the click record in historical record about user will be very few at this time, therefore according to going through
History data, which directly carry out advertisement dispensing to user, cannot accurately find the interested direction of user, and advertisement delivery effect is just at this time
It can have a greatly reduced quality.Therefore, the interest for being often difficult preferably to predict user is launched in advertisement in the prior art, thus cannot be very
Accurately launch the interested advertisement of user.
Summary of the invention
Technical problem to be solved by the invention is to provide a kind of analyses based on similarity relation between user, can be accurate
Predict that user to the clicking rate of advertisement, realizes the ad click rate prediction side based on similarity relation between user that advertisement is accurately launched
Method.
In order to solve the above-mentioned technical problem the present invention uses following technical scheme: the present invention devises a kind of based between user
The ad click rate prediction technique of similarity relation, includes the following steps:
Step 001. is directed to each user according to the advertisement click logs in server respectively, obtains user in default sieve
Select its all search key and user in the period wide for each shown to it respectively within the default screening period
The clicking rate of announcement, subsequently into step 002;
Step 002. is directed to all users, obtains and screens in the period on search key between all two two users default
Similarity value, then choose corresponding two two users of each similarity value institute for being greater than default similarity threshold, respectively structure
There are two users of direct similarity relation at each group, and obtain the dependence between two users of each group, according to each group
Dependence between two users determines the direct similar users of each user, subsequently into step 003;
Step 003. has the dependence between two users of two users and each group of direct similarity relation for each group
Relationship establishes Bayesian network model, wherein each user is respectively adopted each user node and indicates, each group has directly similar
Dependence between two users of relationship is indicated using the oriented arrow between user node, subsequently into step 004;
Step 004. is respectively for each user node in Bayesian network model, if there are user father's sections for user node
Point then obtains the user node and clicks advertisement in its each user's father node respectively and do not click different groups of advertisement two states
Under conjunction, correspond to click advertisement state posterior probability, that is, obtain the user node using its each user's father node as
Each direct similar users, in the case where each direct similar users click advertisement and do not click the various combination of advertisement two states,
The corresponding posterior probability for clicking advertisement state of the user node;If user's father node is not present in user node, the user is obtained
Node is corresponding to be clicked advertisement state and does not click advertisement shape probability of state;Subsequently into step 005;
Step 005. is according to the structure of Bayesian network model, and there is no each user node of user's father node,
It is corresponding to click advertisement state and do not click advertisement shape probability of state, respectively for each user section in Bayesian network model
Point, acquisition user node is respectively with respect to other each user nodes with its indirect association in the case where clicking advertisement state, the user
The corresponding posterior probability for clicking advertisement state of node, and selection is respectively corresponded greater than each posterior probability of predetermined probabilities threshold value
Two indirect associations user node, that is, respectively constitute each group have indirect similarity relation two users, subsequently into step
Rapid 006;
Step 006. obtains each direct similar users, each indirect similar users for corresponding to target prediction user, and
Further obtaining target prediction user, relatively each direct similar users click advertisement and do not click advertisement two states respectively
Under various combination, the corresponding posterior probability for clicking advertisement state of target prediction user;And target prediction user is relatively each respectively
A indirect similar users are in the case where clicking advertisement state, the corresponding posterior probability for clicking advertisement state of target prediction user, i.e., by mesh
Direct similar users, the indirect similar users of mark prediction user are referred to as similar users, and it is each with respect to it to obtain target prediction user
Position similar users respectively correspond its posterior probability for clicking advertisement state;Subsequently into step 007;
Step 007. is according to each user respectively for the click to its each branch advertisement shown within the default screening period
Rate, obtain the relatively every similar users of target prediction user, respectively correspond its posterior probability for clicking advertisement state respectively with it is right
Answer each similar users for targeted advertisements clicking rate product, finally by after each product addition multiplied by normalization factor institute
It must be worth, as prediction clicking rate of the target prediction user for targeted advertisements.
As a preferred technical solution of the present invention: the step 001 specifically comprises the following steps:
Step 001-1. is directed to each user according to the advertisement click logs in server respectively, obtains user default
It screens its all search key, each branch advertisement in the period and is presetting the displaying number in the screening period to the user respectively,
And the user is directed to the number of clicks to its each branch advertisement shown respectively within the default screening period, subsequently into step
001-2;
Step 001-2. is directed to each user respectively, according to each branch advertisement exhibition within the default screening period to user respectively
Show that number and the user, respectively for the number of clicks to its each branch advertisement shown, are somebody's turn to do within the default screening period
User is directed to the clicking rate to its each branch advertisement shown respectively within the default screening period, subsequently into step 002.
As a preferred technical solution of the present invention: the step 002 specifically comprises the following steps:
Step 002-1. is directed to all users, obtain it is all to two two users, respectively for each to two two users, according to
It is crucial to obtain two users common search within the default screening period for family its all search key within the default screening period
The number of word accounts for the ratio of two users number of all search keys within the default screening period, dual-purpose to two as this
Similarity value between family within the default screening period on search key, thus to obtain in all users it is each to two two users it
Between similarity value within the default screening period on search key, subsequently into step 002-2;
Step 00202. chooses corresponding two two users of each similarity value institute for being greater than default similarity threshold, point
Not Gou Cheng each group there are two users of direct similarity relation, and enter step 002-3;
Step 002-3. is directed to two the users A and B that each group has direct similarity relation respectively, within the default screening period
Following judgement is done, subsequently into step 003;
If the number of judgement two user's common search keywords of A, B accounts for the ratio of all search key numbers of party A-subscriber,
Greater than the ratio that the number of two user's common search keywords of A, B accounts for all search key numbers of party B-subscriber, then A, B two
Dependence between user is that user A is directed toward user B, i.e. user A is user's father node of user B, and user B is user A's
User's child node, i.e. user A are the direct similar users of user B;
If the number of judgement two user's common search keywords of A, B accounts for the ratio of all search key numbers of party B-subscriber,
Greater than the ratio that the number of two user's common search keywords of A, B accounts for all search key numbers of party A-subscriber, then A, B two
Dependence between user is that user B is directed toward user A, i.e. user B is user's father node of user A, and user A is user B's
User's child node, i.e. user B are the direct similar users of user A;
If the number of judgement two user's common search keywords of A, B accounts for the ratio of all search key numbers of party B-subscriber,
Equal to the ratio that the number of two user's common search keywords of A, B accounts for all search key numbers of party A-subscriber, then further look into
It sees between two users of A, B and whether has deposited dependence, be, do not do further operating for two users of A, B;Otherwise it is directed to
A, dependence is set at random between two users of B.
As a preferred technical solution of the present invention: the step 004 specifically comprises the following steps:
It is pressed respectively for each user node in Bayesian network model using user node as active user's node
Following steps are operated:
Step 004-1. judges that active user's node is to enter step 004-2 with the presence or absence of user's father node;Otherwise into
Enter step 004-5;
Step 004-2. obtains the number N of user's father node corresponding to active user's node, according to each user node point
Dui Ying advertisement not clicked and not click the two states of advertisement, will click on advertisement state and be defined as 1, not click the definition of advertisement state
It is 0, and then obtains and combine constituted 2 between active user's node different conditions and its all user's father node different conditions
N+1 power state, subsequently into step 004-3;
Step 004-3. be directed between active user's node and its all user's father node respectively combine constituted it is each
State further takes the user node pre- for each user node in state if user node state is 1 respectively
If screening interim keyword of its all search key as the user node in the period;If user node state is 0,
Within the default screening period, all search keys of all user nodes and the user node in Bayesian network model are taken
The difference set of all search keys, as the interim keyword of the user node, subsequently into step 004-4;
Step 004-4. be directed between active user's node and its all user's father node respectively combine constituted it is each
State, active user's node in the keyword number and state of the intersection of all interim keywords of user node in acquisition state
The ratio of the keyword number of the intersection of all interim keywords of user's father node, in this state as active user's node
Posterior probability;Advertisement is clicked in its each user's father node thus to obtain active user's node and does not click advertisement two states
Under various combination, it is each with its to obtain active user's node for the corresponding posterior probability for clicking advertisement state of active user's node
User's father node clicks advertisement in each direct similar users and does not click two kinds of advertisement respectively as each direct similar users
Under the various combination of state, the corresponding posterior probability for clicking advertisement state of active user's node;
Step 004-5. takes and works as then within the default screening period for the corresponding state for clicking advertisement of active user's node
All search keys of all user nodes in preceding all search key numbers of user node and Bayesian network model
Several ratio is as the corresponding click advertisement shape probability of state of active user's node;Meanwhile for the corresponding not point of active user's node
The state of advertisement is hit, then within the default screening period, takes all search of all user nodes in Bayesian network model crucial
Own in the number and Bayesian network model of the search key of the difference set of word and all search keys of active user's node
The ratio of the number of all search keys of user node does not click the general of advertisement state as active user's node correspondence
Rate;Advertisement shape probability of state is not clicked thus to obtain the corresponding click advertisement state of active user's node and.
As a preferred technical solution of the present invention: in the step 005, using Gibbs sampling method, according to shellfish
The structure of this network model of leaf, and there is no each user nodes of user's father node, it is corresponding to click advertisement state and not point
Advertisement shape probability of state is hit, respectively for each user node in Bayesian network model, it is opposite respectively to obtain user node
Other each user nodes with its indirect association are in the case where clicking advertisement state, after the corresponding click advertisement state of the user node
Test probability.
As a preferred technical solution of the present invention: in the step 005, using Gibbs sampling method, difference needle
Active user's node is obtained using user node as active user's node to each user node in Bayesian network model
Respectively with respect to other each user nodes with its indirect association in the case where clicking advertisement state, active user's node is corresponding to be clicked extensively
The posterior probability for state of lodging a complaint with, specifically comprises the following steps:
Step 005-1. is clicked extensively in other each user nodes with active user's node indirect association by corresponding
The user node for state of lodging a complaint with will be removed as evidence variable e, active user's node as target variable t in Bayesian network model
Other user nodes other than evidence variable, target variable are as non-evidence variable q, respectively in Bayesian network model
Each user node, using user's father node of user's father node of user node, user's child node and user's child node as
The markov of the user node covers;Subsequently into step 005-2;
Step 005-2. initializes the state of all user nodes as first sample, and evidence variable states are assigned a value of
1, non-evidence variable assigns state 0 or 1 at random, and enters step 005-3;
Step 005-3. recycles non-evidence variable and utilizes posterior probability in step 004 to each non-evidence variable q
It calculates, calculating its in the covering of its markov under each user node status condition of current non-evidence variable q is respectively 0 He
1 posterior probability, subsequently into step 005-4;
Step 005-4. be randomly generated one 0 to the sum of the current non-evidence variable q conditional probability for being 0 and 1 it is random
The state of current non-evidence variable is changed to 0, if the random number if the random number is less than or equal to its conditional probability for being 0 by number
The conditional probability for being 0 greater than it, and be less than its sum of conditional probability for being 0 and 1, then the state of current non-evidence variable is changed to
1, the state of each non-evidence variable is thus updated as new sample, subsequently into step 005-5;
Step 005-5. repeats step 005-2 to step 005-4 step, and constantly sampling generates new sample, and statistics is all
The sample number n that target variable state is 1 in sample calculates the ratio of sample number n and frequency in sampling s that target variable state is 1,
As active user's node itself is also 1 posterior probability under conditions of having determined that a certain user node state is 1.
As a preferred technical solution of the present invention: the acquisition of normalization factor passes through such as lower section in the step 007
Method:
First within the default screening period, each similar users for obtaining target prediction user are directed to respectively to its displaying
The sum of the clicking rate of each branch advertisement, then calculate the ratio of the sum of 1 and the clicking rate, that is, it is used as normalization factor.
A kind of ad click rate prediction technique based on similarity relation between user of the present invention uses above technical scheme
Compared with prior art, a kind of ad click based on similarity relation between user designed by the present invention is had following technical effect that
Rate prediction technique constructs the structure and parameter of Bayesian network model based on the extraction of data in advertisement click logs, realizes and uses
The analysis of similarity relation between family further realizes prediction of the user to ad click rate as a result, final to realize the accurate of advertisement
It launches;Wherein, the foundation of Bayesian network model, accuracy with higher not will lead to result and have no basis, and creating
When building Bayesian network model, redundancy side is eliminated, enhances the reliability and validity of Bayesian network model;Not only such as
This carries out the reasoning of Bayesian network by a variety of methods in the establishment process of Bayesian network model, obtains indirectly similar
User, flexibility with higher and selectivity;So that the designed ad click rate based on similarity relation between user of the present invention
Prediction technique has fully considered the interest and focus of user, has combined two while by ad content and search content matching
Kind launches advertisement a little, avoids the one-sidedness of single advertisement putting mode, has preferable prediction effect.
Detailed description of the invention
Fig. 1 is the configuration diagram based on the ad click rate prediction technique of similarity relation between user that the present invention designs;
Fig. 2 is Bayesian network mould in the ad click rate prediction technique based on similarity relation between user of the invention designed
The construction flow chart of type;
Fig. 3 is to be adopted in the ad click rate prediction technique based on similarity relation between user of the invention designed using gibbs
Quadrat method constructs the flow chart of indirect similar users relationship.
Specific embodiment
Specific embodiments of the present invention will be described in further detail with reference to the accompanying drawings of the specification.
Problem solved by the invention is that in the prior art, for lacking, user clicks record or click records less feelings
The interested advertisement of user cannot be accurately launched very much under condition, the problem being not efficient enough is launched in advertisement.Based on pass similar between user
It is ad click rate prediction technique, is established between user by similitude of the user in search behavior, using Bayesian network
Direct similar dependence, and between being inferred between user by the direct similarity relation, using the rationalistic method of Bayesian network
Similar dependence is connect, so as to predict certain user to the clicking rate of certain advertisement, hence for a certain of certain user search
Keyword can match all advertisements that may be launched, then the clicking rate for all advertisements that may be launched by prediction carries out
Sequence, and then the advertisement that user is oriented is launched by the prediction, the income and advertisement for effectively improving advertisement putting business are thrown
Effect is put, solves the problems such as current advertisement dispensing is not efficient enough.
As shown in Figure 1, a kind of ad click rate prediction technique based on similarity relation between user designed by the present invention, real
In the application process of border, specifically comprise the following steps:
Step 001. is directed to each user according to the advertisement click logs in server respectively, obtains user in default sieve
Select its all search key and user in the period wide for each shown to it respectively within the default screening period
The clicking rate of announcement, subsequently into step 002.
Wherein, the step 001 specifically comprises the following steps:
Step 001-1. from the advertisement click logs in server, filter out user characteristics mark, characteristic of advertisement mark,
The description of user's search key, the displayings number of advertisement and number this five fields for being clicked, as a result, respectively for each use
Family, obtain user within the default screening period its all search key, each branch advertisement respectively within the default screening period to
The displaying number of the user and the user are secondary for the click to its each branch advertisement shown respectively within the default screening period
Number, subsequently into step 001-2.
Step 001-2. is directed to each user respectively, according to each branch advertisement exhibition within the default screening period to user respectively
Show that number and the user, respectively for the number of clicks to its each branch advertisement shown, are somebody's turn to do within the default screening period
User is directed to the clicking rate to its each branch advertisement shown respectively within the default screening period, subsequently into step 002.
Step 002. is directed to all users, obtains and screens in the period on search key between all two two users default
Similarity value, then choose corresponding two two users of each similarity value institute for being greater than default similarity threshold, respectively structure
There are two users of direct similarity relation at each group, and obtain the dependence between two users of each group, according to each group
Dependence between two users, determines the direct similar users of each user, and saves, subsequently into step 003.
Wherein, as shown in Fig. 2, step 002 specifically comprises the following steps:
Step 002-1. is directed to all users, obtain it is all to two two users, respectively for each to two two users, according to
It is crucial to obtain two users common search within the default screening period for family its all search key within the default screening period
The number of word accounts for the ratio of two users number of all search keys within the default screening period, dual-purpose to two as this
Similarity value between family within the default screening period on search key, thus to obtain in all users it is each to two two users it
Between similarity value within the default screening period on search key, subsequently into step 002-2.
Step 00202. chooses corresponding two two users of each similarity value institute for being greater than default similarity threshold, point
Not Gou Cheng each group there are two users of direct similarity relation, and enter step 002-3.
Step 002-3. is directed to two the users A and B that each group has direct similarity relation respectively, within the default screening period
Following judgement is done, subsequently into step 003.
If the number of judgement two user's common search keywords of A, B accounts for the ratio of all search key numbers of party A-subscriber,
Greater than the ratio that the number of two user's common search keywords of A, B accounts for all search key numbers of party B-subscriber, then A, B two
Dependence between user is that user A is directed toward user B, i.e. user A is user's father node of user B, and user B is user A's
User's child node, i.e. user A are the direct similar users of user B.
If the number of judgement two user's common search keywords of A, B accounts for the ratio of all search key numbers of party B-subscriber,
Greater than the ratio that the number of two user's common search keywords of A, B accounts for all search key numbers of party A-subscriber, then A, B two
Dependence between user is that user B is directed toward user A, i.e. user B is user's father node of user A, and user A is user B's
User's child node, i.e. user B are the direct similar users of user A.
If the number of judgement two user's common search keywords of A, B accounts for the ratio of all search key numbers of party B-subscriber,
Equal to the ratio that the number of two user's common search keywords of A, B accounts for all search key numbers of party A-subscriber, then further look into
It sees between two users of A, B and whether has deposited dependence, be, do not do further operating for two users of A, B;Otherwise it is directed to
A, dependence is set at random between two users of B.
Step 003. has the dependence between two users of two users and each group of direct similarity relation for each group
Relationship establishes Bayesian network model, wherein each user is respectively adopted each user node and indicates, each group has directly similar
Dependence between two users of relationship is indicated using the oriented arrow between user node, subsequently into step 004.
Step 004. is respectively for each user node in Bayesian network model, if there are user father's sections for user node
Point then obtains the user node and clicks advertisement in its each user's father node respectively and do not click different groups of advertisement two states
Under conjunction, correspond to click advertisement state posterior probability, that is, obtain the user node using its each user's father node as
Each direct similar users, in the case where each direct similar users click advertisement and do not click the various combination of advertisement two states,
The corresponding posterior probability for clicking advertisement state of the user node, and save;If user's father node is not present in user node, obtain
The user node is corresponding to be clicked advertisement state and does not click advertisement shape probability of state, and is saved;Subsequently into step 005.
Wherein, above-mentioned steps 004 specifically comprise the following steps:
It is pressed respectively for each user node in Bayesian network model using user node as active user's node
Following steps are operated:
Step 004-1. judges that active user's node is to enter step 004-2 with the presence or absence of user's father node;Otherwise into
Enter step 004-5.
Step 004-2. obtains the number N of user's father node corresponding to active user's node, according to each user node point
Dui Ying advertisement not clicked and not click the two states of advertisement, will click on advertisement state and be defined as 1, not click the definition of advertisement state
It is 0, and then obtains and combine constituted 2 between active user's node different conditions and its all user's father node different conditions
N+1 power state, each state is formed by binary N+1 0 or 1, subsequently into step 004-3.
Step 004-3. be directed between active user's node and its all user's father node respectively combine constituted it is each
State further takes the user node pre- for each user node in state if user node state is 1 respectively
If screening interim keyword of its all search key as the user node in the period;If user node state is 0,
Within the default screening period, all search keys of all user nodes and the user node in Bayesian network model are taken
The difference set of all search keys, as the interim keyword of the user node, subsequently into step 004-4.
Step 004-4. be directed between active user's node and its all user's father node respectively combine constituted it is each
State, active user's node in the keyword number and state of the intersection of all interim keywords of user node in acquisition state
The ratio of the keyword number of the intersection of all interim keywords of user's father node, in this state as active user's node
Posterior probability;Advertisement is clicked in its each user's father node thus to obtain active user's node and does not click advertisement two states
Under various combination, it is each with its to obtain active user's node for the corresponding posterior probability for clicking advertisement state of active user's node
User's father node clicks advertisement in each direct similar users and does not click two kinds of advertisement respectively as each direct similar users
Under the various combination of state, the corresponding posterior probability for clicking advertisement state of active user's node, and save.
Step 004-5. takes and works as then within the default screening period for the corresponding state for clicking advertisement of active user's node
All search keys of all user nodes in preceding all search key numbers of user node and Bayesian network model
Several ratio is as the corresponding click advertisement shape probability of state of active user's node;Meanwhile for the corresponding not point of active user's node
The state of advertisement is hit, then within the default screening period, takes all search of all user nodes in Bayesian network model crucial
Own in the number and Bayesian network model of the search key of the difference set of word and all search keys of active user's node
The ratio of the number of all search keys of user node does not click the general of advertisement state as active user's node correspondence
Rate;Advertisement shape probability of state is not clicked thus to obtain the corresponding click advertisement state of active user's node and, and saved.
Step 005. is according to the structure of Bayesian network model, and there is no each user node of user's father node,
It is corresponding to click advertisement state and do not click advertisement shape probability of state, respectively for each user section in Bayesian network model
Point, acquisition user node is respectively with respect to other each user nodes with its indirect association in the case where clicking advertisement state, the user
The corresponding posterior probability for clicking advertisement state of node, and selection is respectively corresponded greater than each posterior probability of predetermined probabilities threshold value
Two indirect associations user node, that is, respectively constitute two users that each group has indirect similarity relation, and save, then
Enter step 006.
Wherein, as shown in figure 3, in above-mentioned steps 005, using Gibbs sampling method, it is directed to Bayesian network mould respectively
It is opposite respectively and therebetween to obtain active user's node using user node as active user's node for each user node in type
Other each user nodes of connection are connect in the case where clicking advertisement state, the corresponding posteriority for clicking advertisement state of active user's node is general
Rate, and save, specifically comprise the following steps:
Step 005-1. is clicked extensively in other each user nodes with active user's node indirect association by corresponding
The user node for state of lodging a complaint with will be removed as evidence variable e, active user's node as target variable t in Bayesian network model
Other user nodes other than evidence variable, target variable are as non-evidence variable q, respectively in Bayesian network model
Each user node, using user's father node of user's father node of user node, user's child node and user's child node as
The markov of the user node covers;Subsequently into step 005-2.
Step 005-2. initializes the state of all user nodes as first sample, and evidence variable states are assigned a value of
1, non-evidence variable assigns state 0 or 1 at random, and enters step 005-3.
Step 005-3. recycles non-evidence variable and utilizes posterior probability in step 004 to each non-evidence variable q
It calculates, calculating its in the covering of its markov under each user node status condition of current non-evidence variable q is respectively 0 He
1 posterior probability, subsequently into step 005-4.
Step 005-4. be randomly generated one 0 to the sum of the current non-evidence variable q conditional probability for being 0 and 1 it is random
The state of current non-evidence variable is changed to 0, if the random number if the random number is less than or equal to its conditional probability for being 0 by number
The conditional probability for being 0 greater than it, and be less than its sum of conditional probability for being 0 and 1, then the state of current non-evidence variable is changed to
1, the state of each non-evidence variable is thus updated as new sample, subsequently into step 005-5.
Step 005-5. repeats step 005-2 to step 005-4 step, and constantly sampling generates new sample, and statistics is all
The sample number n that target variable state is 1 in sample calculates the ratio of sample number n and frequency in sampling s that target variable state is 1,
As active user's node itself is also 1 posterior probability under conditions of having determined that a certain user node state is 1.
Step 006. obtains each direct similar users, each indirect similar users for corresponding to target prediction user, and
Further obtaining target prediction user, relatively each direct similar users click advertisement and do not click advertisement two states respectively
Under various combination, the corresponding posterior probability for clicking advertisement state of target prediction user;And target prediction user is relatively each respectively
A indirect similar users are in the case where clicking advertisement state, the corresponding posterior probability for clicking advertisement state of target prediction user, i.e., by mesh
Direct similar users, the indirect similar users of mark prediction user are referred to as similar users, and it is each with respect to it to obtain target prediction user
Position similar users respectively correspond its posterior probability for clicking advertisement state, and save;Subsequently into step 007.
Step 007. is according to each user respectively for the click to its each branch advertisement shown within the default screening period
Rate, obtain the relatively every similar users of target prediction user, respectively correspond its posterior probability for clicking advertisement state respectively with it is right
Answer each similar users for targeted advertisements clicking rate product, finally by after each product addition multiplied by normalization factor institute
It must be worth, as prediction clicking rate of the target prediction user for targeted advertisements.Wherein, for normalization factor, exist first
In the default screening period, each similar users for obtaining target prediction user are directed to the click of each branch advertisement to its displaying respectively
The sum of rate, then calculate the ratio of the sum of 1 and the clicking rate, that is, it is used as normalization factor.
Based on the above-mentioned technical proposal, the prediction clicking rate of advertisement or searching for user are directed to according to designed acquisition user
Rope keyword, it is accurate to realize the dispensing for meeting the advertisement of its interest for user, achieve the purpose that promote advertisement delivery effect.
A kind of ad click rate prediction technique based on similarity relation between user designed by above-mentioned technical proposal is based on advertisement
The extraction of data in click logs constructs the structure and parameter of Bayesian network model, realizes point of similarity relation between user
Analysis, further realizes prediction of the user to ad click rate as a result, the final accurate dispensing for realizing advertisement;Wherein, Bayesian network
The foundation of network model, accuracy with higher not will lead to result and have no basis, and in creation Bayesian network model
When, redundancy side is eliminated, the reliability and validity of Bayesian network model are enhanced;Moreover, in Bayesian network mould
In the establishment process of type, the reasoning of Bayesian network is carried out by a variety of methods, obtains indirect similar users, spirit with higher
Activity and selectivity;So that the designed ad click rate prediction technique based on similarity relation between user of the present invention, it will be in advertisement
While appearance with search content matching, fully considers the interest and focus of user, has combined two kinds of dispensing advertisements a little,
The one-sidedness of single advertisement putting mode is avoided, there is preferable prediction effect.
Embodiments of the present invention are explained in detail above in conjunction with attached drawing, but the present invention is not limited to above-mentioned implementations
Mode within the knowledge of a person skilled in the art can also be without departing from the purpose of the present invention
It makes a variety of changes.