CN103870452A - Method and method for recommending data - Google Patents

Method and method for recommending data Download PDF

Info

Publication number
CN103870452A
CN103870452A CN201210525892.8A CN201210525892A CN103870452A CN 103870452 A CN103870452 A CN 103870452A CN 201210525892 A CN201210525892 A CN 201210525892A CN 103870452 A CN103870452 A CN 103870452A
Authority
CN
China
Prior art keywords
data
recommended
user
feature
recommendation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210525892.8A
Other languages
Chinese (zh)
Inventor
纪达麒
陈运文
刘作涛
辛颖伟
王文广
姚璐
邹溢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shengle Information Technolpogy Shanghai Co Ltd
Original Assignee
Shengle Information Technolpogy Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shengle Information Technolpogy Shanghai Co Ltd filed Critical Shengle Information Technolpogy Shanghai Co Ltd
Priority to CN201210525892.8A priority Critical patent/CN103870452A/en
Publication of CN103870452A publication Critical patent/CN103870452A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/74Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

The invention relates to a method and system for recommending data, wherein the method comprises the steps of training weight factors of all users by taking the characteristics of all the users and the serial numbers of pieces of first to-be-recommended data as characteristic factors, obtaining the pieces of second to-be-recommended data from the pieces of first to-be-recommended data and obtaining predicted click rates of the pieces of second to-be-recommended data corresponding to a user requesting recommendation according to the weight factors, the characteristic of the user requesting recommendation and the serial numbers of the pieces of second to-be-recommended data, ranking the predicted click rates from high to low, obtaining first K pieces of the second to-be-recommended data highest in the predicted click rate and recommending the obtained first K pieces of the second to-be-recommended data to the user requesting recommendation, wherein K is a positive integer. The method and the system are capable of realizing personalized recommendation to a new user by thoroughly utilizing the characteristics of the user and the characteristic factors carried in the to-be-recommended data, and the characteristic factors are quite easy to extend, and therefore, new weight factors can be trained quickly according to new characteristic factors.

Description

Data recommendation method and system
Technical field
The present invention relates to a kind of data recommendation method and system.
Background technology
The personalized recommendation of data as the personalized recommendation of video website often according to user and data as the interbehavior of video, recommend the interested data of user as video to user.
But, for new user in the case of without any click, watch or mutual-action behavior, how these new users being carried out to personalized recommendation is a very large problem.For these new users, prior art is only from database, to choose hot data to recommend to it, but this recommend method is not realized the object of personalized recommendation.Therefore, need at present a kind of personalized recommendation method for new user and system badly.
Summary of the invention
The object of the present invention is to provide a kind of data recommendation method and system, feature and the entrained characterization factor of data to be recommended that can make full use of user carry out personalized recommendation to new user, and characterization factor ratio is easier to expansion, the weight factor that can make new advances according to new characterization factor Fast Training.
For addressing the above problem, the invention provides a kind of data recommendation method, comprising:
The weight factor that the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor;
From the first data to be recommended, obtain the second data to be recommended, and obtain the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation according to user's the feature of described weight factor, request recommendation and the numbering of the second data to be recommended;
Described prediction clicking rate is sorted from big to small, obtain front K the user that the second data recommendation to be recommended is recommended to described request of prediction clicking rate maximum, wherein K is positive integer.
Further, in said method, the quantity of the quantity × data to be recommended of all users' of quantity=1+ of described weight factor feature.
Further, in said method, train all users' weight factor according to following formula:
Z = W 0 + Σ M × N ( W M × N × F M × N )
Wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended.
Further, in said method, obtain the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation according to following formula:
P = 1 1 + e - ( W 0 + Σ M × N ( W M × N × F M × N )
Wherein, P is prediction clicking rate, e=2.71828, W 0and W m × Nrepresent described weight factor, F m × nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended.
Further, in said method, user's feature comprises one or the combination in any in browser type, display resolution, types of network equipment, access websites time, location, user website incoming road and user's landing page.
Further, in said method, described the first data to be recommended are the high-quality data of regularly obtaining.
Further, in said method, to described high-quality data sorting, obtain front Q high-quality data as the second recommending data according to one or more users' feature, wherein Q is positive integer.
According to another side of the present invention, a kind of data recommendation system is provided, comprising:
Model module, for the weight factor that the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor;
Recommended engine module, for obtaining the second data to be recommended from the first data to be recommended, and obtain the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation according to user's the feature of described weight factor, request recommendation and the numbering of the second data to be recommended, and described prediction clicking rate is sorted from big to small, front K the user that the second data recommendation to be recommended is recommended to described request who obtains prediction clicking rate maximum, wherein K is positive integer.
Further, in said system, train all users' weight factor according to following formula:
Z = W 0 + Σ M × N ( W M × N × F M × N )
Wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended.
Further, in said system, obtain the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation according to following formula:
P = 1 1 + e - ( W 0 + Σ M × N ( W M × N × F M × N )
Wherein, P is prediction clicking rate, e=2.71828, W 0and W m × Nrepresent described weight factor, F m × nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended.
Compared with prior art, the present invention is by the weight factor that the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor, from the first data to be recommended, obtain the second data to be recommended, and according to described weight factor, user's the feature that request is recommended and the numbering of the second data to be recommended are obtained the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation, described prediction clicking rate is sorted from big to small, obtain front K the user that the second data recommendation to be recommended is recommended to described request of prediction clicking rate maximum, feature and the entrained characterization factor of data to be recommended that can make full use of user carry out new user to carry out personalized recommendation.
In addition, according to formula
Figure BDA00002545030300042
train all users' weight factor, wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended, and according to formula
Figure BDA00002545030300051
obtain the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation, wherein, P is prediction clicking rate, e=2.71828, W 0and W m × Nrepresent described weight factor, F m × Nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended, makes characterization factor ratio be easier to expansion, the weight factor that can make new advances according to new characterization factor Fast Training.By above-mentioned intelligent recommendation scheme, can greatly improve the efficiency of user's fast searching content of interest, reduce the invalid browsing time of user, especially use in the environment that network traffics expense is higher at mobile device, save a large amount of communication flowss, for user saves campus network, improve user's reading satisfaction.
Accompanying drawing explanation
Fig. 1 is the process flow diagram of the data recommendation method of the embodiment of the present invention one;
Fig. 2 is the personalized recommendation result schematic diagram of the embodiment of the present invention one;
Fig. 3 is the data recommendation method flow diagram of the embodiment of the present invention two;
Fig. 4 is the module diagram of the data recommendation system of the embodiment of the present invention three;
Fig. 5 is the structural drawing of the data recommendation system of the embodiment of the present invention three.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, below in conjunction with the drawings and specific embodiments, the present invention is further detailed explanation.
Embodiment mono-
As shown in Figure 1, the invention provides a kind of data recommendation method, comprising:
Step S11, the weight factor that the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor.Wherein, described data can be videos, and the numbering of described the first data to be recommended can be video ID, have represented the difference hobby for the treatment of recommending data without user's feature.
Preferably, described user's feature can be obtained from a User action log, described user's feature comprises one or the combination in any in browser type, display resolution, types of network equipment, access websites time, location, user website incoming road (referer) and user's landing page (landing page), described user's feature comprises one or more characterization factors, and certain described user's feature also can comprise the data that user is demonstrated, the data that user clicked or watched.
Concrete, each user's feature specifically can comprise:
1) browser type can comprise: 360, IE7, IE6, IE8, Sogou, Chome, FireFox or other;
2) display resolution can comprise: 1440*900,1024*768,1280*800,800*680 or other;
3) types of network equipment can comprise: pc, movement or other;
4) the access websites time can comprise various hourages;
5) location can be accurate to province;
6) various user website incoming roads, user website incoming road (referrer) is that visitor enters any approach in website, HTTP referer is a part of header, in the time that browser sends request to web server, generally can bring referer to tell server user to connect and come from that page, server can obtain some information term processing by this
7) various user's landing page, user's landing page (Landing Page is sometimes called as and primarily catches user page) is the webpage that is shown to user after potential user clicks advertisement or utilizes search engine search.General this page can show and the advertisement of clicking or Search Results link relevant expansion content, and this page should be to do search engine optimization for certain key word (or phrase).
Preferably, the quantity N of the quantity M × data to be recommended of all users' of quantity=1+ of described weight factor feature.
Preferably, can train according to following formula all users' weight factor:
Z = W 0 + Σ M × N ( W M × N × F M × N )
Wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended, can carry out LR modeling to user's characterization factor by this formula, and the weight generation factor.
The feature of supposing the user who only uses 1 types of network equipment comprises pc, mobile and other, and the numbering that has 2 data comprises video1 and video2 as video ID, and such one has 6 characterization factors comprises: F pc × video1, F mobile × video1, F other × video1, F pc × viedo2, F mobile × video2and F other × video2, need to train altogether 1+3*2=7 weight factor to comprise: W 0, W pc × video1, W mobile × video1, W other × video1, W pc × viedo2, W mobile × video2and W other × video2.
So, formula Z = W 0 + Σ M × N ( W M × N × F M × N ) Be converted into:
Z=W 0+W pc×video1×F pc×video1+W mobile×video1×F mobile×video1+W other×video1×F other× video1+W pc×viedo2×F pc×viedo2+W mobile×video2×F mobile×video2+W other×video2×F other×video2
If certain user sees to it and shown video Video1 by PC, but do not click Video1, Z is 0, owing to only having F pc × video1be 1, remaining F mobile × video1, F other × video1, F pc × viedo2, F mobile × video2and F other × video2be 0, above-mentioned formula is further converted to:
0=W 0+W pc×video1
Can train above-mentioned 7 weight factors according to above-mentioned same mode.
Step S12, from the first data to be recommended, obtain the second data to be recommended, and obtain the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation according to user's the feature of described weight factor, request recommendation and the numbering of the second data to be recommended.
Preferably, can obtain the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation according to following formula:
P = 1 1 + e - ( W 0 + Σ M × N ( W M × N × F M × N )
Wherein, P is prediction clicking rate, and e is natural logarithm, e=2.71828, and, W 0and W m × Nrepresent described weight factor, F m × Nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is that the quantity of the first data to be recommended can be carried out ctr to the second data to be recommended by this formula and estimated to obtain prediction clicking rate.
Step S13, described prediction clicking rate is sorted from big to small, obtain front K the user that the second data recommendation to be recommended is recommended to described request of prediction clicking rate maximum, wherein K is positive integer, concrete, for without logging in, history information as click, watch or mutual-action behavior etc. without or little new user, in the time of its request personalized recommendation, still can recommend out data accurately, Fig. 2 is the personalized recommendation result that the present embodiment is shown to new user.
To sum up, the present embodiment can make characterization factor ratio be easier to expansion, the weight factor that can make new advances according to new characterization factor Fast Training, thus the feature and the entrained characterization factor of data to be recommended that make full use of user carry out personalized recommendation to new user.By above-mentioned intelligent recommendation scheme, can greatly improve the efficiency of user's fast searching content of interest, reduce the invalid browsing time of user, especially use in the environment that network traffics expense is higher at mobile device, save a large amount of communication flowss, for user saves campus network, improve user's reading satisfaction.
Embodiment bis-
As shown in Figure 2, the invention provides another kind of data recommendation method, the difference of the present embodiment and embodiment is that described the first data to be recommended are the high-quality data of regularly obtaining, in addition according to one or more users' feature to described high-quality data sorting, obtain front Q high-quality data as the second recommending data, thereby make recommendation results more accurate, described method comprises:
Step S21, the high-quality data of regularly obtaining are as the first data to be recommended; Concrete, described data can be video, for example can regularly from the whole network video library, obtain high-quality video and upgrade and store in a high-quality video storehouse, because the video of the whole network video library may have several necessarily even several hundred million, data volume is too large, train all users' the weight factor workload can be very large to all videos of the whole network video library, also there is no need, cross first to screen the video in the whole network video library and obtain high-quality video as the first data to be recommended, specifically can screen according to dimensions such as the broadcasting number of video, comment number, video qualities.
Step S22, the weight factor that the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor.Wherein, described data can be videos, and the numbering of described the first data to be recommended can be video ID.
Preferably, described user's feature can be obtained from a User action log, described user's feature comprises one or the combination in any in browser type, display resolution, types of network equipment, access websites time, location, user website incoming road (referer) and user's landing page (landing page), described user's feature comprises one or more characterization factors, and certain described user's feature also can comprise the data that user is demonstrated, the data that user clicked or watched.
Concrete, each user's feature specifically can comprise:
1) browser type can comprise: 360, IE7, IE6, IE8, Sogou, Chome, FireFox or other;
2) display resolution can comprise: 1440*900,1024*768,1280*800,800*680 or other;
3) types of network equipment can comprise: pc, movement or other;
4) the access websites time can comprise various hourages;
5) location can be accurate to province;
6) various user website incoming roads, user website incoming road (referrer) is that visitor enters any approach in website, HTTP referer is a part of header, in the time that browser sends request to web server, generally can bring referer to tell server user to connect and come from that page, server can obtain some information term processing by this
7) various user's landing page, user's landing page (Landing Page is sometimes called as and primarily catches user page) is the webpage that is shown to user after potential user clicks advertisement or utilizes search engine search.General this page can show and the advertisement of clicking or Search Results link relevant expansion content, and this page should be to do search engine optimization for certain key word (or phrase).
Preferably, the quantity N of the quantity M × data to be recommended of all users' of quantity=1+ of described weight factor feature.
Preferably, can train according to following formula all users' weight factor:
Z = W 0 + Σ M × N ( W M × N × F M × N )
Wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended, can carry out LR modeling to user's characterization factor by this formula, and the weight generation factor.
The feature of supposing the user who only uses 1 types of network equipment comprises pc, mobile and other, and the numbering that has 2 data comprises video1 and video2 as video ID, and such one has 6 characterization factors comprises: F pc × video1, F mobile × video1, F other × video1, F pc × viedo2, F mobile × video2and F other × video2, need to train altogether 1+3*2=7 weight factor to comprise: W 0, W pc × video1, W mobile × video1, W other × video1, W pc × viedo2, W mobile × video2and W other × video2.
So, formula Z = W 0 + Σ M × N ( W M × N × F M × N ) Be converted into:
Z=W 0+W pc×video1×F pc×video1+W mobile×video1×F mobile×video1+W other×video1×F other× video1+W pc×viedo2×F pc×viedo2+W mobile×video2×F mobile×video2+W other×video2×F other×video2
If certain user sees to it and shown video Video1 by PC, but do not click Video1, Z is 0, owing to only having F pc × video1be 1, remaining F mobile × video1, F other × video1, F pc × viedo2, F mobile × video2and F other × video2be 0, above-mentioned formula is further converted to:
0=W 0+W pc×video1
Can train above-mentioned 7 weight factors according to above-mentioned same mode.
Step S23, according to one or more users' feature to described high-quality data sorting, obtain front Q high-quality data as the second recommending data, wherein Q is positive integer, concrete, in the time that data are video, can pass through browser type, display resolution, types of network equipment, access websites time, location, user website incoming road (referer) and user's landing page (landing page) described high-quality video is sorted, obtain a front Q high-quality video as 1024 more the data of high-quality as the second recommending data.
Step S24, from the first data to be recommended, obtain the second data to be recommended, and obtain the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation according to user's the feature of described weight factor, request recommendation and the numbering of the second data to be recommended.
Preferably, can obtain the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation according to following formula:
P = 1 1 + e - ( W 0 + Σ M × N ( W M × N × F M × N )
Wherein, P is prediction clicking rate, and e is natural logarithm, e=2.71828, and, W 0and W m × Nrepresent described weight factor, F m × Nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is that the quantity of the first data to be recommended can be carried out ctr to the second data to be recommended by this formula and estimated to obtain prediction clicking rate.
Step S25, described prediction clicking rate is sorted from big to small, obtain front K the user that the second data recommendation to be recommended is recommended to described request of prediction clicking rate maximum, wherein K is positive integer, concrete, for without logging in, history information as click, watch or mutual-action behavior etc. without or little new user, in the time of its request personalized recommendation, still can recommend out data accurately.
To sum up, the present embodiment can make characterization factor ratio be easier to expansion, the weight factor that can make new advances according to new characterization factor Fast Training, thereby the feature and the entrained characterization factor of data to be recommended that make full use of user carry out personalized recommendation to new user, and be the high-quality data of regularly obtaining by described the first data to be recommended, and according to one or more users' feature to described high-quality data sorting, obtain front Q high-quality data as the second recommending data, thereby make recommendation results more accurate.By above-mentioned intelligent recommendation scheme, can greatly improve the efficiency of user's fast searching content of interest, reduce the invalid browsing time of user, especially use in the environment that network traffics expense is higher at mobile device, save a large amount of communication flowss, for user saves campus network, improve user's reading satisfaction.
Embodiment tri-
As shown in Figure 4, the present invention also provides a kind of data recommendation system, comprises model module 1 and recommended engine module 2.Wherein, described data can be videos, and the numbering of described the first data to be recommended can be video ID.
The weight factor of model module 1 for the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor.
Preferably, the quantity of the quantity × data to be recommended of all users' of quantity=1+ of described weight factor feature.
Preferably, described user's feature can be obtained from a User action log, described user's feature comprises one or the combination in any in browser type, display resolution, types of network equipment, access websites time, location, user website incoming road (referer) and user's landing page (landing page), described user's feature comprises one or more characterization factors, and certain described user's feature also can comprise the data that user is demonstrated, the data that user clicked or watched.
Concrete, each user's feature specifically can comprise:
1) browser type can comprise: 360, IE7, IE6, IE8, Sogou, Chome, FireFox or other;
2) display resolution can comprise: 1440*900,1024*768,1280*800,800*680 or other;
3) types of network equipment can comprise: pc, movement or other;
4) the access websites time can comprise various hourages;
5) location can be accurate to province;
6) various user website incoming roads, user website incoming road (referrer) is that visitor enters any approach in website, HTTP referer is a part of header, in the time that browser sends request to web server, generally can bring referer to tell server user to connect and come from that page, server can obtain some information term processing by this
7) various user's landing page, user's landing page (Landing Page is sometimes called as and primarily catches user page) is the webpage that is shown to user after potential user clicks advertisement or utilizes search engine search.General this page can show and the advertisement of clicking or Search Results link relevant expansion content, and this page should be to do search engine optimization for certain key word (or phrase).
Preferably, can train according to following formula all users' weight factor:
Z = W 0 + Σ M × N ( W M × N × F M × N )
Wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended, can carry out LR modeling to user's characterization factor by this formula, and the weight generation factor.
The feature of supposing the user who only uses 1 types of network equipment comprises pc, mobile and other, and the numbering that has 2 data comprises video1 and video2 as video ID, and such one has 6 characterization factors comprises: F pc × video1, F mobile × video1, F other × video1, F pc × viedo2, F mobile × video2and F other × video2, need to train altogether 1+3*2=7 weight factor to comprise: W 0, W pc × video1, W mobile × video1, W other × video1, W pc × viedo2, W mobile × video2and W other × video2.Formula
Figure BDA00002545030300161
be converted into:
Z=W 0+W pc×video1×F pc×video1+W mobile×video1×F mobile×video1+W other×video1×F other× video1+W pc×viedo2×F pc×viedo2+W mobile×video2×F mobile×video2+W other×video2×F other×video2
If certain user sees to it and shown video Video1 by PC, but do not click Video1, Z is 0, owing to only having F pc × video1be 1, remaining F mobile × video1, F other × video1, F pc × viedo2, F mobile × video2and F other × video2be 0, above-mentioned formula is further converted to:
0=W 0+W pc×video1
Can train above-mentioned 7 weight factors according to above-mentioned same mode.
Preferably, described the first data to be recommended are the high-quality data of regularly obtaining.Concrete, described data can be video, for example can regularly from the whole network video library, obtain high-quality video and upgrade and store in a high-quality video storehouse, because the video of the whole network video library may have several necessarily even several hundred million, data volume is too large, train all users' the weight factor workload can be very large to all videos of the whole network video library, also there is no need, cross first to screen the video in the whole network video library and obtain high-quality video as the first data to be recommended, specifically can screen according to dimensions such as the broadcasting number of video, comment number, video qualities.
Recommended engine module 2 is for obtaining the second data to be recommended from the first data to be recommended, and obtain the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation according to user's the feature of described weight factor, request recommendation and the numbering of the second data to be recommended, and described prediction clicking rate is sorted from big to small, front K the user that the second data recommendation to be recommended is recommended to described request who obtains prediction clicking rate maximum, wherein K is positive integer.Concrete, in the time that data are video, can pass through browser type, display resolution, types of network equipment, access websites time, location, user website incoming road (referer) and user's landing page (landing page) sorts to described high-quality video, obtain a front Q high-quality video as 1024 more high-quality video as the second recommending data, for without logging in, history information as click, watch or mutual-action behavior etc. without or little new user, in the time of its request personalized recommendation, still can recommend out data accurately.
Preferably, can obtain the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation according to following formula:
P = 1 1 + e - ( W 0 + Σ M × N ( W M × N × F M × N )
Wherein, P is prediction clicking rate, and e is natural logarithm, e=2.71828, W 0and W m × Nrepresent described weight factor, F m × Nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended, can carry out ctr to the second data to be recommended estimate to obtain prediction clicking rate by this formula.
Preferably, can be according to one or more users' feature to described high-quality data sorting, obtain front Q high-quality data as the second recommending data, wherein Q is positive integer.
As shown in Figure 5, in the time that data are video, can from the whole network video library 3, filter out high-quality video by a VideoSelector module 4 and deposit a high-quality video storehouse 5 in, model module 1 is according to the feature of the user in user's row daily record 6 and the high-quality video weight generation factor 8, then recommended engine 2 sends to front end according to the partial video of the further screening in high-quality video storehouse 5 and weight factor 8 generating recommendations results, realizes to new user and carries out personalized video recommendation.
To sum up, the present embodiment can make characterization factor ratio be easier to expansion, the weight factor that can make new advances according to new characterization factor Fast Training, thereby the feature and the entrained characterization factor of data to be recommended that make full use of user carry out personalized recommendation to new user, and be the high-quality data of regularly obtaining by described the first data to be recommended, and according to one or more users' feature to described high-quality data sorting, obtain front Q high-quality data as the second recommending data, thereby make recommendation results more accurate.
Compared with prior art, the present invention is by the weight factor that the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor, from the first data to be recommended, obtain the second data to be recommended, and according to described weight factor, user's the feature that request is recommended and the numbering of the second data to be recommended are obtained the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation, described prediction clicking rate is sorted from big to small, obtain front K the user that the second data recommendation to be recommended is recommended to described request of prediction clicking rate maximum, feature and the entrained characterization factor of data to be recommended that can make full use of user carry out new user to carry out personalized recommendation.
In addition, according to formula train all users' weight factor, wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended, and according to formula
Figure BDA00002545030300191
obtain the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation, wherein, P is prediction clicking rate, and e is natural logarithm, e=2.71828, W 0and W m × Nrepresent described weight factor, F m × Nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended, makes characterization factor ratio be easier to expansion, the weight factor that can make new advances according to new characterization factor Fast Training.By above-mentioned intelligent recommendation scheme, can greatly improve the efficiency of user's fast searching content of interest, reduce the invalid browsing time of user, especially use in the environment that network traffics expense is higher at mobile device, save a large amount of communication flowss, for user saves campus network, improve user's reading satisfaction.
In this instructions, each embodiment adopts the mode of going forward one by one to describe, and what each embodiment stressed is and the difference of other embodiment, between each embodiment identical similar part mutually referring to.For the disclosed system of embodiment, owing to corresponding to the method disclosed in Example, so description is fairly simple, relevant part illustrates referring to method part.
Professional can also further recognize, unit and the algorithm steps of each example of describing in conjunction with embodiment disclosed herein, can realize with electronic hardware, computer software or the combination of the two, for the interchangeability of hardware and software is clearly described, composition and the step of each example described according to function in the above description in general manner.These functions are carried out with hardware or software mode actually, depend on application-specific and the design constraint of technical scheme.Professional and technical personnel can realize described function with distinct methods to each specifically should being used for, but this realization should not thought and exceeds scope of the present invention.
Obviously, those skilled in the art can carry out various changes and modification and not depart from the spirit and scope of the present invention invention.Like this, if within of the present invention these are revised and modification belongs to the scope of the claims in the present invention and equivalent technologies thereof, the present invention is also intended to including these changes and modification.

Claims (10)

1. a data recommendation method, is characterized in that, comprising:
The weight factor that the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor;
From the first data to be recommended, obtain the second data to be recommended, and obtain the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation according to user's the feature of described weight factor, request recommendation and the numbering of the second data to be recommended;
Described prediction clicking rate is sorted from big to small, obtain front K the user that the second data recommendation to be recommended is recommended to described request of prediction clicking rate maximum, wherein K is positive integer.
2. data recommendation method as claimed in claim 1, is characterized in that, trains all users' weight factor according to following formula:
Z = W 0 + Σ M × N ( W M × N × F M × N )
Wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended.
3. data recommendation method as claimed in claim 2, is characterized in that, obtains the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation according to following formula:
P = 1 1 + e - ( W 0 + Σ M × N ( W M × N × F M × N )
Wherein, P is prediction clicking rate, e=2.71828, W 0and W m × Nrepresent described weight factor, F m × nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended.
4. data recommendation method as claimed in claim 1, is characterized in that, the quantity of the quantity × data to be recommended of all users' of quantity=1+ of described weight factor feature.
5. data recommendation method as claimed in claim 1, it is characterized in that, user's feature comprises one or the combination in any in browser type, display resolution, types of network equipment, access websites time, location, user website incoming road and user's landing page.
6. data recommendation method as claimed in claim 1, is characterized in that, described the first data to be recommended are the high-quality data of regularly obtaining.
7. data recommendation method as claimed in claim 6, is characterized in that, according to one or more users' feature, described high-quality data is sorted, and obtains front Q high-quality data as the second recommending data, and wherein Q is positive integer.
8. a data recommendation system, is characterized in that, comprising:
Model module, for the weight factor that the numbering of all users' feature and the first data to be recommended is trained to all users as characterization factor;
Recommended engine module, for obtaining the second data to be recommended from the first data to be recommended, and obtain the prediction clicking rate of described the second data to be recommended with respect to the user of request recommendation according to user's the feature of described weight factor, request recommendation and the numbering of the second data to be recommended, and described prediction clicking rate is sorted from big to small, front K the user that the second data recommendation to be recommended is recommended to described request who obtains prediction clicking rate maximum, wherein K is positive integer.
9. data recommendation system as claimed in claim 8, is characterized in that, trains all users' weight factor according to following formula:
Z = W 0 + Σ M × N ( W M × N × F M × N )
Wherein, the click situation of Z representative of consumer to a certain the first data to be recommended, when click, Z is 1, while click, Z is 0, W 0and W m × Nrepresent described weight factor, F m × Nthe representative feature factor is the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended.
10. data recommendation system as claimed in claim 8, is characterized in that, obtains the prediction clicking rate of the second data to be recommended with respect to the user of request recommendation according to following formula:
P = 1 1 + e - ( W 0 + Σ M × N ( W M × N × F M × N )
Wherein, P is prediction clicking rate, e=2.71828, W 0and W m × Nrepresent described weight factor, F m × nrepresent the displaying situation of a certain the first data to be recommended under a certain user's feature, F when displaying m × Nbe 1, F while displaying m × Nbe that 0, M is the quantity of user's feature, N is the quantity of the first data to be recommended.
CN201210525892.8A 2012-12-07 2012-12-07 Method and method for recommending data Pending CN103870452A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210525892.8A CN103870452A (en) 2012-12-07 2012-12-07 Method and method for recommending data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210525892.8A CN103870452A (en) 2012-12-07 2012-12-07 Method and method for recommending data

Publications (1)

Publication Number Publication Date
CN103870452A true CN103870452A (en) 2014-06-18

Family

ID=50908996

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210525892.8A Pending CN103870452A (en) 2012-12-07 2012-12-07 Method and method for recommending data

Country Status (1)

Country Link
CN (1) CN103870452A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106168980A (en) * 2016-07-26 2016-11-30 合网络技术(北京)有限公司 Multimedia resource recommends sort method and device
CN108537568A (en) * 2018-03-07 2018-09-14 阿里巴巴集团控股有限公司 A kind of information recommendation method and device
CN109862432A (en) * 2019-01-31 2019-06-07 厦门美图之家科技有限公司 Clicking rate prediction technique and device
WO2019127845A1 (en) * 2017-12-28 2019-07-04 平安科技(深圳)有限公司 Recording recommendation method, device, apparatus, and computer readable storage medium
CN110415063A (en) * 2018-07-31 2019-11-05 北京京东尚科信息技术有限公司 Method of Commodity Recommendation, device, electronic equipment and readable medium
CN111314790A (en) * 2020-03-26 2020-06-19 北京奇艺世纪科技有限公司 Video playing record sequencing method and device and electronic equipment
CN113870641A (en) * 2021-09-29 2021-12-31 上海乐项信息技术有限公司 Simulation training method and system for live broadcast of tape goods

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106168980A (en) * 2016-07-26 2016-11-30 合网络技术(北京)有限公司 Multimedia resource recommends sort method and device
CN106168980B (en) * 2016-07-26 2020-07-28 阿里巴巴(中国)有限公司 Multimedia resource recommendation sequencing method and device
WO2019127845A1 (en) * 2017-12-28 2019-07-04 平安科技(深圳)有限公司 Recording recommendation method, device, apparatus, and computer readable storage medium
CN108537568A (en) * 2018-03-07 2018-09-14 阿里巴巴集团控股有限公司 A kind of information recommendation method and device
CN108537568B (en) * 2018-03-07 2021-12-21 创新先进技术有限公司 Information recommendation method and device
CN110415063A (en) * 2018-07-31 2019-11-05 北京京东尚科信息技术有限公司 Method of Commodity Recommendation, device, electronic equipment and readable medium
CN109862432A (en) * 2019-01-31 2019-06-07 厦门美图之家科技有限公司 Clicking rate prediction technique and device
CN111314790A (en) * 2020-03-26 2020-06-19 北京奇艺世纪科技有限公司 Video playing record sequencing method and device and electronic equipment
CN113870641A (en) * 2021-09-29 2021-12-31 上海乐项信息技术有限公司 Simulation training method and system for live broadcast of tape goods

Similar Documents

Publication Publication Date Title
CN103870452A (en) Method and method for recommending data
US8332775B2 (en) Adaptive user feedback window
CN101957834B (en) Content recommending method and device based on user characteristics
CN102426610B (en) Microblog rank searching method and microblog searching engine
CN102855309B (en) A kind of information recommendation method based on user behavior association analysis and device
CN107784066A (en) Information recommendation method, device, server and storage medium
CN106528693A (en) Individualized learning-oriented educational resource recommendation method and system
CN103544212A (en) Content recommending method and system
CN104199874A (en) Webpage recommendation method based on user browsing behaviors
CN105989074A (en) Method and device for recommending cold start through mobile equipment information
CN109325179A (en) A kind of method and device that content is promoted
CN105589971A (en) Method and device for training recommendation model, and recommendation system
CN105045864B (en) A kind of digitalization resource personalized recommendation method
CN103186595A (en) Method and system for recommending audios/videos
US9558175B2 (en) Generating an individualized web page template with entities of a web page publisher organization associated with areas of the template
CN104077415A (en) Searching method and device
CN102831114A (en) Method and device for realizing statistical analysis on user access condition of Internet
CN103713894A (en) Method and equipment for determining access demand information of user
CN104951518A (en) Context recommending method based on dynamic incremental updating
CN102957949A (en) Device and method for recommending video to user
CN104503988A (en) Searching method and device
CN104281619A (en) System and method for ordering search results
CN105653550B (en) Webpage filtering method and device
Nasraoui et al. Performance of recommendation systems in dynamic streaming environments
CN102637182B (en) Method for analyzing interactive evolution of core user information of Web social network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140618