CN105160008A - Method and device for locating suggested users - Google Patents

Method and device for locating suggested users Download PDF

Info

Publication number
CN105160008A
CN105160008A CN201510604634.2A CN201510604634A CN105160008A CN 105160008 A CN105160008 A CN 105160008A CN 201510604634 A CN201510604634 A CN 201510604634A CN 105160008 A CN105160008 A CN 105160008A
Authority
CN
China
Prior art keywords
user
channel
specific characteristic
data
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510604634.2A
Other languages
Chinese (zh)
Other versions
CN105160008B (en
Inventor
李添
王晓龙
姚键
潘柏宇
王冀
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Youku Network Technology Beijing Co Ltd
Original Assignee
1Verge Internet Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 1Verge Internet Technology Beijing Co Ltd filed Critical 1Verge Internet Technology Beijing Co Ltd
Priority to CN201510604634.2A priority Critical patent/CN105160008B/en
Publication of CN105160008A publication Critical patent/CN105160008A/en
Application granted granted Critical
Publication of CN105160008B publication Critical patent/CN105160008B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/7867Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

The invention discloses a method and a device for locating suggested users. The method comprises the following steps: collecting user transaction data and user video viewing data from a video system; extracting an appointed characteristic data set of training users and an appointed characteristic data set of test users from the collected user transaction data and user video viewing data; training various appointed characteristics in the appointed characteristic data set of the training users according to a training algorithm, so as to obtain weighted values of various appointed characteristics; determining paying probability prediction data of various test users according to the appointed characteristic data set of the test users and the trained weighted values of the appointed characteristics; and locating the suggested users according to the paying probability prediction data of the test users. According to the method and the device, the users with the paying tendency can be relatively accurately picked out from the users in a test set, so that the suggested users can be relatively accurately located.

Description

A kind of method and device of locating recommendation user
Technical field
The present invention relates to video technique field, in particular, the present invention relates to a kind of method and device of locating recommendation user.
Background technology
Video (Video) makes a general reference the various technology that a series of static image is caught in the mode of electric signal, note down, processes, stores, transmits and reappeared.Continuous print image change per second more than more than 24 frames (frame) picture time, according to persistence of vision principle, because human eye cannot distinguish the tableaux of single width, therefore, continuous print picture looks it is level and smooth continuous print visual effect when playing, and continuous print picture is called video like this.
It is exactly make to impel customer consumption that current video website improves one of important way of income, such as to user's advertisement etc., but current commending system ubiquity is extensively casted net, the feature of low conversion ratio, how navigating to the higher user of tendency that pays more accurately is industry problem in the urgent need to address.
Summary of the invention
In view of the above problems, this application provides a kind of method and device of locating recommendation user, recommend user to navigate to more accurately.
In order to solve the problems of the technologies described above, the application adopts following technical scheme:
Locate a method of recommending user, it comprises:
Gather customer transaction data and user video viewing data in video system;
From the customer transaction data of described collection and the user video viewing extracting data training specific characteristic data set of user and the specific characteristic data set of test subscriber;
The weighted value obtaining each specific characteristic is trained according to training algorithm each specific characteristic to the specific characteristic data centralization of training user;
Weighted value according to the specific characteristic data set of test subscriber and each specific characteristic of training acquisition determines the probabilistic forecasting data that each test subscriber pays;
User is recommended according to the probabilistic forecasting data location that each test subscriber pays.
Wherein, collection video system customer transaction data and user video viewing data comprise:
The daily record obtaining member's tran list obtains video system customer transaction data;
Obtain user video viewing daily record and obtain video system customer transaction data.
Wherein, the daily record of described member's tran list and described user video viewing daily record comprise personal computer and daily record corresponding to wireless side.
Wherein, determine that the probabilistic forecasting data that each test subscriber pays comprise according to the weighted value of the specific characteristic data set of test subscriber and each specific characteristic of training acquisition:
The weighted value obtaining each specific characteristic is sorted;
Determine to specify the crucial specific characteristic in sequencing horizon according to ranking results;
The probabilistic forecasting data that each test subscriber pays are determined according to the weight of crucial specific characteristic and the specific characteristic data set of test subscriber.
Wherein, the probabilistic forecasting data location of paying according to each test subscriber recommends user to comprise:
Determine according to recommendation accuracy the first threshold curve recommending user according to the probabilistic forecasting data of paying according to each test subscriber in section in positioning time;
The probabilistic forecasting data of paying according to each test subscriber in positioning time section are according to the Second Threshold curve recommending efficiency to determine to recommend user;
The threshold value of recommending user is determined according to described first threshold curve and Second Threshold curve;
User is recommended according to the threshold value location of the described recommendation user determined.
Wherein, the specific characteristic data set of described training user comprises positive sample data and negative sample data, described positive sample data is the specific characteristic data set of the user that fixed time point paying is bought, and described negative sample data are put the specific characteristic data set of the user bought that do not pay the fixed time.
Wherein, described fixed time point is that positive sample and negative sample gather the same day.
Wherein, described negative sample data bulk is three times of positive sample data quantity.
Wherein, the described user bought that pays is the user buying member.
Wherein, described training algorithm is L2 canonical logistic regression training algorithm.
Wherein, specific characteristic is one or more in following characteristics:
Movie channel, series channel, automobile channel, to make laughs channel, animation channel, XATV-6, fashion channel, parent-offspring's channel, GameChannel, original channel, advertisement channel, music channel, Info channel, sports channel, life channel, tourism channel, science and technology channel, education channel, entertainment channel, documentary film channel, other channel, android equipment, iphone equipment, ipad equipment, ipod equipment, miscellaneous equipment, member, non-member, paid video, free video, complete viewing and trying.
The application also provides a kind of and locates the device recommending user, and it comprises:
Acquisition module, for gathering customer transaction data and user video viewing data in video system;
Extraction module, for from the customer transaction data of described collection and the user video viewing extracting data training specific characteristic data set of user and the specific characteristic data set of test subscriber;
Training module, for training according to training algorithm each specific characteristic to the specific characteristic data centralization of training user the weighted value obtaining each specific characteristic;
Determination module, the weighted value for each specific characteristic obtained according to specific characteristic data set and the training of test subscriber determines the probabilistic forecasting data that each test subscriber pays;
Locating module, recommends user for the probabilistic forecasting data location of paying according to each test subscriber.
Wherein, acquisition module comprises:
First obtains submodule, obtains video system customer transaction data for obtaining the daily record of member's tran list;
Second obtains submodule, obtains video system customer transaction data for obtaining user video viewing daily record.
Wherein, the daily record of described member's tran list and described user video viewing daily record comprise personal computer and daily record corresponding to wireless side.
Wherein, determination module comprises:
Sorting sub-module, for sorting to the weighted value obtaining each specific characteristic;
Crucial specific characteristic determination submodule, specifies the crucial specific characteristic in sequencing horizon for determining according to ranking results;
Probabilistic forecasting data determination submodule, for determining according to the weight of crucial specific characteristic and the specific characteristic data set of test subscriber the probabilistic forecasting data that each test subscriber pays.
Wherein, locating module comprises:
First threshold curve determination submodule, for determining according to recommendation accuracy the first threshold curve recommending user according to the probabilistic forecasting data of paying according to each test subscriber in positioning time in section;
Second Threshold curve determination submodule, the probabilistic forecasting data of paying according to each test subscriber in positioning time section are according to the Second Threshold curve recommending efficiency to determine to recommend user;
Threshold value determination submodule, determines the threshold value of recommending user according to described first threshold curve and Second Threshold curve;
Recommend user's locator module, recommend user for the threshold value location according to the described recommendation user determined.
Wherein, the specific characteristic data set of described training user comprises positive sample data and negative sample data, described positive sample data is the specific characteristic data set of the user that fixed time point paying is bought, and described negative sample data are put the specific characteristic data set of the user bought that do not pay the fixed time.
Wherein, described fixed time point is that positive sample and negative sample gather the same day.
Wherein, described negative sample data bulk is three times of positive sample data quantity.
Wherein, the described user bought that pays is the user buying member.
Wherein, described training algorithm is L2 canonical logistic regression training algorithm.
Wherein, specific characteristic is one or more in following characteristics:
Movie channel, series channel, automobile channel, to make laughs channel, animation channel, XATV-6, fashion channel, parent-offspring's channel, GameChannel, original channel, advertisement channel, music channel, Info channel, sports channel, life channel, tourism channel, science and technology channel, education channel, entertainment channel, documentary film channel, other channel, android equipment, iphone equipment, ipad equipment, ipod equipment, miscellaneous equipment, member, non-member, paid video, free video, complete viewing and trying.
Locate the method and device of recommending user according to a kind of of the application, it gathers customer transaction data and user video viewing data in video system; From the customer transaction data of described collection and the user video viewing extracting data training specific characteristic data set of user and the specific characteristic data set of test subscriber; The weighted value obtaining each specific characteristic is trained according to training algorithm each specific characteristic to the specific characteristic data centralization of training user; Weighted value according to the specific characteristic data set of test subscriber and each specific characteristic of training acquisition determines the probabilistic forecasting data that each test subscriber pays; User is recommended according to the probabilistic forecasting data location that each test subscriber pays, wherein specific characteristic data set is the set of the data of user's usage behavior frequency on specific characteristic, the weighted value of each specific characteristic accurately can be obtained by the specific characteristic data set of training algorithm to training user, according to the weighted value of each specific characteristic described and then determine test set user, from test set user, the user with tendency of paying can be picked out more accurately, can navigate to more accurately and recommend user.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, the accompanying drawing that the following describes is only some embodiments recorded in the present invention, for those of ordinary skill in the art, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is according to an a kind of specific embodiment process flow diagram of locating the method for recommending user of the application.;
Fig. 2 is according to a kind of entirety composition schematic diagram of locating the device recommending user of the application;
Fig. 3 is a kind of specific embodiment composition schematic diagram according to acquisition module in Fig. 2;
Fig. 4 is a kind of specific embodiment composition schematic diagram according to determination module in Fig. 2;
Fig. 5 is a kind of specific embodiment composition schematic diagram according to locating module in Fig. 2.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, the every other embodiment that those of ordinary skill in the art obtain, all belongs to the scope of protection of the invention.
Refer to Fig. 1, it is an a kind of specific embodiment process flow diagram of locating the method for recommending user of the present invention.In the present embodiment, location recommends the method for user mainly to comprise the steps:
Step S101, gathers customer transaction data and user video viewing data in video system;
During specific implementation, collection video system customer transaction data and user video viewing data realize by various mode, as a preferred embodiment, such as, can adopt following manner:
The daily record obtaining member's tran list obtains video system customer transaction data; And acquisition user video viewing daily record obtains video system customer transaction data.
Need to illustrate, the daily record of described member's tran list and described user video viewing daily record can comprise personal computer and daily record corresponding to wireless side, and the analysis that the data of acquisition like this watch behavior for user is more accurate.
Step S102, from the customer transaction data of described collection and the user video viewing extracting data training specific characteristic data set of user and the specific characteristic data set of test subscriber;
For the user's (such as buying the user of member) bought that pays, in its usage behavior feature in video website, some feature is bought with paying generally stronger associating, namely specific characteristic described in the present embodiment is the feature associated with user charges buying behavior, the specific characteristic data set of user is the set of the data of the user's usage behavior frequency on specific characteristic, for example, paying customer can relate to viewing movie channel, namely movie channel is a specific characteristic, namely user's usage behavior frequency on specific characteristic is the frequency that in timing statistics section, user watches movie channel, and namely the specific characteristic data set extracted needs the frequency data of the usage behavior of user on specific characteristic in specific characteristic and timing statistics section, such as, timing statistics section is 60 days, the characteristic data set of the movie channel extracted needs to preserve preserves according to the classification of movie channel the number of times that user watches movie channel in 60 days, as a specific embodiment, such as, it is one or more that the specific characteristic associated with user charges buying behavior can comprise in following characteristics:
Movie channel, series channel, automobile channel, to make laughs channel, animation channel, XATV-6, fashion channel, parent-offspring's channel, GameChannel, original channel, advertisement channel, music channel, Info channel, sports channel, life channel, tourism channel, science and technology channel, education channel, entertainment channel, documentary film channel, other channel, android equipment, iphone equipment, ipad equipment, ipod equipment, miscellaneous equipment, member, non-member, paid video, free video, complete viewing and trying, increasing along with the increase of the increase of video-see channel or the mobile device of use and Association Identity and business in reality, more specific characteristic can also be increased, here only illustrate, instead of be specifically confined to above-mentioned feature.
In addition, the specific characteristic data set of the user extracted in this step is divided into the training specific characteristic data set of user and the specific characteristic data set of test subscriber, the specific characteristic data set of user is wherein trained to comprise positive sample data and negative sample data, described negative sample data are put the specific characteristic data set of the user bought that do not pay the fixed time, during specific implementation, ratio between positive sample and negative sample can be adjusted according to actual conditions, such as, described negative sample data bulk is three times of positive sample data quantity, in addition, as specific embodiment, such as described positive sample data is the specific characteristic data set of the user that fixed time point paying is bought, illustrate, predict that next day user buys the probability of member, then the fixed time puts is then the same day, can guarantee like this to train and probabilistic forecasting data have identical ageing, wherein the quantity of positive sample bought the number of member the same day, quantity is 1.5 ten thousand people, simultaneously owing to not buying the number of member every day far away more than the number buying member, therefore the number of negative sample more than positive sample, should can have chosen 50,000 people as negative sample, and namely negative sample quantity is about 3 times of positive sample number.
Step S103, trains according to training algorithm each specific characteristic to the specific characteristic data centralization of training user the weighted value obtaining each specific characteristic;
Various existing training algorithm can be adopted in the application to train, here be not construed as limiting, only illustrate, such as, training algorithm can adopt L2 canonical logistic regression training algorithm, L2 canonical logistic regression training algorithm is also referred to as L2 regularization logistic algorithm, it is widely used in statistics, in the present embodiment, above-mentioned steps S102 is extracted and obtain training the specific characteristic data set of user to carry out training the weighted value that can obtain each specific characteristic, such as, using the positive sample data of above-mentioned 1.5 ten thousand people and the negative sample data of 50,000 people as input data, through training algorithm, the weighted value of each specific characteristic such as can be obtained after the training of L2 canonical logistic regression training algorithm, such as, if weight total score is 100, in above-mentioned specific characteristic, the weight of movie channel is 8, the weight of series channel is 10, the weight of other specific characteristics can obtain equally, here only illustrate, repeat no more.
Step S104, the weighted value according to the specific characteristic data set of test subscriber and each specific characteristic of training acquisition determines the probabilistic forecasting data that each test subscriber pays;
During specific implementation, the weighted value of each specific characteristic that above-mentioned steps S103 obtains varies in size, the weighted value of some specific characteristic may be less, prediction can be not used in, namely can use the weighted value of whole specific characteristic to predict in this step, also the weight of the larger specific characteristic of fractional weight can be adopted to predict, such as, a kind of mode is:
The weighted value obtaining each specific characteristic is sorted;
Determine to specify the crucial specific characteristic in sequencing horizon according to ranking results, such as using the specific characteristic of sequence in the sequencing horizon of front ten as crucial specific characteristic;
The probabilistic forecasting data that each test subscriber pays are determined according to the weight of crucial specific characteristic and the specific characteristic data set of test subscriber.
Step S105, recommends user according to the probabilistic forecasting data location that each test subscriber pays.
During specific implementation, user can be recommended according to the size location of probabilistic forecasting data according to the probabilistic forecasting data that each test subscriber pays, but need to reach to realize high conversion ratio under the condition of lower coverage rate, for this reason, suppose accuracy=predict correct number/actual purchase number, namely the ratio of correct paying number and the same day actual paying number is predicted, efficiency=predict correct number/prediction purchase number, namely predict correct paying number and dope the number buying member, the probabilistic forecasting data location of paying according to each test subscriber in the present embodiment recommends user to locate in the following manner, that is:
Determine according to recommendation accuracy the first threshold curve recommending user according to the probabilistic forecasting data of paying according to each test subscriber in section in positioning time;
The probabilistic forecasting data of paying according to each test subscriber in positioning time section are according to the Second Threshold curve recommending efficiency to determine to recommend user;
The threshold value of recommending user is determined according to described first threshold curve and Second Threshold curve;
User is recommended according to the threshold value location of the described recommendation user determined.
According to the above embodiments, the input less crowd being carried out to video can be realized, reach the feature of raising the efficiency, and also check prediction effect by reality input test further, by adding up the data of every day, can verify further or adjust, repeat no more here.
Refer to Fig. 2, it is according to a kind of specific embodiment composition schematic diagram of locating the device recommending user of the present invention, mainly comprises:
Acquisition module 1, in the present embodiment, acquisition module 1 is mainly used in gathering customer transaction data and user video viewing data in video system, during specific implementation, collection video system customer transaction data and user video viewing data realize by various mode, as a specific embodiment, with reference to figure 3, acquisition module can comprise:
First obtains submodule 11, obtains video system customer transaction data for obtaining the daily record of member's tran list;
Second obtains submodule 12, obtains video system customer transaction data for obtaining user video viewing daily record.
As aforementioned, the daily record of described member's tran list and described user video viewing daily record comprise personal computer and daily record corresponding to wireless side, and the analysis that the data of acquisition like this watch behavior for user is more accurate.
Extraction module 2, in the present embodiment, extraction module 2 is mainly used in from the customer transaction data of described collection and the user video viewing extracting data training specific characteristic data set of user and the specific characteristic data set of test subscriber, during specific implementation, in the present embodiment, specific characteristic can be one or more in following characteristics:
Movie channel, series channel, automobile channel, to make laughs channel, animation channel, XATV-6, fashion channel, parent-offspring's channel, GameChannel, original channel, advertisement channel, music channel, Info channel, sports channel, life channel, tourism channel, science and technology channel, education channel, entertainment channel, documentary film channel, other channel, android equipment, iphone equipment, ipad equipment, ipod equipment, miscellaneous equipment, member, non-member, paid video, free video, complete viewing and trying.
In addition, needs illustrate, extract main according to user ID, according to specific characteristic classification, and then gather the user behavior frequency data of each specific characteristic corresponding, such as, determine that user ID is user100, the data of the various viewing behaviors of collecting user100 can be watched daily record from user video, such as, watch the frequency of movie channel, the frequency etc. of viewing series channel, and then train user and test subscriber to form corresponding specific data collection respectively.
In addition, in order to realize training, the specific characteristic data set of described training user can comprise positive sample data and negative sample data, described positive sample data can be the specific characteristic data set of the user that fixed time point paying is bought, described negative sample data are put the specific characteristic data set of the user bought that do not pay the fixed time, general negative sample data bulk is greater than positive sample data quantity, such as, described negative sample data bulk is three times or other ratios of positive sample data quantity, here concrete restriction is not done, in addition, concrete restriction is not done to putting at fixed time in the present embodiment yet, such as, fixed time point can be the same day, the described user paying purchase such as can buy the user etc. of member, also can be other situations in reality, here only illustrate, and be not specifically limited.
Training module 3, in the present embodiment, training module 3 is mainly used in the weighted value training to obtain each specific characteristic according to training algorithm each specific characteristic to the specific characteristic data centralization of training user, as aforementioned, various existing training algorithm can be adopted in the application to train, here be not construed as limiting, only illustrate, such as, training algorithm can adopt L2 canonical logistic regression training algorithm.
Determination module 4, in the present embodiment, determination module 4 is mainly used in determining according to the weighted value of the specific characteristic data set of test subscriber and each specific characteristic of training acquisition the probabilistic forecasting data that each test subscriber pays; During specific implementation, as a specific embodiment, such as, with reference to figure 4, determination module can comprise:
Sorting sub-module 41, for sorting to the weighted value obtaining each specific characteristic;
Crucial specific characteristic determination submodule 42, specifies the crucial specific characteristic in sequencing horizon for determining according to ranking results;
Probabilistic forecasting data determination submodule 43, for determining according to the weight of crucial specific characteristic and the specific characteristic data set of test subscriber the probabilistic forecasting data that each test subscriber pays.
Locating module 5, user is recommended in the probabilistic forecasting data location that in the present embodiment, locating module 5 is mainly used in paying according to each test subscriber, and during specific implementation, as a specific embodiment, such as, with reference to figure 5, locating module can comprise:
First threshold curve determination submodule 51, for determining according to recommendation accuracy the first threshold curve recommending user according to the probabilistic forecasting data of paying according to each test subscriber in positioning time in section;
Second Threshold curve determination submodule 52, the probabilistic forecasting data of paying according to each test subscriber in positioning time section are according to the Second Threshold curve recommending efficiency to determine to recommend user;
Threshold value determination submodule 53, determines the threshold value of recommending user according to described first threshold curve and Second Threshold curve;
Recommend user's locator module 54, recommend user for the threshold value location according to the described recommendation user determined.
In above-mentioned provided instructions, describe a large amount of detail.But can understand, embodiments of the invention can be put into practice when not having these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand in each inventive aspect one or more, in the description above to exemplary embodiment of the present invention, each feature of the present invention is grouped together in single embodiment, figure or the description to it sometimes.But, the method for the disclosure should be construed to the following intention of reflection: namely the present invention for required protection requires feature more more than the feature clearly recorded in each claim.Or rather, as claims below reflect, all features of disclosed single embodiment before inventive aspect is to be less than.Therefore, the claims following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
The present invention will be described instead of limit the invention to it should be noted above-described embodiment, and those skilled in the art can design alternative embodiment when not departing from the scope of claims.

Claims (22)

1. locate a method of recommending user, it is characterized in that, comprising:
Gather customer transaction data and user video viewing data in video system;
From the customer transaction data of described collection and the user video viewing extracting data training specific characteristic data set of user and the specific characteristic data set of test subscriber;
The weighted value obtaining each specific characteristic is trained according to training algorithm each specific characteristic to the specific characteristic data centralization of training user;
Weighted value according to the specific characteristic data set of test subscriber and each specific characteristic of training acquisition determines the probabilistic forecasting data that each test subscriber pays;
User is recommended according to the probabilistic forecasting data location that each test subscriber pays.
2. method according to claim 1, is characterized in that, collection video system customer transaction data and user video viewing data comprise:
The daily record obtaining member's tran list obtains video system customer transaction data;
Obtain user video viewing daily record and obtain video system customer transaction data.
3. method according to claim 2, is characterized in that, the daily record of described member's tran list and described user video viewing daily record comprise personal computer and daily record corresponding to wireless side.
4. method according to claim 1, is characterized in that, the weighted value according to the specific characteristic data set of test subscriber and each specific characteristic of training acquisition determines that the probabilistic forecasting data that each test subscriber pays comprise:
The weighted value obtaining each specific characteristic is sorted;
Determine to specify the crucial specific characteristic in sequencing horizon according to ranking results;
The probabilistic forecasting data that each test subscriber pays are determined according to the weight of crucial specific characteristic and the specific characteristic data set of test subscriber.
5. method according to claim 1, is characterized in that, recommends user to comprise according to the probabilistic forecasting data location that each test subscriber pays:
Determine according to recommendation accuracy the first threshold curve recommending user according to the probabilistic forecasting data of paying according to each test subscriber in section in positioning time;
The probabilistic forecasting data of paying according to each test subscriber in positioning time section are according to the Second Threshold curve recommending efficiency to determine to recommend user;
The threshold value of recommending user is determined according to described first threshold curve and Second Threshold curve;
User is recommended according to the threshold value location of the described recommendation user determined.
6. method according to claim 1, it is characterized in that, the specific characteristic data set of described training user comprises positive sample data and negative sample data, described positive sample data is the specific characteristic data set of the user that fixed time point paying is bought, and described negative sample data are put the specific characteristic data set of the user bought that do not pay the fixed time.
7. method according to claim 6, is characterized in that, described fixed time point is that positive sample and negative sample gather the same day.
8. method according to claim 6, is characterized in that, described negative sample data bulk is three times of positive sample data quantity.
9. method according to claim 6, is characterized in that, the described user bought that pays is the user buying member.
10. the method according to any one of claim 1-10, is characterized in that, described training algorithm is L2 canonical logistic regression training algorithm.
11. methods according to any one of claim 1-10, is characterized in that, specific characteristic is one or more in following characteristics:
Movie channel, series channel, automobile channel, to make laughs channel, animation channel, XATV-6, fashion channel, parent-offspring's channel, GameChannel, original channel, advertisement channel, music channel, Info channel, sports channel, life channel, tourism channel, science and technology channel, education channel, entertainment channel, documentary film channel, other channel, android equipment, iphone equipment, ipad equipment, ipod equipment, miscellaneous equipment, member, non-member, paid video, free video, complete viewing and trying.
Locate the device recommending user, it is characterized in that, comprising for 12. 1 kinds:
Acquisition module, for gathering customer transaction data and user video viewing data in video system;
Extraction module, for from the customer transaction data of described collection and the user video viewing extracting data training specific characteristic data set of user and the specific characteristic data set of test subscriber;
Training module, for training according to training algorithm each specific characteristic to the specific characteristic data centralization of training user the weighted value obtaining each specific characteristic;
Determination module, the weighted value for each specific characteristic obtained according to specific characteristic data set and the training of test subscriber determines the probabilistic forecasting data that each test subscriber pays;
Locating module, recommends user for the probabilistic forecasting data location of paying according to each test subscriber.
13. devices according to claim 12, it is characterized in that, acquisition module comprises:
First obtains submodule, obtains video system customer transaction data for obtaining the daily record of member's tran list;
Second obtains submodule, obtains video system customer transaction data for obtaining user video viewing daily record.
14. devices according to claim 13, is characterized in that, the daily record of described member's tran list and described user video viewing daily record comprise personal computer and daily record corresponding to wireless side.
15. devices according to claim 12, it is characterized in that, determination module comprises:
Sorting sub-module, for sorting to the weighted value obtaining each specific characteristic;
Crucial specific characteristic determination submodule, specifies the crucial specific characteristic in sequencing horizon for determining according to ranking results;
Probabilistic forecasting data determination submodule, for determining according to the weight of crucial specific characteristic and the specific characteristic data set of test subscriber the probabilistic forecasting data that each test subscriber pays.
16. devices according to claim 12, it is characterized in that, locating module comprises:
First threshold curve determination submodule, for determining according to recommendation accuracy the first threshold curve recommending user according to the probabilistic forecasting data of paying according to each test subscriber in positioning time in section;
Second Threshold curve determination submodule, the probabilistic forecasting data of paying according to each test subscriber in positioning time section are according to the Second Threshold curve recommending efficiency to determine to recommend user;
Threshold value determination submodule, determines the threshold value of recommending user according to described first threshold curve and Second Threshold curve;
Recommend user's locator module, recommend user for the threshold value location according to the described recommendation user determined.
17. devices according to claim 12, it is characterized in that, the specific characteristic data set of described training user comprises positive sample data and negative sample data, described positive sample data is the specific characteristic data set of the user that fixed time point paying is bought, and described negative sample data are put the specific characteristic data set of the user bought that do not pay the fixed time.
18. devices according to claim 17, is characterized in that, described fixed time point is that positive sample and negative sample gather the same day.
19. devices according to claim 17, is characterized in that, described negative sample data bulk is three times of positive sample data quantity.
20. devices according to claim 17, is characterized in that, the described user bought that pays is the user buying member.
21. devices according to any one of claim 12-20, it is characterized in that, described training algorithm is L2 canonical logistic regression training algorithm.
22. devices according to any one of claim 12-20, is characterized in that, specific characteristic is one or more in following characteristics:
Movie channel, series channel, automobile channel, to make laughs channel, animation channel, XATV-6, fashion channel, parent-offspring's channel, GameChannel, original channel, advertisement channel, music channel, Info channel, sports channel, life channel, tourism channel, science and technology channel, education channel, entertainment channel, documentary film channel, other channel, android equipment, iphone equipment, ipad equipment, ipod equipment, miscellaneous equipment, member, non-member, paid video, free video, complete viewing and trying.
CN201510604634.2A 2015-09-21 2015-09-21 Method and device for positioning recommended user Active CN105160008B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510604634.2A CN105160008B (en) 2015-09-21 2015-09-21 Method and device for positioning recommended user

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510604634.2A CN105160008B (en) 2015-09-21 2015-09-21 Method and device for positioning recommended user

Publications (2)

Publication Number Publication Date
CN105160008A true CN105160008A (en) 2015-12-16
CN105160008B CN105160008B (en) 2020-03-31

Family

ID=54800864

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510604634.2A Active CN105160008B (en) 2015-09-21 2015-09-21 Method and device for positioning recommended user

Country Status (1)

Country Link
CN (1) CN105160008B (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105719164A (en) * 2016-01-21 2016-06-29 海信集团有限公司 Paid multimedia resource recommending method and paid multimedia resource recommending device
CN106168980A (en) * 2016-07-26 2016-11-30 合网络技术(北京)有限公司 Multimedia resource recommends sort method and device
CN106204106A (en) * 2016-06-28 2016-12-07 武汉斗鱼网络科技有限公司 A kind of specific user's recognition methods and system
CN107016569A (en) * 2017-03-21 2017-08-04 聚好看科技股份有限公司 The targeted customer's account acquisition methods and device of a kind of networking products
CN107679920A (en) * 2017-10-20 2018-02-09 北京奇艺世纪科技有限公司 The put-on method and device of a kind of advertisement
CN108416649A (en) * 2018-02-05 2018-08-17 北京三快在线科技有限公司 Search result ordering method, device, electronic equipment and storage medium
CN108509459A (en) * 2017-02-28 2018-09-07 腾讯科技(北京)有限公司 Data processing method and device
CN108683949A (en) * 2018-05-18 2018-10-19 北京奇艺世纪科技有限公司 A kind of extracting method and device of live streaming platform potential user
CN108965938A (en) * 2018-08-03 2018-12-07 山东大学 Potential paying customer prediction technique and system in smart television
CN109121006A (en) * 2018-08-02 2019-01-01 中国联合网络通信集团有限公司 A kind of marketing method and platform based on live streaming viewing user
CN110033294A (en) * 2018-01-12 2019-07-19 腾讯科技(深圳)有限公司 A kind of determination method of business score value, business score value determining device and medium
CN110555747A (en) * 2018-06-01 2019-12-10 北京京东尚科信息技术有限公司 method and device for determining target user
CN111309939A (en) * 2020-02-03 2020-06-19 天津智融创新科技发展有限公司 Video recommendation sorting method and device
WO2020135193A1 (en) * 2018-12-27 2020-07-02 深圳Tcl新技术有限公司 Deep neural network-based video recommendation method and system, and storage medium
CN112131479A (en) * 2020-09-30 2020-12-25 深圳前海微众银行股份有限公司 Data processing method, device, equipment and storage medium
CN112866760A (en) * 2021-01-18 2021-05-28 青岛聚看云科技有限公司 Content display method, display equipment and server
CN113139856A (en) * 2021-05-14 2021-07-20 青岛聚看云科技有限公司 Movie and television member package recommendation method and device
CN114513700A (en) * 2020-11-16 2022-05-17 中国移动通信集团有限公司 Method, device and equipment for recommending video and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080201201A1 (en) * 2006-09-25 2008-08-21 Sms.Ac Methods and systems for finding, tagging, rating and suggesting content provided by networked application pods
US20080288375A1 (en) * 2007-04-21 2008-11-20 Carpe Media Media Distribution Reporting System, Apparatus, Method and Software
CN104090888A (en) * 2013-12-10 2014-10-08 深圳市腾讯计算机系统有限公司 Method and device for analyzing user behavior data
CN104598521A (en) * 2014-12-12 2015-05-06 北京京东尚科信息技术有限公司 Method and device for processing user behavior data
CN104778173A (en) * 2014-01-10 2015-07-15 腾讯科技(深圳)有限公司 Determination method, device and equipment of objective user

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080201201A1 (en) * 2006-09-25 2008-08-21 Sms.Ac Methods and systems for finding, tagging, rating and suggesting content provided by networked application pods
US20080288375A1 (en) * 2007-04-21 2008-11-20 Carpe Media Media Distribution Reporting System, Apparatus, Method and Software
CN104090888A (en) * 2013-12-10 2014-10-08 深圳市腾讯计算机系统有限公司 Method and device for analyzing user behavior data
CN104778173A (en) * 2014-01-10 2015-07-15 腾讯科技(深圳)有限公司 Determination method, device and equipment of objective user
CN104598521A (en) * 2014-12-12 2015-05-06 北京京东尚科信息技术有限公司 Method and device for processing user behavior data

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105719164A (en) * 2016-01-21 2016-06-29 海信集团有限公司 Paid multimedia resource recommending method and paid multimedia resource recommending device
CN106204106A (en) * 2016-06-28 2016-12-07 武汉斗鱼网络科技有限公司 A kind of specific user's recognition methods and system
CN106168980A (en) * 2016-07-26 2016-11-30 合网络技术(北京)有限公司 Multimedia resource recommends sort method and device
CN106168980B (en) * 2016-07-26 2020-07-28 阿里巴巴(中国)有限公司 Multimedia resource recommendation sequencing method and device
CN108509459B (en) * 2017-02-28 2022-05-20 腾讯科技(北京)有限公司 Data processing method and device
CN108509459A (en) * 2017-02-28 2018-09-07 腾讯科技(北京)有限公司 Data processing method and device
CN107016569A (en) * 2017-03-21 2017-08-04 聚好看科技股份有限公司 The targeted customer's account acquisition methods and device of a kind of networking products
CN107679920A (en) * 2017-10-20 2018-02-09 北京奇艺世纪科技有限公司 The put-on method and device of a kind of advertisement
CN110033294A (en) * 2018-01-12 2019-07-19 腾讯科技(深圳)有限公司 A kind of determination method of business score value, business score value determining device and medium
CN108416649A (en) * 2018-02-05 2018-08-17 北京三快在线科技有限公司 Search result ordering method, device, electronic equipment and storage medium
CN108683949A (en) * 2018-05-18 2018-10-19 北京奇艺世纪科技有限公司 A kind of extracting method and device of live streaming platform potential user
CN110555747A (en) * 2018-06-01 2019-12-10 北京京东尚科信息技术有限公司 method and device for determining target user
CN109121006A (en) * 2018-08-02 2019-01-01 中国联合网络通信集团有限公司 A kind of marketing method and platform based on live streaming viewing user
CN109121006B (en) * 2018-08-02 2021-02-02 中国联合网络通信集团有限公司 Marketing method and platform based on live broadcast watching user
CN108965938B (en) * 2018-08-03 2020-03-20 山东大学 Method and system for predicting potential pay users in smart television
CN108965938A (en) * 2018-08-03 2018-12-07 山东大学 Potential paying customer prediction technique and system in smart television
WO2020135193A1 (en) * 2018-12-27 2020-07-02 深圳Tcl新技术有限公司 Deep neural network-based video recommendation method and system, and storage medium
CN111309939A (en) * 2020-02-03 2020-06-19 天津智融创新科技发展有限公司 Video recommendation sorting method and device
CN111309939B (en) * 2020-02-03 2023-02-28 天津智融创新科技发展有限公司 Video recommendation sorting method and device
CN112131479A (en) * 2020-09-30 2020-12-25 深圳前海微众银行股份有限公司 Data processing method, device, equipment and storage medium
CN114513700A (en) * 2020-11-16 2022-05-17 中国移动通信集团有限公司 Method, device and equipment for recommending video and storage medium
CN112866760A (en) * 2021-01-18 2021-05-28 青岛聚看云科技有限公司 Content display method, display equipment and server
CN113139856A (en) * 2021-05-14 2021-07-20 青岛聚看云科技有限公司 Movie and television member package recommendation method and device

Also Published As

Publication number Publication date
CN105160008B (en) 2020-03-31

Similar Documents

Publication Publication Date Title
CN105160008A (en) Method and device for locating suggested users
CN103489117B (en) Method and system for information releasing
CN107798412B (en) Route recommendation method and device
CN103729785B (en) Video user gender classification method and device for method
US9779143B2 (en) Information pushing method and apparatus
CN105930425A (en) Personalized video recommendation method and apparatus
CN103763585B (en) A kind of user's characteristic information acquisition methods, device and terminal device
CN110309434B (en) Track data processing method and device and related equipment
CN106294830A (en) The recommendation method and device of multimedia resource
US20150205580A1 (en) Method and System for Sorting Online Videos of a Search
CN103559206A (en) Information recommending method and system
CN103870454A (en) Method and method for recommending data
CN108965938B (en) Method and system for predicting potential pay users in smart television
CN109509039A (en) Method for building up and system, the Method of Commodity Recommendation and system of price expectation model
US20140030681A1 (en) Activity-oriented Studying Method in an Online-to-offline Manner
CN109543132A (en) Content recommendation method, device, electronic equipment and storage medium
CN104967690B (en) A kind of information-pushing method and device
CN103383702A (en) Method and system for recommending personalized news based on ranking of votes of users
CN103970861A (en) Information presenting method and device
US9064018B2 (en) Information providing system that provides information based on strenghth of desire and current situation
CN105260905A (en) Method and device for evaluating and predicting influence of media program
US20160150289A1 (en) Method of Providing Advertising Services and Electronic Device Adapted to the Same
CN102957949A (en) Device and method for recommending video to user
CN103152618A (en) Content recommendation method and device for digital television value added services
CN105893530A (en) Multimedia recommendation method and device based on user relationship

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address

Address after: 100080 Beijing Haidian District city Haidian street A Sinosteel International Plaza No. 8 block 5 layer A, C

Patentee after: Youku network technology (Beijing) Co.,Ltd.

Address before: 100080, A, 5, block 8, Sinosteel International Plaza, Haidian Avenue, Haidian District, Beijing

Patentee before: 1VERGE INTERNET TECHNOLOGY (BEIJING) Co.,Ltd.

CP03 Change of name, title or address
TR01 Transfer of patent right

Effective date of registration: 20200619

Address after: 310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Patentee after: Alibaba (China) Co.,Ltd.

Address before: 100080 Beijing Haidian District city Haidian street A Sinosteel International Plaza No. 8 block 5 layer A, C

Patentee before: Youku network technology (Beijing) Co.,Ltd.

TR01 Transfer of patent right