CN104331433A - Tobacco information recommending method based on mobile terminal user logs - Google Patents

Tobacco information recommending method based on mobile terminal user logs Download PDF

Info

Publication number
CN104331433A
CN104331433A CN201410568065.6A CN201410568065A CN104331433A CN 104331433 A CN104331433 A CN 104331433A CN 201410568065 A CN201410568065 A CN 201410568065A CN 104331433 A CN104331433 A CN 104331433A
Authority
CN
China
Prior art keywords
user
tobacco
users
similarity
record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410568065.6A
Other languages
Chinese (zh)
Other versions
CN104331433B (en
Inventor
陆海良
姜学峰
郁钢
高扬华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Tobacco Zhejiang Industrial Co Ltd
Original Assignee
China Tobacco Zhejiang Industrial Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Tobacco Zhejiang Industrial Co Ltd filed Critical China Tobacco Zhejiang Industrial Co Ltd
Priority to CN201410568065.6A priority Critical patent/CN104331433B/en
Publication of CN104331433A publication Critical patent/CN104331433A/en
Application granted granted Critical
Publication of CN104331433B publication Critical patent/CN104331433B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]

Abstract

The invention relates to a tobacco information recommending method based on mobile terminal user logs. The method comprises three stages including user category determination, inter-user similarity model building and tobacco information recommending. The method concretely comprises the following steps of 1, user category determination, 2, inter-user similarity model building and 3, tobacco information recommending. The tobacco information recommending method has the advantages that 1, registration information of users in the tobacco information logs is analyzed, and the classification of different users is completed; 2, by aiming at the users, a user-user similarity model is built for meeting the specific tobacco information demands through combining the historically browsed tobacco information of the users in the recommending process, and the recommending method based on the historically browsed behaviors of the users is realized; 3, the method belongs to a completely automatic recommending method, and the users do not need any manual intervention in the information browsing process.

Description

A kind of Tobacco Reference recommend method based on mobile phone users daily record
Technical field
The present invention relates to technical field of information recommendation, particularly relate to the Tobacco Reference recommend method based on mobile phone users daily record.
Background technology
Internet has become the important channel of people's obtaining information.Because the information content on internet is huge, find its interested information fast for ease of user, information recommendation technology is achieving fast development in recent years.Information recommendation technology generally can be divided into the information recommendation based on temperature and the large class of the information recommendation based on user behavior two.Based on the number of times comformed information temperature that the information recommendation of temperature is accessed according to information, by information recommendation high for temperature to user, because the calculating of heatrate and particular user have nothing to do, so these class methods cannot realize personalized recommendation.Information recommendation based on user behavior can realize personalized recommendation, and it can be further divided into again content-based information recommendation and the large class of information recommendation two based on collaborative filtering.Content-based information filtering is according to the similarity determination recommendation information with user browsing information, but these class methods are difficult to the new interest finding user.Information recommendation based on collaborative filtering is pinpointed the problems by introducing the similar users calculating new interest solved in recommendation process, but all there is new customer problem in itself and content-based information filtering, cannot provide effective information recommendation when user just uses a certain website for it.
Along with the development of mobile Internet, increasing user brings into use obtaining information in mobile terminal.The log content of mobile terminal is enriched (behavior etc. that such as user uses mobile terminal), and there is clear and definite corresponding relation between log content and particular user, provides possibility for solving based on the new customer problem in the information recommendation of collaborative filtering.It is one of Core Feature of tobacco website that Tobacco Reference is recommended, and the user accessing this type of website mainly can be divided into smoker, smoker family members and tobacco practitioner three class, need consider the classification of user when carrying out information recommendation.
Summary of the invention
The problem to be solved in the present invention is how by mobile terminal Web log mining (the mobile terminal log recording of user is on Mobile solution middleware server), for user recommends its interested Tobacco Reference.First the present invention determines that mobile phone users is smoker, smoker family members or tobacco practitioner, the user of all mobile terminals is divided into three colonies (smoker colony, smoker family members colony and tobacco practitioner colony); Then analyze the historical viewings information of user in three types of populations, set up the Similarity Model between fellow users; Finally in conjunction with the Tobacco Reference browsing histories of collaborative filtering thought and similarity user, generating recommendations is to the Tobacco Reference of targeted customer.
In order to realize above-mentioned object, present invention employs following technical scheme:
Tobacco Reference recommend method the method based on mobile phone users daily record is divided into be determined class of subscriber, sets up Similarity Model between user, recommend a Tobacco Reference three phases, specifically comprises the following steps:
One, class of subscriber is determined:
Step 1.1, extracts the user type information that all users register when browsing tobacco web first;
Step 1.2, determines the classification of all users according to step 1;
Step 1.3, according to step 2, is divided into smoker colony, smoker family members colony and tobacco practitioner colony three part by all users;
Two, Similarity Model between user is set up:
Step 2.1, extracts the message reference record of all users;
Step 2.2, adds up the number of times of all users to message reference;
Step 2.3, according to step 2.1-2.2, sets up user-user Similarity Model;
Step 2.4, according to step 2.3, sets up user's similarity matrix.
Three, Tobacco Reference is recommended:
Step 3.1, extracts user ID and classification, finds the customer group at user place;
Step 3.2, according to the application that step 3.1 and user are selected, finds the user of the same app of application with its use to gather;
Step 3.3, according to step 3.2 and set up user's similarity matrix, this user set in find top n similarity user;
Step 3.4, carries out collaborative filtering calculating to the historical information of similarity user;
Step 3.5, the score calculated according to step 3.4 sorts, and selects top m information recommendation to user.
As preferably, the historical information Visitor Logs of described step 2.1 user u is expressed as: u=<ID_user, I>, and wherein ID_user is the unique identifying number of user, I is the message reference record that user ID _ user stays, I=(I 1, I 2..., I n),
As preferably, the vector that described step 2.2 couple user sets up is: u=<ID_user, (I 1, t 1; I 2, t 2; I n, t n) >.
As preferably, in the model described in described step 2.3, user u pwith user u qbetween the computing formula of similarity degree as (1-3):
Sim ( u p , u q ) = &Sigma; i = 1 n I pi &times; I qi | | u p | | &CenterDot; | | u q | | - - - ( 1 )
| | u p | | = &Sigma; i = 1 n I pi 2 - - - ( 2 )
| | u q | | = &Sigma; i = 1 n I qi 2 - - - ( 3 )
In this formula, represent I piuser p browses the number of times of record i, and n represents that user browses the number of record.
As preferably, described step 2.4 user-user similarity matrix is M uu, M uu=[S ij], S in this matrix ijrepresent the similarity degree value of i-th user and a jth user.
As preferably, described step 3.3 compositional similarity user gathers U '; Step 3.4 is calculated by formula (4-7) for every bar information;
Score ( I i ) = R p &OverBar; + k &Sigma; u q &Element; U &prime; Sim ( u p , u q ) &CenterDot; ( r q i - R q &OverBar; ) - - - ( 4 )
k = 1 | U &prime; | &Sigma; u q &Element; U &prime; Sim ( u p , u q ) - - - ( 5 )
R p &OverBar; = 1 | S ( R p ) | &Sigma; j &Element; S ( R p ) r p j - - - ( 6 )
R q &OverBar; = 1 | S ( R q ) | &Sigma; j &Element; S ( R q ) r q j - - - ( 7 )
In formula, R puser u phistorical information record R p=<r p0, r p1..., r pn>, r piuser u pto the access times of i bar record; S (R p) be user u pthe subset of historical information record, user u pthe mean value of historical information record.
Tobacco user browse history analysis is one of the important method that understanding tobacco user pays close attention to angle, user interest, travel log have recorded the historical information that user browses tobacco in detail, the present invention is directed to the problem that suitable Tobacco Reference can not be recommended suitable user by traditional information recommendation engine, propose the Tobacco Reference recommend method based on mobile phone users daily record, the classification browsing user is determined according to user's registration information, excavate the historical information browsing tobacco of similar users, in conjunction with collaborative filtering thought, final generating recommendations gives the Tobacco Reference being applicable to user.Advantage of the present invention comprises:
1) log-on message of user in Tobacco Reference daily record is analyzed, complete the classification of different user;
2) for user for meeting specific Tobacco Reference demand, in recommendation process, combine user browsed Tobacco Reference in the past, establish the Similarity Model of user-user, achieve the recommend method based on the behavior of user's historical viewings tobacco;
3) a full automatic recommend method, user in browsing information process without the need to any manual intervention.
Accompanying drawing explanation
Fig. 1: a kind of Tobacco Reference recommend method process flow diagram based on mobile phone users daily record.
Fig. 2: determine class of subscriber process flow diagram.
Fig. 3: set up Similarity Model process flow diagram between user.
Fig. 4: recommend Tobacco Reference phase flow figure.
Embodiment
The present invention proposes a kind of Tobacco Reference recommend method based on mobile phone users daily record, process flow diagram as shown in Figure 1.The method is divided into be determined class of subscriber, sets up Similarity Model between user and recommend Tobacco Reference three phases.
The flow process determining the class of subscriber stage as shown in Figure 2, mainly comprises the following steps:
1) user's registration information is extracted.User is when browsing tobacco web first, and system requirements registered user selects type belonging to it (i.e. smoker, smoker family members and tobacco practitioner), extracts the classification that these information determine each user;
2) according to the classification information of the user's registration determined in step 1, all users are divided into smoker colony, smoker family members colony and tobacco practitioner colony;
3) Output rusults.
The flow process setting up the Similarity Model stage between user as shown in Figure 3, mainly comprises the following steps:
1) the message reference record of each user in Mobile solution middleware is extracted.The historical information Visitor Logs of user u is expressed as: u=<ID_user, I>, and wherein ID_user is the unique identifying number of user, and I is the message reference record that user ID _ user stays, I=(I 1, I 2..., I n);
2) counting user is to the number of times of every bar message reference, sets up a vector to each user.According to step 1, to the vector that user sets up be: u=<ID_user, (I 1, t 1; I 2, t 2; I n, t n) >;
3) according to step 1-2, user-user Similarity Model is set up.In the model, user u pwith user u qbetween the computing formula of similarity degree as (1-3):
Sim ( u p , u q ) = &Sigma; i = 1 n I pi &times; I qi | | u p | | &CenterDot; | | u q | | - - - ( 1 )
| | u p | | = &Sigma; i = 1 n I pi 2 - - - ( 2 )
| | u q | | = &Sigma; i = 1 n I qi 2 - - - ( 3 )
In this formula, represent I piuser p browses the number of times of record i, and n represents that user browses the number of record;
4) according to step 1-3, user-user similarity matrix M is set up uu.M uu=[S ij], S in this matrix ijrepresent the similarity degree value of i-th user and a jth user.
User's Tobacco Reference recommends the flow process in stage as shown in Figure 4, mainly comprises the following steps:
1) according to user ID and classification, find the customer group at this user place, namely this user belongs to smoker colony, smoker family members colony or tobacco practitioner colony;
2) according to the application app that user selects, the user of the same application with its use is found to gather;
3) according to the user-user similarity matrix set up, find top n the user similar to this user, compositional similarity user gathers U ';
4) utilize the historical information of collaborative filtering to similarity user to calculate, every bar information is calculated by formula (4-7), in formula, R puser u phistorical information record R p=<r p0, r p1..., r pn>, r piuser u pto the access times of i bar record.S (R p) be user u pthe subset of historical information record, user u pthe mean value of historical information record.Such as, user u pand u qhistorical record be respectively R p=<2,3,2,0,3,1,2,0,1> and R q=<2,3,0,3,3,4,4,2,0>, then S (R p)=<2,3,2,3,1,2,1>, S (R q)=<2,3,3,3,4,4,2>, =2, =3;
Score ( I i ) = R p &OverBar; + k &Sigma; u q &Element; U &prime; Sim ( u p , u q ) &CenterDot; ( r q i - R q &OverBar; ) - - - ( 4 )
k = 1 | U &prime; | &Sigma; u q &Element; U &prime; Sim ( u p , u q ) - - - ( 5 )
R p &OverBar; = 1 | S ( R p ) | &Sigma; j &Element; S ( R p ) r p j - - - ( 6 )
R q &OverBar; = 1 | S ( R q ) | &Sigma; j &Element; S ( R q ) r q j - - - ( 7 )
5) score calculated according to step 4 sorts, and selects top m information recommendation to user.

Claims (6)

1., based on a Tobacco Reference recommend method for mobile phone users daily record, it is characterized in that the method is divided into and determine class of subscriber, set up Similarity Model between user, recommend Tobacco Reference three phases, specifically comprise the following steps:
One, class of subscriber is determined:
Step 1.1, extracts the user type information that all users register when browsing tobacco web first;
Step 1.2, determines the classification of all users according to step 1;
Step 1.3, according to step 2, is divided into smoker colony, smoker family members colony and tobacco practitioner colony three part by all users;
Two, Similarity Model between user is set up:
Step 2.1, extracts the message reference record of all users;
Step 2.2, adds up the number of times of all users to message reference;
Step 2.3, according to step 2.1-2.2, sets up user-user Similarity Model;
Step 2.4, according to step 2.3, sets up user's similarity matrix.
Three, Tobacco Reference is recommended:
Step 3.1, extracts user ID and classification, finds the customer group at user place;
Step 3.2, according to the application that step 3.1 and user are selected, finds the user of the same app of application with its use to gather;
Step 3.3, according to step 3.2 and set up user's similarity matrix, this user set in find top n similarity user;
Step 3.4, carries out collaborative filtering calculating to the historical information of similarity user;
Step 3.5, the score calculated according to step 3.4 sorts, and selects top m information recommendation to user.
2. a kind of Tobacco Reference recommend method based on mobile phone users daily record according to claim 1, it is characterized in that: the historical information Visitor Logs of step 2.1 user u is expressed as: u=<ID_user, I>, wherein ID_user is the unique identifying number of user, I is the message reference record that user ID _ user stays, I=(I 1, I 2..., I n).
3. a kind of Tobacco Reference recommend method based on mobile phone users daily record according to claim 2, is characterized in that: the vector that step 2.2 couple user sets up is: u=<ID_user, (I 1, t 1; I 2, t 2; I n, t n) >.
4. a kind of Tobacco Reference recommend method based on mobile phone users daily record according to claim 3, is characterized in that: in the model described in step 2.3, user u pwith user u qbetween the computing formula of similarity degree as (1-3):
In this formula, represent I piuser p browses the number of times of record i, and n represents that user browses the number of record.
5. a kind of Tobacco Reference recommend method based on mobile phone users daily record according to claim 4, is characterized in that: step 2.4 user-user similarity matrix is M uu, M uu=[S ij], S in this matrix ijrepresent the similarity degree value of i-th user and a jth user.
6. a kind of Tobacco Reference recommend method based on mobile phone users daily record according to Claims 1 to 5 any one claim, is characterized in that: step 3.3 compositional similarity user gathers U '; Step 3.4 is calculated by formula (4-7) for every bar information;
In formula, R puser u phistorical information record R p=<r p0, r p1..., r pn>, r piuser u pto the access times of i bar record; S (R p) be user u pthe subset of historical information record, user u pthe mean value of historical information record.
CN201410568065.6A 2014-10-22 2014-10-22 A kind of Tobacco Reference based on mobile terminal user's daily record recommends method Active CN104331433B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410568065.6A CN104331433B (en) 2014-10-22 2014-10-22 A kind of Tobacco Reference based on mobile terminal user's daily record recommends method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410568065.6A CN104331433B (en) 2014-10-22 2014-10-22 A kind of Tobacco Reference based on mobile terminal user's daily record recommends method

Publications (2)

Publication Number Publication Date
CN104331433A true CN104331433A (en) 2015-02-04
CN104331433B CN104331433B (en) 2017-12-29

Family

ID=52406160

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410568065.6A Active CN104331433B (en) 2014-10-22 2014-10-22 A kind of Tobacco Reference based on mobile terminal user's daily record recommends method

Country Status (1)

Country Link
CN (1) CN104331433B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105100269A (en) * 2015-08-27 2015-11-25 努比亚技术有限公司 Mobile terminal and content recommending method based on different users
CN105528716A (en) * 2015-12-03 2016-04-27 山东烟草研究院有限公司 Tobacco brand remote intelligent recommendation method facing retailer individual need

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003157375A (en) * 2001-11-21 2003-05-30 Tetsukazu Yamaguchi Recommendation sales method for diversified commodities and recommendation data base therefor
CN102479366A (en) * 2010-11-25 2012-05-30 阿里巴巴集团控股有限公司 Commodity recommending method and system
CN103678652A (en) * 2013-12-23 2014-03-26 山东大学 Information individualized recommendation method based on Web log data

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003157375A (en) * 2001-11-21 2003-05-30 Tetsukazu Yamaguchi Recommendation sales method for diversified commodities and recommendation data base therefor
CN102479366A (en) * 2010-11-25 2012-05-30 阿里巴巴集团控股有限公司 Commodity recommending method and system
CN103678652A (en) * 2013-12-23 2014-03-26 山东大学 Information individualized recommendation method based on Web log data

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105100269A (en) * 2015-08-27 2015-11-25 努比亚技术有限公司 Mobile terminal and content recommending method based on different users
CN105528716A (en) * 2015-12-03 2016-04-27 山东烟草研究院有限公司 Tobacco brand remote intelligent recommendation method facing retailer individual need

Also Published As

Publication number Publication date
CN104331433B (en) 2017-12-29

Similar Documents

Publication Publication Date Title
CN102033883B (en) A kind of method, Apparatus and system improving data transmission speed of website
CN103106259B (en) A kind of mobile webpage content recommendation method based on situation
CN102929928B (en) Multidimensional-similarity-based personalized news recommendation method
CN103106285B (en) Recommendation algorithm based on information security professional social network platform
US9858308B2 (en) Real-time content recommendation system
CN103778260A (en) Individualized microblog information recommending system and method
US20150046249A1 (en) Method and system for measuring web advertising effect based on multiple-contact attribution model
CN101493832A (en) Website content combine recommendation system and method
CN102509233A (en) User online action information-based recommendation method
CN103020845A (en) Mobile application pushing method and system
CN103729359A (en) Method and system for recommending search terms
CN104463630A (en) Product recommendation method and system based on characteristics of online shopping insurance products
CN104899273A (en) Personalized webpage recommendation method based on topic and relative entropy
CN103678618A (en) Web service recommendation method based on socializing network platform
CN102254265A (en) Rich media internet advertisement content matching and effect evaluation method
CN103186565A (en) Method and device for judging user preference according to web browsing behavior of user
CN104408210B (en) Based on the video recommendation method of leader of opinion
CN102622396A (en) Web service clustering method based on labels
CN105095211A (en) Acquisition method and device for multimedia data
CN107153908A (en) Mobile news App influence power ranking methods
CN107797894A (en) APP user behavior analysis method and apparatus
CN103440328B (en) A kind of user classification method based on mouse behavior
CN105975609A (en) Industrial design product intelligent recommendation method and system
CN105653640A (en) Collaborative filtering recommendation method based on trust mechanism
CN101706926A (en) Method for investigating and processing cigarette consumption information

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant