CN103700018A - Method for dividing users in mobile social network - Google Patents
Method for dividing users in mobile social network Download PDFInfo
- Publication number
- CN103700018A CN103700018A CN201310686372.XA CN201310686372A CN103700018A CN 103700018 A CN103700018 A CN 103700018A CN 201310686372 A CN201310686372 A CN 201310686372A CN 103700018 A CN103700018 A CN 103700018A
- Authority
- CN
- China
- Prior art keywords
- user
- matrix
- similarity
- users
- social networks
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a method for dividing users in a mobile social network. The method comprises the following steps of a, importing user geographical track journal files, and calculating disperse geographical track similarity si,j between any two users; b, establishing a User-User similarity matrix G, a Locate-User matrix L and a User-Locate matrix U based on the disperse geographical track similarity between any two users; c, importing user social activity loggings, establishing a social network model according to mobile phone calling historical records between users, and calculating a social network adjacent matrix F; d, calculating a block matrix and dividing the users. According to the method, the users in a mobile phone network or in a position-based on-line social network are accurately divided, and the method can be applied to a personalized content recommendation system and pushes personalized content to target users.
Description
Technical field
The present invention relates to data mining and user behavior analysis technology, be specifically related to the crowd's division methods in a kind of mobile community network.By the user in cell phone network or the online social networks of position-based, precisely divide, can be applicable to individualized content commending system.
Background technology
Accurately the corporations in division crowd or cluster are significant for individualized content commending systems such as optimizing location-based Internet service, according to crowd's division result, can to targeted customer, push individualized content exactly.
Existing division crowd's method generally can be divided into based on community network Detecting Community method and the clustering method based on data characteristics.
On the one hand, the method of the Detecting Community based on community network is divided traditional corporations according to the tightness degree of the social networks between individuality, the social networks inside and outside corporations can be reflected very intuitively in the corporations that obtain due to this method, therefore be easy to for analyzing and explain some universal phenomena, can be applied to very easily that pure social networking service is recommended or Study on Problems that other and social networks are closely related in, but crowd might not only have social attribute, other attributes are geographical attribute for example, social propertys etc. have influence on individual behavior and even the division of corporations equally, and then affect its application surface.
On the other hand, the data clustering method based on common trait has had a lot of application at Data Mining.The similarity of the attribute of this method based on Different Individual dimension obtains the cluster of the common trait between individuality, thereby divides dissimilar crowd.This method has been widely applied in personalized recommendation fields such as ecommerce, content of multimedia propelling movement, but this method specific aim is relatively strong, only can analyze and the behavior of detecting user in particular community dimension.
Summary of the invention
For the deficiencies in the prior art, the object of the invention is to propose the crowd's division methods in a kind of mobile community network, the method, based on discrete geographical track similarity, simultaneously in conjunction with individual geographical attribute and social attribute, is divided more accurate.
For realizing above goal of the invention, the present invention by the following technical solutions:
Crowd's division methods in mobile community network, comprises the following steps:
A, the geographical trace logs file of importing user, calculate the discrete geographical track similarity s between any two users
i,j, wherein, given total number of users N, User Activity region is covered completely by M base station, and the geographical trace logs of described user is comprised of the base station sequence number list of user ID and User Activity;
B, the discrete geographical track similarity based on any two users, set up User-User similarity matrix G, Locate-User matrix L and User-Locate matrix U, wherein, and the matrix element PV of N*M dimension User-Locate matrix U
ijrepresent that user i is at the probability of occurrence of position j; The matrix element LV of M*N dimension Locate-User matrix L
ijrepresent the probability that i upper user j in position occurs; N*N dimension User-User similarity matrix G is:
Wherein, the right geographical similarity of user of correspondence position in each matrix element respective user list;
C, importing user social contact activity log, set up social networks model according to user's handset call historical record each other, calculates social networks adjacency matrix F, and wherein, social networks adjacency matrix F is N*N dimension matrix, matrix element V
ijrepresent user i, the weight of the social networks of j;
D, calculate partitioned matrix and divide crowd.
Compared with prior art, the present invention has following technique effect: owing to coming cluster to have the individuality of similar behavior in conjunction with multidimensional data, divide more accurate.
Accompanying drawing explanation
Explanation with reference to below, by reference to the accompanying drawings, can have best understanding to the present invention.In the accompanying drawings, identical part can be represented by identical label.
Fig. 1 is crowd's division methods general frame schematic diagram that the present invention proposes;
Fig. 2 is social networks adjacency matrix calculation flow chart;
Fig. 3 is the process flow diagram of crowd's partiting step.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearer, below in conjunction with accompanying drawing and exemplary embodiment, the present invention is further elaborated.Should be appreciated that exemplary embodiment described herein is only in order to explain the present invention, the scope of application being not intended to limit the present invention.
As shown in Figure 1, the crowd's division methods in the mobile community network of the embodiment of the present invention comprises the following steps:
A, the geographical trace logs file of importing user, calculate the discrete geographical track similarity between any two users.
Wherein, the geographical trace logs of user is comprised of the base station sequence number list of user ID and User Activity.Given total number of users N, User Activity region is covered completely by M base station, any user i, the discrete geographical track similarity between j is expressed as the cosine similarity indices sum of both all common location:
P wherein
i,lthe probability that user occurs at position l, || p
i,l|| be p
i,lnorm.In the present embodiment, this norm is 2 norms.
Step a is specially: first import any two users' historical base station sequence number list, contrast both common base station numbers and account for ratio separately, calculate the probability of occurrence p of each comfortable all total base station location l
i,l, then calculate corresponding cosine similarity indices, finally summation obtains any two users' discrete geographical track similarity.
B, the discrete geographical track similarity based on any two users, set up User-User similarity matrix G, Locate-User matrix L and User-Locate matrix U.
Step b specifically comprises: according to the discrete geographical track similarity s between individuality
i,jset up User-User similarity matrix G, set up User-Locate matrix U and Locate-User matrix L simultaneously, utilize symmetrical Algorithms of Non-Negative Matrix Factorization (SNMF) to carry out matrix decomposition to User-User similarity matrix G, obtain the Preliminary division of individual geographic track, i.e. the degree of adhering to separately of affiliated each partitioned matrix.
Wherein, the matrix element PV of N*M dimension User-Locate matrix U
ijrepresent that user i is at the probability of occurrence of position j; The matrix element LV of M*N dimension Locate-User matrix L
ijrepresent the probability that i upper user j in position occurs; N*N dimension User-User similarity matrix G is:
Wherein, the right geographical similarity of user of correspondence position in each matrix element respective user list.
C, importing user social contact activity log, set up social networks model according to user's handset call historical record each other, calculates social networks adjacency matrix F.
Wherein, social networks adjacency matrix F is N*N dimension matrix, matrix element V
ijrepresent user i, the weight of the social networks of j.In the present embodiment, described weight can be user i, the social interaction frequency of j, i.e. user i, the number of communications between j and the duration of call.
Step c is specially: import N user's doings daily record, initialization N*N social networks adjacency matrix F=0, targeted customer corresponding in each user's doings daily record is read in circulation, according to the weight of social networks between individuality, upgrade corresponding matrix element in social networks adjacency matrix, thereby set up the social networks adjacency matrix of colony, as shown in Figure 2.
D, calculate partitioned matrix and divide crowd.
Steps d is specially: import the social networks adjacency matrix F in User-User similarity matrix G, User-Locate matrix U, Locate-User matrix L and the step c in step b, the initial value X of target setting partitioned matrix carries out to User-User similarity matrix G degree of adhering to separately the matrix that matrix decomposition obtains in step b, adopt associating Algorithms of Non-Negative Matrix Factorization (JNMF) to do matrix decomposition, iterate until target partitioned matrix
before and after meeting, the mould of the difference of twice iteration is less than predetermined value, exports each node affiliated each partitioned matrix and degree of adhering to separately thereof, obtains the dividing condition of colony, as shown in Figure 3.
According to such scheme, input above-mentioned four matrix G, U, L, F and do associating Non-negative Matrix Factorization, being calculated as follows of its target partitioned matrix (N*K matrix):
In the present embodiment, preset value α=β=0.5, k=1,2 ..., K, K is default geographical corporations number, scalar 1 is the unit matrix of corresponding dimension (the diagonal element of 2 β L all deducts 1).After final the stablizing that iterate, obtain required target partitioned matrix
wherein the capable representative of consumer i of i is for the degree of membership of each geographical corporations.
Steps d obtains after crowd's division result, and individualized content commending system can push individualized content to targeted customer according to the crowd who divides.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, all any modifications of doing within the spirit and principles in the present invention, be equal to and replace and improvement etc., within all should being included in protection scope of the present invention.
Claims (7)
1. the crowd's division methods in mobile community network, comprises the following steps:
A, the geographical trace logs file of importing user, calculate the discrete geographical track similarity s between any two users
i,j, wherein, given total number of users N, User Activity region is covered completely by M base station, and the geographical trace logs of described user is comprised of the base station sequence number list of user ID and User Activity;
B, the discrete geographical track similarity based on any two users, set up User-User similarity matrix G, Locate-User matrix L and User-Locate matrix U, wherein, and the matrix element PV of N*M dimension User-Locate matrix U
ijrepresent that user i is at the probability of occurrence of position j; The matrix element LV of M*N dimension Locate-User matrix L
ijrepresent the probability that i upper user j in position occurs; N*N dimension User-User similarity matrix G is:
Wherein, the right geographical similarity of user of correspondence position in each matrix element respective user list;
C, importing user social contact activity log, set up social networks model according to user's handset call historical record each other, calculates social networks adjacency matrix F, and wherein, social networks adjacency matrix F is N*N dimension matrix, matrix element V
ijrepresent user i, the weight of the social networks of j;
D, calculate partitioned matrix and divide crowd.
2. division methods according to claim 1, wherein, user i arbitrarily, the discrete geographical track similarity between j is expressed as the cosine similarity indices sum of both all common location:
P wherein
i,lthe probability that user occurs at position l, || p
i,l|| be p
i,lnorm, be preferably 2 norms.
3. division methods according to claim 2, wherein, step a is specially:
First import any two users' historical base station sequence number list, contrast both common base station numbers and account for ratio separately, calculate the probability of occurrence p of each comfortable all total base station location l
i,l, then calculate corresponding cosine similarity indices, finally summation obtains any two users' discrete geographical track similarity.
4. division methods according to claim 1, wherein, step b also comprises:
Utilize symmetrical Algorithms of Non-Negative Matrix Factorization (SNMF) to carry out matrix decomposition to User-User similarity matrix G, obtain the Preliminary division of individual geographic track, be i.e. the degree of adhering to separately of affiliated each partitioned matrix.
5. division methods according to claim 1, wherein, described user i, the weight of the social networks of j is specially user i, the social interaction frequency of j, i.e. user i, the number of communications between j and the duration of call.
6. division methods according to claim 1, wherein, step c is specially: import N user's doings daily record, initialization N*N social networks adjacency matrix F=0, targeted customer corresponding in each user's doings daily record is read in circulation, according to the weight of social networks between individuality, upgrade corresponding matrix element in social networks adjacency matrix, thereby set up the social networks adjacency matrix of colony.
7. division methods according to claim 1, wherein, steps d is specially: import the social networks adjacency matrix F in User-User similarity matrix G, User-Locate matrix U, Locate-User matrix L and the step c in step b, the initial value X of target setting partitioned matrix carries out to User-User similarity matrix G degree of adhering to separately the matrix that matrix decomposition obtains in step b, adopt associating Algorithms of Non-Negative Matrix Factorization (JNMF) to do matrix decomposition, iterate until target partitioned matrix
before and after meeting, the mould of the difference of twice iteration is less than predetermined value, exports each node affiliated each partitioned matrix and degree of adhering to separately thereof, obtains the dividing condition of colony.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310686372.XA CN103700018B (en) | 2013-12-16 | 2013-12-16 | A kind of crowd division methods in mobile community network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310686372.XA CN103700018B (en) | 2013-12-16 | 2013-12-16 | A kind of crowd division methods in mobile community network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103700018A true CN103700018A (en) | 2014-04-02 |
CN103700018B CN103700018B (en) | 2018-04-24 |
Family
ID=50361537
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310686372.XA Expired - Fee Related CN103700018B (en) | 2013-12-16 | 2013-12-16 | A kind of crowd division methods in mobile community network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103700018B (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104035978A (en) * | 2014-05-26 | 2014-09-10 | 南京泰锐斯通信科技有限公司 | Association discovering method and system |
CN104518930A (en) * | 2015-01-09 | 2015-04-15 | 哈尔滨工程大学 | Method for synchronously detecting abnormal users and abnormal messages for microblog |
CN104820905A (en) * | 2015-05-19 | 2015-08-05 | 威海北洋电气集团股份有限公司 | Space trajectory big data analysis-based person management and control method and system |
CN104881459A (en) * | 2015-05-22 | 2015-09-02 | 电子科技大学 | Friend recommendation method of mobile social network |
WO2016169192A1 (en) * | 2015-04-24 | 2016-10-27 | 百度在线网络技术(北京)有限公司 | Method and apparatus for determining user similarity |
CN106506705A (en) * | 2016-12-29 | 2017-03-15 | 平安科技(深圳)有限公司 | Listener clustering method and device based on location-based service |
CN106570764A (en) * | 2016-11-09 | 2017-04-19 | 广州杰赛科技股份有限公司 | User relationship predicting method and device |
CN106600053A (en) * | 2016-12-12 | 2017-04-26 | 西安交通大学 | Spatial-temporal trajectory and social network user attribute prediction system |
CN106846228A (en) * | 2017-02-13 | 2017-06-13 | 浪潮通信信息系统有限公司 | A kind of public safety prewarning method and system based on positional information and social networks |
CN107316044A (en) * | 2016-04-27 | 2017-11-03 | 中国电信股份有限公司 | Similar users recognition methods and device |
CN107613520A (en) * | 2017-08-29 | 2018-01-19 | 重庆邮电大学 | A kind of telecommunication user similarity based on LDA topic models finds method |
CN110750730A (en) * | 2019-09-10 | 2020-02-04 | 合肥工业大学 | Group detection method and system based on space-time constraint |
CN112417076A (en) * | 2020-11-24 | 2021-02-26 | 杭州东信北邮信息技术有限公司 | Building personnel affiliation identification method based on big data mining technology |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101259604B1 (en) * | 2011-11-15 | 2013-05-14 | 경기대학교 산학협력단 | Contents recommendation system and method based on social network |
CN102880719A (en) * | 2012-10-16 | 2013-01-16 | 四川大学 | User trajectory similarity mining method for location-based social network |
CN103049496B (en) * | 2012-12-07 | 2016-08-17 | 北京百度网讯科技有限公司 | A kind of multiple users are carried out the method for customer group division, device and equipment |
-
2013
- 2013-12-16 CN CN201310686372.XA patent/CN103700018B/en not_active Expired - Fee Related
Cited By (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104035978A (en) * | 2014-05-26 | 2014-09-10 | 南京泰锐斯通信科技有限公司 | Association discovering method and system |
CN104035978B (en) * | 2014-05-26 | 2017-06-30 | 南京泰锐斯通信科技有限公司 | Combo discovering method and system |
CN104518930A (en) * | 2015-01-09 | 2015-04-15 | 哈尔滨工程大学 | Method for synchronously detecting abnormal users and abnormal messages for microblog |
CN104518930B (en) * | 2015-01-09 | 2017-11-21 | 哈尔滨工程大学 | A kind of abnormal user and message Simultaneous Detection towards microblogging |
WO2016169192A1 (en) * | 2015-04-24 | 2016-10-27 | 百度在线网络技术(北京)有限公司 | Method and apparatus for determining user similarity |
CN104820905B (en) * | 2015-05-19 | 2018-11-20 | 威海北洋电气集团股份有限公司 | Personnel's management-control method and system based on space tracking big data analysis |
CN104820905A (en) * | 2015-05-19 | 2015-08-05 | 威海北洋电气集团股份有限公司 | Space trajectory big data analysis-based person management and control method and system |
CN104881459A (en) * | 2015-05-22 | 2015-09-02 | 电子科技大学 | Friend recommendation method of mobile social network |
CN107316044A (en) * | 2016-04-27 | 2017-11-03 | 中国电信股份有限公司 | Similar users recognition methods and device |
CN106570764A (en) * | 2016-11-09 | 2017-04-19 | 广州杰赛科技股份有限公司 | User relationship predicting method and device |
CN106600053A (en) * | 2016-12-12 | 2017-04-26 | 西安交通大学 | Spatial-temporal trajectory and social network user attribute prediction system |
CN106600053B (en) * | 2016-12-12 | 2020-04-10 | 西安交通大学 | User attribute prediction system based on space-time trajectory and social network |
CN106506705A (en) * | 2016-12-29 | 2017-03-15 | 平安科技(深圳)有限公司 | Listener clustering method and device based on location-based service |
WO2018120424A1 (en) * | 2016-12-29 | 2018-07-05 | 平安科技(深圳)有限公司 | Location service-based method, device, equipment for crowd classification, and storage medium |
CN106506705B (en) * | 2016-12-29 | 2020-07-28 | 平安科技(深圳)有限公司 | Crowd classification method and device based on location service |
CN106846228A (en) * | 2017-02-13 | 2017-06-13 | 浪潮通信信息系统有限公司 | A kind of public safety prewarning method and system based on positional information and social networks |
CN107613520A (en) * | 2017-08-29 | 2018-01-19 | 重庆邮电大学 | A kind of telecommunication user similarity based on LDA topic models finds method |
CN107613520B (en) * | 2017-08-29 | 2020-08-04 | 重庆邮电大学 | Telecommunication user similarity discovery method based on L DA topic model |
CN110750730A (en) * | 2019-09-10 | 2020-02-04 | 合肥工业大学 | Group detection method and system based on space-time constraint |
CN112417076A (en) * | 2020-11-24 | 2021-02-26 | 杭州东信北邮信息技术有限公司 | Building personnel affiliation identification method based on big data mining technology |
Also Published As
Publication number | Publication date |
---|---|
CN103700018B (en) | 2018-04-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103700018A (en) | Method for dividing users in mobile social network | |
CN103488678B (en) | Friend recommendation system based on user sign-in similarity | |
CN103617547B (en) | A kind of business recommended method and system | |
CN103295145A (en) | Mobile phone advertising method based on user consumption feature vector | |
CN103678647A (en) | Method and system for recommending information | |
CN105335400A (en) | Method and apparatus for obtaining answer information for questioning intention of user | |
CN103533390A (en) | Method and system for recommending television programs based on social network information | |
CN105760449B (en) | A kind of cloud method for pushing towards multi-source heterogeneous data | |
CN108154425A (en) | Method is recommended by the Xian Xia trade companies of a kind of combination community network and position | |
CN103795613A (en) | Method for predicting friend relationships in online social network | |
CN104581749B (en) | A kind of prediction technique and device of mobile network data service traffic | |
CN106528812B (en) | A kind of cloud recommended method based on USDR model | |
CN110489507A (en) | Determine the method, apparatus, computer equipment and storage medium of point of interest similarity | |
CN111160847B (en) | Method and device for processing flow information | |
CN104424247A (en) | Product information filtering recommendation method and device | |
Zhu et al. | Learning geographical and mobility factors for mobile application recommendation | |
CN105653535A (en) | Media resource recommendation method | |
CN106846082A (en) | Tourism cold start-up consumer products commending system and method based on hardware information | |
CN110188268A (en) | A kind of personalized recommendation method based on label and temporal information | |
CN108038734B (en) | Urban commercial facility spatial distribution detection method and system based on comment data | |
EP3503004A1 (en) | Method and system for evaluating user portrait data | |
Chen et al. | User intent-oriented video QoE with emotion detection networking | |
CN103744929A (en) | Target user object determination method | |
CN109508862A (en) | A kind of allocator and device | |
CN101925044A (en) | Method for judging wireless user attributes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20180424 Termination date: 20181216 |
|
CF01 | Termination of patent right due to non-payment of annual fee |