CN103700018A - Method for dividing users in mobile social network - Google Patents

Method for dividing users in mobile social network Download PDF

Info

Publication number
CN103700018A
CN103700018A CN201310686372.XA CN201310686372A CN103700018A CN 103700018 A CN103700018 A CN 103700018A CN 201310686372 A CN201310686372 A CN 201310686372A CN 103700018 A CN103700018 A CN 103700018A
Authority
CN
China
Prior art keywords
user
matrix
similarity
users
social networks
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310686372.XA
Other languages
Chinese (zh)
Other versions
CN103700018B (en
Inventor
陈述
涂来
黄本雄
马雪琴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong University of Science and Technology
Original Assignee
Huazhong University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong University of Science and Technology filed Critical Huazhong University of Science and Technology
Priority to CN201310686372.XA priority Critical patent/CN103700018B/en
Publication of CN103700018A publication Critical patent/CN103700018A/en
Application granted granted Critical
Publication of CN103700018B publication Critical patent/CN103700018B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a method for dividing users in a mobile social network. The method comprises the following steps of a, importing user geographical track journal files, and calculating disperse geographical track similarity si,j between any two users; b, establishing a User-User similarity matrix G, a Locate-User matrix L and a User-Locate matrix U based on the disperse geographical track similarity between any two users; c, importing user social activity loggings, establishing a social network model according to mobile phone calling historical records between users, and calculating a social network adjacent matrix F; d, calculating a block matrix and dividing the users. According to the method, the users in a mobile phone network or in a position-based on-line social network are accurately divided, and the method can be applied to a personalized content recommendation system and pushes personalized content to target users.

Description

Crowd's division methods in a kind of mobile community network
Technical field
The present invention relates to data mining and user behavior analysis technology, be specifically related to the crowd's division methods in a kind of mobile community network.By the user in cell phone network or the online social networks of position-based, precisely divide, can be applicable to individualized content commending system.
Background technology
Accurately the corporations in division crowd or cluster are significant for individualized content commending systems such as optimizing location-based Internet service, according to crowd's division result, can to targeted customer, push individualized content exactly.
Existing division crowd's method generally can be divided into based on community network Detecting Community method and the clustering method based on data characteristics.
On the one hand, the method of the Detecting Community based on community network is divided traditional corporations according to the tightness degree of the social networks between individuality, the social networks inside and outside corporations can be reflected very intuitively in the corporations that obtain due to this method, therefore be easy to for analyzing and explain some universal phenomena, can be applied to very easily that pure social networking service is recommended or Study on Problems that other and social networks are closely related in, but crowd might not only have social attribute, other attributes are geographical attribute for example, social propertys etc. have influence on individual behavior and even the division of corporations equally, and then affect its application surface.
On the other hand, the data clustering method based on common trait has had a lot of application at Data Mining.The similarity of the attribute of this method based on Different Individual dimension obtains the cluster of the common trait between individuality, thereby divides dissimilar crowd.This method has been widely applied in personalized recommendation fields such as ecommerce, content of multimedia propelling movement, but this method specific aim is relatively strong, only can analyze and the behavior of detecting user in particular community dimension.
Summary of the invention
For the deficiencies in the prior art, the object of the invention is to propose the crowd's division methods in a kind of mobile community network, the method, based on discrete geographical track similarity, simultaneously in conjunction with individual geographical attribute and social attribute, is divided more accurate.
For realizing above goal of the invention, the present invention by the following technical solutions:
Crowd's division methods in mobile community network, comprises the following steps:
A, the geographical trace logs file of importing user, calculate the discrete geographical track similarity s between any two users i,j, wherein, given total number of users N, User Activity region is covered completely by M base station, and the geographical trace logs of described user is comprised of the base station sequence number list of user ID and User Activity;
B, the discrete geographical track similarity based on any two users, set up User-User similarity matrix G, Locate-User matrix L and User-Locate matrix U, wherein, and the matrix element PV of N*M dimension User-Locate matrix U ijrepresent that user i is at the probability of occurrence of position j; The matrix element LV of M*N dimension Locate-User matrix L ijrepresent the probability that i upper user j in position occurs; N*N dimension User-User similarity matrix G is:
Figure BDA0000438890820000021
Wherein, the right geographical similarity of user of correspondence position in each matrix element respective user list;
C, importing user social contact activity log, set up social networks model according to user's handset call historical record each other, calculates social networks adjacency matrix F, and wherein, social networks adjacency matrix F is N*N dimension matrix, matrix element V ijrepresent user i, the weight of the social networks of j;
D, calculate partitioned matrix and divide crowd.
Compared with prior art, the present invention has following technique effect: owing to coming cluster to have the individuality of similar behavior in conjunction with multidimensional data, divide more accurate.
Accompanying drawing explanation
Explanation with reference to below, by reference to the accompanying drawings, can have best understanding to the present invention.In the accompanying drawings, identical part can be represented by identical label.
Fig. 1 is crowd's division methods general frame schematic diagram that the present invention proposes;
Fig. 2 is social networks adjacency matrix calculation flow chart;
Fig. 3 is the process flow diagram of crowd's partiting step.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearer, below in conjunction with accompanying drawing and exemplary embodiment, the present invention is further elaborated.Should be appreciated that exemplary embodiment described herein is only in order to explain the present invention, the scope of application being not intended to limit the present invention.
As shown in Figure 1, the crowd's division methods in the mobile community network of the embodiment of the present invention comprises the following steps:
A, the geographical trace logs file of importing user, calculate the discrete geographical track similarity between any two users.
Wherein, the geographical trace logs of user is comprised of the base station sequence number list of user ID and User Activity.Given total number of users N, User Activity region is covered completely by M base station, any user i, the discrete geographical track similarity between j is expressed as the cosine similarity indices sum of both all common location:
S ij = Σ l ∈ L , i , j ∈ N p i , l p j , l | | p i , l | | × | | p j , l | |
P wherein i,lthe probability that user occurs at position l, || p i,l|| be p i,lnorm.In the present embodiment, this norm is 2 norms.
Step a is specially: first import any two users' historical base station sequence number list, contrast both common base station numbers and account for ratio separately, calculate the probability of occurrence p of each comfortable all total base station location l i,l, then calculate corresponding cosine similarity indices, finally summation obtains any two users' discrete geographical track similarity.
B, the discrete geographical track similarity based on any two users, set up User-User similarity matrix G, Locate-User matrix L and User-Locate matrix U.
Step b specifically comprises: according to the discrete geographical track similarity s between individuality i,jset up User-User similarity matrix G, set up User-Locate matrix U and Locate-User matrix L simultaneously, utilize symmetrical Algorithms of Non-Negative Matrix Factorization (SNMF) to carry out matrix decomposition to User-User similarity matrix G, obtain the Preliminary division of individual geographic track, i.e. the degree of adhering to separately of affiliated each partitioned matrix.
Wherein, the matrix element PV of N*M dimension User-Locate matrix U ijrepresent that user i is at the probability of occurrence of position j; The matrix element LV of M*N dimension Locate-User matrix L ijrepresent the probability that i upper user j in position occurs; N*N dimension User-User similarity matrix G is:
Figure BDA0000438890820000041
Wherein, the right geographical similarity of user of correspondence position in each matrix element respective user list.
C, importing user social contact activity log, set up social networks model according to user's handset call historical record each other, calculates social networks adjacency matrix F.
Wherein, social networks adjacency matrix F is N*N dimension matrix, matrix element V ijrepresent user i, the weight of the social networks of j.In the present embodiment, described weight can be user i, the social interaction frequency of j, i.e. user i, the number of communications between j and the duration of call.
Step c is specially: import N user's doings daily record, initialization N*N social networks adjacency matrix F=0, targeted customer corresponding in each user's doings daily record is read in circulation, according to the weight of social networks between individuality, upgrade corresponding matrix element in social networks adjacency matrix, thereby set up the social networks adjacency matrix of colony, as shown in Figure 2.
D, calculate partitioned matrix and divide crowd.
Steps d is specially: import the social networks adjacency matrix F in User-User similarity matrix G, User-Locate matrix U, Locate-User matrix L and the step c in step b, the initial value X of target setting partitioned matrix carries out to User-User similarity matrix G degree of adhering to separately the matrix that matrix decomposition obtains in step b, adopt associating Algorithms of Non-Negative Matrix Factorization (JNMF) to do matrix decomposition, iterate until target partitioned matrix before and after meeting, the mould of the difference of twice iteration is less than predetermined value, exports each node affiliated each partitioned matrix and degree of adhering to separately thereof, obtains the dividing condition of colony, as shown in Figure 3.
According to such scheme, input above-mentioned four matrix G, U, L, F and do associating Non-negative Matrix Factorization, being calculated as follows of its target partitioned matrix (N*K matrix):
X ^ ik ← X ^ ik ( [ U + 2 αFX + X L ^ 2 ( α + β ) [ XX T X ] ij ) 1 / 4 Wherein L ^ = 2 βL - 1
In the present embodiment, preset value α=β=0.5, k=1,2 ..., K, K is default geographical corporations number, scalar 1 is the unit matrix of corresponding dimension (the diagonal element of 2 β L all deducts 1).After final the stablizing that iterate, obtain required target partitioned matrix wherein the capable representative of consumer i of i is for the degree of membership of each geographical corporations.
Steps d obtains after crowd's division result, and individualized content commending system can push individualized content to targeted customer according to the crowd who divides.
The foregoing is only preferred embodiment of the present invention, not in order to limit the present invention, all any modifications of doing within the spirit and principles in the present invention, be equal to and replace and improvement etc., within all should being included in protection scope of the present invention.

Claims (7)

1. the crowd's division methods in mobile community network, comprises the following steps:
A, the geographical trace logs file of importing user, calculate the discrete geographical track similarity s between any two users i,j, wherein, given total number of users N, User Activity region is covered completely by M base station, and the geographical trace logs of described user is comprised of the base station sequence number list of user ID and User Activity;
B, the discrete geographical track similarity based on any two users, set up User-User similarity matrix G, Locate-User matrix L and User-Locate matrix U, wherein, and the matrix element PV of N*M dimension User-Locate matrix U ijrepresent that user i is at the probability of occurrence of position j; The matrix element LV of M*N dimension Locate-User matrix L ijrepresent the probability that i upper user j in position occurs; N*N dimension User-User similarity matrix G is:
Figure FDA0000438890810000011
Wherein, the right geographical similarity of user of correspondence position in each matrix element respective user list;
C, importing user social contact activity log, set up social networks model according to user's handset call historical record each other, calculates social networks adjacency matrix F, and wherein, social networks adjacency matrix F is N*N dimension matrix, matrix element V ijrepresent user i, the weight of the social networks of j;
D, calculate partitioned matrix and divide crowd.
2. division methods according to claim 1, wherein, user i arbitrarily, the discrete geographical track similarity between j is expressed as the cosine similarity indices sum of both all common location:
S ij = Σ l ∈ L , i , j ∈ N p i , l p j , l | | p i , l | | × | | p j , l | |
P wherein i,lthe probability that user occurs at position l, || p i,l|| be p i,lnorm, be preferably 2 norms.
3. division methods according to claim 2, wherein, step a is specially:
First import any two users' historical base station sequence number list, contrast both common base station numbers and account for ratio separately, calculate the probability of occurrence p of each comfortable all total base station location l i,l, then calculate corresponding cosine similarity indices, finally summation obtains any two users' discrete geographical track similarity.
4. division methods according to claim 1, wherein, step b also comprises:
Utilize symmetrical Algorithms of Non-Negative Matrix Factorization (SNMF) to carry out matrix decomposition to User-User similarity matrix G, obtain the Preliminary division of individual geographic track, be i.e. the degree of adhering to separately of affiliated each partitioned matrix.
5. division methods according to claim 1, wherein, described user i, the weight of the social networks of j is specially user i, the social interaction frequency of j, i.e. user i, the number of communications between j and the duration of call.
6. division methods according to claim 1, wherein, step c is specially: import N user's doings daily record, initialization N*N social networks adjacency matrix F=0, targeted customer corresponding in each user's doings daily record is read in circulation, according to the weight of social networks between individuality, upgrade corresponding matrix element in social networks adjacency matrix, thereby set up the social networks adjacency matrix of colony.
7. division methods according to claim 1, wherein, steps d is specially: import the social networks adjacency matrix F in User-User similarity matrix G, User-Locate matrix U, Locate-User matrix L and the step c in step b, the initial value X of target setting partitioned matrix carries out to User-User similarity matrix G degree of adhering to separately the matrix that matrix decomposition obtains in step b, adopt associating Algorithms of Non-Negative Matrix Factorization (JNMF) to do matrix decomposition, iterate until target partitioned matrix
Figure FDA0000438890810000021
before and after meeting, the mould of the difference of twice iteration is less than predetermined value, exports each node affiliated each partitioned matrix and degree of adhering to separately thereof, obtains the dividing condition of colony.
CN201310686372.XA 2013-12-16 2013-12-16 A kind of crowd division methods in mobile community network Expired - Fee Related CN103700018B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310686372.XA CN103700018B (en) 2013-12-16 2013-12-16 A kind of crowd division methods in mobile community network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310686372.XA CN103700018B (en) 2013-12-16 2013-12-16 A kind of crowd division methods in mobile community network

Publications (2)

Publication Number Publication Date
CN103700018A true CN103700018A (en) 2014-04-02
CN103700018B CN103700018B (en) 2018-04-24

Family

ID=50361537

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310686372.XA Expired - Fee Related CN103700018B (en) 2013-12-16 2013-12-16 A kind of crowd division methods in mobile community network

Country Status (1)

Country Link
CN (1) CN103700018B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104035978A (en) * 2014-05-26 2014-09-10 南京泰锐斯通信科技有限公司 Association discovering method and system
CN104518930A (en) * 2015-01-09 2015-04-15 哈尔滨工程大学 Method for synchronously detecting abnormal users and abnormal messages for microblog
CN104820905A (en) * 2015-05-19 2015-08-05 威海北洋电气集团股份有限公司 Space trajectory big data analysis-based person management and control method and system
CN104881459A (en) * 2015-05-22 2015-09-02 电子科技大学 Friend recommendation method of mobile social network
WO2016169192A1 (en) * 2015-04-24 2016-10-27 百度在线网络技术(北京)有限公司 Method and apparatus for determining user similarity
CN106506705A (en) * 2016-12-29 2017-03-15 平安科技(深圳)有限公司 Listener clustering method and device based on location-based service
CN106570764A (en) * 2016-11-09 2017-04-19 广州杰赛科技股份有限公司 User relationship predicting method and device
CN106600053A (en) * 2016-12-12 2017-04-26 西安交通大学 Spatial-temporal trajectory and social network user attribute prediction system
CN106846228A (en) * 2017-02-13 2017-06-13 浪潮通信信息系统有限公司 A kind of public safety prewarning method and system based on positional information and social networks
CN107316044A (en) * 2016-04-27 2017-11-03 中国电信股份有限公司 Similar users recognition methods and device
CN107613520A (en) * 2017-08-29 2018-01-19 重庆邮电大学 A kind of telecommunication user similarity based on LDA topic models finds method
CN110750730A (en) * 2019-09-10 2020-02-04 合肥工业大学 Group detection method and system based on space-time constraint
CN112417076A (en) * 2020-11-24 2021-02-26 杭州东信北邮信息技术有限公司 Building personnel affiliation identification method based on big data mining technology

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101259604B1 (en) * 2011-11-15 2013-05-14 경기대학교 산학협력단 Contents recommendation system and method based on social network
CN102880719A (en) * 2012-10-16 2013-01-16 四川大学 User trajectory similarity mining method for location-based social network
CN103049496B (en) * 2012-12-07 2016-08-17 北京百度网讯科技有限公司 A kind of multiple users are carried out the method for customer group division, device and equipment

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104035978A (en) * 2014-05-26 2014-09-10 南京泰锐斯通信科技有限公司 Association discovering method and system
CN104035978B (en) * 2014-05-26 2017-06-30 南京泰锐斯通信科技有限公司 Combo discovering method and system
CN104518930A (en) * 2015-01-09 2015-04-15 哈尔滨工程大学 Method for synchronously detecting abnormal users and abnormal messages for microblog
CN104518930B (en) * 2015-01-09 2017-11-21 哈尔滨工程大学 A kind of abnormal user and message Simultaneous Detection towards microblogging
WO2016169192A1 (en) * 2015-04-24 2016-10-27 百度在线网络技术(北京)有限公司 Method and apparatus for determining user similarity
CN104820905B (en) * 2015-05-19 2018-11-20 威海北洋电气集团股份有限公司 Personnel's management-control method and system based on space tracking big data analysis
CN104820905A (en) * 2015-05-19 2015-08-05 威海北洋电气集团股份有限公司 Space trajectory big data analysis-based person management and control method and system
CN104881459A (en) * 2015-05-22 2015-09-02 电子科技大学 Friend recommendation method of mobile social network
CN107316044A (en) * 2016-04-27 2017-11-03 中国电信股份有限公司 Similar users recognition methods and device
CN106570764A (en) * 2016-11-09 2017-04-19 广州杰赛科技股份有限公司 User relationship predicting method and device
CN106600053A (en) * 2016-12-12 2017-04-26 西安交通大学 Spatial-temporal trajectory and social network user attribute prediction system
CN106600053B (en) * 2016-12-12 2020-04-10 西安交通大学 User attribute prediction system based on space-time trajectory and social network
CN106506705A (en) * 2016-12-29 2017-03-15 平安科技(深圳)有限公司 Listener clustering method and device based on location-based service
WO2018120424A1 (en) * 2016-12-29 2018-07-05 平安科技(深圳)有限公司 Location service-based method, device, equipment for crowd classification, and storage medium
CN106506705B (en) * 2016-12-29 2020-07-28 平安科技(深圳)有限公司 Crowd classification method and device based on location service
CN106846228A (en) * 2017-02-13 2017-06-13 浪潮通信信息系统有限公司 A kind of public safety prewarning method and system based on positional information and social networks
CN107613520A (en) * 2017-08-29 2018-01-19 重庆邮电大学 A kind of telecommunication user similarity based on LDA topic models finds method
CN107613520B (en) * 2017-08-29 2020-08-04 重庆邮电大学 Telecommunication user similarity discovery method based on L DA topic model
CN110750730A (en) * 2019-09-10 2020-02-04 合肥工业大学 Group detection method and system based on space-time constraint
CN112417076A (en) * 2020-11-24 2021-02-26 杭州东信北邮信息技术有限公司 Building personnel affiliation identification method based on big data mining technology

Also Published As

Publication number Publication date
CN103700018B (en) 2018-04-24

Similar Documents

Publication Publication Date Title
CN103700018A (en) Method for dividing users in mobile social network
CN103488678B (en) Friend recommendation system based on user sign-in similarity
CN103617547B (en) A kind of business recommended method and system
CN103295145A (en) Mobile phone advertising method based on user consumption feature vector
CN103678647A (en) Method and system for recommending information
CN105335400A (en) Method and apparatus for obtaining answer information for questioning intention of user
CN103533390A (en) Method and system for recommending television programs based on social network information
CN105760449B (en) A kind of cloud method for pushing towards multi-source heterogeneous data
CN108154425A (en) Method is recommended by the Xian Xia trade companies of a kind of combination community network and position
CN103795613A (en) Method for predicting friend relationships in online social network
CN104581749B (en) A kind of prediction technique and device of mobile network data service traffic
CN106528812B (en) A kind of cloud recommended method based on USDR model
CN110489507A (en) Determine the method, apparatus, computer equipment and storage medium of point of interest similarity
CN111160847B (en) Method and device for processing flow information
CN104424247A (en) Product information filtering recommendation method and device
Zhu et al. Learning geographical and mobility factors for mobile application recommendation
CN105653535A (en) Media resource recommendation method
CN106846082A (en) Tourism cold start-up consumer products commending system and method based on hardware information
CN110188268A (en) A kind of personalized recommendation method based on label and temporal information
CN108038734B (en) Urban commercial facility spatial distribution detection method and system based on comment data
EP3503004A1 (en) Method and system for evaluating user portrait data
Chen et al. User intent-oriented video QoE with emotion detection networking
CN103744929A (en) Target user object determination method
CN109508862A (en) A kind of allocator and device
CN101925044A (en) Method for judging wireless user attributes

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180424

Termination date: 20181216

CF01 Termination of patent right due to non-payment of annual fee