CN104239327B - A kind of mobile Internet user behavior analysis method and device based on positional information - Google Patents

A kind of mobile Internet user behavior analysis method and device based on positional information Download PDF

Info

Publication number
CN104239327B
CN104239327B CN201310239737.4A CN201310239737A CN104239327B CN 104239327 B CN104239327 B CN 104239327B CN 201310239737 A CN201310239737 A CN 201310239737A CN 104239327 B CN104239327 B CN 104239327B
Authority
CN
China
Prior art keywords
theme
information
place
application service
historical
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310239737.4A
Other languages
Chinese (zh)
Other versions
CN104239327A (en
Inventor
张媛
陈小军
黄哲学
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Institute of Advanced Technology of CAS
Original Assignee
Shenzhen Institute of Advanced Technology of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Institute of Advanced Technology of CAS filed Critical Shenzhen Institute of Advanced Technology of CAS
Priority to CN201310239737.4A priority Critical patent/CN104239327B/en
Publication of CN104239327A publication Critical patent/CN104239327A/en
Application granted granted Critical
Publication of CN104239327B publication Critical patent/CN104239327B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries

Abstract

The present invention relates to a kind of mobile Internet user behavior analysis method and device based on positional information, including step:S1, obtain and store the IMEI code of mobile interchange network users, IMSI codes;S2, the historical position information and historical usage information on services for obtaining and storing each identity mobile interchange network users;S3, according to the historical position information initialize place theme, according to the historical usage information on services initialize application service theme;S4, get parms matrix Φ and parameter matrix Β;S5, according to parameter acquiring parameter matrix Θ;S6, undated parameter matrix Φ, Β, Θ;S7, judge whether Φ, Β, Θ restrain, undated parameter matrix Φ, Β, Θ value is until convergence;S8, according to convergency value Βf、Θf、ΦfThe apply service element maximum with the current location information degree of association is obtained, and is pushed to the user.Application service is pushed present invention utilizes the geographical location information of mobile interchange network users, the accuracy of marketing is improved.

Description

A kind of mobile Internet user behavior analysis method and device based on positional information
【Technical field】
The present invention relates to Data Mining, more particularly to a kind of mobile Internet user behavior point based on positional information Analyse method and device.
【Background technology】
Nowadays the content type in network service is increasingly enriched covers all trades and professions there is provided the webpage of various information, can Various application services to operate in mobile-terminal platform are powerful and species is various, and user's choosing is enriched in these application services While selecting, while also bringing the challenge that information content explodes to user.To enable users to from the application service of numerous and complicated Selection meets the service of self-demand, and operator wishes to provide personalized service for different users, to realize accurately Marketing strategy, so that maximum revenue.Therefore need accurately to analyze the behavior pattern of user, know different user group The behavioral characteristic and its use habit of body.
The method to Internet user's behavioural analysis mainly has behavioural analysis and base based on conventional internet user at present In the major class of behavioural analysis two of wireless interconnected network users.
Behavioural analysis presence based on conventional internet user can not effectively identify sole user and obtain its personal attribute And behavioral data, can only to single behavior in itself carry out analyze prediction shortcoming.In addition, some use such as neutral net at present The user behavior analysis scheme of tagsort algorithm needs to predefine the class of service that user uses, and setting network user makes Corresponding relation between class of business and networks congestion control classification, this way needs in advance to classify to business, The automatic identification of class of business can not be accomplished, cost of labor is higher, and increase with user and class of business is constantly soaring, The scalability of the program is not strong.
Behavioural analysis based on wireless interconnected network users also has many in terms of subscriber segmentation, user characteristics extraction can be with The space of lifting, is also merely resting on the stage that point heap is carried out according to the statistical indicator of user to the subdivision of client at present, to The excavation that family pent-up demand is carried out is not careful enough and gos deep into, and seldom combines and uses in particularly current user behavior analysis scheme The geographical location information at family.Because the behavior of Internet user with the geographical position where the user has great correlation, because The geographical location information of this user has great significance for user behavior analysis.
【The content of the invention】
Present invention seek to address that above-mentioned problems of the prior art, propose a kind of mobile interchange based on positional information Network users behavior analysis method and device.
One aspect of the present invention proposes a kind of mobile Internet user behavior analysis method based on positional information, including step Suddenly:S1, obtain and store the IMEI code of mobile interchange network users, IMSI codes, mobile Internet user identity is identified; S2, the historical position information and historical usage information on services for obtaining and storing each identity mobile interchange network users, it is described to go through History positional information includes some place elements and its frequency, the place that the place element representation mobile interchange network users pass through, The historical usage information on services includes some apply service elements and its frequency, and the apply service element represents mobile interchange The used application service of network users;S3, according to the historical position information initialize place theme Li(i=1,2 ...), according to The historical usage information on services initialization application service theme Aj(j=1,2 ...), wherein each place theme LiRepresent The set of place element described in identical type, each application service theme AjRepresent apply service element described in identical type Set;S4, the historical usage information on services is sampled, connected applications service theme AjGet parms matrix Φ, and The historical position information is sampled, with reference to place theme LiGet parms matrix Β, and wherein Φ represents to apply at each The probability of each apply service element is produced under service theme, Β represents to produce each place element under each place theme Probability;S5, using Gibbs samplings, according to application service theme Aj, place theme Li, parameter matrix Φ, Β get parms square Battle array Θ, Θ represents to produce the probability of each place theme under each application service theme;S6, using Gibbs samplings, and base In Maximum-likelihood estimation criterion, undated parameter matrix Φ, Β, Θ;S7, judge whether parameter matrix Φ, Β, Θ value restrain, if Otherwise repeat step S4 to S6, undated parameter matrix Φ, Β, Θ value are until convergence;S8, the IMEI for obtaining mobile interchange network users Code, IMSI codes and its current location information, according to the parameter matrix Β for belonging to the userf、Θf、ΦfObtain and current location The maximum apply service element of information relevance, and pushed to the user, wherein, Φf、Βf、ΘfRespectively Φ, Β, Θ are received Optimal value after holding back.
Another aspect of the present invention proposes a kind of mobile Internet user behavior analysis device based on positional information, including deposits Module, customer attribute information acquisition module, historical information acquisition module, data processing module, application service pushing module are stored up, its In, the customer attribute information acquisition module obtains the IMEI code of mobile interchange network users, IMSI codes, the memory module storage The IMEI code, IMSI codes;The historical information acquisition module obtains the history bit confidence of each identity mobile interchange network users Breath and historical usage information on services, the historical position information include some place elements and its frequency, the place element The place that mobile interchange network users pass through is represented, the historical usage information on services includes some apply service elements and its frequency Secondary, the apply service element represents the used application service of mobile interchange network users, is gone through described in the memory module storage History positional information and historical usage information on services;The data processing module initializes place according to the historical position information Theme Li(i=1,2 ...), and application service theme A is initialized according to the historical usage information on servicesj(j=1,2 ...), wherein Each place theme LiThe set of place element described in identical type is represented, each application service theme AjRepresent phase With the set of apply service element described in species;The data processing module is sampled to the historical usage information on services, Connected applications service theme AjGot parms matrix Φ, and the historical position information is sampled, with reference to place theme LiObtain Parameter matrix Β, wherein Φ is taken to represent to produce the probability of each apply service element under each application service theme, Β is represented The probability of each place element is produced under each place theme;According to application service theme Aj, place theme Li, parameter matrix Φ, Β, based on Gibbs samplings, the data processing module is got parms matrix Θ, and Θ is represented in each application service theme The lower probability for producing each place theme;Based on Gibbs samplings and Maximum-likelihood estimation criterion, the data processing module is more New parameter matrix Φ, Β, Θ value is until convergence, obtains Φ optimal value Φf, Β optimal value ΒfAnd Θ optimal value Θf; The customer attribute information acquisition module obtains IMEI code, IMSI codes and its current location information of mobile interchange network users, institute Data processing module is stated according to the parameter matrix Β for belonging to the userf、Θf、ΦfObtain with the current location information degree of association most Big application service;The application service pushing module pushes the application maximum with the current location information degree of association to the user Service.
Mobile Internet user behavior analysis method and device proposed by the present invention based on positional information make use of movement The geographical location information of Internet user, and based on mobile interchange network users geographical location information and its in the geographical position institute Its application service paid high attention to is pushed to user using the high correlation of application service species, the accuracy of marketing is improved; The present invention program extracts theme using improved probability topic model is automatic from the ground point set of user and behavior set simultaneously, And the class of business of user need not be classified in advance, the cost of labor that manual classification is caused is reduced, business is realized The automatic identification of species, with stronger scalability.
【Brief description of the drawings】
Fig. 1 is the mobile Internet user behavior analysis method flow diagram based on positional information of one embodiment of the invention;
Fig. 2 is the acquisition of one embodiment of the invention application service maximum with the mobile Internet current location information degree of association The method flow diagram of element;
Fig. 3 is the mobile Internet user behavior analysis structure drawing of device based on positional information of one embodiment of the invention.
【Embodiment】
In order that the objects, technical solutions and advantages of the present invention become apparent from, it is right below in conjunction with specific embodiment and accompanying drawing The present invention is described in further detail.It should be appreciated that the specific embodiment described in text is only to explain skill of the invention Art scheme, and it is not construed as limitation of the present invention.
One aspect of the present invention provides a kind of mobile Internet user behavior analysis method based on positional information, such as Fig. 1 institutes Show, this method comprises the following steps:S1, obtain and store the IMEI code of mobile interchange network users, IMSI codes, to mobile Internet User identity is identified;S2, the historical position information and history for obtaining and storing each identity mobile interchange network users should Use information on services;S3, according to the historical position information initialize place theme Li(i=1,2 ...), according to the historical usage Information on services initialization application service theme Aj(j=1,2,…);S4, the historical usage information on services is sampled, with reference to Application service theme AjGot parms matrix Φ, and the historical position information is sampled, and combines place theme LiObtain Parameter matrix Β;S5, using Gibbs samplings, according to application service theme Aj, place theme Li, parameter matrix Φ, Β obtain Parameter matrix Θ;S6, using Gibbs samplings, and based on Maximum-likelihood estimation criterion, undated parameter matrix Φ, Β, Θ;S7、 Judge whether parameter matrix Φ, Β, Θ value restrain, if otherwise repeat step S4 to S6, undated parameter matrix Φ, Β, Θ value are straight To convergence;S8, IMEI code, IMSI codes and its current location information for obtaining mobile interchange network users, according to belonging to the user Parameter matrix Βf、Θf、ΦfThe apply service element maximum with the current location information degree of association is obtained, and is pushed away to the user Send, wherein, Φf、Βf、ΘfOptimal value respectively after Φ, Β, Θ convergence.
The technical scheme to the above-mentioned mobile Internet user behavior analysis method based on positional information is made into one below Step is discussed in detail.
In step sl, the IMEI code of mobile interchange network users is obtained and stored by mobile operator data servers (International Mobile Equipment Identity, International Mobile Equipment Identity code), IMSI codes (International Mobile Subscriber Identity, international mobile subscriber identity), due to being likely to occur one The situation of card or a card multimachine more than machine, in the present embodiment, by IMEI code and IMSI codes collectively as mobile interchange network users Identity.
In step s 2, obtained by mobile operator data servers and store each identity mobile interchange network users Historical position information and historical usage information on services, the historical position information include some place elements and its frequency, institute Place or place that place element representation mobile interchange network users pass through in the past period T are stated, for example " first people cure Institute ", " Suning market ", " Xinhua Bookstore " etc., the frequency herein be the Internet user in the past period T by upper State the number of times of each place element;The historical usage information on services includes some apply service elements and its frequency, described to answer The used application service in the past period T of mobile interchange network users, such as " QQ ", " Google are represented with service element Map ", " masses' comment " etc., the frequency herein is that the Internet user is answered in the past period T using above-mentioned each With the number of times of service.Preferably, the data sample included for the guarantee historical position information and historical usage information on services It is sufficiently large, to the value of the time T is with some moons or longer is advisable.
In step s3, place theme L is initialized according to the historical position information obtained in step s 2i(i=1, 2 ...), wherein each place theme LiThe set of place element described in identical type is represented, mobile Internet is used as described The places such as " First People's Hospital ", " the second the People's Hospital ", " healthcare hospital for women & children " were once gone at family in a period of time T, then place Some above-mentioned place elements such as theme " hospital " representative " First People's Hospital ", " the second the People's Hospital ", " healthcare hospital for women & children " Set;Meanwhile, in step s3, application service theme A is initialized according to the historical usage information on servicesj(j=1,2 ...), Wherein each application service theme AjThe set of apply service element described in identical type is represented, as described mobile Internet User once used the application services such as " QQ ", " wechat ", " Skype " in a period of time T, then application service theme " chat class " Represent the set of some above-mentioned apply service elements such as " QQ ", " wechat ", " Skype ".
Preferably, the application service theme Aj(j=1,2 ...) obeys Multinomial (π) distributions, and π is obeyed Dirichlet (α) is distributed, and α is Dirichlet distributed constants.
In step s 4, the historical usage information on services is sampled, with reference to the application service theme AjObtain Parameter matrix Φ;The historical position information is sampled simultaneously, with reference to place theme LiGet parms matrix Β, wherein joining Matrix number Φ represented to produce the probability of each apply service element under each application service theme, and parameter matrix Β is represented each The probability of each place element is produced under individual place theme.
In step s 5, using Gibbs samplings, according to the application service theme Aj(j=1,2 ...), place theme Li (i=1,2 ...), parameter matrix Φ, Β get parms matrix Θ, and Θ represents to produce each place under each application service theme The probability of theme.
In step s 6, using Gibbs samplings, and based on Maximum-likelihood estimation criterion, undated parameter matrix Φ, Β, Θ;In the step s 7, judge whether parameter matrix Φ, Β, Θ value restrain, if otherwise repeat step S4 to S6, undated parameter square Battle array Φ, Β, Θ value is until convergence.Gibbs samplings are to calculate LDA(Latent Dirichlet Allocation)Theme mould A kind of mathematics implementation method of shape parameter, this method passes through DSMC(Monte Carlo method), accumulated using class Divide method, by substantial amounts of circulation random sampling, by the use of last result of calculation as prior probability, calculate posteriority again afterwards general Rate, according to Bayes and statistics correlation theory, when cycle-index is enough by result of calculation approaching to reality value.Gibbs takes out Sample method is cyclically updated number of times K and can preset, and K values are bigger, parameter matrix optimal value Φf、Βf、ΘfIt is more accurate, in this implementation In example, defining K value can be set with specific reference to the operational capability of data processor.
In step s 8, IMEI code, IMSI codes and its current location information of mobile interchange network users are obtained, according to belonging to The parameter matrix Β of the userf、Θf、ΦfObtain the apply service element maximum with the current location information degree of association, and to institute User's push is stated, wherein, Φf、Βf、ΘfOptimal value respectively after Φ, Β, Θ convergence.Obtain mobile interchange network users IMEI code, IMSI codes are used for the identity for determining the mobile interchange network users, mutual due to having stored the movement in step sl The IMEI code of on-line customer, IMSI codes are identified to its identity, therefore in step s 8, according to belonging to the mobile Internet The parameter matrix Β of userf、Θf、ΦfObtain the application maximum with the mobile Internet user current location information relevance Service element, and push the application service to the mobile interchange network users.
Preferably, as shown in Fig. 2 step S8 specifically includes following steps:S81, according to the current location information and Parameter matrix ΒfObtain the place theme L maximum with the current location information degree of associationc;S82, according to parameter matrix ΘfObtain Take and LcThe maximum application service theme A of the degree of associationc;S83, according to parameter matrix ΦfObtain and AcThe maximum application clothes of the degree of association Be engaged in element Ec, the apply service element EcIt is i.e. maximum with the mobile Internet user current location information relevance.Specifically Ground, in step S81, place element and parameter matrix Β in the current location informationfObtain and the place The maximum place theme L of elements correlation degreec, wherein parameter matrix ΒfRow represent place element, row represent place theme, according to The place element, from parameter matrix ΒfIt is middle to extract the corresponding column vector of place element, the wherein element representation in column vector The place element belongs to the probability distribution situation of different location theme, and column vector is sorted, and obtains general belonging to the place element The maximum place theme L of ratec;In step S82, according to parameter matrix ΘfObtain and LcThe maximum application service theme of the degree of association Ac, wherein parameter matrix ΘfRow represent application service theme, row represent place theme, according to the place obtained in step S81 Theme Lc, from parameter matrix ΘfMiddle extraction place theme LcElement representation in corresponding row vector, wherein row vector is represented Place theme LcThe probability distribution situation of correspondence different application service theme, row vector is sorted, place theme L is obtainedcIt is corresponding The application service theme A of maximum probabilityc;In step S83, according to parameter matrix ΦfObtain and AcThe maximum application clothes of the degree of association Be engaged in element Ec, the apply service element EcMaximum, the wherein parameter matrix Φ with the user current location information relevancef's Row represents apply service element, and row represent application service theme, according to the application service theme obtained in step S82, from parameter Matrix ΦfMiddle extraction application service theme AcElement representation application service theme A in corresponding row vector, wherein row vectorc Comprising different application service element probability distribution situation, row vector is sorted, the bigger apply service element table of probability Show in application service theme AcLower user is higher using the possibility of the apply service element, chooses the application service of maximum probability Element EcAs the application service maximum with the mobile Internet user current location information relevance, and it is mutual to the movement On-line customer pushes.
Another aspect of the present invention provides a kind of mobile Internet user behavior analysis device based on positional information, such as Fig. 3 Shown, described device includes:Memory module 100, customer attribute information acquisition module 200, historical information acquisition module 300, number According to processing module 400, application service pushing module 500.
Below by the company between the mobile Internet user behavior analysis device modules based on positional information Connect relation and operation principle is described in further detail.
The customer attribute information acquisition module 200 is obtained by mobile operator data servers and stores mobile interchange The IMEI code of network users(International Mobile Equipment Identity, International Mobile Equipment Identity code)、 IMSI codes(International Mobile Subscriber Identity, international mobile subscriber identity), due to possible There is the situation of one-telephone multi-card or a card multimachine, in the present embodiment, by IMEI code and IMSI codes collectively as mobile Internet The identity of user.The memory module 100 stores the IMEI code, IMSI codes, as to the mobile interchange network users Identity is identified.
The historical information acquisition module 300 is obtained by mobile operator data servers and stores the movement of each identity The historical position information and historical usage information on services of Internet user, the historical position information includes some place elements And its frequency, place or place that the place element representation mobile interchange network users pass through in the past period T, for example " First People's Hospital ", " Suning market ", " Xinhua Bookstore " etc., the frequency herein were the Internet user at one section of past Between in T by each above-mentioned place element number of times;The historical usage information on services include some apply service elements and its The frequency, the apply service element represents the used application service in the past period T of mobile interchange network users, for example " QQ ", " Google Maps ", " masses' comment " etc., the frequency herein is that the Internet user uses in the past period T The number of times of each above-mentioned application service.Preferably, it is that the guarantee historical position information and historical usage information on services are included Data sample it is sufficiently large, to the value of the time T is with some moons or longer is advisable.The memory module 100 stores described Historical position information and historical usage information on services.
The data processing module 400 initializes place theme L according to the historical position informationi(i=1,2 ...), its In each place theme LiThe set of place element described in identical type is represented, mobile interchange network users are at one section as described Once the places such as " First People's Hospital ", " the second the People's Hospital ", " healthcare hospital for women & children " were gone in time T, then place theme " doctor Institute " represents the set of some above-mentioned place elements such as " First People's Hospital ", " the second the People's Hospital ", " healthcare hospital for women & children ";Together When, the data processing module 400 initializes application service theme A according to the historical usage information on servicesj(j=1,2 ...), Wherein each application service theme AjThe set of apply service element described in identical type is represented, as described mobile Internet User once used the application services such as " QQ ", " wechat ", " Skype " in a period of time T, then application service theme " chat class " Represent the set of some above-mentioned apply service elements such as " QQ ", " wechat ", " Skype ".
Preferably, the application service theme Aj(j=1,2 ...) obeys Multinomial (π) distributions, and π is obeyed Dirichlet (α) is distributed, and α is Dirichlet distributed constants.
400 pairs of the data processing module historical usage information on services is sampled, with reference to the application service master Inscribe AjGet parms matrix Φ;The historical position information is sampled simultaneously, with reference to place theme LiGet parms matrix Β, wherein parameter matrix Φ represent to produce the probability of each apply service element, parameter matrix under each application service theme Β represents to produce the probability of each place element under each place theme.
According to the application service theme Aj(j=1,2 ...), place theme Li(i=1,2 ...), parameter matrix Φ, Β, and Based on Gibbs samplings, the data processing module 400 is got parms matrix Θ, and Θ is represented under each application service theme Produce the probability of each place theme.
Based on Gibbs samplings and Maximum-likelihood estimation criterion, the data processing module 400 updates the parameter matrix Φ, Β, Θ value are until convergence, obtains Φ optimal value Φf, Β optimal value ΒfAnd Θ optimal value Θf.Gibbs samples Method is to calculate LDA(Latent Dirichlet Allocation)A kind of mathematics implementation method of topic model parameter, this method Pass through DSMC(Monte Carlo method), using class integration method, pass through substantial amounts of circulation random sampling, profit With last result of calculation as prior probability, posterior probability is calculated again afterwards, according to Bayes and statistics correlation theory, By result of calculation approaching to reality value when cycle-index is enough.Gibbs samplings are cyclically updated number of times K and can preset, K values It is bigger, parameter matrix optimal value Φf、Βf、ΘfIt is more accurate, in the present embodiment, can be with specific reference to the data processing module 400 operational capability sets defining K value.
The customer attribute information acquisition module 200 obtains the IMEI codes of mobile interchange network users, IMSI codes and its currently Positional information, the data processing module 400 is according to the parameter matrix Β for belonging to the userf、Θf、ΦfObtain and present bit The maximum apply service element of information relevance is put, the application service pushing module 500 pushes the application to the user and taken Business.The IMEI code of mobile interchange network users, IMSI codes are used for the identity for determining the mobile interchange network users, due to the storage Module 100, which has stored the IMEI code of the mobile interchange network users, IMSI codes, to be used to be identified user identity, therefore basis Belong to the parameter matrix Β of the mobile interchange network usersf、Θf、ΦfObtain and believe with the mobile Internet user current location The maximum apply service element of the degree of association is ceased, the application service pushing module 500 is pushed to the mobile interchange network users should Application service.
Preferably, the data processing module 400 is according to the current location information and parameter matrix ΒfObtain and institute State the maximum place theme L of the current location information degree of associationc, according to parameter matrix ΘfObtain and LcThe maximum application clothes of the degree of association Be engaged in theme Ac, according to parameter matrix ΦfObtain and AcThe maximum apply service element E of the degree of associationc, the apply service element EcI.e. For the application service maximum with the current location information degree of association.
Specifically, place element and parameter square of the data processing module 400 in the current location information Battle array ΒfObtain the place theme L maximum with the place elements correlation degreec, wherein parameter matrix ΒfRow represent place element, Row represent place theme, according to the place element, from parameter matrix ΒfMiddle extraction corresponding column vector of place element, its The element representation place element in middle column vector belongs to the probability distribution situation of different location theme, and column vector is sorted, obtained To the place theme L of the maximum probability belonging to the place elementc;The data processing module 400 is according to parameter matrix ΘfObtain With LcThe maximum application service theme A of the degree of associationc, wherein parameter matrix ΘfRow represent application service theme, row represent place Theme, according to the place theme L obtained in step S71c, from parameter matrix ΘfMiddle extraction place theme LcCorresponding row to Amount, wherein the element representation in row vector represents place theme LcThe probability distribution situation of correspondence different application service theme, will Row vector sorts, and obtains place theme LcThe application service theme A of corresponding maximum probabilityc;The data processing module 400 According to parameter matrix ΦfObtain and AcThe maximum apply service element E of the degree of associationc, the apply service element EcWork as with the user Front position information relevance is maximum, wherein parameter matrix ΦfRow represent apply service element, row represent application service theme, According to application service theme Ac, from parameter matrix ΦfMiddle extraction application service theme AcCorresponding row vector, wherein row vector In element representation application service theme AcComprising different application service element probability distribution situation, row vector is arranged Sequence, the bigger apply service element of probability is represented in application service theme AcLower user uses the possibility of the apply service element It is higher, choose the apply service element E of maximum probabilitycAs with the mobile Internet user current location information relevance most Big application service, the application service pushing module 500 pushes the application service to the mobile interchange network users.
Mobile Internet user behavior analysis method and device proposed by the present invention based on positional information make use of movement The geographical location information of Internet user, and based on mobile interchange network users geographical location information and its in the geographical position institute Its application service paid high attention to is pushed to user using the high correlation of application service species, the accuracy of marketing is improved; The present invention program extracts theme using improved probability topic model is automatic from the ground point set of user and behavior set simultaneously, And the class of business of user need not be classified in advance, the cost of labor that manual classification is caused is reduced, business is realized The automatic identification of species, with stronger scalability.
Although the present invention is described with reference to current better embodiment, those skilled in the art should be able to manage Solution, above-mentioned better embodiment is only used for explanation and illustration technical scheme, and is not used for limiting the guarantor of the present invention Shield scope, any within the scope of the spirit and principles in the present invention, any modification, equivalence replacement, deformation, improvement for being done etc., Within the claims that should be included in the present invention.

Claims (6)

1. a kind of mobile Internet user behavior analysis method based on positional information, comprises the following steps:
S1, obtain and store the IMEI code of mobile interchange network users, IMSI codes, mobile Internet user identity is identified;
S2, the historical position information and historical usage information on services for obtaining and storing each identity mobile interchange network users, institute Stating historical position information includes some place elements and its frequency, the ground that the place element representation mobile interchange network users pass through Point, the historical usage information on services includes some apply service elements and its frequency, and the apply service element represents mobile The used application service of Internet user;
S3, according to the historical position information initialize place theme Li(i=1,2 ...), according to the historical usage information on services Initialize application service theme Aj(j=1,2 ...), wherein each place theme LiRepresent place element described in identical type Set, each application service theme AjRepresent the set of apply service element described in identical type;
S4, the historical usage information on services is sampled, connected applications service theme AjGet parms matrix Φ, and to institute State historical position information to be sampled, with reference to place theme LiGot parms matrix Β, and wherein Φ is represented in each application service The probability of each apply service element is produced under theme, Β represents to produce the general of each place element under each place theme Rate;
S5, using Gibbs samplings, according to application service theme Aj, place theme Li, parameter matrix Φ, Β get parms matrix Θ, Θ represent to produce the probability of each place theme under each application service theme;
S6, using Gibbs samplings, and based on Maximum-likelihood estimation criterion, undated parameter matrix Φ, Β, Θ;
S7, judge whether parameter matrix Φ, Β, Θ value restrain, if otherwise repeat step S4 to S6, undated parameter matrix Φ, Β, Θ values are until convergence;
S8, IMEI code, IMSI codes and its current location information for obtaining mobile interchange network users, according to the ginseng for belonging to the user Matrix number Βf、Θf、ΦfThe apply service element maximum with the current location information degree of association is obtained, and is pushed to the user, Wherein, Φf、Βf、ΘfOptimal value respectively after Φ, Β, Θ convergence.
2. the mobile Internet user behavior analysis method according to claim 1 based on positional information, it is characterised in that Application service theme AjMultinomial (π) distributions are obeyed, π obeys Dirichlet (α) distributions, and α is that Dirichlet is distributed ginseng Number.
3. the mobile Internet user behavior analysis method according to claim 1 based on positional information, step S8 includes Following steps:
S81, according to the current location information and parameter matrix ΒfObtain and current location information degree of association maximum Place theme Lc
S82, according to parameter matrix ΘfObtain and LcThe maximum application service theme A of the degree of associationc
S83, according to parameter matrix ΦfObtain and AcThe maximum apply service element E of the degree of associationc, the apply service element EcWith The user current location information relevance is maximum.
4. a kind of mobile Internet user behavior analysis device based on positional information, including:Memory module, customer attribute information Acquisition module, historical information acquisition module, data processing module, application service pushing module, wherein,
The customer attribute information acquisition module obtains the IMEI code of mobile interchange network users, IMSI codes, and the memory module is deposited Store up the IMEI code, IMSI codes;
The historical information acquisition module obtains the historical position information and historical usage of each identity mobile interchange network users Information on services, the historical position information includes some place elements and its frequency, the place element representation mobile Internet The place that user passes through, the historical usage information on services includes some apply service elements and its frequency, the application service The used application service of element representation mobile interchange network users, the memory module stores the historical position information and gone through History application service information;
The data processing module initializes place theme L according to the historical position informationi(i=1,2 ...), and according to described Historical usage information on services initialization application service theme Aj(j=1,2 ...), wherein each place theme LiRepresent identical The set of place element described in species, each application service theme AjRepresent the collection of apply service element described in identical type Close;
The data processing module is sampled to the historical usage information on services, connected applications service theme AjGet parms Matrix Φ, and the historical position information is sampled, with reference to place theme LiGot parms matrix Β, and wherein Φ is represented The probability of each apply service element is produced under each application service theme, Β represents to produce each ground under each place theme The probability of point element;
According to application service theme Aj, place theme Li, parameter matrix Φ, Β, based on Gibbs samplings, the data processing mould Block gets parms matrix Θ, and Θ represents to produce the probability of each place theme under each application service theme;
Based on Gibbs samplings and Maximum-likelihood estimation criterion, data processing module undated parameter matrix Φ, Β, Θ value Until convergence, obtains Φ optimal value Φf, Β optimal value ΒfAnd Θ optimal value Θf
The customer attribute information acquisition module obtains IMEI code, IMSI codes and its present bit confidence of mobile interchange network users Breath, the data processing module is according to the parameter matrix Β for belonging to the userf、Θf、ΦfAcquisition is associated with current location information Spend maximum application service;
Application service pushing module pushes the application service maximum with the current location information degree of association to the user.
5. the mobile Internet user behavior analysis device according to claim 4 based on positional information, it is characterised in that Application service theme AjMultinomial (π) distributions are obeyed, π obeys Dirichlet (α) distributions, and α is that Dirichlet is distributed ginseng Number.
6. the mobile Internet user behavior analysis device according to claim 4 based on positional information, it is characterised in that The data processing module is according to the current location information and parameter matrix ΒfAcquisition is associated with the current location information Spend maximum place theme Lc, according to parameter matrix ΘfObtain and LcThe maximum application service theme A of the degree of associationc, according to parameter Matrix ΦfObtain and AcThe maximum apply service element E of the degree of associationc, the apply service element EcAs and current location information The maximum application service of the degree of association.
CN201310239737.4A 2013-06-17 2013-06-17 A kind of mobile Internet user behavior analysis method and device based on positional information Active CN104239327B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310239737.4A CN104239327B (en) 2013-06-17 2013-06-17 A kind of mobile Internet user behavior analysis method and device based on positional information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310239737.4A CN104239327B (en) 2013-06-17 2013-06-17 A kind of mobile Internet user behavior analysis method and device based on positional information

Publications (2)

Publication Number Publication Date
CN104239327A CN104239327A (en) 2014-12-24
CN104239327B true CN104239327B (en) 2017-11-07

Family

ID=52227416

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310239737.4A Active CN104239327B (en) 2013-06-17 2013-06-17 A kind of mobile Internet user behavior analysis method and device based on positional information

Country Status (1)

Country Link
CN (1) CN104239327B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106488493B (en) * 2015-08-24 2020-06-02 阿里巴巴集团控股有限公司 Method and device for identifying network hotspot type of user and electronic equipment
CN107071181B (en) * 2017-04-27 2020-01-10 长沙军鸽软件有限公司 Method for automatically matching communication contact persons
CN107704586B (en) * 2017-10-09 2020-02-07 陈包容 User portrait method, device and system based on user activity address
CN107613084B (en) * 2017-10-09 2020-02-07 陈包容 Method, device and system for automatically grouping contacts in address list

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101986298A (en) * 2010-10-28 2011-03-16 浙江大学 Information real-time recommendation method for online forum
CN102355623A (en) * 2011-06-29 2012-02-15 青岛百灵信息科技有限公司 System and method for changing applicable desktop theme of mobile terminal according to position of mobile terminal
CN102591917A (en) * 2011-12-16 2012-07-18 华为技术有限公司 Data processing method and system and related device
CN102902700A (en) * 2012-04-05 2013-01-30 中国人民解放军国防科学技术大学 Online-increment evolution topic model based automatic software classifying method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009059480A1 (en) * 2007-11-08 2009-05-14 Shanghai Hewlett-Packard Co., Ltd Url and anchor text analysis for focused crawling

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101986298A (en) * 2010-10-28 2011-03-16 浙江大学 Information real-time recommendation method for online forum
CN102355623A (en) * 2011-06-29 2012-02-15 青岛百灵信息科技有限公司 System and method for changing applicable desktop theme of mobile terminal according to position of mobile terminal
CN102591917A (en) * 2011-12-16 2012-07-18 华为技术有限公司 Data processing method and system and related device
CN102902700A (en) * 2012-04-05 2013-01-30 中国人民解放军国防科学技术大学 Online-increment evolution topic model based automatic software classifying method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Categorizing software applications for maintenance;MeMillan,C etc;《International Conference on Software Maintenance》;20110930;343-352 *
基于地理位置的社交网络潜在用户和位置推荐模型研究;朱荣鑫;《中国优秀硕士学位论文全文数据库 信息科技辑》;20130615;I139-251 *

Also Published As

Publication number Publication date
CN104239327A (en) 2014-12-24

Similar Documents

Publication Publication Date Title
Yin et al. Joint modeling of user check-in behaviors for real-time point-of-interest recommendation
Sobolevsky et al. General optimization technique for high-quality community detection in complex networks
CN104239327B (en) A kind of mobile Internet user behavior analysis method and device based on positional information
CN104598557B (en) The method and apparatus of data rasterizing, user behavior analysis
CN103795613B (en) Method for predicting friend relationships in online social network
CN104731917B (en) A kind of recommendation method and device
CN105045916A (en) Mobile game recommendation system and recommendation method thereof
CN110191416A (en) For analyzing the devices, systems, and methods of the movement of target entity
CN106506705A (en) Listener clustering method and device based on location-based service
CN107798027B (en) Information popularity prediction method, information recommendation method and device
CN105138647A (en) Travel network cell division method based on Simhash algorithm
CN104915354A (en) Multimedia file pushing method and device
CN106101222A (en) The method for pushing of information and device
CN105069077A (en) Search method and device
Chen et al. Effective and efficient user account linkage across location based social networks
EP3358474A1 (en) Route search method, device and apparatus, and non-volatile computer storage medium
CN108052639A (en) Industry user based on carrier data recommends method and device
Xin et al. A location-context awareness mobile services collaborative recommendation algorithm based on user behavior prediction
CN106470435B (en) The method and system of identification WiFi groups
CN110298687B (en) Regional attraction assessment method and device
CN105589683B (en) Sample extraction method and device
Guidotti et al. Unveiling mobility complexity through complex network analysis
CN105282011A (en) Social group finding method based on cluster fusion algorithm
CN107301192A (en) A kind of terminal identification method and identification server
KR102135477B1 (en) Method and system for image automatic classification

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant