CN106095776B - The method that a kind of couple of user carries out Topics Crawling and application is recommended - Google Patents

The method that a kind of couple of user carries out Topics Crawling and application is recommended Download PDF

Info

Publication number
CN106095776B
CN106095776B CN201610357421.9A CN201610357421A CN106095776B CN 106095776 B CN106095776 B CN 106095776B CN 201610357421 A CN201610357421 A CN 201610357421A CN 106095776 B CN106095776 B CN 106095776B
Authority
CN
China
Prior art keywords
user
application
distribution
theme
indicate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610357421.9A
Other languages
Chinese (zh)
Other versions
CN106095776A (en
Inventor
胡浩
柏杨
刘冶
印鉴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Zhanyi Information Technology Co.,Ltd.
Original Assignee
GUANGZHOU INFINITE WISDOM ASPECT INFORMATION TECHNOLOGY Co Ltd
Guangzhou Zhongda Nansha Technology Innovation Industrial Park Co Ltd
National Sun Yat Sen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GUANGZHOU INFINITE WISDOM ASPECT INFORMATION TECHNOLOGY Co Ltd, Guangzhou Zhongda Nansha Technology Innovation Industrial Park Co Ltd, National Sun Yat Sen University filed Critical GUANGZHOU INFINITE WISDOM ASPECT INFORMATION TECHNOLOGY Co Ltd
Priority to CN201610357421.9A priority Critical patent/CN106095776B/en
Publication of CN106095776A publication Critical patent/CN106095776A/en
Application granted granted Critical
Publication of CN106095776B publication Critical patent/CN106095776B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides the method that a kind of couple of user carries out Topics Crawling and application is recommended, this method, which first quantitatively calculates, measures the weight that user's theme is contributed in some application that user is installed, the model that user selects application installation is established according to weight, then calculate again established model parameter can analog subscriber selection application process and recommend the application software of its intention to user, the function of excavating the potential feature of user and application according to the installation list of application of user is realized, and recommends user the function of its more interested application.

Description

The method that a kind of couple of user carries out Topics Crawling and application is recommended
Technical field
The present invention relates to the field of data mining, carry out Topics Crawling more particularly, to a kind of couple of user and application is recommended Method.
Background technique
On a mobile platform, the theme for excavating user is often carried out according to some apparent labels of user, such as Age, active degree, geographical location etc..In fact, the installation list of application of user is also able to reflect some subject matter preferences of user. But since the size distribution of the amount of user installation application is extremely uneven, and the user in country variant or area answers same Preference difference is larger, thus this partial information is more difficult is used appropriately.Using traditional Topics Crawling algorithm, such as LDA algorithm, the theme excavated can not reflect that the theme distribution of user and geographical location influence.
This theme distribution that the installation list of application and geographical location information of user is included can not only be used to carry out Customer analysis may be also used in and carry out user using in the work recommended.Traditional proposed algorithm, can be according to the spy of user The union feature of sign, the feature of application and user and application carrys out training pattern, to complete to carry out user using recommendation Task.But the installation list of application of user can not directly be brought as feature because each user installation application amount difference compared with Greatly.It, can be by if the potential feature of user and application can be excavated according to the installation list of application of numerous users This calculates a similarity between user and application, and the important feature of an application whether is liked as judge user.
Summary of the invention
The present invention provides the method that a kind of couple of user carries out Topics Crawling and application is recommended, and this method can be according to the peace of user List of application is filled to excavate the potential feature of user and application, and recommends the application of its intention to user.
In order to reach above-mentioned technical effect, technical scheme is as follows:
The method that a kind of couple of user carries out Topics Crawling and application is recommended, comprising the following steps:
S1: the weight that user's theme is contributed in application is calculated;
S2: the probability graph model of user's selection course when installing application is established;
S3: seeking probability graph model parameter, and completes the recommendation of excavation and the application of theme;
Further, the weighted value that user's theme is contributed in application is calculated in the step S1:
Wherein, user indicates that a user, app indicate the application that user is installed, LuserIndicate user user institute Number, L are applied in installationaverageWhat the average user of expression was installed applies number, | U | indicate the total amount of user in data set, nappIndicate the sum for being mounted with the user of the app.By rounding up, weight is an integer greater than 0.
Further, the process of the step S2 is as follows:
S21: according to the preference distribution of userGenerate the preferences variable x of useru,n, whereinI.e. Preferences variable xu,nObey withFor a bi-distribution of parameter;
S22: if x obtained in S21u,nValue be 0, then it represents that when selecting application, consider is personalized preference to user, Distribution first according to user to themeUser sample out for the theme z of application to be installedu,n, whereinI.e. the application theme obey withFor the multinomial distribution of parameter;
S23: according to distribution of the theme z in each applicationCorresponding application is generated, whereinI.e. Institute's application to be installed is by a parameterMultinomial distribution sample generate;
S24: if x obtained in S21u,nValue be 1, then it represents that in selection, consider at once is locating geographical position to user Factor is set, geographical location locating for user u is l,Indicate the position to the preference distribution of application, whereinI.e. institute's application to be installed is by a parameterMultinomial distribution sample generate;
Wherein, xu,nIndicate selection preference of u-th of user in n-th of installation application, xu,n∈ { 0,1 }, works as xu,nValue When being 0, indicates that user u is to select to install what this was applied according to the hobby feature of oneself, work as xu,nValue be 1 when, indicate user It is according to geographical location locating for the user u to select that the application is installed;Indicate the preference distribution of user u selection application, zu,nIndicate the theme of n-th of application of u-th of user installation,Indicate user u to the preference distribution of theme,Indicate theme z A distribution in each application,Indicate preference distribution of the country l to application, appuIndicate what user u was installed Some application;It indicatesPrior distribution parameter,It indicatesPrior distribution parameter,It indicatesPrior distribution Parameter,It indicatesPrior distribution parameter;For convenient for solving model parameter, prior distribution takes corresponding conjugation here Distribution, the conjugation of bi-distribution are distributed as beta (Beta) distribution, and the conjugation of multinomial distribution is distributed as Di Li Cray (Dirichlet) it is distributed;
Herein, when the weight using app of user user installation is weight, it is believed that user in total carries out app The process of weight selection installation.
Further, the process for the parameter for seeking model in the step S3 is as follows:
It is iterated by formula of sampling as follows:
WhereinIndicate exclude current application when time select after, xu,nValue be 0 application selection install number;AndIndicate exclude current application when time select after, xu,nValue be x' application selection install number;Correspondingly,Table Show exclude current application after time selection, the selection installation number of application that theme is z;It indicates to exclude current application When time selection after, using app remaining weight-1 selection install in, xu,nValue be 0 number;It is other that the rest may be inferred;
After completing Λ iteration, according to the following various parameter for acquiring model:
Each theme is applying upper distributionThe theme exactly excavated utilizes following formula:
P (app | u, l)=p (x=0 | u) ∑zP (z | u) p (app | z)+p (x=1 | u) p (app | l) calculate user couple Degree value is liked in a certain application, and the highest several applications of degree value are liked in right rear line recommendation.
Preferably, the number of iterations Λ is not small by 300.
Compared with prior art, the beneficial effect of technical solution of the present invention is:
The present invention, which first quantitatively calculates, measures the weight that user's theme is contributed in some application that user is installed, in conjunction with power Establish the model of process that user selects application to be installed again, the parameter for then calculating established model again can mould Quasi- user selects the process of application and recommends the application software of its intention to user, realizes the installation list of application according to user The function of the potential feature of user and application is excavated, and recommends user the function of more interested application.
Detailed description of the invention
Fig. 1 is the flow chart that model parameter is sought in the present invention;
Fig. 2 is the explanatory diagram of probability graph model.
Specific embodiment
The attached figures are only used for illustrative purposes and cannot be understood as limitating the patent;
In order to better illustrate this embodiment, certain components have omission, zoom in or out in attached drawing, do not represent practical production The size of product;
To those skilled in the art, it is to be understood that certain known features and its explanation, which may be omitted, in attached drawing 's.
The following further describes the technical solution of the present invention with reference to the accompanying drawings and examples.
Embodiment 1
The method that a kind of couple of user carries out Topics Crawling and application is recommended, comprising the following steps:
S1: the weight that user's theme is contributed in application is calculated;
S2: the probability graph model of user's selection course when installing application is established;
S3: seeking the parameter of probability graph model, and completes the recommendation of theme excavated and applied.
Further, the weighted value that user's theme is contributed in application is calculated in the step S1:
Wherein, user indicates that a user, app indicate the application that user is installed, LuserIndicate user user institute Number, L are applied in installationaverageWhat the average user of expression was installed applies number, | U | indicate the total amount of user in data set, nappIndicate the sum for being mounted with the user of the app.By rounding up, weight is an integer greater than 0.
As shown in Fig. 2, the process of step S2 is as follows:
S21: according to the preference distribution of userGenerate the preferences variable x of useru,n, whereinI.e. Preferences variable xu,nObey withFor a bi-distribution of parameter;
S22: if x obtained in S21u,nValue is 0, then it represents that user is personalized preference select to consider when application, first The first distribution according to user to themeUser sample out for the theme z of application to be installedu,n, wherein I.e. the application theme obey withFor the multinomial distribution of parameter;
S23: according to distribution of the theme z in each applicationCorresponding application is generated, whereinI.e. Institute's application to be installed is by a parameterMultinomial distribution sample generate;
S24: if x obtained in S21u,nValue is 1, then it represents that user is locating geographical position select to consider when application Factor is set, geographical location locating for user u is l,Indicate the position to the preference distribution of application, whereinI.e. institute's application to be installed is by a parameterMultinomial distribution sample generate;
Wherein, xu,nIndicate selection preference of u-th of user in n-th of installation application, xu,n∈ { 0,1 }, works as xu,nValue When being 0, indicates that user u is to select to install what this was applied according to the hobby feature of oneself, work as xu,nValue be 1 when indicate user It is according to geographical location locating for the user u to select that the application is installed;Indicate the preference distribution of user u selection application, zu,nIndicate the theme of n-th of application of u-th of user installation,Indicate user u to the preference distribution of theme,Indicate theme z A distribution in each application,Indicate preference distribution of the country l to application, appuIndicate what user u was installed Some application;It indicatesPrior distribution parameter,It indicatesPrior distribution parameter,It indicatesPrior distribution Parameter,It indicatesPrior distribution parameter;For convenient for solving parameter, prior distribution takes corresponding conjugation point here Cloth, the conjugation of bi-distribution are distributed as beta (Beta) distribution, and the conjugation of multinomial distribution is distributed as Di Li Cray (Dirichlet) Distribution.Study firstWithThe result of model parameter is influenced and little, for convenient for calculating, if its per it is one-dimensional It is identical, and its value takes a value less than 1;In Fig. 2, L indicates the number in geographical location, and K indicates the number of theme, and U is indicated The number of user, N indicate the number of the installed application of user.
Herein, according to patent requirements 2, when the weight using app of user user installation is weight, it is believed that user The process of weight selection installation has been carried out to app in total.
It is as follows that model parameter process is sought in step S3:
It is iterated by formula of sampling as follows:
WhereinIndicate exclude current application when time select after, xU, nValue be 0 application selection install number;AndIndicate exclude current application when time select after, xu,nValue be x' application selection install number;Correspondingly,Table Show exclude current application after time selection, the selection installation number of application that theme is z;It indicates to exclude current application When time selection after, using app remaining weight-1 selection install in, xu,nValue be 0 number;It is other that the rest may be inferred;
After completing Λ iteration, according to the following various parameter for acquiring model:
Each theme is applying upper distributionThe theme exactly excavated utilizes following formula:
P (app | u, l)=p (x=0 | u) ΣzP (z | u) p (app | z)+p (x=1 | u) p (app | l) calculate user couple Degree value is liked in a certain application, and the highest several applications of degree value are liked in right rear line recommendation.
This method, which first quantitatively calculates, measures the weight that user's theme is contributed in some application that user is installed, in conjunction with power Establish the model of process that user selects application to be installed again, the parameter for then calculating established model again can mould Quasi- user selects the process of application and recommends the application software of its intention to user, realizes the installation list of application according to user The function of the potential feature of user and application is excavated, and recommends user the function of more interested application.
The same or similar label correspond to the same or similar components;
Described in attached drawing positional relationship for only for illustration, should not be understood as the limitation to this patent;
Obviously, the above embodiment of the present invention be only to clearly illustrate example of the present invention, and not be pair The restriction of embodiments of the present invention.For those of ordinary skill in the art, may be used also on the basis of the above description To make other variations or changes in different ways.There is no necessity and possibility to exhaust all the enbodiments.It is all this Made any modifications, equivalent replacements, and improvements etc., should be included in the claims in the present invention within the spirit and principle of invention Protection scope within.

Claims (3)

  1. The method that 1. a kind of couple of user carries out Topics Crawling and application is recommended, which comprises the following steps:
    S1: the weight that user's theme is contributed in application is calculated;
    S2: the probability graph model of user's selection course when installing application is established;
    S3: seeking the parameter of probability graph model, and completes the recommendation of excavation and the application of theme;
    The weighted value that user's theme is contributed in application is calculated in step S1:
    Wherein, user indicates that a user, app indicate the application that user is installed, LuserIndicate what user user was installed Using number, LaverageWhat the average user of expression was installed applies number, | U | indicate the total amount of user in data set, nappIt indicates It is mounted with the sum of the user of the app, by rounding up, weight is an integer greater than 0;
    The process of step S2 is as follows:
    S21: according to the preference distribution of userGenerate the preferences variable x of useru,n, whereinThat is preference Variable xu,nObey withFor a bi-distribution of parameter;
    S22: if x obtained in S21u,nValue is 0, then it represents that user is personalized preference select to consider when application, first root Distribution according to user to themeUser sample out for the theme z of application to be installedu,n, whereinI.e. should The theme of application obey withFor the multinomial distribution of parameter;
    S23: according to distribution of the theme z in each applicationCorresponding application is generated, whereinWanted The application of installation by a parameter isMultinomial distribution sample generate;
    S24: if x obtained in S21u,nValue be 1, then it represents that user selection consider at once be locating geographical location because Element, geographical location locating for user u are l,Indicate the position to the preference distribution of application, whereinI.e. Institute's application to be installed is by a parameterMultinomial distribution sample generate;
    Wherein, xu,nIndicate selection preference of u-th of user in n-th of installation application, xu,n∈ { 0,1 }, works as xu,nValue be 0 When, it indicates that user u is to select to install what this was applied according to the hobby feature of oneself, works as xu,nValue be 1 when, indicate user be According to geographical location locating for the user u select that the application is installed;Indicate the preference distribution of user u selection application, zu,n Indicate the theme of n-th of application of u-th of user installation,Indicate user u to the preference distribution of theme,Indicate that theme z exists A respectively upper distribution of application,Indicate preference distribution of the country l to application, appuIndicate user u installed certain A application;It indicatesPrior distribution parameter,It indicatesPrior distribution parameter,It indicatesPrior distribution Parameter,It indicatesPrior distribution parameter;
    When the weight using app of user user installation is weight, user has carried out weight selection peace to app in total The process of dress.
  2. 2. the method according to claim 1 for carrying out Topics Crawling to user and application is recommended, which is characterized in that the step The process for the parameter for seeking model in rapid S3 is as follows:
    It is iterated by formula of sampling as follows:
    WhereinIndicate exclude current application when time select after, xu,nValue be 0 application selection install number;And Indicate exclude current application when time select after, xu,nValue be x' application selection install number,It indicates to exclude current Application after time selection, the selection installation number of application that theme is z;It indicates to exclude selecting when secondary for current application Afterwards, using app remaining weight-1 selection installation in, xu,nValue be 0 number;
    After completing Λ iteration, according to the following various parameter for acquiring model:
    Distribution of each theme in each applicationThe theme exactly excavated, utilizes following formula: and p (app | u, l)=p (x=0 | u) ∑zP (z | u) p (app | z)+p (x=1 | u) p (app | l) calculates user and likes degree value to a certain application, The highest several applications of degree value are liked in right rear line recommendation.
  3. 3. the method according to claim 2 for carrying out Topics Crawling to user and application is recommended, which is characterized in that described to change Generation number Λ is not small by 300.
CN201610357421.9A 2016-05-25 2016-05-25 The method that a kind of couple of user carries out Topics Crawling and application is recommended Active CN106095776B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610357421.9A CN106095776B (en) 2016-05-25 2016-05-25 The method that a kind of couple of user carries out Topics Crawling and application is recommended

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610357421.9A CN106095776B (en) 2016-05-25 2016-05-25 The method that a kind of couple of user carries out Topics Crawling and application is recommended

Publications (2)

Publication Number Publication Date
CN106095776A CN106095776A (en) 2016-11-09
CN106095776B true CN106095776B (en) 2019-10-01

Family

ID=57230934

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610357421.9A Active CN106095776B (en) 2016-05-25 2016-05-25 The method that a kind of couple of user carries out Topics Crawling and application is recommended

Country Status (1)

Country Link
CN (1) CN106095776B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106547922B (en) * 2016-12-07 2020-08-25 阿里巴巴(中国)有限公司 Application program sorting method and device and server
CN108717467B (en) * 2018-06-07 2021-09-28 南京大学 Open source software recommendation method based on topic incidence relation analysis

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105426514A (en) * 2015-11-30 2016-03-23 扬州大学 Personalized mobile APP recommendation method
CN105512347A (en) * 2016-01-27 2016-04-20 北京航空航天大学 Information processing method based on geographic topic model

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20120079925A (en) * 2011-01-06 2012-07-16 삼성전자주식회사 Method and apparatus for providing help of a portable terminal

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105426514A (en) * 2015-11-30 2016-03-23 扬州大学 Personalized mobile APP recommendation method
CN105512347A (en) * 2016-01-27 2016-04-20 北京航空航天大学 Information processing method based on geographic topic model

Also Published As

Publication number Publication date
CN106095776A (en) 2016-11-09

Similar Documents

Publication Publication Date Title
Lee et al. A time-based approach to effective recommender systems using implicit feedback
Cui et al. DDSE: A novel evolutionary algorithm based on degree-descending search strategy for influence maximization in social networks
Dai et al. Deep coevolutionary network: Embedding user and item features for recommendation
Liu et al. Unified point-of-interest recommendation with temporal interval assessment
Grohs et al. Nonsmooth trust region algorithms for locally Lipschitz functions on Riemannian manifolds
Mavrotas et al. Robustness analysis methodology for multi-objective combinatorial optimization problems and application to project selection
Si et al. CTF-ARA: An adaptive method for POI recommendation based on check-in and temporal features
CN105930425A (en) Personalized video recommendation method and apparatus
Schramski et al. Network environ theory, simulation, and EcoNet® 2.0
CN110503531A (en) The dynamic social activity scene recommended method of timing perception
CN108205766A (en) Information-pushing method, apparatus and system
CN106708978B (en) A kind of service recommendation method and device based on bipartite graph
CN110008973B (en) Model training method, method and device for determining target user based on model
Song et al. Multifractal analysis of weighted networks by a modified sandbox algorithm
Liu et al. A self-adaptive point-of-interest recommendation algorithm based on a multi-order Markov model
CN106484777A (en) A kind of multimedia data processing method and device
CN104331411A (en) Item recommendation method and item recommendation device
Khan et al. Design of momentum fractional stochastic gradient descent for recommender systems
CN106095776B (en) The method that a kind of couple of user carries out Topics Crawling and application is recommended
CN105023178A (en) Main body-based electronic commercere commendation method
Zhang et al. Machine learning approach for sequence clustering with applications to ground-motion selection
CN106452809A (en) Data processing method and device
CN109377284B (en) Method and electronic equipment for pushing information
CN106095960A (en) A kind of network service recommends method and device
Papadimitriou et al. Geo-social recommendations

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20211203

Address after: 510260 block B, Zhongda Science Park, building 705, Zhongda Park West, No. 135, Xingang West Road, Xingang street, Haizhu District, Guangzhou, Guangdong

Patentee after: Guangzhou Zhanyi Information Technology Co.,Ltd.

Address before: 510275 No. 135 West Xingang Road, Guangzhou, Guangdong, Haizhuqu District

Patentee before: SUN YAT-SEN University

Patentee before: GUANGZHOU ZHONGDA NANSHA TECHNOLOGY INNOVATION INDUSTRIAL PARK Co.,Ltd.

Patentee before: GUANGZHOU ZHIHAI ZONGHENG INFORMATION TECHNOLOGY CO.,LTD.