CN104239399A - Method for recommending potential friends in social network - Google Patents

Method for recommending potential friends in social network Download PDF

Info

Publication number
CN104239399A
CN104239399A CN201410333744.5A CN201410333744A CN104239399A CN 104239399 A CN104239399 A CN 104239399A CN 201410333744 A CN201410333744 A CN 201410333744A CN 104239399 A CN104239399 A CN 104239399A
Authority
CN
China
Prior art keywords
user
theme
article
similarity
rough
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410333744.5A
Other languages
Chinese (zh)
Other versions
CN104239399B (en
Inventor
陈秀真
李建华
李生红
史辰烨
周泉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Jiaotong University
Original Assignee
Shanghai Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Jiaotong University filed Critical Shanghai Jiaotong University
Priority to CN201410333744.5A priority Critical patent/CN104239399B/en
Publication of CN104239399A publication Critical patent/CN104239399A/en
Application granted granted Critical
Publication of CN104239399B publication Critical patent/CN104239399B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

The invention relates to a method for recommending potential friends in a social network and realizes the recommendation for the potential friends through two layers of moulds. The method comprises the flowing steps of in the first layer, firstly carrying out subject classification on the articles of all users, constructing a user attention vector, and calculating the cosine similarity of the users to obtain a rough potential-friend recommending list; in the second layer, taking the interest changes with time of the users into account, taking change factors into the calculation of the similarity, and then further detailing the friend recommendation on the basis of the rough recommending list in the first layer. Because the characteristic that the interests of the users are reflected by the articles is utilized, by starting from the interest similarity and bypassing the limitation of an existing social network on the basis of the original friend relationship of the users, the friend recommending method based on the interest similarity is realized, the friend recommending range is expanded, and the recommendation of the friends is further more accurately and effectively realized within a large range.

Description

Potential friend recommendation method in social networks
Technical field
The present invention relates to computer program field, particularly relate to a kind of potential friend recommendation method in social networks.
Background technology
The rise of Web2.0 allows increasing network user participate in social networks, and they hanker after carrying out resource sharing and information exchange, is got more and more by the mutual interchange of online social networks.The research producing content for user not only can help businessman to understand the hobby trend of all kinds of different user group to commodity, also can improve multiple network service, has great importance for lifting Consumer's Experience.
Because the network user has been not content with the social circle of the circle of friends formation in reality gradually, in social networks, the strange user having common interest hobby is found to become the demand of numerous user.In social networks, the friend-making circle extended one's service is one of major way of social networks development, effectively finds interested contact person can give user and makes friends or professional help.But the friend recommendation function that existing system provides is not very good.Therefore, propose a kind of effective friend recommendation mechanism and there is very strong realistic meaning.
Find by literature search, Chen W and Fong S at article " Social network collaborating filtering framework and online trust factors:A case study on Facebook " (" by research Facebook data, social networks collaborative filtering filtering model in conjunction with trust factor ") (2010fifth international conference on ICDIM, the recommend method that trust factor combines with collaborative filtering model is proposed IEEE2010:266-273), be specially: first, the method of data mining (decision tree and correlation rule) is utilized to analyze user data, show that in doings, each key element is on the impact of users to trust degree.Then these degree of belief influence factors quantized, the part as collaborative filtering model inputs.Finally, collaborative filtering recommending is carried out in conjunction with user characteristics.But there is certain difficulty when quantizing the weight of each key element of degree of belief in the method, well can not determine the weight of each key element, this certainly will bring certain impact to final recommendation results.
Find by literature search, the people such as Nitai B.Silva and Ing-Ren Tsang propose to utilize network topology structure to carry out commending friends at article " A graph-based friend recommendation system using genetic algorithm " (" utilizing the friend recommendation system based on graph structure of genetic algorithm ") (2010IEEE congress on Evolutionary Computation, 2010:1-7).Be specially: first, be partitioned into the local subgraph in active user's certain distance, then utilize the network structure feature of the good friend of Analysis of Genetic Algorithms user and the good friend of good friend, filter out and become the larger nodes recommendations of good friend's probability to user.But this commending system is a kind of acquaintance's recommendation mechanisms, not very remarkable to the contribution of user development friend-making circle.
Summary of the invention
The technical problem to be solved in the present invention how to expand the scope of friend recommendation, so on a large scale in realize the recommendation of good friend more accurately and effectively.
In order to solve this technical matters, the invention provides the potential friend recommendation method in two kinds of similar social networks, wherein a kind of method comprises the steps:
S01: obtain from social networking service device all articles that every user delivered in a year, and carry out stored record;
S02: add up the article sum of every user under each theme;
After completing steps S02, carry out step S03 and S04 respectively, after completing steps S04, implementation step S05, wherein:
S03: utilize the cosine law according to the rough similarity between following formulae discovery user:
similarity ( i , j ) = Σ t = 1 k n ( i , t ) × n ( j , t ) n ( i , 1 ) 2 + n ( i , 2 ) 2 + . . . + n ( i , k ) 2 n ( j , 1 ) 2 + n ( j , 2 ) 2 + . . . + n ( j , k ) 2
Wherein, similarity (i, j)for the rough similarity of user i and user j, the quantity that k is the theme, n i, trepresent the article number that user i delivers under theme t, n j, trepresent the article number that user j delivers under theme t;
For a specific user, to other users according to similarity similarity rough between itself and this specific user (i, j)size arrange, select the potential user rough recommendation tables of the highest x of a similarity user as this specific user;
S04: the article number delivered under k theme every month according to each user, according to the difference dif between the following formulae discovery m month and the m+1 month i[m]:
dif i [ m ] = 1 - Σ t = 1 k n ( i , m + 1 , t ) × n ( j , m , t ) n ( i , m + 1,1 ) 2 + n ( i , m + 1,2 ) 2 + . . . + n ( i , m + 1 , k ) 2 × n ( i , m , 1 ) 2 + n ( i , m , 2 ) 2 + . . . + n ( i , m , k ) 2
And then according to the interests change susceptibility λ of each user in 1 year in a following formulae discovery x user i:
λ i = Σ i = 1 11 dif i [ m ] 11
S05: that adds up article delivers the time, by the similarity sim of two users in following formulae discovery theme t t(i, j):
sim t ( i , j ) = Σ b = 1 | n ( i , t ) | Σ a = 1 | n ( j , t ) | e - λ ( time ( a ) - time ( b ) ) | n ( i , t ) | × | n ( j , t ) |
Wherein, n (i, t)with n (j, t)be respectively user i in theme t, article that user j has separately sum, λ is sensitive factor, and described sensitive factor λ is obtained by following formulae discovery:
λ = ( λ i + λ j ) 2
Finally, refining similar degree sim (i, j) between two users is obtained by following formula
sim ( i , j ) = Σ t = 1 k - 1 sim t ( i , j ) * ( n ( i , t ) + n ( j , t ) ) Σ t = 1 k - 1 ( n ( i , t ) + n ( j , t ) )
After completing S03 and S05, implement following steps:
S06: for this specific user, by x user in its rough recommendation list according to the refining similar degree sim (i between itself and this specific user, j) size arranges, and obtains final recommendation list, thus to the recommendation list that this specific user provides this final.
Potential friend recommendation method in another kind of social networks, comprises the steps:
S01: obtain from social networking service device all articles that every user delivered in a year, and carry out stored record;
S02: add up the article sum of every user under each theme;
S03: utilize the cosine law according to the rough similarity between following formulae discovery user:
similarity ( i , j ) = Σ t = 1 k n ( i , t ) × n ( j , t ) n ( i , 1 ) 2 + n ( i , 2 ) 2 + . . . + n ( i , k ) 2 n ( j , 1 ) 2 + n ( j , 2 ) 2 + . . . + n ( j , k ) 2
Wherein, similarity (i, j)for the rough similarity of user i and user j, the quantity that k is the theme, n i, trepresent the article number that user i delivers under theme t, n j, trepresent the article number that user j delivers under theme t;
For a specific user, to other users according to similarity similarity rough between itself and this specific user (i, j)size arrange, select the potential user rough recommendation tables of the highest x of a similarity user as this specific user;
S04: the article number delivered under k theme every month according to each user in this specific user and the rough recommendation tables of potential user, according to the difference dif between the following formulae discovery m month and the m+1 month i[m]:
dif i [ m ] = 1 - Σ t = 1 k n ( i , m + 1 , t ) × n ( j , m , t ) n ( i , m + 1,1 ) 2 + n ( i , m + 1,2 ) 2 + . . . + n ( i , m + 1 , k ) 2 × n ( i , m , 1 ) 2 + n ( i , m , 2 ) 2 + . . . + n ( i , m , k ) 2
And then according to the wherein interests change susceptibility λ of each user in 1 year in a following formulae discovery x user i:
λ i = Σ i = 1 11 dif i [ m ] 11
S05: that adds up each user's article in this specific user and the rough recommendation tables of potential user delivers the time, by the similarity sim of two users in following formulae discovery theme t t(i, j):
sim t ( i , j ) = Σ b = 1 | n ( i , t ) | Σ a = 1 | n ( j , t ) | e - λ ( time ( a ) - time ( b ) ) | n ( i , t ) | × | n ( j , t ) |
Wherein, n (i, t)with n (j, t)be respectively user i in theme t, article that user j has separately sum, λ is sensitive factor, and described sensitive factor λ is obtained by following formulae discovery:
λ = ( λ i + λ j ) 2
Finally, refining similar degree sim (i, j) between two users is obtained by following formula
sim ( i , j ) = Σ t = 1 k - 1 sim t ( i , j ) * ( n ( i , t ) + n ( j , t ) ) Σ t = 1 k - 1 ( n ( i , t ) + n ( j , t ) )
S06: for this specific user, by x user in its rough recommendation list according to the refining similar degree sim (i between itself and this specific user, j) size arranges, and obtains final recommendation list, thus to the recommendation list that this specific user provides this final.
In described step S01, all articles that every user delivered in a year store in the following manner:
Blog(user i)={b 1,b 2,......,b q}
Wherein, Blog (user i) represent all article set of user i, b qrefer to Blog (user i) in q section article.
Also comprise the steps: between described step S01 and S02
For the article of every user, based on the tag along sort preset, utilize many labels subject classification model to classify, after classification, every section of article obtains three labels, the corresponding theme of each label.
Many labels subject classification model comprises ground floor model and second layer model;
Described ground floor model, for being that every section of article carries out word segmentation processing by participle instrument, filtered the effective word obtaining every section of article with the filtration of stop-word subsequently by part of speech, use term frequency-inverse document frequency to calculate the weight of each word afterwards, thus obtain the keyword of every section of article;
Described second layer model, for the keyword obtained in described ground floor model and label being carried out semantic matches according to semantic knowledge-base, afterwards, the label obtained the keyword of every section of article uses the method for multi-source evidence fusion to calculate the final label of every section of article.
The method of described multi-source evidence fusion have employed Dempster composition rule.
Described participle instrument is ICTCLAS participle instrument.
Described semantic knowledge-base is " knowing net " and " Chinese thesaurus expansion version ".
In described step S03, first according to the article sum under each theme of user, obtain the article theme attention rate vector of user:
user i=((topic 1,n i,1),(topic 2,n i,2),......,(topic k,n i,k))
Wherein, user irepresent user i, topic trepresent t theme, the quantity that k is the theme;
And then carry out rough similarity similarity according to this theme concern vector (i, j).
In described step S04, first according to the article sum under user's every month each theme, to obtain in 1 year theme in each month and pay close attention to distribution table:
user i[m]={(topic 1,n i,m,1),(topic 2,n i,m,2),......,(topic k,n i,m,k)}
Wherein, m represents month, n i, m, trepresent the article number that user i delivers under the m month, theme t, the quantity that k is the theme.
The present invention realizes the recommendation to potential good friend by two-layer model.First ground floor is carry out subject classification to the article of each user, constructs user's attention rate vector, calculates cosine (cosine) similarity of user, obtain rough potential friend recommendation list.User's interests change is in time counted consideration by the second layer, is considered by changing factor in Similarity Measure, then based on the further refinement friend recommendation of alligatoring recommendation tables of ground floor.Owing to make use of this feature of article reflection user interest, from similar interests degree, walk around the restriction of existing social networks based on the original friend relation of user, achieve the friend recommendation method based on Interest Similarity, expanded the scope of friend recommendation, so on a large scale in realize the recommendation of good friend more accurately and effectively.
Accompanying drawing explanation
Fig. 1 is potential friend recommendation illustraton of model in the embodiment of the present invention 1;
Fig. 2 is keyword semantic matches and blog classification process figure in the embodiment of the present invention 1;
Fig. 3 is the rough recommendation figure based on total Interest Similarity in the embodiment of the present invention 1;
Fig. 4 is the refinement recommendation figure counting the article time in the embodiment of the present invention 1.
Embodiment
Below with reference to Fig. 1 to Fig. 4, by two embodiments, potential friend recommendation method in social networks provided by the invention is described in detail, it is optional two embodiments of the present invention, can think, those skilled in the art, in the scope not changing the present invention's spirit and content, can modify to it and polish.
Embodiment 1
Please refer to Fig. 1, present embodiments provide a kind of potential friend recommendation method in social networks, the present embodiment utilizes the open data platform of Renren Network to obtain data set, and process according to flow process shown in Fig. 1 of the present invention these data, the method comprises the steps:
S01: obtain all articles that every user delivered in a year from social networking service device, and carry out stored record, in the present embodiment, namely first 5314 good friends are obtained from Renren Network api interface, share article and 138901 sections of original articles for 17956 sections that excavate them altogether, article wherein, also can be described as blog article; In described step S01, all articles that every user delivered in a year store in the following manner:
Blog(user i)={b 1,b 2,......,b q}
Wherein, Blog (user i) represent all article set of user i, b qrefer to Blog (user i) in q section article.
Theme belonging to article can to refer in social networks article just selected at the beginning of writing belonging to theme, so in the case, theme is known, be stored in the fix information in social networking service device, this method just transfers use; Certainly, affiliated theme also can be determined as described in the embodiment, and in order to determine the theme of article, the present embodiment, after described step S01, before S02, also comprises the steps:
For the article of every user, based on the tag along sort preset, utilize many labels subject classification model to classify, after classification, every section of article obtains three labels, the corresponding theme of each label.
Specifically, in the present embodiment, first, for the article of every user, based on Sina's 41 tag along sorts (see table 1), many labels subject classification model is utilized to classify.After classification, every section of article obtains three labels:
b q={label (q,1),label (q,2),label (q,3)}
Described many labels subject classification model, refers to the theme variation based on blog content and the feature such as dispersed, have employed the blog subject classification model of multi-class, many labelings.This model is mainly divided into two-layer:
Ground floor model adopts vector space model (Vector space model, VSM) to carry out text representation.Based on the concrete feature of Chinese, it is that every section of article carries out word segmentation processing that model have employed ICTCLAS participle instrument, the effective word obtaining every section of article with the filtration of stop-word is filtered subsequently by part of speech, TF-IDF (Term frequency – Inverse document frequency, term frequency-inverse document frequency) is used to calculate the weight of each word and obtain the keyword of every section of article afterwards.
Furthermore, utilize vector space model (Vector space model, VSM) to carry out text representation for the article of every user to be:
B(q)={(KeyWord 1,w 1),(KeyWord 2,w 2),......,(KeyWord i,w i)}
Wherein, B (q) represents effective entry set of article q, w irepresent KeyWord iweighted value.
Second layer model uses two kinds of semantic knowledge-bases: " knowing net " and " Chinese thesaurus expansion version ", and the tag along sort of determining of the keyword obtained in ground floor model described in the and sina blog is carried out semantic matches.Afterwards, the tag along sort obtained the keyword of every section of article uses the method for multi-source evidence fusion: Dempster composition rule, calculates the class label that every section of article is final.Semantic matches and labeling process flow diagram are as shown in Fig. 2.
The theme of article can carry out subsequent step after determining.In the present embodiment, please refer to Fig. 1, after article theme is determined, carry out step S03 and S04 respectively, there is no precedence relationship between step 03 and step 04, after completing steps S04, implementation step S05, as long as implement S06 again after completing S03 and S05 respectively.S04 in the present embodiment calculates the interests change susceptibility of each user, and then carries out the calculating of refining similar degree.
About step S03: the rough process flow diagram recommended as shown in Figure 3.
Wherein, utilize the cosine law according to the rough similarity between following formulae discovery user:
similarity ( i , j ) = Σ t = 1 k n ( i , t ) × n ( j , t ) n ( i , 1 ) 2 + n ( i , 2 ) 2 + . . . + n ( i , k ) 2 n ( j , 1 ) 2 + n ( j , 2 ) 2 + . . . + n ( j , k ) 2
Wherein, similarity (i, j)for the rough similarity of user i and user j, the quantity that k is the theme, n i, trepresent the article number that user i delivers under theme t, n j, trepresent the article number that user j delivers under theme t;
For a specific user, to other users according to similarity similarity rough between itself and this specific user (i, j)size arrange, select the potential user rough recommendation tables of the highest x of a similarity user as this specific user;
Specifically, in described step S03, first according to the article sum under each theme of user, obtain the article theme attention rate vector of user:
user i=((topic 1,n i,1),(topic 2,n i,2),......,(topic k,n i,k))
Wherein, user irepresent user i, topic trepresent t theme, the quantity that k is the theme;
And then carry out rough similarity similarity according to this theme concern vector (i, j).
About step S04: the article number delivered under k theme every month according to each user, according to the difference dif between the following formulae discovery m month and the m+1 month i[m]:
dif i [ m ] = 1 - Σ t = 1 k n ( i , m + 1 , t ) × n ( j , m , t ) n ( i , m + 1,1 ) 2 + n ( i , m + 1,2 ) 2 + . . . + n ( i , m + 1 , k ) 2 × n ( i , m , 1 ) 2 + n ( i , m , 2 ) 2 + . . . + n ( i , m , k ) 2
And then according to the interests change susceptibility λ of each user in 1 year in a following formulae discovery x user i, namely obtain the interests change susceptibility λ of user in 1 year by the mean value of 11 differences again i:
λ i = Σ i = 1 11 dif i [ m ] 11
In described step S04, article sum first under each theme foundation user's every month, to obtain in 1 year theme in each month and pay close attention to distribution table, specifically, it is using the moon as node, the interest distribution of counting user every month, builds the theme of user in 1 year in each month m and pays close attention to distribution table:
user i[m]={(topic 1,n i,m,1),(topic 2,n i,m,2),......,(topic k,n i,m,k)}
Wherein, m represents month, n i, m, trepresent the article number that user i delivers under the m month, theme t, the quantity that k is the theme.
Connection with step S05: that adds up article delivers the time, by the similarity sim of two users in following formulae discovery theme t t(i, j):
sim t ( i , j ) = Σ b = 1 | n ( i , t ) | Σ a = 1 | n ( j , t ) | e - λ ( time ( a ) - time ( b ) ) | n ( i , t ) | × | n ( j , t ) |
Wherein, n (i, t)with n (j, t)be respectively user i in theme t, article that user j has separately sum, λ is sensitive factor, and described sensitive factor λ is obtained by following formulae discovery:
λ = ( λ i + λ j ) 2
Finally, refining similar degree sim (i, j) between two users is obtained by following formula
sim ( i , j ) = Σ t = 1 k - 1 sim t ( i , j ) * ( n ( i , t ) + n ( j , t ) ) Σ t = 1 k - 1 ( n ( i , t ) + n ( j , t ) )
About step S05: obtaining the interests change susceptibility λ of every user self iafterwards, get back in each theme class further, the joining day factor calculates.
To any theme j, find each self-corresponding article collection of user i and user j under this theme: Blog (user i, topic t) and Blog (user j, topic t).That traces back to every section of article in theme delivers time time (b).
Counting user article deliver the time, by the similarity sim of two users in following formulae discovery theme t t(i, j):
sim t ( i , j ) = Σ b = 1 | n ( i , t ) | Σ a = 1 | n ( j , t ) | e - λ ( time ( a ) - time ( b ) ) | n ( i , t ) | × | n ( j , t ) |
Wherein, n (i, t)with n (j, t)be respectively user i in theme t, article that user j has separately sum, λ is sensitive factor, and λ is sensitive factor, if change λ → 0, then similarity does not affect by lead time substantially, if but λ → 1, then similarity can in time gap increase and greatly weaken.Interests change susceptibility due to user self truly reflects the susceptibility of user for time difference, and this model employs the interests change susceptibility λ of user i and j self iwith λ jcalculate sensitive factor.Described sensitive factor λ is obtained by following formulae discovery:
λ = ( λ i + λ j ) 2
Finally, refining similar degree sim (i, j) between two users is obtained by following formula
sim ( i , j ) = Σ t = 1 k - 1 sim t ( i , j ) * ( n ( i , t ) + n ( j , t ) ) Σ t = 1 k - 1 ( n ( i , t ) + n ( j , t ) )
After completing S03 and S05, implement following steps:
S06: for this specific user, by x user in its rough recommendation list according to the refining similar degree sim (i between itself and this specific user, j) size arranges, and obtains final recommendation list, thus to the recommendation list that this specific user provides this final.Its process can see Fig. 4.
The list of table 1 theme label
Embodiment 2
The present embodiment 1 is only with the difference of embodiment 1, in the present embodiment, S3, S4, S5 has sequencing, namely implement successively, so, the interests change susceptibility of x user in rough list and specific that user is only calculated in S4, the refining similar degree of x user in that user specific and rough list is only calculated in S5, and unlike in embodiment 1, calculate interests change susceptibility and the refining similar degree of all users, information needed can be transferred in all results in embodiment 1, only need calculate once, just can meet the friend recommendation of all users, embodiment 2 carries out the calculating of S4 and S5 respectively, namely need to calculate respectively.Except above difference, embodiment concrete in each step is all similar to embodiment 1.
Specifically, this enforcement provides a kind of potential friend recommendation method in social networks, comprises the steps:
S01: obtain from social networking service device all articles that every user delivered in a year, and carry out stored record;
S02: add up the article sum of every user under each theme;
S03: utilize the cosine law according to the rough similarity between following formulae discovery user:
similarity ( i , j ) = Σ t = 1 k n ( i , t ) × n ( j , t ) n ( i , 1 ) 2 + n ( i , 2 ) 2 + . . . + n ( i , k ) 2 n ( j , 1 ) 2 + n ( j , 2 ) 2 + . . . + n ( j , k ) 2
Wherein, similarity (i, j)for the rough similarity of user i and user j, the quantity that k is the theme, n i, trepresent the article number that user i delivers under theme t, n j, trepresent the article number that user j delivers under theme t;
For a specific user, to other users according to similarity similarity rough between itself and this specific user (i, j)size arrange, select the potential user rough recommendation tables of the highest x of a similarity user as this specific user;
S04: the article number delivered under k theme every month according to each user in this specific user and the rough recommendation tables of potential user, according to the difference dif between the following formulae discovery m month and the m+1 month i[m]:
dif i [ m ] = 1 - Σ t = 1 k n ( i , m + 1 , t ) × n ( j , m , t ) n ( i , m + 1,1 ) 2 + n ( i , m + 1,2 ) 2 + . . . + n ( i , m + 1 , k ) 2 × n ( i , m , 1 ) 2 + n ( i , m , 2 ) 2 + . . . + n ( i , m , k ) 2
And then according to the wherein interests change susceptibility λ of each user in 1 year in a following formulae discovery x user i:
λ i = Σ i = 1 11 dif i [ m ] 11
S05: that adds up each user's article in this specific user and the rough recommendation tables of potential user delivers the time, by the similarity sim of two users in following formulae discovery theme t t(i, j):
sim t ( i , j ) = Σ b = 1 | n ( i , t ) | Σ a = 1 | n ( j , t ) | e - λ ( time ( a ) - time ( b ) ) | n ( i , t ) | × | n ( j , t ) |
Wherein, n (i, t)with n (j, t)be respectively user i in theme t, article that user j has separately sum, λ is sensitive factor, and described sensitive factor λ is obtained by following formulae discovery:
λ = ( λ i + λ j ) 2
Finally, refining similar degree sim (i, j) between two users is obtained by following formula
sim ( i , j ) = Σ t = 1 k - 1 sim t ( i , j ) * ( n ( i , t ) + n ( j , t ) ) Σ t = 1 k - 1 ( n ( i , t ) + n ( j , t ) )
S06: for this specific user, by x user in its rough recommendation list according to the refining similar degree sim (i between itself and this specific user, j) size arranges, and obtains final recommendation list, thus to the recommendation list that this specific user provides this final.
In the present embodiment, use everybody network data, the calculating of method flow of the present invention and user's similarity is illustrated, the results show, when not considering sequence, the rough recommendation list that this model provides can obtain the effect up to 67%, and thinning process is by the rank precision improvement of recommendation list 17%.Thus demonstrate validity and the accuracy of method.
In sum, the present invention is by the recommendation of two-layer model realization to potential good friend.First ground floor is carry out subject classification to the article of each user, constructs user's attention rate vector, calculates cosine (cosine) similarity of user, obtain rough potential friend recommendation list.User's interests change is in time counted consideration by the second layer, is considered by changing factor in Similarity Measure, then based on the further refinement friend recommendation of alligatoring recommendation tables of ground floor.Owing to make use of this feature of article reflection user interest, from similar interests degree, walk around the restriction of existing social networks based on the original friend relation of user, achieve the friend recommendation method based on Interest Similarity, expanded the scope of friend recommendation, so on a large scale in realize the recommendation of good friend more accurately and effectively.

Claims (10)

1. the potential friend recommendation method in social networks, comprises the steps:
S01: obtain from social networking service device all articles that every user delivered in a year, and carry out stored record;
S02: add up the article sum of every user under each theme;
After completing steps S02, carry out step S03 and S04 respectively, after completing steps S04, implementation step S05, wherein:
S03: utilize the cosine law according to the rough similarity between following formulae discovery user:
similarity ( i , j ) = Σ t = 1 k n ( i , t ) × n ( j , t ) n ( i , 1 ) 2 + n ( i , 2 ) 2 + . . . + n ( i , k ) 2 n ( j , 1 ) 2 + n ( j , 2 ) 2 + . . . + n ( j , k ) 2
Wherein, similarity (i, j)for the rough similarity of user i and user j, the quantity that k is the theme, n i, trepresent the article number that user i delivers under theme t, n j, trepresent the article number that user j delivers under theme t;
For a specific user, to other users according to similarity similarity rough between itself and this specific user (i, j)size arrange, select the potential user rough recommendation tables of the highest x of a similarity user as this specific user;
S04: the article number delivered under k theme every month according to each user, according to the difference dif between the following formulae discovery m month and the m+1 month i[m]:
dif i [ m ] = 1 - Σ t = 1 k n ( i , m + 1 , t ) × n ( j , m , t ) n ( i , m + 1,1 ) 2 + n ( i , m + 1,2 ) 2 + . . . + n ( i , m + 1 , k ) 2 × n ( i , m , 1 ) 2 + n ( i , m , 2 ) 2 + . . . + n ( i , m , k ) 2
And then according to the interests change susceptibility λ of each user in 1 year in a following formulae discovery x user i:
λ i = Σ i = 1 11 dif i [ m ] 11
S05: that adds up article delivers the time, by the similarity sim of two users in following formulae discovery theme t t(i, j):
sim t ( i , j ) = Σ b = 1 | n ( i , t ) | Σ a = 1 | n ( j , t ) | e - λ ( time ( a ) - time ( b ) ) | n ( i , t ) | × | n ( j , t ) |
Wherein, n (i, t)with n (j, t)be respectively user i in theme t, article that user j has separately sum, λ is sensitive factor, and described sensitive factor λ is obtained by following formulae discovery:
λ = ( λ i + λ j ) 2
Finally, refining similar degree sim (i, j) between two users is obtained by following formula
sim ( i , j ) = Σ t = 1 k - 1 sim t ( i , j ) * ( n ( i , t ) + n ( j , t ) ) Σ t = 1 k - 1 ( n ( i , t ) + n ( j , t ) )
After completing S03 and S05, implement following steps:
S06: for this specific user, by x user in its rough recommendation list according to the refining similar degree sim (i between itself and this specific user, j) size arranges, and obtains final recommendation list, thus to the recommendation list that this specific user provides this final.
2. the potential friend recommendation method in social networks, comprises the steps:
S01: obtain from social networking service device all articles that every user delivered in a year, and carry out stored record;
S02: add up the article sum of every user under each theme;
S03: utilize the cosine law according to the rough similarity between following formulae discovery user:
similarity ( i , j ) = Σ t = 1 k n ( i , t ) × n ( j , t ) n ( i , 1 ) 2 + n ( i , 2 ) 2 + . . . + n ( i , k ) 2 n ( j , 1 ) 2 + n ( j , 2 ) 2 + . . . + n ( j , k ) 2
Wherein, similarity (i, j)for the rough similarity of user i and user j, the quantity that k is the theme, n i, trepresent the article number that user i delivers under theme t, n j, trepresent the article number that user j delivers under theme t;
For a specific user, to other users according to similarity similarity rough between itself and this specific user (i, j)size arrange, select the potential user rough recommendation tables of the highest x of a similarity user as this specific user;
S04: the article number delivered under k theme every month according to each user in this specific user and the rough recommendation tables of potential user, according to the difference dif between the following formulae discovery m month and the m+1 month i[m]:
dif i [ m ] = 1 - Σ t = 1 k n ( i , m + 1 , t ) × n ( j , m , t ) n ( i , m + 1,1 ) 2 + n ( i , m + 1,2 ) 2 + . . . + n ( i , m + 1 , k ) 2 × n ( i , m , 1 ) 2 + n ( i , m , 2 ) 2 + . . . + n ( i , m , k ) 2
And then according to the wherein interests change susceptibility λ of each user in 1 year in a following formulae discovery x user i:
λ i = Σ i = 1 11 dif i [ m ] 11
S05: that adds up each user's article in this specific user and the rough recommendation tables of potential user delivers the time, by the similarity sim of two users in following formulae discovery theme t t(i, j):
sim t ( i , j ) = Σ b = 1 | n ( i , t ) | Σ a = 1 | n ( j , t ) | e - λ ( time ( a ) - time ( b ) ) | n ( i , t ) | × | n ( j , t ) |
Wherein, n (i, t)with n (j, t)be respectively user i in theme t, article that user j has separately sum, λ is sensitive factor, and described sensitive factor λ is obtained by following formulae discovery:
λ = ( λ i + λ j ) 2
Finally, refining similar degree sim (i, j) between two users is obtained by following formula
sim ( i , j ) = Σ t = 1 k - 1 sim t ( i , j ) * ( n ( i , t ) + n ( j , t ) ) Σ t = 1 k - 1 ( n ( i , t ) + n ( j , t ) )
S06: for this specific user, by x user in its rough recommendation list according to the refining similar degree sim (i between itself and this specific user, j) size arranges, and obtains final recommendation list, thus to the recommendation list that this specific user provides this final.
3. the potential friend recommendation method in social networks as claimed in claim 1 or 2, is characterized in that: in described step S01, and all articles that every user delivered in a year store in the following manner:
Blog(user i)={b 1,b 2,......,b q}
Wherein, Blog (user i) represent all article set of user i, b qrefer to Blog (user i) in q section article.
4. the potential friend recommendation method in social networks as claimed in claim 1 or 2, is characterized in that: also comprise the steps: between described step S01 and S02
For the article of every user, based on the tag along sort preset, utilize many labels subject classification model to classify, after classification, every section of article obtains three labels, the corresponding theme of each label.
5. the potential friend recommendation method in social networks as claimed in claim 4, is characterized in that: described many labels subject classification model comprises ground floor model and second layer model;
Described ground floor model, for being that every section of article carries out word segmentation processing by participle instrument, filtered the effective word obtaining every section of article with the filtration of stop-word subsequently by part of speech, use term frequency-inverse document frequency to calculate the weight of each word afterwards, thus obtain the keyword of every section of article;
Described second layer model, for the keyword obtained in described ground floor model and label being carried out semantic matches according to semantic knowledge-base, afterwards, the label obtained the keyword of every section of article uses the method for multi-source evidence fusion to calculate the final label of every section of article.
6. the potential friend recommendation method in social networks as claimed in claim 5, is characterized in that: the method for described multi-source evidence fusion have employed Dempster composition rule.
7. the potential friend recommendation method in social networks as claimed in claim 5, is characterized in that: described participle instrument is ICTCLAS participle instrument.
8. the potential friend recommendation method in social networks as claimed in claim 5, is characterized in that: described semantic knowledge-base is " knowing net " and " Chinese thesaurus expansion version ".
9. the potential friend recommendation method in social networks as claimed in claim 1 or 2, is characterized in that: in described step S03, first according to the article sum under each theme of user, obtains the article theme attention rate vector of user:
user i=((topic 1,n i,1),(topic 2,n i,2),......,(topic k,n i,k))
Wherein, user irepresent user i, topic trepresent t theme, the quantity that k is the theme;
And then carry out rough similarity similarity according to this theme concern vector (i, j).
10. the potential friend recommendation method in social networks as claimed in claim 1 or 2, is characterized in that: in described step S04, first according to the article sum under user's every month each theme, to obtain in 1 year theme in each month and pays close attention to distribution table:
user i[m]={(topic 1,n i,m,1),(topic 2,n i,m,2),......,(topic k,n i,m,k)}
Wherein, m represents month, n i, m, trepresent the article number that user i delivers under the m month, theme t, the quantity that k is the theme.
CN201410333744.5A 2014-07-14 2014-07-14 Potential friend recommendation method in social networks Active CN104239399B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410333744.5A CN104239399B (en) 2014-07-14 2014-07-14 Potential friend recommendation method in social networks

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410333744.5A CN104239399B (en) 2014-07-14 2014-07-14 Potential friend recommendation method in social networks

Publications (2)

Publication Number Publication Date
CN104239399A true CN104239399A (en) 2014-12-24
CN104239399B CN104239399B (en) 2017-08-01

Family

ID=52227463

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410333744.5A Active CN104239399B (en) 2014-07-14 2014-07-14 Potential friend recommendation method in social networks

Country Status (1)

Country Link
CN (1) CN104239399B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105045822A (en) * 2015-06-26 2015-11-11 淮海工学院 Method for monitoring similar users of specific user in microblog
CN105608194A (en) * 2015-12-24 2016-05-25 成都陌云科技有限公司 Method for analyzing main characteristics in social media
CN105631749A (en) * 2015-12-24 2016-06-01 成都陌云科技有限公司 User portrait calculation method based on statistical data
CN106156117A (en) * 2015-04-07 2016-11-23 中国科学院信息工程研究所 Hidden community core communication circle detection towards particular topic finds method and system
CN106709750A (en) * 2015-11-18 2017-05-24 腾讯科技(深圳)有限公司 User recommendation method and device
CN106933943A (en) * 2017-01-17 2017-07-07 浙江工业大学 A kind of friend recommendation method based on the record time difference
CN107038226A (en) * 2017-03-31 2017-08-11 努比亚技术有限公司 A kind of information recommendation method and the network equipment
CN107866071A (en) * 2017-11-03 2018-04-03 杭州电魂网络科技股份有限公司 Game role recommends method and apparatus
CN108804438A (en) * 2017-04-26 2018-11-13 洪荣昭 friend-making matching method
CN109783740A (en) * 2019-01-24 2019-05-21 北京字节跳动网络技术有限公司 Pay close attention to the sort method and device of the page
CN109933731A (en) * 2019-03-18 2019-06-25 苏州亿歌网络科技有限公司 A kind of friend recommendation method, apparatus, equipment and storage medium
CN110909259A (en) * 2019-11-27 2020-03-24 腾讯科技(深圳)有限公司 Block chain-based user recommendation method, device, equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101770459A (en) * 2008-12-31 2010-07-07 华为技术有限公司 Method and device for commending friends
CN103312745A (en) * 2012-03-12 2013-09-18 腾讯科技(深圳)有限公司 Method, system and device for mining information of potential friend
US20140032675A1 (en) * 2011-09-27 2014-01-30 Tencent Technology (Shenzhen) Company Limited Method and system for pushing associated users in social networking service network
CN103678531A (en) * 2013-12-02 2014-03-26 三星电子(中国)研发中心 Friend recommendation method and friend recommendation device
US20140143277A1 (en) * 2011-08-03 2014-05-22 Tencent Technology (Shenzhen) Company Limited Method and device for matching friend relationship chain in instant messaging tool

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101770459A (en) * 2008-12-31 2010-07-07 华为技术有限公司 Method and device for commending friends
US20140143277A1 (en) * 2011-08-03 2014-05-22 Tencent Technology (Shenzhen) Company Limited Method and device for matching friend relationship chain in instant messaging tool
US20140032675A1 (en) * 2011-09-27 2014-01-30 Tencent Technology (Shenzhen) Company Limited Method and system for pushing associated users in social networking service network
CN103312745A (en) * 2012-03-12 2013-09-18 腾讯科技(深圳)有限公司 Method, system and device for mining information of potential friend
CN103678531A (en) * 2013-12-02 2014-03-26 三星电子(中国)研发中心 Friend recommendation method and friend recommendation device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
DOU SHEN 等: "Latent Friend Mining from Blog Data", 《SIXTH INTERNATIONAL CONFERENCE ON DATA MINING》 *
王兵辉: "社交网络中潜在好友推荐算法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106156117B (en) * 2015-04-07 2018-05-01 中国科学院信息工程研究所 Hidden community's core communication circle detection towards particular topic finds method and system
CN106156117A (en) * 2015-04-07 2016-11-23 中国科学院信息工程研究所 Hidden community core communication circle detection towards particular topic finds method and system
CN105045822A (en) * 2015-06-26 2015-11-11 淮海工学院 Method for monitoring similar users of specific user in microblog
CN106709750A (en) * 2015-11-18 2017-05-24 腾讯科技(深圳)有限公司 User recommendation method and device
CN105608194A (en) * 2015-12-24 2016-05-25 成都陌云科技有限公司 Method for analyzing main characteristics in social media
CN105631749A (en) * 2015-12-24 2016-06-01 成都陌云科技有限公司 User portrait calculation method based on statistical data
CN106933943A (en) * 2017-01-17 2017-07-07 浙江工业大学 A kind of friend recommendation method based on the record time difference
CN107038226A (en) * 2017-03-31 2017-08-11 努比亚技术有限公司 A kind of information recommendation method and the network equipment
CN107038226B (en) * 2017-03-31 2019-10-15 努比亚技术有限公司 A kind of information recommendation method and the network equipment
CN108804438A (en) * 2017-04-26 2018-11-13 洪荣昭 friend-making matching method
CN107866071A (en) * 2017-11-03 2018-04-03 杭州电魂网络科技股份有限公司 Game role recommends method and apparatus
CN109783740A (en) * 2019-01-24 2019-05-21 北京字节跳动网络技术有限公司 Pay close attention to the sort method and device of the page
CN109933731A (en) * 2019-03-18 2019-06-25 苏州亿歌网络科技有限公司 A kind of friend recommendation method, apparatus, equipment and storage medium
CN110909259A (en) * 2019-11-27 2020-03-24 腾讯科技(深圳)有限公司 Block chain-based user recommendation method, device, equipment and storage medium

Also Published As

Publication number Publication date
CN104239399B (en) 2017-08-01

Similar Documents

Publication Publication Date Title
CN104239399A (en) Method for recommending potential friends in social network
Jin et al. Using user-generated content to explore the temporal heterogeneity in tourist mobility
Jiang et al. Author topic model-based collaborative filtering for personalized POI recommendations
Padhi et al. Quantifying potential tourist behavior in choice of destination using Google Trends
Logesh et al. Learning recency and inferring associations in location based social network for emotion induced point-of-interest recommendation.
He et al. SocoTraveler: Travel-package recommendations leveraging social influence of different relationship types
Arora et al. Movie recommendation system based on users’ similarity
Jung Contextualized mobile recommendation service based on interactive social network discovered from mobile users
Al-Taie et al. Python for graph and network analysis
Sharma et al. Ranking hotels using aspect ratings based sentiment classification and interval-valued neutrosophic TOPSIS
Mazhari et al. A user-profile-based friendship recommendation solution in social networks
CN103632290A (en) Recommendation probability fusion based hybrid recommendation method
Hu et al. Hotel Recommendation System based on Review and Context Information: a Collaborative filtering Appro.
Pahari et al. An online review-based hotel selection process using intuitionistic fuzzy TOPSIS method
Sun et al. Do Airbnb’s “Superhosts” deserve the badge? An empirical study from China
CN105913159A (en) Social network event based user's influence prediction method
Ahmedi et al. A bimodal social network analysis to recommend points of interest to tourists
Ju et al. Relationship strength estimation based on Wechat Friends Circle
CN104965883A (en) Individualized travel information screening method matching user characteristics
Saraswathi et al. Deep Learning Enabled Social Media Recommendation Based on User Comments.
He et al. Online spam review detection: A survey of literature
Xue et al. Forecasting hourly attraction tourist volume with search engine and social media data for decision support
Dang et al. Collaborative filtering in social networks: A community-based approach
Cai et al. An extension of social network group decision-making based on trustrank and personas
Choudhary et al. Community detection algorithms for recommendation systems: techniques and metrics

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant