CN106055616A - Friend recommendation method for social networking website based on named entities - Google Patents
Friend recommendation method for social networking website based on named entities Download PDFInfo
- Publication number
- CN106055616A CN106055616A CN201610357590.2A CN201610357590A CN106055616A CN 106055616 A CN106055616 A CN 106055616A CN 201610357590 A CN201610357590 A CN 201610357590A CN 106055616 A CN106055616 A CN 106055616A
- Authority
- CN
- China
- Prior art keywords
- entity
- user
- entities
- list
- conentity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 230000006855 networking Effects 0.000 title claims abstract description 13
- 230000008569 process Effects 0.000 claims description 6
- 238000000605 extraction Methods 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 abstract 1
- 238000009412 basement excavation Methods 0.000 description 2
- 238000012549 training Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/288—Entity relationship models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides a friend recommendation method for social networking website based on named entities. The method comprises following steps: establishing an entity list ConEntity (ui) composed of entities during users' speaking and entities of followers in speaking, establishing an entity list InfEntity (ui) for personal information and sequencing entities of users' preference based on obtained entity list ConEntity (ui) and the entity list InfEntity (ui) to obtain an entity list for preference sequences. Through similarity between entity lists of users, friends are recommended for users with similar interests in the social networking website.
Description
Technical field
The present invention relates to natural language processing technique field, more particularly, to a kind of social network based on name entity
Stand friend recommendation method.
Background technology
In recent years, along with the fast development of the Internet, large-scale knowledge base is (such as wikipedia, DBpedia, Baidu hundred
Section etc.) occur in the Internet and be rapidly developed.These knowledge bases are rich in about the language between attribute and the entity of limbs
Justice relations etc., their appearance makes user can obtain relevant information easily.Therefore, the technology about name entity also exists
Develop rapidly and be applied to the every field of the Internet.
So-called name entity, it is simply that name, mechanism's name, place name and other all entities with entitled mark.Wider
General entity also includes numeral, date, currency, address etc..Existing technology relates generally to name the identification of entity, and name is real
The link of body and disambiguation and the relation excavation field of name entity, and the most ripe.The identification of name entity refers to,
From given a word or an article, find the word referring to name entity, we term it entity censures item,
In " Jordon is famous basket baller ", it is " Jordon " that our entity to be identified censures item.The link of name entity is
Refer to that we to censure item and the entity (showing as a page in wikipedia) in certain knowledge base determined identify
It is chained up reaching the purpose of disambiguation.Such as, " Jordon is famous basket baller " and " Jordon is that U.S.'s Berkeley is big
The professor of the research machine learning learned " in two identical denotion items " Jordon " point to is diverse two entities.And
The contact between two entities is mainly looked in the excavation of entity relationship, " Jordon " and " Berkeley during as above an example is waited
University " relation of university is " A teaches in B ".
Existing social networks friend recommendation method mainly has based on customer relationship with based on label and the big class of content two: base
Recommendation in customer relationship mainly has the common friend recommending user, it is recommended that the good friend of good friend, it is recommended that the follower etc. of follower
Deng, and work in coordination with based on interest, if i.e. user A is similar with user B vermicelli crowd, and user to be recommended has paid close attention to user A, that
User B is also recommended this user;Expansion is come, and can find the community structure residing for user based on customer relationship figure, will
Other in same corporations are not paid close attention to member and are recommended this user.Method based on label and content is mainly according to user's
Geographical location information, educational work information, or user-defined individual's label, it is recommended that other similar users.Base
The recommended intensity of the method for plinth is identical, does not has the difference according to the probability size becoming good friend and has different recommendations
Dynamics.And other traditional methods or the most sufficiently utilize speech information and the label information of user, or by user's
Good friend is confined in certain interest circle or friend circle, it is impossible to fully find the potential targets of interest of user.
Summary of the invention
The present invention provides a kind of friends in social networking sites based on name entity to recommend method, it is achieved the use in social networks
Other users of similar preference are recommended at family.
In order to reach above-mentioned technique effect, technical scheme is as follows:
A kind of friends in social networking sites based on name entity recommends method, comprises the following steps:
S1: to the user u in candidate user collection UiAnd the speech text of follower carries out pretreatment and is named reality
The identification of body and connection, draw by user uiEntity in speech and entity row of the entity composition in its follower speech
Table ConEntity (ui);
S2: to user uiCarry out personal information extraction and be named identification and the connection of entity, drawing about user uiIndividual
List of entities InfEntity (the u of people's informationi);
S3: according to the list of entities ConEntity (u obtainedi) and InfEntity (ui) to user uiThe entity of preference
It is ranked up, obtains the list of entities UserEntity (u of a preference sequencei);
S4: utilize the UserEntity (u of each useri) carrying out similarity comparison, the user choosing highest similarity enters
Row is mutually recommended.
Preferably, to user u in described step S1iAnd the speech text of follower carries out the mode of pretreatment and includes point
Word, stop words filter.
Preferably, the userspersonal information in described step S2 includes geographical location information, hobby label.
Further, the detailed process of described step S3 is as follows:
K is made to represent the significance level that user mentions for entity e:
K=countConEntity(e)+1.2countInfEntity(e)
Wherein, countConEntity(e) presentation-entity e occurrence number in set ConEntity (U),
countInfEntity(e) presentation-entity e occurrence number in set ConEntity (U);
User's preference value to entity:
Wherein, Sim (ui, e) represent user uiWith the text similarity of entity e, Entity (e) the presentation-entity page is mentioned
The entity sets in addition to e, Entity (ui)=ConEntity (ui)∩InfEntity(ui) represent that user two is relevant real
The set of body, in (e) represents the number of the entity pointing to entity e,The popularity of presentation-entity, α and β is
Weight parameter.
Further, according to user, the preference value of entity is obtained user uiThe list of entities of preference sequence, chooses it
Middle top n entity forms list of entities UserEntity (ui), during less than N entity, room goes out fills with null.
Further, the detailed process of step S4 is as follows:
The distance of two inter-entity of calculating:
Wherein, Total is the quantity of all entities in knowledge base, and g (e) has link to point to entity e in knowledge base
The set of entity, for user u1With user u2, have list of entities UserEntity (u1) and UserEntity (u2), ei∈
UserEntity(u1), ej∈UserEntity(u2), i, j ∈ [1, N], eiWith user u2List similarity:
User u1With user u2Between similarity be:
Compared with prior art, technical solution of the present invention provides the benefit that:
The present invention sets up the list of entities that the entity during the entity in being made a speech is made a speech with its follower forms by user
ConEntity(ui), set up the list of entities InfEntity (u of userspersonal informationi), and according to the list of entities obtained
ConEntity(ui) and InfEntity (ui) entity of the preference of user is ranked up obtaining the entity row of preference sequence
Table, carries out, to the user in social networks, the friend recommendation that interest is similar by the similarity degree of list of entities between user.
Accompanying drawing explanation
Fig. 1 is the inventive method flow chart.
Detailed description of the invention
Accompanying drawing being merely cited for property explanation, it is impossible to be interpreted as the restriction to this patent;
In order to the present embodiment is more preferably described, some parts of accompanying drawing have omission, zoom in or out, and do not represent actual product
Size;
To those skilled in the art, in accompanying drawing, some known features and explanation thereof may be omitted is to be appreciated that
's.
With embodiment, technical scheme is described further below in conjunction with the accompanying drawings.
Embodiment 1
As it is shown in figure 1, a kind of friends in social networking sites based on name entity recommends method, comprise the following steps:
S1: to the user u in candidate user collection UiAnd the speech text of follower carries out pretreatment and is named reality
The identification of body and connection, draw by user uiEntity in speech and entity row of the entity composition in its follower speech
Table ConEntity (ui);
S2: to user uiCarry out personal information extraction and be named identification and the connection of entity, drawing about user uiIndividual
List of entities InfEntity (the u of people's informationi);
S3: according to the list of entities ConEntity (u obtainedi) and InfEntity (ui) to user uiThe entity of preference
It is ranked up, obtains the list of entities UserEntity (u of a preference sequencei);
S4: utilize the UserEntity (u of each useri) carrying out similarity comparison, the user choosing highest similarity enters
Row is mutually recommended.
To user u in step S1iAnd the speech text of follower carries out the mode of pretreatment and includes participle, stop words mistake
Filter.
Userspersonal information in step S2 includes geographical location information, hobby label.
The detailed process of step S3 is as follows:
K is made to represent the significance level that user mentions for entity e:
K=countConEntity(e)+1.2countInfEntity(e)
Wherein, countConEntity(e) presentation-entity e occurrence number in set ConEntity (U),
countInfEntity(e) presentation-entity e occurrence number in set ConEntity (U);
User's preference value to entity:
Wherein, Sim (ui, e) represent user uiWith the text similarity of entity e, Entity (e) the presentation-entity page is mentioned
The entity sets in addition to e, Entity (ui)=ConEntity (ui)∩InfEntity(ui) represent that user two is relevant real
The set of body, in (e) represents the number of the entity pointing to entity e,The popularity of presentation-entity, α and β is
Weight parameter, can be obtained by training set training.
According to user, the preference value of entity is obtained user uiThe list of entities of preference sequence, chooses wherein top n real
Body forms list of entities UserEntity (ui), during less than N entity, room goes out fills with null.In the present embodiment Europe, N takes 30,
During less than 30 entity, room goes out fills with null.
The detailed process of step S4 is as follows:
The distance of two inter-entity of calculating:
Wherein, Total is the quantity of all entities in knowledge base, and g (e) has link to point to entity e in knowledge base
The set of entity, for user u1With user u2, have list of entities UserEntity (u1) and UserEntity (u2), ei∈
UserEntity(u1), ej∈UserEntity(u2), i, j ∈ [1, N], eiWith user u2List similarity:
User u1With user u2Between similarity be:
Obtain and can mutually recommend between the user of highest similarity.Such as user u1, calculate respectively he with
The similarity of all users in other candidate user set, then chooses front n user and carries out friend recommendation or interested
User pay close attention to recommendation.
This method sets up the entity row that the entity during the entity in being made a speech is made a speech with its follower forms by user ui
Table ConEntity (ui), sets up list of entities InfEntity (ui) of userspersonal information, and according to the list of entities obtained
ConEntity (ui) and InfEntity (ui) is ranked up obtaining the entity row of preference sequence to the entity of the preference of user
Table, carries out, to the user in social networks, the friend recommendation that interest is similar by the similarity degree of list of entities between user.
The corresponding same or analogous parts of same or analogous label;
Described in accompanying drawing, position relationship is used for the explanation of being merely cited for property, it is impossible to be interpreted as the restriction to this patent;
Obviously, the above embodiment of the present invention is only for clearly demonstrating example of the present invention, and is not right
The restriction of embodiments of the present invention.For those of ordinary skill in the field, the most also may be used
To make other changes in different forms.Here without also cannot all of embodiment be given exhaustive.All at this
Any amendment, equivalent and the improvement etc. made within the spirit of invention and principle, should be included in the claims in the present invention
Protection domain within.
Claims (6)
1. a friends in social networking sites based on name entity recommends method, it is characterised in that comprise the following steps:
S1: to the user u in candidate user collection UiAnd the speech text of follower carries out pretreatment and is named the knowledge of entity
And connect, do not draw by user uiEntity in speech and a list of entities of the entity composition in its follower speech
ConEntity(ui);
S2: to user uiCarry out personal information extraction and be named identification and the connection of entity, drawing about user uiIndividual's letter
List of entities InfEntity (the u of breathi);
S3: according to the list of entities ConEntity (u obtainedi) and InfEntity (ui) to user uiThe entity of preference carry out
Sequence, obtains the list of entities UserEntity (u of a preference sequencei);
S4: utilize the UserEntity (u of each useri) carrying out similarity comparison, the user choosing highest similarity is carried out mutually
Recommend.
Friends in social networking sites based on name entity the most according to claim 1 recommends method, it is characterised in that described step
To user u in rapid S1iAnd the speech text of follower carries out the mode of pretreatment and includes that participle, stop words filter.
Friends in social networking sites based on name entity the most according to claim 1 recommends method, it is characterised in that described step
Userspersonal information in rapid S2 includes geographical location information, hobby label.
Friends in social networking sites based on name entity the most according to claim 1 recommends method, it is characterised in that described step
The detailed process of rapid S3 is as follows:
K is made to represent the significance level that user mentions for entity e:
K=countConEntity(e)+1.2countInfEntity(e)
Wherein, countConEntity(e) presentation-entity e occurrence number in set ConEntity (U), countInfEntity(e)
Presentation-entity e occurrence number in set ConEntity (U);
User's preference value to entity:
Wherein, Sim (ui, e) represent user uiWith the text similarity of entity e, Entity (e) the presentation-entity page mention except e
Outside entity sets, Entity (ui)=ConEntity (ui)∩InfEntity(ui) represent two related entities of user
Set, in (e) represents the number of the entity pointing to entity e,The popularity of presentation-entity, α and β is weight
Parameter.
The most according to claim 4 based on name entity friends in social networking sites recommend method, it is characterised in that according to
Family obtains user u to the preference value of entityiThe list of entities of preference sequence, chooses wherein top n entity and forms list of entities
UserEntity(ui), during less than N entity, room goes out fills with null.
Friends in social networking sites based on name entity the most according to claim 5 recommends method, it is characterised in that step S4
Detailed process as follows:
The distance of two inter-entity of calculating:
Wherein, Total is the quantity of all entities in knowledge base, and g (e) is the entity having link to point to entity e in knowledge base
Set, for user u1With user u2, have list of entities UserEntity (u1) and UserEntity (u2), ei∈
UserEntity(u1), ej∈UserEntity(u2), i, j ∈ [1, N], eiWith user u2List similarity:
User u1With user u2Between similarity be:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610357590.2A CN106055616A (en) | 2016-05-25 | 2016-05-25 | Friend recommendation method for social networking website based on named entities |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610357590.2A CN106055616A (en) | 2016-05-25 | 2016-05-25 | Friend recommendation method for social networking website based on named entities |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106055616A true CN106055616A (en) | 2016-10-26 |
Family
ID=57175226
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610357590.2A Pending CN106055616A (en) | 2016-05-25 | 2016-05-25 | Friend recommendation method for social networking website based on named entities |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106055616A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108959258A (en) * | 2018-07-02 | 2018-12-07 | 昆明理工大学 | It is a kind of that entity link method is integrated based on the specific area for indicating to learn |
CN109101540A (en) * | 2018-07-02 | 2018-12-28 | 湖北衣谷电子商务有限公司 | Friend location information excavating method for pushing and system based on social networks |
CN111597455A (en) * | 2020-04-29 | 2020-08-28 | 腾讯科技(深圳)有限公司 | Social relationship establishing method and device, electronic equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040044677A1 (en) * | 2000-03-08 | 2004-03-04 | Better T.V. Technologies Ltd. | Method for personalizing information and services from various media sources |
CN1739106A (en) * | 2001-09-28 | 2006-02-22 | 皇家飞利浦电子股份有限公司 | Individual recommender profile modification using profiles of others |
CN101079714A (en) * | 2006-12-13 | 2007-11-28 | 腾讯科技(深圳)有限公司 | A method and system for recommending friends in SNS community |
CN101105795A (en) * | 2006-10-27 | 2008-01-16 | 北京搜神网络技术有限责任公司 | Network behavior based personalized recommendation method and system |
-
2016
- 2016-05-25 CN CN201610357590.2A patent/CN106055616A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040044677A1 (en) * | 2000-03-08 | 2004-03-04 | Better T.V. Technologies Ltd. | Method for personalizing information and services from various media sources |
CN1739106A (en) * | 2001-09-28 | 2006-02-22 | 皇家飞利浦电子股份有限公司 | Individual recommender profile modification using profiles of others |
CN101105795A (en) * | 2006-10-27 | 2008-01-16 | 北京搜神网络技术有限责任公司 | Network behavior based personalized recommendation method and system |
CN101079714A (en) * | 2006-12-13 | 2007-11-28 | 腾讯科技(深圳)有限公司 | A method and system for recommending friends in SNS community |
Non-Patent Citations (3)
Title |
---|
余勇: "《面向微博用户的标签及好友推荐算法研究》", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
吕润桃 等: "《异质社交网络中多通道特征融合的好友推荐模型》", 《激光杂志》 * |
吴昊: "《云计算环境下的社交网络好友推荐系统研究》", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108959258A (en) * | 2018-07-02 | 2018-12-07 | 昆明理工大学 | It is a kind of that entity link method is integrated based on the specific area for indicating to learn |
CN109101540A (en) * | 2018-07-02 | 2018-12-28 | 湖北衣谷电子商务有限公司 | Friend location information excavating method for pushing and system based on social networks |
CN108959258B (en) * | 2018-07-02 | 2021-06-18 | 昆明理工大学 | Specific field integrated entity linking method based on representation learning |
CN111597455A (en) * | 2020-04-29 | 2020-08-28 | 腾讯科技(深圳)有限公司 | Social relationship establishing method and device, electronic equipment and storage medium |
CN111597455B (en) * | 2020-04-29 | 2024-03-01 | 腾讯科技(深圳)有限公司 | Social relation establishing method and device, electronic equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Perez et al. | Graph creation and analysis for linking actors: Application to social data | |
CN103823888B (en) | Node-closeness-based social network site friend recommendation method | |
CN104239399B (en) | Potential friend recommendation method in social networks | |
CN106886518A (en) | A kind of method of microblog account classification | |
CN107133282B (en) | Improved evaluation object identification method based on bidirectional propagation | |
CN105740382A (en) | Aspect classification method for short comment texts | |
Rokade et al. | Business intelligence analytics using sentiment analysis-a survey | |
Okazaki et al. | How to mine brand Tweets: Procedural guidelines and pretest | |
Bashar et al. | A bibliometric review of online impulse buying behaviour | |
CN105045822A (en) | Method for monitoring similar users of specific user in microblog | |
CN106055616A (en) | Friend recommendation method for social networking website based on named entities | |
Ataei et al. | Pars-ABSA: an aspect-based sentiment analysis dataset for Persian | |
Mishra et al. | How delightful is Indian wellness tourism? A netnographic study | |
Tri et al. | Exploiting geotagged resources to spatial ranking by extending hits algorithm | |
Teh et al. | Exploring fine-grained sentiment values in online product reviews | |
Singh | Role of social media in ecotourism development | |
CN105468780B (en) | The normalization method and device of ProductName entity in a kind of microblogging text | |
Gliwa et al. | Finding influential bloggers | |
Mardjo | Impacts of social media’s reputation, security, privacy and information quality on Thai young adults’ purchase intention towards Facebook commerce | |
Ma | Recommendation of sustainable economic learning course based on text vector model and support vector machine | |
Peláez-Verdet et al. | The usefulness of social media analysis within scholarly publications: a study of first-tier tourism journals | |
Budzanowska-Drzewiecka | Cross-cultural differences in electronic word-of-mouth: A literature review | |
Otsuka et al. | Evaluation of the reputation network using realistic distance between facebook data | |
Cohen et al. | Sentiment analysis in microblogging: a practical implementation | |
Rodavia et al. | AutoRec: A recommender system based on social media stream |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20161026 |