CN111324807A - Collaborative filtering recommendation method based on trust degree - Google Patents

Collaborative filtering recommendation method based on trust degree Download PDF

Info

Publication number
CN111324807A
CN111324807A CN202010030767.4A CN202010030767A CN111324807A CN 111324807 A CN111324807 A CN 111324807A CN 202010030767 A CN202010030767 A CN 202010030767A CN 111324807 A CN111324807 A CN 111324807A
Authority
CN
China
Prior art keywords
user
similarity
trust
item
recommendation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010030767.4A
Other languages
Chinese (zh)
Inventor
王凯旋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN202010030767.4A priority Critical patent/CN111324807A/en
Publication of CN111324807A publication Critical patent/CN111324807A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/067Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Theoretical Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Development Economics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Human Resources & Organizations (AREA)
  • Physics & Mathematics (AREA)
  • Finance (AREA)
  • Economics (AREA)
  • Accounting & Taxation (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Educational Administration (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a collaborative filtering recommendation method based on credibility, which is expected to recommend items to the most mental apparatus of users in the large environment of Internet information blowout, thereby saving the precious time of the users and improving the information acquisition efficiency. The algorithm is characterized in that: on the basis of a traditional collaborative filtering algorithm, attribute features (including age factors, occupational factors and gender factors), interestingness (a certain category of items preferred by a user), trust (the satisfaction degree of the user on recommendation of a certain part of users) and other dimensions are fused to model the preference of the user, so that the cold start and sparsity problems of a user scoring matrix are relieved as much as possible, and the most valuable, most desirable and most surprised recommendation is made for the user.

Description

基于信任度的协同过滤推荐方法Trust-based collaborative filtering recommendation method

技术领域technical field

本发明涉及计算机推荐,具体涉及推荐系统、协同过滤、相似度建模、信任度建模应用于所有需要推荐场景的计算机系统。The invention relates to computer recommendation, in particular to a computer system in which a recommendation system, collaborative filtering, similarity modeling and trust modeling are applied to all recommendation scenarios.

背景技术Background technique

21世纪是一个信息爆炸的时代,随着计算机技术的发展,海量的信息给信息查找造成了巨大的挑战,如何在庞大的信息数据海洋中找到自己需要的内容,使查找更加高效,才能让用户有更好的上网体验The 21st century is an era of information explosion. With the development of computer technology, massive amounts of information have created a huge challenge for information search. How to find the content you need in the huge ocean of information data and make the search more efficient can users Have a better online experience

基于以上的市场切实需求,推荐系统应运而生,这是一种根据用户之前的行为信息发现新的需求的一种方法,目前在各电商中广泛使用的个性化推荐算法当属协同过滤推荐算法,但是用户的评分矩阵终究是一个稀疏矩阵,因此:新用户的冷启动问题,数据稀疏性和推荐准确性仍然需要持续研究改进。Based on the above practical market demands, the recommendation system came into being. This is a method to discover new demands based on the user's previous behavior information. Currently, the personalized recommendation algorithm widely used in various e-commerce companies is collaborative filtering recommendation. algorithm, but the user's rating matrix is a sparse matrix after all. Therefore, the cold start problem of new users, data sparsity and recommendation accuracy still need continuous research and improvement.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于优化现有推荐算法的不足之处,为解决数据稀疏性,用户冷启动问题,以及提升推荐准确度提供一些方法。The purpose of the present invention is to optimize the shortcomings of the existing recommendation algorithms, and to provide some methods for solving the problems of data sparsity, user cold start, and improving recommendation accuracy.

本发明对相似性的贡献点如下:The contribution points of the present invention to similarity are as follows:

结合属性特征。本发明涉及的属性特征包括年龄,职业,性别,这些特征标识了用户的生活状态。也影响着用户的喜好通过分析属性特征的相似性可以提高推荐准确性。Combine attribute features. The attribute features involved in the present invention include age, occupation, and gender, and these features identify the user's living state. It also affects the user's preferences. By analyzing the similarity of attribute features, the recommendation accuracy can be improved.

结合兴趣度。兴趣度是推荐系统的核心推荐依据,好的兴趣度模型可大大的提高推荐准确性。Combine interest. Interest degree is the core recommendation basis of recommendation system, and a good interest degree model can greatly improve the accuracy of recommendation.

结合信任度。本文将信任度分为直接信任度和间接信任度,直接信任度可以提高推荐准确性,间接信任度可以填充评分矩阵,提高准确性的同时还能改善数据稀疏性。Combined with trust. In this paper, trust is divided into direct trust and indirect trust. Direct trust can improve the recommendation accuracy, and indirect trust can fill the scoring matrix, which can improve the accuracy and data sparsity.

表1基于m个用户n个项目的m*n的用户-评分矩阵R(m,n)Table 1 User-rating matrix R(m,n) of m*n based on m users with n items

I<sub>1</sub>I<sub>1</sub> I<sub>2</sub>I<sub>2</sub> I<sub>n</sub>I<sub>n</sub> U<sub>1</sub>U<sub>1</sub> R<sub>11</sub>R<sub>11</sub> R<sub>12</sub>R<sub>12</sub> R<sub>1n</sub>R<sub>1n</sub> U<sub>2</sub>U<sub>2</sub> R<sub>21</sub>R<sub>21</sub> R<sub>22</sub>R<sub>22</sub> R<sub>2n</sub>R<sub>2n</sub> U<sub>m</sub>U<sub>m</sub> R<sub>m1</sub>R<sub>m1</sub> R<sub>m2</sub>R<sub>m2</sub> R<sub>mn</sub>R<sub>mn</sub>

本发明采用的技术方案为基于信任度的协同过滤推荐方法,该方法的实现过程如下:The technical solution adopted in the present invention is a trust-based collaborative filtering recommendation method, and the implementation process of the method is as follows:

1)用户属性特征相似性建模1) User attribute feature similarity modeling

用户属性特征相似性模型的用户属性综合性别、年龄、职业三者对用户的兴趣的影响,建立过程如下:The user attribute of the user attribute feature similarity model integrates the influence of gender, age, and occupation on the user's interest. The establishment process is as follows:

a)性别相似度建模a) Gender similarity modeling

性别不同兴趣可能天差地别,例如女性喜欢化妆品,服装,首饰的兴趣显然高于男性,而男性普遍又更倾向于电子产品。假设用户u 的性别为Su,用户v的性别为Sv,则用户u和用户v的性别相似度 S(u,v)如下所示:The interests of different genders may be very different. For example, women like cosmetics, clothing, and jewelry are obviously more interested than men, and men generally prefer electronic products. Assuming that the gender of user u is Su and the gender of user v is S v , the gender similarity S( u ,v) of user u and user v is as follows:

Figure BDA0002364206930000021
Figure BDA0002364206930000021

b)年龄相似度建模b) Age similarity modeling

年龄不同爱好也会随之不同,因此引入年龄模型来优化相似度模型,设用户u的年龄为Au,用户v的年龄为Av则用户u和用户v的年龄相似度A(u,v)如下:Different ages have different hobbies. Therefore, an age model is introduced to optimize the similarity model. Let the age of user u be A u and the age of user v to be A v , then the age similarity of user u and user v A(u,v )as follows:

Figure BDA0002364206930000022
Figure BDA0002364206930000022

c)职业相似度建模c) Occupational similarity modeling

将职业按种类分类为一个树形结构如下图2,图2中任意两个节点的长度设为1,用Height表示总长度职业a和职业b最近的父节点在职业树中的层数称为它的高度,记Ha,b用户u和用户v的职业分别用职业a和职业b表示,则职业相似性O(u,v)表示如下:The occupations are classified into a tree structure by type as shown in Figure 2. The length of any two nodes in Figure 2 is set to 1, and the height is used to represent the total length. The number of layers in the occupation tree of the nearest parent node of occupation a and occupation b is called Its height, remember H a, b The occupation of user u and user v is represented by occupation a and occupation b respectively, then occupation similarity O(u, v) is expressed as follows:

Figure BDA0002364206930000031
Figure BDA0002364206930000031

综上,将性别,年龄,职业因素综合考虑得到用户的属性相似度 sima(u,v)如下:To sum up, the user's attribute similarity sim a (u, v) is obtained by comprehensively considering gender, age, and occupation factors as follows:

sima(u,v)=αS(u,v)+βA(u,v)+γO(u,v) (4)sim a (u,v)=αS(u,v)+βA(u,v)+γO(u,v) (4)

其中α,β,γ根据具体系统动态调参;Among them, α, β, γ are dynamically adjusted according to the specific system;

2)用户兴趣特征相似性建模;2) Similarity modeling of user interest features;

上部分探讨的属性特征在冷启动情况下为提供了推荐方案,但是用户的评论足迹丰富后便可以对用户特有兴趣度进行建模,兴趣对于选择的影响异常重要,本方法所探讨的兴趣度是用户对某类项目的兴趣度,即将用户对某种项目的评价比重作为他对这类项目的兴趣度度量。设NIu,i表示用户u对i类项目的评价总数,NIu表示用户已评价的项目总数,那么用户u对i类项目的兴趣度Iu,i如下:The attribute features discussed in the above section provide a recommendation solution in the case of cold start, but after the user's comment footprint is enriched, the user's specific interest degree can be modeled. The influence of interest on selection is extremely important. The interest degree discussed in this method It is the user's interest in a certain type of item, that is, the user's evaluation ratio for a certain item is used as a measure of his interest in this type of item. Suppose NI u,i represents the total number of evaluations by user u on items of type i, and NI u represents the total number of items that the user has evaluated, then the degree of interest I u,i of user u to items of type i is as follows:

Figure BDA0002364206930000032
Figure BDA0002364206930000032

用式(5)得到用户u的兴趣度集合为Iu=(Iu,1,Iu,2,...Iu,i)表示,其中 Iuv表示用户u和用户v共同评分的项目种类数,

Figure BDA0002364206930000033
用来标识用户u 对所有项目种类的平均兴趣度,根据皮尔逊相关相似度计算得到,两用户间的兴趣相似度simI(u,v)如下:Using formula (5), the set of interest degree of user u can be obtained as I u =(I u,1 ,I u,2 ,...I u,i ), where I uv represents the item jointly rated by user u and user v number of species,
Figure BDA0002364206930000033
It is used to identify the average interest degree of user u to all item types, calculated according to the Pearson correlation similarity, and the interest similarity sim I (u, v) between two users is as follows:

Figure BDA0002364206930000041
Figure BDA0002364206930000041

3)相似性计算:3) Similarity calculation:

用户相似度的计算依赖于用户-项目评分表,每个用户兴趣用一个评分向量表示,来与表1中的每一行相映射,所以用户相似度的计算本质上便是计算用户评分向量之间的距离。用户相似度的计算方法众多,选取约束皮尔逊相关相似度和jaccard相似度进行相关计算。The calculation of user similarity depends on the user-item rating table. Each user interest is represented by a rating vector to map with each row in Table 1. Therefore, the calculation of user similarity is essentially the calculation of the relationship between user rating vectors. the distance. There are many calculation methods for user similarity, and constrained Pearson correlation similarity and jaccard similarity are selected for correlation calculation.

约束皮尔逊相关相似度:Ruv表示用户u和用户v的共同评分集合,Ru,i标识用户u对项目i的评分,Rv,i标识用户v对项目i的评分,

Figure BDA0002364206930000042
代表用户u所有评分的均值,
Figure 100002_1
标识用户v所有评分的均值,则用户 u和用户v的评分相似度用皮尔逊相关相似性表示如下:Constrained Pearson correlation similarity: R uv represents the common rating set of user u and user v, R u,i identifies user u’s rating for item i, R v,i identifies user v’s rating for item i,
Figure BDA0002364206930000042
represents the mean of all ratings of user u,
Figure 100002_1
Identifying the mean of all ratings of user v, then the similarity of ratings between user u and user v is expressed as Pearson correlation similarity as follows:

Figure BDA0002364206930000044
Figure BDA0002364206930000044

Jaccard相似度:余弦相似度和约束皮尔逊相关相似度是基于用户共同的评分项目来度量相似度的,在数据稀疏的情况下,这两种相似度计算方法都存在共同的问题,即当数据稀疏到用户u与用户v只有一个共同的评分项目,且两人都对这个项目评了相同的低分,此时说明用户u和用户v都不偏好于这个项目,但是余弦相似度与约束皮尔逊相关相似度却认为二者的相似度是最大值1,从而优先将这两个用户的喜好相互推荐。为了缓解这种情况引入Jarccard相似度,它的相似度计算方法是基于用户间共同评分的个数来衡量的,能够有效避免上述问题的发生。其计算公式如下所示:Jaccard similarity: Cosine similarity and constrained Pearson correlation similarity measure similarity based on the common rating items of users. In the case of sparse data, these two similarity calculation methods have a common problem, that is, when the data is sparse. It is sparse to the point that user u and user v have only one common rating item, and both of them rated this item with the same low score. At this time, it means that neither user u nor user v prefers this item, but the cosine similarity is related to the constraint Peel. However, the similarity between the two users is considered to be the maximum value of 1, so that the preferences of the two users are recommended to each other. In order to alleviate this situation, Jarccard similarity is introduced, and its similarity calculation method is based on the number of common scores among users, which can effectively avoid the occurrence of the above problems. Its calculation formula is as follows:

Figure BDA0002364206930000045
Figure BDA0002364206930000045

式中|Iu,v|表示用户u和用户v的共同评分的项目总数,|Iu|,|Iv|表示用户u,和用户v各自评价的项目数量,Jac(u,v)的取值范围为[0,1],值越大表明二者相关度越高。where |Iu ,v | represents the total number of items jointly rated by user u and user v, |I u |, |I v | represent the number of items evaluated by user u and user v respectively, and Jac (u, v) takes The value range is [0, 1], and the larger the value, the higher the correlation between the two.

4)用户信任度建模:4) User trust modeling:

本方法将社交中的信任因素融合到协同过滤推荐算法中,对信任的定义为:目标用户对其他用户所提出的推荐意见的安全性,真诚性,有效性的肯定。用户a对用户b做的所有推荐项目n,若用户b接受说明对这笔推荐信任,不接受为不信任,用户b对用户a的信任度为接受的笔数与n的比值。This method integrates the social trust factor into the collaborative filtering recommendation algorithm, and the definition of trust is: the target user's affirmation of the security, sincerity, and validity of the recommendation opinions put forward by other users. For all the recommended items n made by user a to user b, if user b accepts the recommendation, it indicates that he trusts the recommendation, and if he does not accept it, he does not trust the recommendation.

4.1)直接信任度计算4.1) Direct trust calculation

直接信任度是由用户主动地直接给出对其他用户的信任评分,但是此方案要求每个用户都要给出其他用户的信任评分,实际操作困难,故此本方法采用了基于用户共同评分项目的直接信任度计算,方法如下:Direct trust is the user's initiative to give the trust score of other users directly, but this scheme requires each user to give the trust score of other users, which is difficult to operate in practice. The direct trust degree is calculated as follows:

DTrustu,v=|sim(u,v)|×Jac(u,v) (9)DTrust u,v =|sim(u,v)|×Jac (u,v) (9)

4.2)间接信任度计算4.2) Indirect trust calculation

用户之间的信任关系具有主观性、弱传递性、非对称性、动态性的特征。由于信任度具有传递性,缓解评分矩阵的稀疏性问题,两用户之间的直接信任度取决于二者是否拥有共同的评分项目,有共同评分的用户之间才存在直接信任度,但是生活中往往发现如果用户u信任用户i,用户i信任用户v那么用户u对用户v的往往存在信任关系,这就是信任的传递性,但是鉴于信任度的衰减很快,为提高准确度只取到二阶信任度,计算公式如下:The trust relationship between users has the characteristics of subjectivity, weak transitivity, asymmetry and dynamics. Since the trust degree is transitive, the sparsity problem of the rating matrix is alleviated. The direct trust degree between two users depends on whether they have a common rating item. There is direct trust between users with common ratings, but in life It is often found that if user u trusts user i, and user i trusts user v, then user u often has a trust relationship with user v, which is the transitivity of trust. However, since the trust degree decays rapidly, in order to improve the accuracy, only two The order of confidence is calculated as follows:

Figure BDA0002364206930000051
Figure BDA0002364206930000051

其中,PTrustu,v表示用户u,用户v之间的间接信任度,adj(u,v) 表示满足条件DTrustu,b≥λ,DTrustb,v≥λ的中间用户的集合,λ参数是一个可调节的信用度阈值,表示只有直接信用度大于λ的中间用户才能被纳入间接信任度计算。Among them, PTrust u,v represents the indirect trust degree between user u and user v, adj(u,v) represents the set of intermediate users that satisfy the condition DTrust u,b ≥λ,DTrust b,v ≥λ, and the λ parameter is An adjustable credit threshold, indicating that only intermediate users whose direct credit is greater than λ can be included in the indirect trust calculation.

用户u和用户v之间的信任度表示为Trustu,v,那么:The trust degree between user u and user v is expressed as Trust u, v , then:

Figure BDA0002364206930000061
Figure BDA0002364206930000061

5)综上所述:5) In summary:

融合用户属性与用户兴趣度,以及评分相似度,信任度的综合相似度simz(u,v)用如下公式表示:δ,ε,μ,ρ可根据具体系统对各因素的依赖程度不同动态调参Integrating user attributes, user interest, and scoring similarity, the comprehensive similarity of trust degree sim z (u, v) is expressed by the following formula: δ, ε, μ, ρ can vary dynamically according to the degree of dependence of the specific system on each factor parameter tuning

simz(u,v)=δsim(u,v)+εsimI(u,v)+μsima(u,v)+ρTrustu,v (12)sim z (u,v)=δsim(u,v)+εsim I (u,v)+μsim a (u,v)+ρTrust u,v (12)

6)产生推荐结果:6) Generate recommended results:

根据用户u,用户v的综合相似度simz(u,v),According to the comprehensive similarity sim z (u, v) of user u and user v,

设用户v对项目i的评分为Rv,i,用户v对所有项目的评分均值为

Figure BDA0002364206930000062
Pu,i表示根据v的评分推测的用户u对项目i的评分,Nu为所有预测评分中评分最高的 top-N个集合,即为最终推荐的项目组合Let the rating of user v to item i be R v,i , and the average rating of user v to all items is
Figure BDA0002364206930000062
P u,i represents the user u's rating for item i based on the rating of v, and N u is the top-N set with the highest rating among all the predicted ratings, which is the final recommended item combination

Figure BDA0002364206930000063
Figure BDA0002364206930000063

7)对本方法的评价标准:7) Evaluation criteria for this method:

采用标准平均误差(MAE)作为评级标准来评价算法的推荐质量,在推荐算法的研究中MAE是公认的使用最广泛的评价指标,计算系统对项目的预测评分与用户对项目的实际评分的平均绝对偏差, MAE值越小表示推荐系统的推荐精度越高,计算公式如下:The standard mean error (MAE) is used as the rating standard to evaluate the recommendation quality of the algorithm. In the research of recommendation algorithms, MAE is recognized as the most widely used evaluation index. Absolute deviation. The smaller the MAE value, the higher the recommendation accuracy of the recommendation system. The calculation formula is as follows:

Figure BDA0002364206930000064
Figure BDA0002364206930000064

其中:Ru,i表示用户u对项目i的实际评分,Pu,i为根据本方法得到的用户u对项目i的预测评分。Among them: R u,i represents the actual rating of user u to item i, and P u,i is the predicted rating of user u to item i obtained according to this method.

附图说明Description of drawings

图1为本方法实施的流程图。Figure 1 is a flow chart of the implementation of the method.

图2为职业树图。Figure 2 is a career tree diagram.

具体实施方式Detailed ways

以下结合附图和实施例对本发明进行详细说明。The present invention will be described in detail below with reference to the accompanying drawings and embodiments.

基于信任度的协同过滤推荐方法,其特征在于:在传统的协同过滤算法基础上融合了属性特征,兴趣度,信任度等多种维度对用户的爱好进行建模,尽最大可能来缓解用户评分矩阵的冷启动,稀疏性问题,为用户做出最有价值,最合心意,最有惊喜的推荐。The trust-based collaborative filtering recommendation method is characterized in that: on the basis of the traditional collaborative filtering algorithm, it integrates attributes, interest, trust and other dimensions to model the user's hobbies, so as to reduce user ratings as much as possible. The cold start of the matrix, the sparsity problem, makes the most valuable, most desirable and most surprising recommendations for users.

在传统的协同过滤推荐算法的基础上融合了用户的属性相似度,将年龄,职业,性别这些可能影响用户抉择的属性融合进算法的推荐过程中Based on the traditional collaborative filtering recommendation algorithm, the user's attribute similarity is integrated, and attributes such as age, occupation, and gender that may affect the user's choice are integrated into the algorithm's recommendation process.

在传统的协同过滤算法的基础上结合了用户属性特征相似性,将用户对某一类项目的偏好充分考虑进算法中,Based on the traditional collaborative filtering algorithm, it combines the similarity of user attributes and features, and fully considers the user's preference for a certain type of item into the algorithm.

传统的协同过滤算法的基础上,用户信任度的建模采用直接信任度与间接信任度的两个维度的计算,缓解了用户评分矩阵稀疏性的问题Based on the traditional collaborative filtering algorithm, the modeling of user trust uses two dimensions of direct trust and indirect trust, which alleviates the problem of the sparsity of the user rating matrix.

最终相似度的计算方式可根据不同的场景,动态调整不同影响因子的比重参数,最大限度的取得最满足用户需求的项目。The calculation method of the final similarity can dynamically adjust the proportion parameters of different influencing factors according to different scenarios, so as to maximize the items that best meet the needs of users.

Claims (1)

1. The collaborative filtering recommendation method based on the trust degree is characterized by comprising the following steps: the method comprises the following implementation processes:
1) user attribute feature similarity modeling
The user attribute of the user attribute feature similarity model integrates the influence of gender, age and occupation on the interest of the user, and the establishing process is as follows:
a) gender similarity modeling
Suppose user u has a gender SuGender of user v is SvThen, the gender similarity S (u, v) of user u and user v) As follows:
Figure FDA0002364206920000011
b) age similarity modeling
Let the age of user u be AuAge of user v is AvThe age similarity a (u, v) of user u and user v is as follows:
Figure FDA0002364206920000012
c) occupational similarity modeling
Classifying careers into a tree structure according to categories, setting the length of any two nodes to be 1, expressing the total length of the layer number of the nearest parent node of career a and career b in the career tree by Height to be called the Height thereof, and recording Ha,bThe professions of user u and user v are denoted by profession a and profession b, respectively, and the profession similarity O (u, v) is expressed as follows:
Figure FDA0002364206920000013
in conclusion, the attribute similarity sim of the user is obtained by comprehensively considering the gender, the age and the occupational factorsa(u, v) are as follows:
sima(u,v)=αS(u,v)+βA(u,v)+γO(u,v) (4)
α, adjusting the parameters of gamma according to the specific system;
2) modeling the similarity of user interest characteristics;
provided with NIu,iIndicates the total number of evaluations, NI, of user u on the i-class itemuRepresenting the total number of the items evaluated by the user, the interest degree I of the user u in the I-type itemsu,iThe following were used:
Figure FDA0002364206920000021
obtaining the interest degree set I of the user u by using the formula (5)u=(Iu,1,Iu,2,...Iu,i) Is shown in the formula IuvThe number of categories of items that represent user u and user v to score together,
Figure FDA0002364206920000026
the method is used for identifying the average interest degree of the user u to all the item types, and the interest degree sim between the two users is obtained through calculation according to the Pearson correlation similarityI(u, v) are as follows:
Figure FDA0002364206920000022
3) similarity calculation:
the calculation of the user similarity relies on the user-item score table, each user interest is represented by a score vector to map with each row in table 1, so the calculation of the user similarity essentially calculates the distance between the user score vectors; the method for calculating the user similarity is numerous, and restricted Pearson correlation similarity and jaccard similarity are selected for correlation calculation;
and (3) constraining the Pearson correlation similarity: ruvRepresenting a common set of scores for user u and user v, Ru,iIdentifying a rating, R, for item i by user uv,iThe rating of item i by user v is identified,
Figure FDA0002364206920000023
represents the mean of all the scores of the user u,
Figure 1
identifying the mean of all scores of user v, the score similarity of user u and user v is expressed in terms of pearson's relative similarity as follows:
Figure FDA0002364206920000025
jaccard similarity: the calculation formula is as follows:
Figure FDA0002364206920000031
in the formula Iu,vI represents the total number of commonly scored items for user u and user v, Iu|,|IvI represents the number of items, Jac, evaluated by user u and user v, respectively(u,v)Has a value range of [0, 1 ]]A larger value indicates a higher correlation between the two;
4) modeling the user trust:
fusing trust factors in social contact into a collaborative filtering recommendation algorithm, wherein if the user a receives a recommendation, the recommendation is not accepted as untrustworthy, and the trust degree of the user b on the user a is the ratio of the accepted number of strokes to n, and all recommended items n made by the user a on the user b are recommended;
4.1) direct confidence calculation
The direct trust degree is that a user actively and directly gives trust scores of other users, but the scheme requires that each user gives trust scores of other users, and the actual operation is difficult, so the method adopts direct trust degree calculation based on a common user score project, and comprises the following steps:
DTrustu,v=|sim(u,v)|×Jac(u,v)(9)
4.2) Indirect confidence computation
The direct trust between two users depends on whether the two users have a common scoring item, the direct trust exists between the users with common scoring, if the user u trusts the user i, and the user i trusts the user v, the user u often has a trust relationship to the user v, which is the transitivity of trust, and only the second-order trust is taken, and the calculation formula is as follows:
Figure FDA0002364206920000032
DTrustu,b≥λ,DTrustb,v≥λ (10)
wherein, PTrustu,vRepresents the indirect trust between the user u and the user v, and adj (u, v) represents the satisfaction of the condition DTrustu,b≥λ,DTrustb,vIn ≧ λThe lambda parameter is an adjustable credit threshold value, which indicates that only the intermediate users with direct credit larger than lambda can be brought into indirect trust calculation;
the Trust level between user u and user v is denoted Trustu,vThen:
Figure FDA0002364206920000041
5) in summary, the following steps:
integrating user attributes and user interest degrees, score similarity and trust degreez(u, v) is expressed by the following formula: the delta, epsilon, mu and rho can be dynamically adjusted according to different dependence degrees of a specific system on various factors
simz(u,v)=δsim(u,v)+εsimI(u,v)+μsima(u,v)+ρTrustu,v(12)
6) Generating a recommendation result:
according to the comprehensive similarity sim of the user u and the user vz(u, v) let user v score item i be Rv,iThe average of the scores of the user v for all the items is
Figure FDA0002364206920000042
Pu,iDenotes the rating, N, of item i by user u based on the rating inference of vuThe top-N sets with the highest scores in all the prediction scores are the final recommended item combination
Figure FDA0002364206920000043
7) Evaluation criteria for the method:
the recommendation quality of the algorithm is evaluated by taking the standard average error MAE as a rating standard, the average absolute deviation of the prediction score of the computing system to the item and the actual score of the user to the item is calculated, the smaller the MAE value is, the higher the recommendation precision of the recommendation system is, and the calculation formula is as follows:
Figure FDA0002364206920000044
wherein: ru,iRepresents the actual rating, P, of user u for item iu,iAnd (4) the predicted scores of the user u on the item i are obtained according to the method.
CN202010030767.4A 2020-01-13 2020-01-13 Collaborative filtering recommendation method based on trust degree Pending CN111324807A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010030767.4A CN111324807A (en) 2020-01-13 2020-01-13 Collaborative filtering recommendation method based on trust degree

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010030767.4A CN111324807A (en) 2020-01-13 2020-01-13 Collaborative filtering recommendation method based on trust degree

Publications (1)

Publication Number Publication Date
CN111324807A true CN111324807A (en) 2020-06-23

Family

ID=71168719

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010030767.4A Pending CN111324807A (en) 2020-01-13 2020-01-13 Collaborative filtering recommendation method based on trust degree

Country Status (1)

Country Link
CN (1) CN111324807A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111523054A (en) * 2020-04-26 2020-08-11 山东师范大学 Project recommendation method and system based on active account and similar accounts
CN112364243A (en) * 2020-11-11 2021-02-12 吉林师范大学 Big data-based information recommendation system
CN112883289A (en) * 2021-04-16 2021-06-01 河北工程大学 PMF recommendation method based on social trust and tag semantic similarity
CN113407863A (en) * 2021-06-17 2021-09-17 天津大学 Personalized product service scheme recommendation method based on trust and forgetting function
CN114004673A (en) * 2021-10-29 2022-02-01 辽宁大学 A Resource Recommendation Method for Second-hand Trading Platforms Based on Hybrid Recommendation
CN114647773A (en) * 2020-12-17 2022-06-21 赣南师范大学 Improved collaborative filtering method based on multiple linear regression and third-party credit
CN114766028A (en) * 2019-11-11 2022-07-19 廉长烈 System for making customized undergarments for a customer

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105528419A (en) * 2015-11-30 2016-04-27 合肥工业大学 Personalized search method based on modeling considering user-author relation
CN109815402A (en) * 2019-01-23 2019-05-28 北京工业大学 Collaborative filtering recommendation algorithm based on user characteristics
CN109978709A (en) * 2019-04-08 2019-07-05 燕山大学 A kind of trust network construction method based on node interbehavior
US10475105B1 (en) * 2018-07-13 2019-11-12 Capital One Services, Llc Systems and methods for providing improved recommendations

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105528419A (en) * 2015-11-30 2016-04-27 合肥工业大学 Personalized search method based on modeling considering user-author relation
US10475105B1 (en) * 2018-07-13 2019-11-12 Capital One Services, Llc Systems and methods for providing improved recommendations
CN109815402A (en) * 2019-01-23 2019-05-28 北京工业大学 Collaborative filtering recommendation algorithm based on user characteristics
CN109978709A (en) * 2019-04-08 2019-07-05 燕山大学 A kind of trust network construction method based on node interbehavior

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
蒋宗礼: "基于用户特征的协同过滤推荐算法", 《计算机系统应用》 *
蒋宗礼: "融合用户相似度与信任度的协同过滤推荐算法", 《软件导刊》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114766028A (en) * 2019-11-11 2022-07-19 廉长烈 System for making customized undergarments for a customer
CN111523054A (en) * 2020-04-26 2020-08-11 山东师范大学 Project recommendation method and system based on active account and similar accounts
CN112364243A (en) * 2020-11-11 2021-02-12 吉林师范大学 Big data-based information recommendation system
CN114647773A (en) * 2020-12-17 2022-06-21 赣南师范大学 Improved collaborative filtering method based on multiple linear regression and third-party credit
CN114647773B (en) * 2020-12-17 2024-03-22 赣南师范大学 Improved collaborative filtering method based on multiple linear regression and third-party credit
CN112883289A (en) * 2021-04-16 2021-06-01 河北工程大学 PMF recommendation method based on social trust and tag semantic similarity
CN112883289B (en) * 2021-04-16 2022-05-06 河北工程大学 A PMF Recommendation Method Based on Social Trust and Tag Semantic Similarity
CN113407863A (en) * 2021-06-17 2021-09-17 天津大学 Personalized product service scheme recommendation method based on trust and forgetting function
CN114004673A (en) * 2021-10-29 2022-02-01 辽宁大学 A Resource Recommendation Method for Second-hand Trading Platforms Based on Hybrid Recommendation

Similar Documents

Publication Publication Date Title
CN111324807A (en) Collaborative filtering recommendation method based on trust degree
CN106682114B (en) Personalized recommendation method integrating user trust relationship and comment information
CN107506480B (en) A bi-level graph structure recommendation method based on review mining and density clustering
CN108665323B (en) Integration method for financial product recommendation system
CN108573041B (en) Probabilistic matrix factorization recommendation method based on weighted trust relationship
US20090125382A1 (en) Quantifying a Data Source&#39;s Reputation
CN108256093A (en) A kind of Collaborative Filtering Recommendation Algorithm based on the more interest of user and interests change
CN111681084B (en) E-commerce platform recommendation method based on social relationship influence factors
CN109598586B (en) Recommendation method based on attention model
CN106570525A (en) Method for evaluating online commodity assessment quality based on Bayesian network
CN106294758A (en) Collaborative recommendation method based on the change of user cognition degree
CN111061962A (en) A Recommendation Method Based on User Score Analysis
CN104077357A (en) User based collaborative filtering hybrid recommendation method
CN109241399A (en) A kind of personalized recommendation method combined based on trust and influence power
CN107025606B (en) An item recommendation method combining rating data and trust relationship in social network
US20160132811A1 (en) Influential Peers
CN111523055A (en) Collaborative recommendation method and system based on agricultural product characteristic attribute comment tendency
CN111639258A (en) News recommendation method based on neural network
CN105761154B (en) A social recommendation method and device
CN111475744B (en) Personalized position recommendation method based on ensemble learning
CN104572915A (en) User event relevance calculation method based on content environment enhancement
CN113836393A (en) A cold-start recommendation method based on preference adaptive meta-learning
CN114022233A (en) A Novel Product Recommendation Method
CN112364254A (en) Collaborative filtering recommendation system and method for improving user similarity
CN110309864A (en) A Method of Collaborative Filtering Recommendation Scheme Fusion of Local Similarity and Global Similarity

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200623

RJ01 Rejection of invention patent application after publication