CN110851705A - Item-based collaborative storage recommendation method and recommendation device - Google Patents

Item-based collaborative storage recommendation method and recommendation device Download PDF

Info

Publication number
CN110851705A
CN110851705A CN201910953264.1A CN201910953264A CN110851705A CN 110851705 A CN110851705 A CN 110851705A CN 201910953264 A CN201910953264 A CN 201910953264A CN 110851705 A CN110851705 A CN 110851705A
Authority
CN
China
Prior art keywords
item
neighborhood
user
similarity
representation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910953264.1A
Other languages
Chinese (zh)
Inventor
喻梅
胡悦
李雪威
于瑞国
赵满坤
徐天一
许林英
刘宏伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201910953264.1A priority Critical patent/CN110851705A/en
Publication of CN110851705A publication Critical patent/CN110851705A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9536Search customisation based on social or collaborative filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a collaborative storage recommendation method and device based on projects, wherein the method comprises the following steps: embedding the user information and the project information respectively to obtain similarity; adding the similarity into an attention mechanism to set a weighting function, and operating the formed weight and the project neighborhood to form neighborhood representation; setting a score function through project information embedding and neighborhood representation; setting a loss function using bayesian personalization-ranking optimization; and realizing the collaborative storage recommendation according to the score function and the loss function. The device comprises: the device comprises a similarity calculation module, a neighborhood representation module, a setting module and a recommendation module. The method effectively overcomes the limitation that only the connection between users is considered, can fully consider the distribution of potential influences between the items and the users, strengthens the relation between a specific item and the neighborhood, considers the potential relation between the items and the users, adopts a higher-order method, enhances the expandability of recommendation, and improves the accuracy and precision of information retrieval and results.

Description

一种基于项目的协作存储推荐方法及其推荐装置Item-based collaborative storage recommendation method and recommendation device

技术领域technical field

本发明涉及自然语言处理和信息检索领域,尤其涉及一种基于项目的协作存储推荐方法及其推荐装置。The present invention relates to the fields of natural language processing and information retrieval, in particular to an item-based collaborative storage recommendation method and a recommendation device thereof.

背景技术Background technique

推荐系统根据用户的兴趣特点与历史行为为用户推荐其所感兴趣的信息和商品,浏览大量无关的信息和产品过程会使用户淹没在信息过载的问题中,个性化推荐系统是建立在海量数据挖掘基础上的一种高级智能平台,以为用户提供完全个性化的决策支持和信息服务,为提高推荐的准确性与效果,多种推荐方法应运而生。基于内容的推荐方法是信息过滤技术的延续与发展,它是建立在项目的内容信息上作出推荐的,而不需要依据用户对项目的评价意见,更多地需要用机器学习的方法从关于内容的特征描述的事例中得到用户的兴趣资料,但其要求特征内容有良好的结构性。基于协同过滤的推荐技术是推荐系统中应用最早和最为成功的技术之一,它一般采用最近邻技术,利用用户的历史喜好信息计算用户之间的距离,然后利用目标用户的最近邻居用户对商品评价的加权评价值来预测目标用户对特定商品的喜好程度,系统从而根据这一喜好程度来对目标用户进行推荐。The recommendation system recommends the information and products of interest to the user according to the user's interest characteristics and historical behavior. The process of browsing a large amount of irrelevant information and products will make the user drowned in the problem of information overload. The personalized recommendation system is based on massive data mining. Based on an advanced intelligent platform, it provides users with fully personalized decision support and information services. In order to improve the accuracy and effect of recommendation, a variety of recommendation methods have emerged. The content-based recommendation method is the continuation and development of information filtering technology. It is based on the content information of the item to make recommendations, and does not need to be based on the user's evaluation of the item. The user's interest information is obtained in the case of the feature description, but it requires the feature content to have a good structure. The recommendation technology based on collaborative filtering is one of the earliest and most successful technologies in the recommendation system. It generally adopts the nearest neighbor technology, uses the user's historical preference information to calculate the distance between users, and then uses the target user's nearest neighbor user to evaluate the product. The weighted evaluation value of the evaluation is used to predict the preference degree of the target user for a specific commodity, and the system recommends the target user according to the preference degree.

在推荐中,利用注意力机制能投对序列学习任务带来巨大的提升作用,注意力模型可以对数据序列进行数据加权的变换,硬注意力在每个时刻只关注模型序列的某一个位置,而软注意力每次会照顾到全部的位置,每个位置的权重不同。局部注意力可视为硬注意力和软注意力在优势上的混合,不同于硬注意力,局部注意力几乎处处可微,易于训练,每次只关注一小部分的源点位置,而全局注意力每次需要扫描全部的源隐藏状态。In the recommendation, the use of the attention mechanism can greatly improve the sequence learning task. The attention model can perform data-weighted transformation on the data sequence, and the hard attention only pays attention to a certain position of the model sequence at each moment. The soft attention will take care of all the positions each time, and the weight of each position is different. Local attention can be regarded as a mixture of hard attention and soft attention in terms of advantages. Different from hard attention, local attention is almost everywhere differentiable and easy to train. It only pays attention to a small part of the source position at a time, while the global attention Attention needs to scan all source hidden states each time.

然而,现有技术如基于内容的推荐方法,他要求内容能容易的抽取成有意义的特征,要求特征内容具有良好的结构性,且用户的偏好必须能够用内容特征形式来表达,不能显示地得到其他用户的情况,而协同过滤技术在推荐过程中具有稀疏问题和可扩展问题,推荐方法可扩展性较差,还面临冷启动等诸多问题。However, in the prior art, such as content-based recommendation methods, it requires that the content can be easily extracted into meaningful features, the feature content has a good structure, and the user's preference must be able to be expressed in the form of content features, which cannot be displayed. However, the collaborative filtering technology has sparse and scalability problems in the recommendation process, the recommendation method has poor scalability, and also faces many problems such as cold start.

发明内容SUMMARY OF THE INVENTION

本发明提供了一种基于项目的协作存储推荐方法及其推荐装置,本发明用于对网络检索信息的排名和推荐,它可以有效克服只考虑用户之间联系的局限性,能够充分考虑到项目与用户之间潜在影响的分布,强化特定项目与邻域之间的关系,考虑到项目与用户之间的潜在关系,采用一种更高阶的方法,增强推荐的可扩展性,提高推荐过程中对信息检索及结果的准确度与精确性,详见下文描述:The invention provides an item-based collaborative storage recommendation method and a recommendation device. The invention is used for ranking and recommending network retrieval information, which can effectively overcome the limitation of only considering the connection between users, and can fully consider items The distribution of potential impact with users, strengthen the relationship between specific items and neighborhoods, take into account the potential relationship between items and users, adopt a higher-order method, enhance the scalability of the recommendation, improve the recommendation process The accuracy and precision of information retrieval and results are described in the following description:

一种基于项目的协作存储推荐方法,所述方法包括:An item-based collaborative storage recommendation method, the method comprising:

对用户及项目信息分别进行嵌入处理,获得相似度;Embed the user and item information respectively to obtain similarity;

将相似度加入注意力机制中设置加权函数,将形成的权重与项目邻域进行操作,形成邻域表示;The similarity is added to the attention mechanism to set a weighting function, and the formed weight is operated with the item neighborhood to form a neighborhood representation;

通过项目信息嵌入和邻域表示设置得分函数;使用贝叶斯个性化-排名优化设置损失函数;Set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization;

根据得分函数以及损失函数实现协作存储推荐。Collaborative storage recommendation is implemented according to the score function and loss function.

进一步地,所述对用户及项目信息分别进行嵌入处理,获得相似度具体为:Further, the embedding processing is performed on the user and item information respectively, and the obtained similarity is specifically:

Figure BDA0002226415490000021
Figure BDA0002226415490000021

其中,t为具有隐式反馈的项目邻域中的特定项目,N(u)为向用户u提供隐式反馈的所有项目的集合,mu为用户存储器组件切片,ei为项目存储器切片。where t is a specific item in the item neighborhood with implicit feedback, N(u) is the set of all items that provide implicit feedback to user u , mu is the user memory component slice, and ei is the item memory slice.

其中,所述将相似度加入注意力机制中设置加权函数,将形成的权重与项目邻域进行操作,形成邻域表示具体为:Among them, the similarity is added to the attention mechanism to set a weighting function, and the formed weight is operated with the item neighborhood, and the neighborhood representation is specifically:

采用自适应加权函数建立注意力机制的学习过程,对不同影响的内容和项目赋予不同的权重,学习项目的独特贡献xiutThe learning process of the attention mechanism is established with an adaptive weighting function, and different weights are given to the content and items of different influences, and the unique contribution of the learning item xiut ;

通过使用注意力机制形成的权重,将权重与邻域进行操作,以形成邻域表示siu,注意力机制形成的权重会以不同的方式有选择的加权到项目及其邻域中,以得到最终的加权后的邻域表示。By using the weights formed by the attention mechanism, the weights are manipulated with the neighborhood to form the neighborhood representation s iu , and the weights formed by the attention mechanism will be selectively weighted to the item and its neighborhood in different ways to get The final weighted neighborhood representation.

其中,所述独特贡献xiut具体为:Wherein, the unique contribution x iut is specifically:

Figure BDA0002226415490000022
Figure BDA0002226415490000022

进一步地,所述邻域表示siu具体为:Further, the neighborhood representation s iu is specifically:

siu=∑t∈N(u)xiut×ft s iu =∑ t∈N(u) x iut ×f t

其中ft是项目的嵌入矩阵的向量,×是矩阵级别的运算。where ft is a vector of the item's embedding matrix and × is a matrix-level operation.

其中,所述通过项目信息嵌入和邻域信息设置得分函数具体为:Wherein, the setting of the scoring function through item information embedding and neighborhood information is specifically:

捕获项目与用户的邻域的局部结构以及项目与用户之间的全局交互信息,在项目与用户间建立联系;Capture the local structure of the neighborhood of the project and the user and the global interaction information between the project and the user, and establish the connection between the project and the user;

通过非线性反映出项目与用户之间的潜在关系,以产生关注项目影响力的排名分数。The latent relationship between items and users is reflected non-linearly to generate ranking scores that focus on item influence.

一种基于项目的协作存储推荐装置,所述装置包括:An item-based collaborative storage recommendation device, the device comprising:

相似度计算模块,用于对用户及项目信息分别进行嵌入处理,获得相似度;The similarity calculation module is used to embed the user and item information respectively to obtain the similarity;

邻域表示模块,用于将相似度加入注意力机制中设置加权函数,将形成的权重与项目邻域进行操作,形成邻域表示;The neighborhood representation module is used to add the similarity to the attention mechanism to set the weighting function, and operate the formed weight with the item neighborhood to form the neighborhood representation;

设置模块,用于通过项目信息嵌入和邻域表示设置得分函数;使用贝叶斯个性化-排名优化设置损失函数;Set module to set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization;

推荐模块,用于根据得分函数以及损失函数实现协作存储推荐。The recommendation module is used to implement collaborative storage recommendation based on the score function and loss function.

本发明提供的技术方案的有益效果是:The beneficial effects of the technical scheme provided by the present invention are:

1、本发明通过在项目级别上考虑项目对用户产生的潜在影响,更好更全面的学习可以有效提高推荐过程中对信息获取的准确度;1. The present invention can effectively improve the accuracy of information acquisition in the recommendation process by considering the potential impact of items on users at the item level, better and more comprehensive learning;

2、本发明关注项目信息及特征,能够更加全面和详细的捕获到项目对用户的吸引力,提高推荐结果的准确性,提升推荐的可扩展性,最终实验结果表明,本方法比只关注用户历史行为与用户间联系的推荐算法结果更加准确。2. The present invention pays attention to item information and features, which can capture the attractiveness of items to users more comprehensively and in detail, improve the accuracy of recommendation results, and improve the scalability of recommendation. The final experimental results show that this method is better than only focusing on users. The results of the recommendation algorithm based on the relationship between historical behavior and users are more accurate.

附图说明Description of drawings

图1为一种基于项目的协作存储推荐方法的流程图;1 is a flowchart of a project-based collaborative storage recommendation method;

图2为一种基于项目的协作存储推荐方法的整体架构图;Fig. 2 is an overall architecture diagram of an item-based collaborative storage recommendation method;

图3为一种基于项目的协作存储推荐装置的结构示意图。FIG. 3 is a schematic structural diagram of an item-based collaborative storage recommendation device.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚,下面对本发明实施方式作进一步地详细描述。In order to make the objectives, technical solutions and advantages of the present invention clearer, the embodiments of the present invention are further described in detail below.

实施例1Example 1

为了实现上述目的,本发明实施例提出了一种基于项目的协作存储推荐方法,参见图1和图2,该方法包括以下步骤:In order to achieve the above object, an embodiment of the present invention proposes an item-based collaborative storage recommendation method, see FIG. 1 and FIG. 2 , and the method includes the following steps:

101:对用户及项目信息分别进行嵌入处理,获得相似度;101: Embed the user and item information respectively to obtain similarity;

102:将得到的相似度加入注意力机制中设置加权函数,将形成的权重与项目邻域进行操作,形成邻域表示;102: Add the obtained similarity to the attention mechanism to set a weighting function, and operate the formed weight with the item neighborhood to form a neighborhood representation;

103:通过项目信息嵌入和邻域信息设置得分函数;103: Set a score function through item information embedding and neighborhood information;

104:使用贝叶斯个性化-排名优化设置损失函数;104: Use Bayesian personalization-rank optimization to set the loss function;

105:根据得分函数以及损失函数实验评估和验证对本发明提出的基于项目的协作存储推荐方法的效果。105: Experiment to evaluate and verify the effect of the item-based collaborative storage recommendation method proposed by the present invention according to the score function and the loss function.

在一个实施例中,步骤101对用户及项目信息分别进行嵌入处理,具体步骤如下:In one embodiment, step 101 performs embedding processing on the user and item information respectively, and the specific steps are as follows:

输入的用户和项目的信息被分别嵌入到用户和项目的存储器组件中,用户的存储器组件为M,项目的存储器组件为E,用户的偏好被存储在用户存储器组件切片mu中,项目i的特征被存储在项目存储器切片ei中,通过计算得到针对特定项目和特定用户在其给定邻域中的相似度aiutThe entered user and item information is embedded in the user and item memory components, respectively, where the user's memory component is M, the item's memory component is E, the user's preferences are stored in the user memory component slice mu , and the item i's memory component is The features are stored in the item memory slice ei, and the similarity a iut in its given neighborhood for a specific item and a specific user is calculated .

在一个实施例中,步骤102在步骤101的基础上采用注意力机制(本领域技术人员人员所公知,本发明实施例对此不做赘述)设置加权函数,使用注意力机制形成的权重与邻域进行操作,形成邻域表示,具体步骤如下:In one embodiment, step 102 adopts an attention mechanism (known to those skilled in the art, which will not be repeated in this embodiment of the present invention) on the basis of step 101 to set a weighting function, and the weights formed by using the attention mechanism are related to neighbors. The domain is operated to form a neighborhood representation. The specific steps are as follows:

采用自适应加权函数来建立注意力机制的学习过程,能够对不同影响的内容和项目赋予不同的权重,凸显具有高影响力内容的重要性,学习项目的独特贡献xiut之后,通过使用注意力机制形成的权重,将权重与邻域进行操作,以形成邻域表示siu,注意力机制形成的权重会以不同的方式有选择的加权到项目及其邻域中,以得到最终的加权后的邻域表示。Using adaptive weighting function to establish the learning process of attention mechanism, it can give different weights to content and items with different influences, highlighting the importance of high-impact content, after learning the unique contribution of items x iut , by using attention The weight formed by the mechanism operates on the weight and the neighborhood to form the neighborhood representation s iu , and the weight formed by the attention mechanism will be selectively weighted to the item and its neighborhood in different ways to obtain the final weighted The neighborhood representation of .

在一个实施例中,步骤103在步骤101和步骤102的基础上通过项目信息嵌入和邻域信息设置得分函数,具体步骤如下:In one embodiment, step 103 sets a score function through item information embedding and neighborhood information on the basis of steps 101 and 102, and the specific steps are as follows:

捕获项目与用户的邻域的局部结构以及项目与用户之间的全局交互信息,在项目与用户间建立更深层次广泛的联系,之后,通过非线性的方法,能够更加全面的反映出项目与用户之间的潜在关系,以产生关注项目影响力的排名分数

Figure BDA0002226415490000041
Capture the local structure of the neighborhood of the project and the user and the global interaction information between the project and the user, and establish a deeper and wider connection between the project and the user. potential relationships to yield ranking scores focusing on project impact
Figure BDA0002226415490000041

在一个实施例中,步骤104设置损失函数,优化模型,具体步骤如下:In one embodiment, step 104 sets a loss function to optimize the model, and the specific steps are as follows:

使用贝叶斯个性化排名优化设置损失函数,优化模型。Use Bayesian Personalized Rank Optimization to set up a loss function and optimize the model.

在一个实施例中,步骤105对基于项目的协作存储推荐方法进行实验,具体步骤如下:In one embodiment, step 105 conducts an experiment on an item-based collaborative storage recommendation method, and the specific steps are as follows:

对实验的命中率和归一化折损累积增益进行计算从而实现对模型效果的评估,为更好地平衡这两个指标,采用八个基线对比实验,对算法的效果进行评估及验证。The hit rate of the experiment and the cumulative gain of normalized damage are calculated to evaluate the effect of the model. In order to better balance these two indicators, eight baseline comparison experiments are used to evaluate and verify the effect of the algorithm.

实施例2Example 2

下面结合具体的计算公式、实例对实施例1中的方案进行进一步地介绍,详见下文描述:The scheme in Embodiment 1 is further introduced below in conjunction with specific calculation formulas and examples, and is described in detail below:

201:对用户及项目信息分别进行嵌入处理,输入的用户和项目的信息被分别嵌入到用户和项目的存储器组件中;201: Perform embedding processing on the user and item information respectively, and the inputted user and item information is embedded in the memory components of the user and the item respectively;

其中,用户的存储器组件为M,项目的存储器组件为E,用户的偏好被存储在用户存储器组件切片mu中,项目i的特征被存储在项目存储器切片ei中,以得到针对特定项目和特定用户在其给定邻域中的相似度aiut,如公式(1)所示。where the user's memory component is M, the item's memory component is E, the user's preferences are stored in the user memory component slice mu , and the features of item i are stored in the item memory slice ei, to obtain specific items and The similarity a iut of a particular user in its given neighborhood is shown in formula (1).

Figure BDA0002226415490000051
Figure BDA0002226415490000051

其中,公式(1)中的t为具有隐式反馈的项目邻域中的特定项目,N(u)为向用户u提供隐式反馈的所有项目的集合。where t in formula (1) is a specific item in the item neighborhood with implicit feedback, and N(u) is the set of all items that provide implicit feedback to user u.

202:采用注意力机制设置加权函数,本发明实施例采用自适应加权函数来建立注意力机制的学习过程,学习项目的独特贡献xiut,如公式(2)所示。202 : Use the attention mechanism to set the weighting function. The embodiment of the present invention uses the adaptive weighting function to establish the learning process of the attention mechanism, and the unique contribution x iut of the learning item is shown in formula (2).

Figure BDA0002226415490000052
Figure BDA0002226415490000052

其中,公式(2)中的aiut为针对特定项目和特定用户在其给定邻域中的相似度。Among them, a iut in formula (2) is the similarity between a specific item and a specific user in its given neighborhood.

203:使用注意力机制形成的权重,将权重与邻域进行操作,以形成邻域表示siu,如公式(3)所示。203: Using the weights formed by the attention mechanism, operate the weights with the neighborhood to form the neighborhood representation s iu , as shown in formula (3).

siu=∑t∈N(u)xiut×ft (3)s iu =∑ t∈N(u) x iut ×f t (3)

公式(3)中的ft是项目的另一个嵌入矩阵的向量,×是矩阵级别的运.针对项目的特点,注意力机制形成的权重会以不同的方式有选择的加权到项目及其邻域中,以得到最终的加权后的邻域。In formula (3), f t is the vector of another embedding matrix of the item, and × is the matrix-level operation. According to the characteristics of the item, the weights formed by the attention mechanism will be selectively weighted to the item and its neighbors in different ways. domain to get the final weighted neighborhood.

203:通过项目信息嵌入和邻域信息设置得分函数,具体步骤如下:203: Set a score function through item information embedding and neighborhood information, the specific steps are as follows:

捕获项目与用户的邻域的局部结构以及项目与用户之间的全局交互信息,将项目与用户建立了更深层次广泛的联系,之后,通过非线性的方法,能够更加全面的反映出项目与用户之间的潜在关系,通过线性投影U以产生关注项目影响力的排名分数

Figure BDA0002226415490000053
如公式(4)、公式(5)和公式(6)所示。Capture the local structure of the neighborhood of the project and the user and the global interaction information between the project and the user, and establish a deeper and wider connection between the project and the user. potential relationship between, by linearly projecting U to generate ranking scores of attention item influence
Figure BDA0002226415490000053
As shown in formula (4), formula (5) and formula (6).

Figure BDA0002226415490000054
Figure BDA0002226415490000054

Figure BDA0002226415490000055
Figure BDA0002226415490000055

Figure BDA0002226415490000056
Figure BDA0002226415490000056

其中,公式(4)(5)(6)中的

Figure BDA0002226415490000057
为存储器,为第y层的相似矩阵,v、b为要学习的参数,
Figure BDA0002226415490000059
为经过项目邻域更新后的邻域,
Figure BDA00022264154900000510
为存储器之间的非线性映射,et为用户t的独特贡献,
Figure BDA0002226415490000061
为第四层的邻域矩阵,σ(x)=1/(1+exp(-x))为非线性激活sigmoid函数,y为存储器的层。Among them, in formula (4)(5)(6)
Figure BDA0002226415490000057
for memory, is the similarity matrix of the y-th layer, v and b are the parameters to be learned,
Figure BDA0002226415490000059
is the neighborhood after project neighborhood update,
Figure BDA00022264154900000510
is the nonlinear mapping between memories, e t is the unique contribution of user t,
Figure BDA0002226415490000061
is the neighborhood matrix of the fourth layer, σ(x)=1/(1+exp(-x)) is the nonlinear activation sigmoid function, and y is the memory layer.

其中,本发明中y为4,W是一个权重矩阵,它将项目的特征映射到潜在的空间,并与前一层的信息相结合,·是元素级别的操作,在获得项目与用户的元素乘积后,以线性投影的方式对学习得到的邻域表示以及参数进行操作。采用非线性激活ReLU函数

Figure BDA0002226415490000062
Among them, in the present invention, y is 4, and W is a weight matrix, which maps the features of the item to the latent space and combines it with the information of the previous layer. It is an element-level operation. After obtaining the elements of the item and the user After multiplication, the learned neighborhood representation and parameters are manipulated by linear projection. Using nonlinear activation ReLU function
Figure BDA0002226415490000062

204:采用贝叶斯个性化排名优化设置损失函数,优化推荐效果,如公式(7)所示。204: Use Bayesian personalized ranking optimization to set a loss function to optimize the recommendation effect, as shown in formula (7).

Figure BDA0002226415490000063
Figure BDA0002226415490000063

公式(7)中的σ(x)=1/(1+exp(-x))为逻辑sigmoid函数。σ(x)=1/(1+exp(-x)) in formula (7) is a logical sigmoid function.

205:通过对命中率和归一化折损累积增益的计算,可以对本发明的方法的效果进行评价和验证。205: Through the calculation of the hit rate and the normalized impairment cumulative gain, the effect of the method of the present invention can be evaluated and verified.

实施例3Example 3

本发明实施例提供了一种基于项目的协作存储推荐装置,参见图3,该装置包括:An embodiment of the present invention provides an item-based collaborative storage recommendation device. Referring to FIG. 3 , the device includes:

相似度计算模块,用于对用户及项目信息分别进行嵌入处理,获得相似度;The similarity calculation module is used to embed the user and item information respectively to obtain the similarity;

即,获得项目嵌入矩阵E、F与用户嵌入矩阵M。That is, item embedding matrices E, F and user embedding matrix M are obtained.

邻域表示模块,用于将相似度加入注意力机制中设置加权函数,将形成的权重与项目邻域进行操作,形成邻域表示;The neighborhood representation module is used to add the similarity to the attention mechanism to set the weighting function, and operate the formed weight with the item neighborhood to form the neighborhood representation;

设置模块,用于通过项目信息嵌入和邻域表示设置得分函数;使用贝叶斯个性化-排名优化设置损失函数;Set module to set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization;

推荐模块,用于根据得分函数以及损失函数实现协作存储推荐。The recommendation module is used to implement collaborative storage recommendation based on the score function and loss function.

实施例4Example 4

在基于项目的协作存储推荐方法及其推荐装置的实验中,通过得分函数计算出每条项目的最终得分,通过得分进行排名和推荐。In the experiment of the item-based collaborative storage recommendation method and its recommendation device, the final score of each item is calculated by the score function, and the ranking and recommendation are carried out by the score.

通过实验效果可以看出,在引入项目对用户产生的潜在影响后,通过注意力机制和损失函数的优化,基于项目的协作存储推荐方法及其推荐装置具有一定的语言学意义和优秀的效果。It can be seen from the experimental results that after introducing the potential impact of items on users, through the optimization of attention mechanism and loss function, the item-based collaborative storage recommendation method and its recommendation device have certain linguistic significance and excellent effects.

在基于项目的协作存储推荐方法及其推荐装置的实验中,通过八个基线实验进行对比试验,实验中的负样本数量设置为4,在预训练中l2的权重衰减被设为0.001,在训练中l2的权重衰减被设置为0.1,内存的嵌入大小d为50,项目信息嵌入矩阵及用户信息嵌入矩阵均为随机自动生成。In the experiments of the item-based collaborative storage recommendation method and its recommendation device, eight baseline experiments are used to conduct comparative experiments, the number of negative samples in the experiment is set to 4, the weight decay of l2 The weight decay of l2 is set to 0.1, the memory embedding size d is 50, and the item information embedding matrix and user information embedding matrix are randomly and automatically generated.

本发明使用命中率(HR)、归一化折损累积增益(NDCG)值两个评价指标对基于项目的协作存储推荐方法及其推荐装置效果进行评估,命中率计算的主要目的是评估模型得到的top-N推荐列表里的项目数目在测试集中的比重,而归一化折损累积增益计算的主要目的是评估模型产生的推荐列表的效果与理想状态下产生推荐列表效果的对比。命中率(HR)的计算公式如公式(8)所示,归一化折损累积增益(NDCG)的计算公式如公式(9)所示。The present invention evaluates the item-based collaborative storage recommendation method and the effect of the recommendation device by using two evaluation indicators: hit rate (HR) and normalized impairment cumulative gain (NDCG) value. The main purpose of the hit rate calculation is to evaluate the model to obtain The proportion of the number of items in the top-N recommendation list in the test set, and the main purpose of the normalized discount cumulative gain calculation is to evaluate the comparison between the effect of the recommendation list generated by the model and the effect of the recommendation list generated under ideal conditions. The calculation formula of the hit ratio (HR) is shown in formula (8), and the calculation formula of the normalized impairment cumulative gain (NDCG) is shown in formula (9).

Figure BDA0002226415490000072
Figure BDA0002226415490000072

公式(8)中数据值|GT|是指所有测试集,NumberofHits@K是属于每个用户的前K推荐列表的测试集数量的总和。公式(9)中DCG是平均折扣累积增益,理想DCG@K是理想条件下的最大DCG值。命中率和归一化折损累积增益均为值越大效果越好。The data value |GT| in formula (8) refers to all test sets, and NumberofHits@K is the sum of the number of test sets belonging to each user's top-K recommendation list. In formula (9), DCG is the average discounted cumulative gain, and ideal DCG@K is the maximum DCG value under ideal conditions. The larger the value of the hit rate and the normalized damage cumulative gain, the better the effect.

表1为实验评价指标表Table 1 is the experimental evaluation index table

Figure BDA0002226415490000073
Figure BDA0002226415490000073

九个基线对比实验的实验效果如表1所示,SVD++是一种结合了基于邻域的相似性和潜在因子模型的混合模型。广义矩阵分解(GMF)模型是一种非线性推广的潜在因子模型的。KNN用于计算余弦项目-项目相似度的基于邻域的方法。贝叶斯个性化排序(BPR)是隐式反馈的一种矩阵分解模型。神经矩阵分解(NeuMF)通过多层感知器模型进行项目排序的一种矩阵分解模型。协同去噪自动编码器(CDAE)是一种基于项目的深度学习模型。因子项相似性模型(FISM)能够将项目-项目对的相似性矩阵进行分解,优化损失函数的一种基于邻域的模型。CMN是一种融合存储器组件与注意力机制的模型。由实验结果可以看出,使用基于项目的协作存储推荐方法及其推荐装置的实验效果的命中率和归一化折损累积增益最高,说明该模型的有效性好。The experimental results of the nine baseline comparison experiments are shown in Table 1. SVD++ is a hybrid model that combines neighborhood-based similarity and latent factor models. The generalized matrix factorization (GMF) model is a nonlinear extension of the latent factor model. KNN's neighborhood-based method for computing cosine item-item similarity. Bayesian Personalization Ranking (BPR) is a matrix factorization model with implicit feedback. Neural Matrix Factorization (NeuMF) is a matrix factorization model for item ranking through a multilayer perceptron model. Collaborative Denoising Autoencoder (CDAE) is an item-based deep learning model. The Factor Term Similarity Model (FISM) is a neighborhood-based model that optimizes the loss function by decomposing the item-item pair similarity matrix. CMN is a model that fuses memory components with attention mechanisms. It can be seen from the experimental results that the experimental effect of the item-based collaborative storage recommendation method and its recommendation device has the highest hit rate and normalized impairment cumulative gain, indicating that the model is effective.

本领域技术人员可以理解附图只是一个优选实施例的示意图,上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。Those skilled in the art can understand that the accompanying drawing is only a schematic diagram of a preferred embodiment, and the above-mentioned serial numbers of the embodiments of the present invention are only for description, and do not represent the advantages or disadvantages of the embodiments.

以上所述仅为本发明的较佳实施例,并不用以限制本发明,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included in the protection of the present invention. within the range.

Claims (7)

1.一种基于项目的协作存储推荐方法,其特征在于,所述方法包括:1. A project-based collaborative storage recommendation method, wherein the method comprises: 对用户及项目信息分别进行嵌入处理,获得相似度;Embed the user and item information respectively to obtain similarity; 将相似度加入注意力机制中设置加权函数,将形成的权重与项目邻域进行操作,形成邻域表示;The similarity is added to the attention mechanism to set a weighting function, and the formed weight is operated with the item neighborhood to form a neighborhood representation; 通过项目信息嵌入和邻域表示设置得分函数;使用贝叶斯个性化-排名优化设置损失函数;Set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization; 根据得分函数以及损失函数实现协作存储推荐。Collaborative storage recommendation is implemented according to the score function and loss function. 2.根据权利要求1所述的一种基于项目的协作存储推荐方法,其特征在于,所述对用户及项目信息分别进行嵌入处理,获得相似度具体为:2. The item-based collaborative storage recommendation method according to claim 1, wherein the embedding processing is performed on the user and item information respectively, and the obtained similarity is specifically:
Figure FDA0002226415480000011
Figure FDA0002226415480000011
其中,t为具有隐式反馈的项目邻域中的特定项目,N(u)为向用户u提供隐式反馈的所有项目的集合,mu为用户存储器组件切片,ei为项目存储器切片。where t is a specific item in the item neighborhood with implicit feedback, N(u) is the set of all items that provide implicit feedback to user u , mu is the user memory component slice, and ei is the item memory slice.
3.根据权利要求2所述的一种基于项目的协作存储推荐方法,其特征在于,所述将相似度加入注意力机制中设置加权函数,将形成的权重与项目邻域进行操作,形成邻域表示具体为:3. A project-based collaborative storage recommendation method according to claim 2, wherein the similarity is added to the attention mechanism to set a weighting function, and the formed weight is operated with the project neighborhood to form a neighborhood. The domain representation is specifically: 采用自适应加权函数建立注意力机制的学习过程,对不同影响的内容和项目赋予不同的权重,学习项目的独特贡献xiutThe learning process of the attention mechanism is established with an adaptive weighting function, and different weights are given to the content and items of different influences, and the unique contribution of the learning item xiut ; 通过使用注意力机制形成的权重,将权重与邻域进行操作,以形成邻域表示siu,注意力机制形成的权重会以不同的方式有选择的加权到项目及其邻域中,以得到最终的加权后的邻域表示。By using the weights formed by the attention mechanism, the weights are manipulated with the neighborhood to form the neighborhood representation s iu , and the weights formed by the attention mechanism will be selectively weighted to the item and its neighborhood in different ways to get The final weighted neighborhood representation. 4.根据权利要求3所述的一种基于项目的协作存储推荐方法,其特征在于,所述独特贡献xiut具体为:4. a kind of project-based collaborative storage recommendation method according to claim 3, is characterized in that, described unique contribution x iut is specifically: 5.根据权利要求3所述的一种基于项目的协作存储推荐方法,其特征在于,所述邻域表示siu具体为:5. a kind of item-based collaborative storage recommendation method according to claim 3, is characterized in that, described neighborhood representation s iu is specifically: siu=∑t∈N(u)xiut×ft s iu =∑ t∈N(u) x iut ×f t 其中ft是项目的嵌入矩阵的向量,×是矩阵级别的运算。where ft is a vector of the item's embedding matrix and × is a matrix-level operation. 6.根据权利要求1所述的一种基于项目的协作存储推荐方法,其特征在于,所述通过项目信息嵌入和邻域信息设置得分函数具体为:6. a kind of item-based collaborative storage recommendation method according to claim 1, is characterized in that, described by item information embedding and neighborhood information setting score function is specifically: 捕获项目与用户的邻域的局部结构以及项目与用户之间的全局交互信息,在项目与用户间建立联系;Capture the local structure of the neighborhood of the project and the user and the global interaction information between the project and the user, and establish the connection between the project and the user; 通过非线性反映出项目与用户之间的潜在关系,以产生关注项目影响力的排名分数。The latent relationship between items and users is reflected non-linearly to generate ranking scores that focus on item influence. 7.一种基于项目的协作存储推荐装置,其特征在于,所述装置包括:7. An item-based collaborative storage recommendation device, wherein the device comprises: 相似度计算模块,用于对用户及项目信息分别进行嵌入处理,获得相似度;The similarity calculation module is used to embed the user and item information respectively to obtain the similarity; 邻域表示模块,用于将相似度加入注意力机制中设置加权函数,将形成的权重与项目邻域进行操作,形成邻域表示;The neighborhood representation module is used to add the similarity to the attention mechanism to set the weighting function, and operate the formed weight with the item neighborhood to form the neighborhood representation; 设置模块,用于通过项目信息嵌入和邻域表示设置得分函数;使用贝叶斯个性化-排名优化设置损失函数;Set module to set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization; 推荐模块,用于根据得分函数以及损失函数实现协作存储推荐。The recommendation module is used to implement collaborative storage recommendation based on the score function and loss function.
CN201910953264.1A 2019-10-09 2019-10-09 Item-based collaborative storage recommendation method and recommendation device Pending CN110851705A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910953264.1A CN110851705A (en) 2019-10-09 2019-10-09 Item-based collaborative storage recommendation method and recommendation device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910953264.1A CN110851705A (en) 2019-10-09 2019-10-09 Item-based collaborative storage recommendation method and recommendation device

Publications (1)

Publication Number Publication Date
CN110851705A true CN110851705A (en) 2020-02-28

Family

ID=69596311

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910953264.1A Pending CN110851705A (en) 2019-10-09 2019-10-09 Item-based collaborative storage recommendation method and recommendation device

Country Status (1)

Country Link
CN (1) CN110851705A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111597432A (en) * 2020-04-03 2020-08-28 浙江工业大学 A method for fast candidate generation and recommendation based on Hash
CN115019933A (en) * 2022-06-16 2022-09-06 浙江工业大学 Amblyopia training scheme recommendation method fusing GMF and CDAE
US11869015B1 (en) 2022-12-09 2024-01-09 Northern Trust Corporation Computing technologies for benchmarking

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105740430A (en) * 2016-01-29 2016-07-06 大连理工大学 A Personalized Recommendation Method Integrating Social Information
CN109241424A (en) * 2018-08-29 2019-01-18 陕西师范大学 A kind of recommended method
CN109670121A (en) * 2018-12-18 2019-04-23 辽宁工程技术大学 Project level and feature level depth Collaborative Filtering Recommendation Algorithm based on attention mechanism
CN110060097A (en) * 2019-04-01 2019-07-26 苏州市职业大学 User behavior sequence of recommendation method based on attention mechanism and convolutional neural networks
US20190251435A1 (en) * 2018-02-09 2019-08-15 Daniel Shiebler Matching cross domain user affinity with co-embeddings
CN110232480A (en) * 2019-03-01 2019-09-13 电子科技大学 The item recommendation method and model training method realized using the regularization stream of variation

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105740430A (en) * 2016-01-29 2016-07-06 大连理工大学 A Personalized Recommendation Method Integrating Social Information
US20190251435A1 (en) * 2018-02-09 2019-08-15 Daniel Shiebler Matching cross domain user affinity with co-embeddings
CN109241424A (en) * 2018-08-29 2019-01-18 陕西师范大学 A kind of recommended method
CN109670121A (en) * 2018-12-18 2019-04-23 辽宁工程技术大学 Project level and feature level depth Collaborative Filtering Recommendation Algorithm based on attention mechanism
CN110232480A (en) * 2019-03-01 2019-09-13 电子科技大学 The item recommendation method and model training method realized using the regularization stream of variation
CN110060097A (en) * 2019-04-01 2019-07-26 苏州市职业大学 User behavior sequence of recommendation method based on attention mechanism and convolutional neural networks

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MEI YU, YUE HU等: "Paper Recommendation with Item-Level Collaborative Memory Network", 《IN PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT》 *
刘剑波等: "基于SVD++与行为分析的社交推荐", 《计算机应用》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111597432A (en) * 2020-04-03 2020-08-28 浙江工业大学 A method for fast candidate generation and recommendation based on Hash
CN115019933A (en) * 2022-06-16 2022-09-06 浙江工业大学 Amblyopia training scheme recommendation method fusing GMF and CDAE
US11869015B1 (en) 2022-12-09 2024-01-09 Northern Trust Corporation Computing technologies for benchmarking

Similar Documents

Publication Publication Date Title
Frolov et al. Tensor methods and recommender systems
US10489688B2 (en) Personalized digital image aesthetics in a digital medium environment
US8965891B1 (en) Training scoring models optimized for highly-ranked results
US20190362220A1 (en) Attentive neural collaborative filtering for modeling implicit feedback
US9589277B2 (en) Search service advertisement selection
CN108509573B (en) Book recommendation method and system based on matrix factorization collaborative filtering algorithm
CN107273438A (en) A kind of recommendation method, device, equipment and storage medium
Li et al. Efficient optimization of performance measures by classifier adaptation
CN109960749B (en) Model acquisition method, keyword generation method, apparatus, medium and computing device
CN111695024B (en) Method and system for predicting object evaluation value, and method and system for recommending object evaluation value
CN110851705A (en) Item-based collaborative storage recommendation method and recommendation device
CN114358657B (en) A method and device for job recommendation based on model fusion
CN116629983B (en) Cross-domain commodity recommendation method and system based on user preference
JP2017199355A (en) Recommendation generation
Bhattacharya et al. Intent-aware contextual recommendation system
WO2020220757A1 (en) Method and device for pushing object to user based on reinforcement learning model
CN110321490B (en) Recommendation method, device, equipment and computer readable storage medium
CN114493822A (en) A pricing method and system for user default prediction based on transfer learning
US20240004888A1 (en) Sorting method, apparatus and device, and computer storage medium
CN118013120B (en) Method, medium and equipment for optimizing products recommended to users based on cluster labels
Meena et al. A genetic algorithm approach for group recommender system based on partial rankings
Xie et al. Interactive resource recommendation with optimization by tag association and significance analysis
CN116957102A (en) Method and computing device for model training
CN109872006A (en) A scoring distribution prediction method and device
CN114298118B (en) Data processing method based on deep learning, related equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200228