CN110851705A

CN110851705A - Item-based collaborative storage recommendation method and recommendation device

Info

Publication number: CN110851705A
Application number: CN201910953264.1A
Authority: CN
Inventors: 喻梅; 胡悦; 李雪威; 于瑞国; 赵满坤; 徐天一; 许林英; 刘宏伟
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2019-10-09
Filing date: 2019-10-09
Publication date: 2020-02-28

Abstract

The invention discloses a collaborative storage recommendation method and device based on projects, wherein the method comprises the following steps: embedding the user information and the project information respectively to obtain similarity; adding the similarity into an attention mechanism to set a weighting function, and operating the formed weight and the project neighborhood to form neighborhood representation; setting a score function through project information embedding and neighborhood representation; setting a loss function using bayesian personalization-ranking optimization; and realizing the collaborative storage recommendation according to the score function and the loss function. The device comprises: the device comprises a similarity calculation module, a neighborhood representation module, a setting module and a recommendation module. The method effectively overcomes the limitation that only the connection between users is considered, can fully consider the distribution of potential influences between the items and the users, strengthens the relation between a specific item and the neighborhood, considers the potential relation between the items and the users, adopts a higher-order method, enhances the expandability of recommendation, and improves the accuracy and precision of information retrieval and results.

Description

Item-based collaborative storage recommendation method and recommendation device

技术领域technical field

本发明涉及自然语言处理和信息检索领域，尤其涉及一种基于项目的协作存储推荐方法及其推荐装置。The present invention relates to the fields of natural language processing and information retrieval, in particular to an item-based collaborative storage recommendation method and a recommendation device thereof.

背景技术Background technique

推荐系统根据用户的兴趣特点与历史行为为用户推荐其所感兴趣的信息和商品，浏览大量无关的信息和产品过程会使用户淹没在信息过载的问题中，个性化推荐系统是建立在海量数据挖掘基础上的一种高级智能平台，以为用户提供完全个性化的决策支持和信息服务，为提高推荐的准确性与效果，多种推荐方法应运而生。基于内容的推荐方法是信息过滤技术的延续与发展，它是建立在项目的内容信息上作出推荐的，而不需要依据用户对项目的评价意见，更多地需要用机器学习的方法从关于内容的特征描述的事例中得到用户的兴趣资料，但其要求特征内容有良好的结构性。基于协同过滤的推荐技术是推荐系统中应用最早和最为成功的技术之一，它一般采用最近邻技术，利用用户的历史喜好信息计算用户之间的距离，然后利用目标用户的最近邻居用户对商品评价的加权评价值来预测目标用户对特定商品的喜好程度，系统从而根据这一喜好程度来对目标用户进行推荐。The recommendation system recommends the information and products of interest to the user according to the user's interest characteristics and historical behavior. The process of browsing a large amount of irrelevant information and products will make the user drowned in the problem of information overload. The personalized recommendation system is based on massive data mining. Based on an advanced intelligent platform, it provides users with fully personalized decision support and information services. In order to improve the accuracy and effect of recommendation, a variety of recommendation methods have emerged. The content-based recommendation method is the continuation and development of information filtering technology. It is based on the content information of the item to make recommendations, and does not need to be based on the user's evaluation of the item. The user's interest information is obtained in the case of the feature description, but it requires the feature content to have a good structure. The recommendation technology based on collaborative filtering is one of the earliest and most successful technologies in the recommendation system. It generally adopts the nearest neighbor technology, uses the user's historical preference information to calculate the distance between users, and then uses the target user's nearest neighbor user to evaluate the product. The weighted evaluation value of the evaluation is used to predict the preference degree of the target user for a specific commodity, and the system recommends the target user according to the preference degree.

在推荐中，利用注意力机制能投对序列学习任务带来巨大的提升作用，注意力模型可以对数据序列进行数据加权的变换，硬注意力在每个时刻只关注模型序列的某一个位置，而软注意力每次会照顾到全部的位置，每个位置的权重不同。局部注意力可视为硬注意力和软注意力在优势上的混合，不同于硬注意力，局部注意力几乎处处可微，易于训练，每次只关注一小部分的源点位置，而全局注意力每次需要扫描全部的源隐藏状态。In the recommendation, the use of the attention mechanism can greatly improve the sequence learning task. The attention model can perform data-weighted transformation on the data sequence, and the hard attention only pays attention to a certain position of the model sequence at each moment. The soft attention will take care of all the positions each time, and the weight of each position is different. Local attention can be regarded as a mixture of hard attention and soft attention in terms of advantages. Different from hard attention, local attention is almost everywhere differentiable and easy to train. It only pays attention to a small part of the source position at a time, while the global attention Attention needs to scan all source hidden states each time.

然而，现有技术如基于内容的推荐方法，他要求内容能容易的抽取成有意义的特征，要求特征内容具有良好的结构性，且用户的偏好必须能够用内容特征形式来表达，不能显示地得到其他用户的情况，而协同过滤技术在推荐过程中具有稀疏问题和可扩展问题，推荐方法可扩展性较差，还面临冷启动等诸多问题。However, in the prior art, such as content-based recommendation methods, it requires that the content can be easily extracted into meaningful features, the feature content has a good structure, and the user's preference must be able to be expressed in the form of content features, which cannot be displayed. However, the collaborative filtering technology has sparse and scalability problems in the recommendation process, the recommendation method has poor scalability, and also faces many problems such as cold start.

发明内容SUMMARY OF THE INVENTION

本发明提供了一种基于项目的协作存储推荐方法及其推荐装置，本发明用于对网络检索信息的排名和推荐，它可以有效克服只考虑用户之间联系的局限性，能够充分考虑到项目与用户之间潜在影响的分布，强化特定项目与邻域之间的关系，考虑到项目与用户之间的潜在关系，采用一种更高阶的方法，增强推荐的可扩展性，提高推荐过程中对信息检索及结果的准确度与精确性，详见下文描述：The invention provides an item-based collaborative storage recommendation method and a recommendation device. The invention is used for ranking and recommending network retrieval information, which can effectively overcome the limitation of only considering the connection between users, and can fully consider items The distribution of potential impact with users, strengthen the relationship between specific items and neighborhoods, take into account the potential relationship between items and users, adopt a higher-order method, enhance the scalability of the recommendation, improve the recommendation process The accuracy and precision of information retrieval and results are described in the following description:

一种基于项目的协作存储推荐方法，所述方法包括：An item-based collaborative storage recommendation method, the method comprising:

对用户及项目信息分别进行嵌入处理，获得相似度；Embed the user and item information respectively to obtain similarity;

将相似度加入注意力机制中设置加权函数，将形成的权重与项目邻域进行操作，形成邻域表示；The similarity is added to the attention mechanism to set a weighting function, and the formed weight is operated with the item neighborhood to form a neighborhood representation;

通过项目信息嵌入和邻域表示设置得分函数；使用贝叶斯个性化-排名优化设置损失函数；Set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization;

根据得分函数以及损失函数实现协作存储推荐。Collaborative storage recommendation is implemented according to the score function and loss function.

进一步地，所述对用户及项目信息分别进行嵌入处理，获得相似度具体为：Further, the embedding processing is performed on the user and item information respectively, and the obtained similarity is specifically:

其中，t为具有隐式反馈的项目邻域中的特定项目，N(u)为向用户u提供隐式反馈的所有项目的集合，m_u为用户存储器组件切片，e_i为项目存储器切片。where t is a specific item in the item neighborhood with implicit feedback, N(u) is the set of all items that provide implicit feedback to user _u , _mu is the user memory component slice, and ei is the item memory slice.

其中，所述将相似度加入注意力机制中设置加权函数，将形成的权重与项目邻域进行操作，形成邻域表示具体为：Among them, the similarity is added to the attention mechanism to set a weighting function, and the formed weight is operated with the item neighborhood, and the neighborhood representation is specifically:

采用自适应加权函数建立注意力机制的学习过程，对不同影响的内容和项目赋予不同的权重，学习项目的独特贡献x_iut；The learning process of the attention mechanism is established with an adaptive weighting function, and different weights are given to the content and items of different influences, and the unique contribution of the learning item _xiut ;

通过使用注意力机制形成的权重，将权重与邻域进行操作，以形成邻域表示s_iu，注意力机制形成的权重会以不同的方式有选择的加权到项目及其邻域中，以得到最终的加权后的邻域表示。By using the weights formed by the attention mechanism, the weights are manipulated with the neighborhood to form the neighborhood representation s _iu , and the weights formed by the attention mechanism will be selectively weighted to the item and its neighborhood in different ways to get The final weighted neighborhood representation.

其中，所述独特贡献x_iut具体为：Wherein, the unique contribution x _iut is specifically:

进一步地，所述邻域表示s_iu具体为：Further, the neighborhood representation s _iu is specifically:

s_iu＝∑_t∈N(u)x_iut×f_t s _iu =∑ _t∈N(u) x _iut ×f _t

其中f_t是项目的嵌入矩阵的向量，×是矩阵级别的运算。where _ft is a vector of the item's embedding matrix and × is a matrix-level operation.

其中，所述通过项目信息嵌入和邻域信息设置得分函数具体为：Wherein, the setting of the scoring function through item information embedding and neighborhood information is specifically:

捕获项目与用户的邻域的局部结构以及项目与用户之间的全局交互信息，在项目与用户间建立联系；Capture the local structure of the neighborhood of the project and the user and the global interaction information between the project and the user, and establish the connection between the project and the user;

通过非线性反映出项目与用户之间的潜在关系，以产生关注项目影响力的排名分数。The latent relationship between items and users is reflected non-linearly to generate ranking scores that focus on item influence.

一种基于项目的协作存储推荐装置，所述装置包括：An item-based collaborative storage recommendation device, the device comprising:

相似度计算模块，用于对用户及项目信息分别进行嵌入处理，获得相似度；The similarity calculation module is used to embed the user and item information respectively to obtain the similarity;

邻域表示模块，用于将相似度加入注意力机制中设置加权函数，将形成的权重与项目邻域进行操作，形成邻域表示；The neighborhood representation module is used to add the similarity to the attention mechanism to set the weighting function, and operate the formed weight with the item neighborhood to form the neighborhood representation;

设置模块，用于通过项目信息嵌入和邻域表示设置得分函数；使用贝叶斯个性化-排名优化设置损失函数；Set module to set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization;

推荐模块，用于根据得分函数以及损失函数实现协作存储推荐。The recommendation module is used to implement collaborative storage recommendation based on the score function and loss function.

本发明提供的技术方案的有益效果是：The beneficial effects of the technical scheme provided by the present invention are:

1、本发明通过在项目级别上考虑项目对用户产生的潜在影响，更好更全面的学习可以有效提高推荐过程中对信息获取的准确度；1. The present invention can effectively improve the accuracy of information acquisition in the recommendation process by considering the potential impact of items on users at the item level, better and more comprehensive learning;

2、本发明关注项目信息及特征，能够更加全面和详细的捕获到项目对用户的吸引力，提高推荐结果的准确性，提升推荐的可扩展性，最终实验结果表明，本方法比只关注用户历史行为与用户间联系的推荐算法结果更加准确。2. The present invention pays attention to item information and features, which can capture the attractiveness of items to users more comprehensively and in detail, improve the accuracy of recommendation results, and improve the scalability of recommendation. The final experimental results show that this method is better than only focusing on users. The results of the recommendation algorithm based on the relationship between historical behavior and users are more accurate.

附图说明Description of drawings

图1为一种基于项目的协作存储推荐方法的流程图；1 is a flowchart of a project-based collaborative storage recommendation method;

图2为一种基于项目的协作存储推荐方法的整体架构图；Fig. 2 is an overall architecture diagram of an item-based collaborative storage recommendation method;

图3为一种基于项目的协作存储推荐装置的结构示意图。FIG. 3 is a schematic structural diagram of an item-based collaborative storage recommendation device.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚，下面对本发明实施方式作进一步地详细描述。In order to make the objectives, technical solutions and advantages of the present invention clearer, the embodiments of the present invention are further described in detail below.

实施例1Example 1

为了实现上述目的，本发明实施例提出了一种基于项目的协作存储推荐方法，参见图1和图2，该方法包括以下步骤：In order to achieve the above object, an embodiment of the present invention proposes an item-based collaborative storage recommendation method, see FIG. 1 and FIG. 2 , and the method includes the following steps:

101：对用户及项目信息分别进行嵌入处理，获得相似度；101: Embed the user and item information respectively to obtain similarity;

102：将得到的相似度加入注意力机制中设置加权函数，将形成的权重与项目邻域进行操作，形成邻域表示；102: Add the obtained similarity to the attention mechanism to set a weighting function, and operate the formed weight with the item neighborhood to form a neighborhood representation;

103：通过项目信息嵌入和邻域信息设置得分函数；103: Set a score function through item information embedding and neighborhood information;

104：使用贝叶斯个性化-排名优化设置损失函数；104: Use Bayesian personalization-rank optimization to set the loss function;

105：根据得分函数以及损失函数实验评估和验证对本发明提出的基于项目的协作存储推荐方法的效果。105: Experiment to evaluate and verify the effect of the item-based collaborative storage recommendation method proposed by the present invention according to the score function and the loss function.

在一个实施例中，步骤101对用户及项目信息分别进行嵌入处理，具体步骤如下：In one embodiment, step 101 performs embedding processing on the user and item information respectively, and the specific steps are as follows:

输入的用户和项目的信息被分别嵌入到用户和项目的存储器组件中，用户的存储器组件为M，项目的存储器组件为E，用户的偏好被存储在用户存储器组件切片m_u中，项目i的特征被存储在项目存储器切片e_i中，通过计算得到针对特定项目和特定用户在其给定邻域中的相似度a_iut。The entered user and item information is embedded in the user and item memory components, respectively, where the user's memory component is M, the item's memory component is E, the user's preferences are stored in the user memory component slice _mu , and the item i's memory component is The features are stored in the item memory slice ei, and the similarity a _iut in its given neighborhood for a specific item and a specific user is _calculated .

在一个实施例中，步骤102在步骤101的基础上采用注意力机制(本领域技术人员人员所公知，本发明实施例对此不做赘述)设置加权函数，使用注意力机制形成的权重与邻域进行操作，形成邻域表示，具体步骤如下：In one embodiment, step 102 adopts an attention mechanism (known to those skilled in the art, which will not be repeated in this embodiment of the present invention) on the basis of step 101 to set a weighting function, and the weights formed by using the attention mechanism are related to neighbors. The domain is operated to form a neighborhood representation. The specific steps are as follows:

采用自适应加权函数来建立注意力机制的学习过程，能够对不同影响的内容和项目赋予不同的权重，凸显具有高影响力内容的重要性，学习项目的独特贡献x_iut之后，通过使用注意力机制形成的权重，将权重与邻域进行操作，以形成邻域表示s_iu，注意力机制形成的权重会以不同的方式有选择的加权到项目及其邻域中，以得到最终的加权后的邻域表示。Using adaptive weighting function to establish the learning process of attention mechanism, it can give different weights to content and items with different influences, highlighting the importance of high-impact content, after learning the unique contribution of items x _iut , by using attention The weight formed by the mechanism operates on the weight and the neighborhood to form the neighborhood representation s _iu , and the weight formed by the attention mechanism will be selectively weighted to the item and its neighborhood in different ways to obtain the final weighted The neighborhood representation of .

在一个实施例中，步骤103在步骤101和步骤102的基础上通过项目信息嵌入和邻域信息设置得分函数，具体步骤如下：In one embodiment, step 103 sets a score function through item information embedding and neighborhood information on the basis of steps 101 and 102, and the specific steps are as follows:

捕获项目与用户的邻域的局部结构以及项目与用户之间的全局交互信息，在项目与用户间建立更深层次广泛的联系，之后，通过非线性的方法，能够更加全面的反映出项目与用户之间的潜在关系，以产生关注项目影响力的排名分数

Capture the local structure of the neighborhood of the project and the user and the global interaction information between the project and the user, and establish a deeper and wider connection between the project and the user. potential relationships to yield ranking scores focusing on project impact

在一个实施例中，步骤104设置损失函数，优化模型，具体步骤如下：In one embodiment, step 104 sets a loss function to optimize the model, and the specific steps are as follows:

使用贝叶斯个性化排名优化设置损失函数，优化模型。Use Bayesian Personalized Rank Optimization to set up a loss function and optimize the model.

在一个实施例中，步骤105对基于项目的协作存储推荐方法进行实验，具体步骤如下：In one embodiment, step 105 conducts an experiment on an item-based collaborative storage recommendation method, and the specific steps are as follows:

对实验的命中率和归一化折损累积增益进行计算从而实现对模型效果的评估，为更好地平衡这两个指标，采用八个基线对比实验，对算法的效果进行评估及验证。The hit rate of the experiment and the cumulative gain of normalized damage are calculated to evaluate the effect of the model. In order to better balance these two indicators, eight baseline comparison experiments are used to evaluate and verify the effect of the algorithm.

实施例2Example 2

下面结合具体的计算公式、实例对实施例1中的方案进行进一步地介绍，详见下文描述：The scheme in Embodiment 1 is further introduced below in conjunction with specific calculation formulas and examples, and is described in detail below:

201：对用户及项目信息分别进行嵌入处理，输入的用户和项目的信息被分别嵌入到用户和项目的存储器组件中；201: Perform embedding processing on the user and item information respectively, and the inputted user and item information is embedded in the memory components of the user and the item respectively;

其中，用户的存储器组件为M，项目的存储器组件为E，用户的偏好被存储在用户存储器组件切片m_u中，项目i的特征被存储在项目存储器切片e_i中，以得到针对特定项目和特定用户在其给定邻域中的相似度a_iut，如公式(1)所示。where the user's memory component is M, the item's memory component is E, the user's preferences are stored in the user memory component slice _mu , and the features of item _i are stored in the item memory slice ei, to obtain specific items and The similarity a _iut of a particular user in its given neighborhood is shown in formula (1).

其中，公式(1)中的t为具有隐式反馈的项目邻域中的特定项目，N(u)为向用户u提供隐式反馈的所有项目的集合。where t in formula (1) is a specific item in the item neighborhood with implicit feedback, and N(u) is the set of all items that provide implicit feedback to user u.

202：采用注意力机制设置加权函数，本发明实施例采用自适应加权函数来建立注意力机制的学习过程，学习项目的独特贡献x_iut，如公式(2)所示。202 : Use the attention mechanism to set the weighting function. The embodiment of the present invention uses the adaptive weighting function to establish the learning process of the attention mechanism, and the unique contribution x _iut of the learning item is shown in formula (2).

其中，公式(2)中的a_iut为针对特定项目和特定用户在其给定邻域中的相似度。Among them, a _iut in formula (2) is the similarity between a specific item and a specific user in its given neighborhood.

203：使用注意力机制形成的权重，将权重与邻域进行操作，以形成邻域表示s_iu，如公式(3)所示。203: Using the weights formed by the attention mechanism, operate the weights with the neighborhood to form the neighborhood representation s _iu , as shown in formula (3).

s_iu＝∑_t∈N(u)x_iut×f_t (3)s _iu =∑ _t∈N(u) x _iut ×f _t (3)

公式(3)中的f_t是项目的另一个嵌入矩阵的向量，×是矩阵级别的运.针对项目的特点，注意力机制形成的权重会以不同的方式有选择的加权到项目及其邻域中，以得到最终的加权后的邻域。In formula (3), f _t is the vector of another embedding matrix of the item, and × is the matrix-level operation. According to the characteristics of the item, the weights formed by the attention mechanism will be selectively weighted to the item and its neighbors in different ways. domain to get the final weighted neighborhood.

203：通过项目信息嵌入和邻域信息设置得分函数，具体步骤如下：203: Set a score function through item information embedding and neighborhood information, the specific steps are as follows:

捕获项目与用户的邻域的局部结构以及项目与用户之间的全局交互信息，将项目与用户建立了更深层次广泛的联系，之后，通过非线性的方法，能够更加全面的反映出项目与用户之间的潜在关系，通过线性投影U以产生关注项目影响力的排名分数

如公式(4)、公式(5)和公式(6)所示。Capture the local structure of the neighborhood of the project and the user and the global interaction information between the project and the user, and establish a deeper and wider connection between the project and the user. potential relationship between, by linearly projecting U to generate ranking scores of attention item influence

As shown in formula (4), formula (5) and formula (6).

其中，公式(4)(5)(6)中的

为存储器，为第y层的相似矩阵，v、b为要学习的参数，

为经过项目邻域更新后的邻域，

为存储器之间的非线性映射，e_t为用户t的独特贡献，

为第四层的邻域矩阵，σ(x)＝1/(1+exp(-x))为非线性激活sigmoid函数，y为存储器的层。Among them, in formula (4)(5)(6)

for memory, is the similarity matrix of the y-th layer, v and b are the parameters to be learned,

is the neighborhood after project neighborhood update,

is the nonlinear mapping between memories, e _t is the unique contribution of user t,

is the neighborhood matrix of the fourth layer, σ(x)=1/(1+exp(-x)) is the nonlinear activation sigmoid function, and y is the memory layer.

其中，本发明中y为4，W是一个权重矩阵，它将项目的特征映射到潜在的空间，并与前一层的信息相结合，·是元素级别的操作，在获得项目与用户的元素乘积后，以线性投影的方式对学习得到的邻域表示以及参数进行操作。采用非线性激活ReLU函数

Among them, in the present invention, y is 4, and W is a weight matrix, which maps the features of the item to the latent space and combines it with the information of the previous layer. It is an element-level operation. After obtaining the elements of the item and the user After multiplication, the learned neighborhood representation and parameters are manipulated by linear projection. Using nonlinear activation ReLU function

204：采用贝叶斯个性化排名优化设置损失函数，优化推荐效果，如公式(7)所示。204: Use Bayesian personalized ranking optimization to set a loss function to optimize the recommendation effect, as shown in formula (7).

公式(7)中的σ(x)＝1/(1+exp(-x))为逻辑sigmoid函数。σ(x)=1/(1+exp(-x)) in formula (7) is a logical sigmoid function.

205：通过对命中率和归一化折损累积增益的计算，可以对本发明的方法的效果进行评价和验证。205: Through the calculation of the hit rate and the normalized impairment cumulative gain, the effect of the method of the present invention can be evaluated and verified.

实施例3Example 3

本发明实施例提供了一种基于项目的协作存储推荐装置，参见图3，该装置包括：An embodiment of the present invention provides an item-based collaborative storage recommendation device. Referring to FIG. 3 , the device includes:

即，获得项目嵌入矩阵E、F与用户嵌入矩阵M。That is, item embedding matrices E, F and user embedding matrix M are obtained.

实施例4Example 4

在基于项目的协作存储推荐方法及其推荐装置的实验中，通过得分函数计算出每条项目的最终得分，通过得分进行排名和推荐。In the experiment of the item-based collaborative storage recommendation method and its recommendation device, the final score of each item is calculated by the score function, and the ranking and recommendation are carried out by the score.

通过实验效果可以看出，在引入项目对用户产生的潜在影响后，通过注意力机制和损失函数的优化，基于项目的协作存储推荐方法及其推荐装置具有一定的语言学意义和优秀的效果。It can be seen from the experimental results that after introducing the potential impact of items on users, through the optimization of attention mechanism and loss function, the item-based collaborative storage recommendation method and its recommendation device have certain linguistic significance and excellent effects.

在基于项目的协作存储推荐方法及其推荐装置的实验中，通过八个基线实验进行对比试验，实验中的负样本数量设置为4，在预训练中l2的权重衰减被设为0.001，在训练中l2的权重衰减被设置为0.1，内存的嵌入大小d为50，项目信息嵌入矩阵及用户信息嵌入矩阵均为随机自动生成。In the experiments of the item-based collaborative storage recommendation method and its recommendation device, eight baseline experiments are used to conduct comparative experiments, the number of negative samples in the experiment is set to 4, the weight decay of l2 The weight decay of l2 is set to 0.1, the memory embedding size d is 50, and the item information embedding matrix and user information embedding matrix are randomly and automatically generated.

本发明使用命中率(HR)、归一化折损累积增益(NDCG)值两个评价指标对基于项目的协作存储推荐方法及其推荐装置效果进行评估，命中率计算的主要目的是评估模型得到的top-N推荐列表里的项目数目在测试集中的比重，而归一化折损累积增益计算的主要目的是评估模型产生的推荐列表的效果与理想状态下产生推荐列表效果的对比。命中率(HR)的计算公式如公式(8)所示，归一化折损累积增益(NDCG)的计算公式如公式(9)所示。The present invention evaluates the item-based collaborative storage recommendation method and the effect of the recommendation device by using two evaluation indicators: hit rate (HR) and normalized impairment cumulative gain (NDCG) value. The main purpose of the hit rate calculation is to evaluate the model to obtain The proportion of the number of items in the top-N recommendation list in the test set, and the main purpose of the normalized discount cumulative gain calculation is to evaluate the comparison between the effect of the recommendation list generated by the model and the effect of the recommendation list generated under ideal conditions. The calculation formula of the hit ratio (HR) is shown in formula (8), and the calculation formula of the normalized impairment cumulative gain (NDCG) is shown in formula (9).

公式(8)中数据值|GT|是指所有测试集，NumberofHits@K是属于每个用户的前K推荐列表的测试集数量的总和。公式(9)中DCG是平均折扣累积增益，理想DCG@K是理想条件下的最大DCG值。命中率和归一化折损累积增益均为值越大效果越好。The data value |GT| in formula (8) refers to all test sets, and NumberofHits@K is the sum of the number of test sets belonging to each user's top-K recommendation list. In formula (9), DCG is the average discounted cumulative gain, and ideal DCG@K is the maximum DCG value under ideal conditions. The larger the value of the hit rate and the normalized damage cumulative gain, the better the effect.

表1为实验评价指标表Table 1 is the experimental evaluation index table

九个基线对比实验的实验效果如表1所示，SVD++是一种结合了基于邻域的相似性和潜在因子模型的混合模型。广义矩阵分解(GMF)模型是一种非线性推广的潜在因子模型的。KNN用于计算余弦项目-项目相似度的基于邻域的方法。贝叶斯个性化排序(BPR)是隐式反馈的一种矩阵分解模型。神经矩阵分解(NeuMF)通过多层感知器模型进行项目排序的一种矩阵分解模型。协同去噪自动编码器(CDAE)是一种基于项目的深度学习模型。因子项相似性模型(FISM)能够将项目-项目对的相似性矩阵进行分解，优化损失函数的一种基于邻域的模型。CMN是一种融合存储器组件与注意力机制的模型。由实验结果可以看出，使用基于项目的协作存储推荐方法及其推荐装置的实验效果的命中率和归一化折损累积增益最高，说明该模型的有效性好。The experimental results of the nine baseline comparison experiments are shown in Table 1. SVD++ is a hybrid model that combines neighborhood-based similarity and latent factor models. The generalized matrix factorization (GMF) model is a nonlinear extension of the latent factor model. KNN's neighborhood-based method for computing cosine item-item similarity. Bayesian Personalization Ranking (BPR) is a matrix factorization model with implicit feedback. Neural Matrix Factorization (NeuMF) is a matrix factorization model for item ranking through a multilayer perceptron model. Collaborative Denoising Autoencoder (CDAE) is an item-based deep learning model. The Factor Term Similarity Model (FISM) is a neighborhood-based model that optimizes the loss function by decomposing the item-item pair similarity matrix. CMN is a model that fuses memory components with attention mechanisms. It can be seen from the experimental results that the experimental effect of the item-based collaborative storage recommendation method and its recommendation device has the highest hit rate and normalized impairment cumulative gain, indicating that the model is effective.

本领域技术人员可以理解附图只是一个优选实施例的示意图，上述本发明实施例序号仅仅为了描述，不代表实施例的优劣。Those skilled in the art can understand that the accompanying drawing is only a schematic diagram of a preferred embodiment, and the above-mentioned serial numbers of the embodiments of the present invention are only for description, and do not represent the advantages or disadvantages of the embodiments.

以上所述仅为本发明的较佳实施例，并不用以限制本发明，凡在本发明的精神和原则之内，所作的任何修改、等同替换、改进等，均应包含在本发明的保护范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included in the protection of the present invention. within the range.

Claims

1. A project-based collaborative storage recommendation method, wherein the method comprises:

Embed the user and item information respectively to obtain similarity;

The similarity is added to the attention mechanism to set a weighting function, and the formed weight is operated with the item neighborhood to form a neighborhood representation;

Set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization;

Collaborative storage recommendation is implemented according to the score function and loss function.

2. The item-based collaborative storage recommendation method according to claim 1, wherein the embedding processing is performed on the user and item information respectively, and the obtained similarity is specifically:

where t is a specific item in the item neighborhood with implicit feedback, N(u) is the set of all items that provide implicit feedback to user _u , _mu is the user memory component slice, and ei is the item memory slice.

3. A project-based collaborative storage recommendation method according to claim 2, wherein the similarity is added to the attention mechanism to set a weighting function, and the formed weight is operated with the project neighborhood to form a neighborhood. The domain representation is specifically:

The learning process of the attention mechanism is established with an adaptive weighting function, and different weights are given to the content and items of different influences, and the unique contribution of the learning item _xiut ;

By using the weights formed by the attention mechanism, the weights are manipulated with the neighborhood to form the neighborhood representation s _iu , and the weights formed by the attention mechanism will be selectively weighted to the item and its neighborhood in different ways to get The final weighted neighborhood representation.

4. a kind of project-based collaborative storage recommendation method according to claim 3, is characterized in that, described unique contribution x _iut is specifically:

5. a kind of item-based collaborative storage recommendation method according to claim 3, is characterized in that, described neighborhood representation s _iu is specifically:

s _iu =∑ _t∈N(u) x _iut ×f _t

where _ft is a vector of the item's embedding matrix and × is a matrix-level operation.

6. a kind of item-based collaborative storage recommendation method according to claim 1, is characterized in that, described by item information embedding and neighborhood information setting score function is specifically:

Capture the local structure of the neighborhood of the project and the user and the global interaction information between the project and the user, and establish the connection between the project and the user;

The latent relationship between items and users is reflected non-linearly to generate ranking scores that focus on item influence.

7. An item-based collaborative storage recommendation device, wherein the device comprises:

The similarity calculation module is used to embed the user and item information respectively to obtain the similarity;

The neighborhood representation module is used to add the similarity to the attention mechanism to set the weighting function, and operate the formed weight with the item neighborhood to form the neighborhood representation;

Set module to set score function through item information embedding and neighborhood representation; set loss function using Bayesian personalization-rank optimization;

The recommendation module is used to implement collaborative storage recommendation based on the score function and loss function.