WO2023098098A1 - Tag-aware recommendation method based on attention mechanism and hypergraph convolution - Google Patents

Tag-aware recommendation method based on attention mechanism and hypergraph convolution Download PDF

Info

Publication number
WO2023098098A1
WO2023098098A1 PCT/CN2022/106913 CN2022106913W WO2023098098A1 WO 2023098098 A1 WO2023098098 A1 WO 2023098098A1 CN 2022106913 W CN2022106913 W CN 2022106913W WO 2023098098 A1 WO2023098098 A1 WO 2023098098A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
item
hypergraph
tag
users
Prior art date
Application number
PCT/CN2022/106913
Other languages
French (fr)
Chinese (zh)
Inventor
王海艳
尤恺翔
骆健
Original Assignee
南京邮电大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南京邮电大学 filed Critical 南京邮电大学
Publication of WO2023098098A1 publication Critical patent/WO2023098098A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the invention belongs to the application fields of information service and computer software technology, and in particular relates to a label-aware recommendation method based on attention mechanism and hypergraph convolution.
  • the present invention introduces a hypergraph to model the high-order relationship between nodes, and utilizes the hypergraph volume
  • the transfer of information in the neural network is completed actively, and the attention mechanism is used to reflect the importance of different information, so as to effectively alleviate the problem of high-level information loss in traditional recommendation methods.
  • the present invention is a label-aware recommendation method based on attention mechanism and hypergraph convolution.
  • the label-aware recommendation method is constructed on the user side and the project side through the direct interaction relationship between users and items and the brief interaction relationship between users, items, and labels.
  • the information of high-order relationship responses is extracted through hypergraph convolution, and the attention mechanism is used to distinguish information of different importance levels, and recommendations are made through the obtained feature representations.
  • the specific steps are as follows:
  • Step 1 Initialize the feature representation of users and items to get u, i;
  • Step 2 Obtain the user feature u tag and the item feature i tag represented by the tag according to the interaction relationship between the user and the item and the tag respectively;
  • Step 3 Construct three interactive bipartite graphs G user- item , G user- tag , and G item-tag according to the interaction relationship between users and items, users and tags , and items and tags;
  • Step 4 According to the bipartite graph of users and items obtained in step 3, respectively construct the hypergraph structure of the user side and the item side represented by the direct interaction relationship
  • Step 5 Based on the bipartite graphs of users and tags and items and tags obtained in step 3, respectively construct the hypergraph structure of the user side and item side represented by the tag relationship
  • Step 6 Use the two hypergraphs obtained in step 4 for the features u and i obtained in step 1 Perform hypergraph convolution respectively to obtain neighborhood feature representations u 1 and i 1 of the updated direct relationship representations on the user side and item side;
  • Step 7 For the features u and i obtained in step 1, the two hypergraphs obtained in step 5 Carry out hypergraph convolution respectively to obtain neighborhood feature representations u 2 and i 2 represented by updated tag relations on the user side and item side;
  • Step 8 Use the attention mechanism to process the features obtained in steps 2, 6, and 7 to obtain the weights of different feature representations to obtain the final user and item feature representations u * , i * ;
  • a further improvement of the present invention is: in step 2, according to the number of times the user tags and the number of times the item is tagged, the tag representation features of the user and the item are initialized and normalized.
  • step 4 according to the bipartite graph of users and items, the hypergraphs of the user side and the item side represented by the direct interaction relationship are respectively constructed.
  • the user side Take the user side as an example: if there is a path between two items m and n, and the number of users passing through the path is less than k, then these two items are reachable neighbors of order k; for item m, if it has reachable neighbors of order k Reach neighbor n and user u directly interacts with m, then user u is the k-order reachable user of item n.
  • step 5 according to the bipartite graph of users and tags, and items and tags, hypergraphs on the user side and item side represented by the tag relationship are respectively constructed.
  • step 5 the bipartite graph of users and tags, and items and tags, hypergraphs on the user side and item side represented by the tag relationship are respectively constructed.
  • step 5 the idea is similar to step 4, but instead of user-item relationships, user-tag and item-tag relationships are used respectively.
  • user-tag and item-tag relationships are used respectively.
  • the user side Take the user side as an example: if there is a path between two labels m and n, and the number of users passing through the path is less than k, then these two labels are k-order reachable neighbors; for label m, if it has k-order reachable neighbors Reach neighbor n and user u directly interacts with m, then user u is the k-order reachable user of label n.
  • the present invention uses the characteristics of the hypergraph data structure to represent the high-order relationship of users and items, and utilizes hypergraph convolution to update information. Compared with the graph neural network, it can reduce the loss of information in the process of information transmission, fully Utilize high-order interaction relationships to obtain domain feature representations.
  • the present invention combines the attention mechanism to effectively use multiple features to represent features with different degrees of importance, and avoid high-value information from being overwhelmed by low-value information.
  • the method of splicing the features of the user side and the item side is used first, and then processed by the fully connected layer and the Sigmoid activation function, which reduces the impact of the information loss problem caused by the direct feature inner product.
  • Figure 1 is a diagram of the relationship between users, items, and tags.
  • Figure 2 is a variety of hypergraph convolutions representing hypergraphs.
  • Figure 3 is the multi-feature processing process based on the attention mechanism.
  • Figure 4 is the score prediction process.
  • the present invention introduces a hypergraph to model the high-order relationship between nodes, uses hypergraph convolution to complete the information transfer in the neural network, and uses the attention mechanism to reflect the different information Importance, so as to effectively alleviate the problem of high-level information loss in traditional recommendation methods.
  • Figure 1 shows the interactive relationship between users, items, and tags, including the direct interaction between users and items, and the relationship between users-tags and items-tags.
  • the present invention introduces attention mechanism and hypergraph convolution.
  • the present invention is a label-aware recommendation method based on attention mechanism and hypergraph convolution.
  • This method uses the direct interaction relationship between users and items and the brief interaction relationship between users, items, and labels in the user Construct different hypergraphs on the side and the project side, extract the information of high-order relational responses through hypergraph convolution, and use the attention mechanism to distinguish information of different importance levels, and make recommendations through the obtained feature representations, including the following steps :
  • Step 1 Initialize features.
  • the initial feature u is established on the user side based on personal information such as user ID, gender, and age
  • the initial feature i is established on the project side based on product information.
  • the label-aware part is not involved.
  • the number of times the user marks the label represents the feature
  • the item-label feature uses in and Respectively represent the number of times that user u and item i are marked with p, and ⁇ represents the normalization operation.
  • Step 3 Construct three interactive bipartite graphs G user-item , G user- tag , and G item-tag according to the interaction relationship between users and items, users and tags , and items and tags.
  • Figure 1 can be regarded as a combined representation of three bipartite graphs.
  • the purpose of establishing a bipartite graph is to construct a hypergraph structure.
  • the bipartite graph of users and items uses the direct interaction between users and items, such as the rating matrix.
  • Step 4 According to the bipartite graph of users and items, respectively construct the hypergraphs of the user side and the item side represented by the direct interaction relationship.
  • the user side Take the user side as an example: if there is a path between two items m and n, and the number of users passing through the path is less than k, then these two items are reachable neighbors of order k; for item m, if there is a reachable neighbor of order k Reach neighbor n and user u directly interacts with m, then user u is the k-order reachable user of item n.
  • k-order reachable users are regarded as a set, and the users on the set are regarded as nodes, and the set can be regarded as a hyperedge, thus constructing a hypergraph
  • the project side is the same as the user side, building a hypergraph Specifically: if there is a path between two users u and v, and the number of items passed in the path is less than k, then the two users are k-order reachable neighbors; for user u, if it has a k-order reachable neighbor v And item m interacts directly with u, then item m is the k-level reachable item of user u.
  • its k-level reachable items are regarded as a set, and the items on the set are used as nodes, and the set can be regarded as super edges, thus constructing the hypergraph
  • Step 5 According to the bipartite graphs of users and tags and items and tags, respectively construct the hypergraphs on the user side and item side represented by the tag relationship.
  • the user side Take the user side as an example: if there is a path between two labels m and n, and the number of users passing through the path is less than k, then these two labels are k-order reachable neighbors; for label m, if it has k-order reachable neighbors Reach neighbor n and user u directly interacts with m, then user u is the k-order reachable user of label n.
  • the two labels are k-order reachable neighbors; for label m, if it has a k-order reachable neighbor n And item p interacts directly with m, then item p is the k-order reachable item of label n.
  • the k-order reachable items are regarded as a set, and the items on the set are used as nodes, and the set can be regarded as super edges, thus constructing the hypergraph
  • the incidence matrix of represents H d-user , H d-item .
  • hypergraph convolution can be expressed as:
  • the spectral method is used for hypergraph convolution.
  • ⁇ (l) represents the learnable parameter matrix on layer l
  • is the activation function
  • D v is the degree matrix of nodes
  • DE is the degree matrix of hyperedges.
  • the multiplication operation of represents the user-side direct interaction relation represents the hypergraph
  • the multiplication operation with H d-user represents Aggregation from hyperedge features to node features.
  • ResNet the original features are added to each layer of the hypergraph convolution to retain the influence of the previous features, so as to prevent the initial features from being reflected due to the excessive influence of the neighbor features.
  • hypergraph convolution can be expressed as:
  • ⁇ (l) represents the learnable parameter matrix on the l layer
  • is the activation function
  • D v is the degree matrix of the node
  • DE is the degree matrix of the hyperedge
  • the multiplication operation represents the project-side direct interaction relation represents the hypergraph The aggregation from the node feature to the hyperedge feature, and the multiplication operation with the H d-item represent Aggregation from hyperedge features to node features.
  • Step 7 Similar to Step 6, get the hypergraph The incidence matrix of represents H t-user , H t-item .
  • User-side hypergraph convolution can be expressed as:
  • ⁇ (l) represents the learnable parameter matrix on the l layer
  • is the activation function
  • D v is the degree matrix of the node
  • DE is the degree matrix of the hyperedge.
  • hypergraph The incidence matrix of , H (V, E), where V is the set of nodes, and E is the set of hyperedges.
  • the multiplication operation of represents the user-side label represents the hypergraph Aggregation from node features to hyperedge features
  • multiplication operation with H t-user represents Aggregation from hyperedge features to node features.
  • the item-side hypergraph convolution can be expressed as:
  • ⁇ (l) represents the learnable parameter matrix on the l layer
  • is the activation function
  • D v is the degree matrix of the node
  • DE is the degree matrix of the hyperedge
  • the multiplication operation represents the item side label represents the hypergraph
  • the multiplication operation with H t-item represents Aggregation from hyperedge features to node features.
  • Step 8 As shown in Figure 3, according to the features u tag , i tag , u 1 , i 1 , u 2 , and i 2 obtained in steps 2, 6, and 7, use the attention mechanism to obtain the user-side and item-side The final features represent u * , i * .
  • u tag has different dimensions from u 1 and u 2 , it is not suitable for direct addition operation. Therefore, they are respectively spliced to obtain u 1-tag and u 2-tag .
  • i * ⁇ (i,1)i 1-tag + ⁇ (i,2)u 2-tag .
  • the items are recommended to the user by Top-K sorting.
  • the present invention introduces hypergraph convolution to mine high-order relations for feature extraction.
  • the attention mechanism is used to assign weights to the features obtained by the user-item direct interaction hypergraph and the label-aware hypergraph construction, which can better distinguish information of different importance.
  • the method proposed in the present invention can fully extract the features in the direct user-item interaction relationship and the interaction relationship with tags, effectively improving the performance of the recommendation method.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A tag-aware recommendation method based on an attention mechanism and hypergraph convolution. The tag-aware recommendation method constructs different hypergraphs respectively on a user side and an item side according to direct interaction relationships between users and items and indirect interaction relationships among the users, the items and tags, extracts information reflected by high-order relationships by means of the hypergraph convolution, distinguishes information having different degrees of importance by means of the attention mechanism, and carries out recommendation by means of obtained feature representation. The method introduces the hypergraph convolution to mine the high-order relationships for feature extraction, and uses the attention mechanism to allocate weights to features obtained from hypergraphs of user-item direct interaction and hypergraphs constructed by tag-awareness, so that information having different degrees of importance can be better distinguished. By combining the hypergraph convolution and the attention mechanism, the method can fully extract features from the user-item direct interaction relationships and from the interaction relationships with the tags, thus improving the performance of the recommendation method.

Description

一种基于注意力机制与超图卷积的标签感知推荐方法A label-aware recommendation method based on attention mechanism and hypergraph convolution 技术领域technical field
本发明属于信息服务、计算机软件技术应用领域,具体的说是涉及一种基于注意力机制与超图卷积的标签感知推荐方法。The invention belongs to the application fields of information service and computer software technology, and in particular relates to a label-aware recommendation method based on attention mechanism and hypergraph convolution.
背景技术Background technique
随着网络各种信息资源的数量飞速增长,如何从巨大的数据量中将符合需求的资源或商品推荐给用户越来越受到工业界和学术界的关注,这就需求服务提供者要有合适的推荐方法。为了提高推荐准确性,许多互联网服务提供者采用了用户生成内容(UGC)的标签系统,允许用户主动给商品、视频、信息等打上标签。传统的如协同过滤等推荐方法难以体现用户和项目的复杂的、多种的、高阶的交互关系,因此效果不佳。随着深度学习的方法的引入,基于图神经网络的推荐方法因可以反应拓扑结构而得到关注。然而,普通的图结构仅仅通过边来反应结点之间的两两关系,却无法反应三者及三者以上的关系。With the rapid growth of the number of various information resources on the Internet, how to recommend resources or products that meet the needs to users from the huge amount of data has attracted more and more attention from the industry and academia, which requires service providers to have appropriate recommended method. In order to improve the accuracy of recommendations, many Internet service providers have adopted a user-generated content (UGC) tagging system, which allows users to actively tag products, videos, information, etc. Traditional recommendation methods such as collaborative filtering are difficult to reflect the complex, diverse, and high-level interaction between users and items, so the effect is not good. With the introduction of deep learning methods, recommendation methods based on graph neural networks have attracted attention because they can reflect topology. However, the ordinary graph structure only reflects the pairwise relationship between nodes through edges, but cannot reflect the relationship between three or more than three.
发明内容Contents of the invention
为了解决标签感知推荐中用户、项目、标签的复杂高阶关系无法使用传统的图神经网络来体现的问题,本发明引入了超图来建模结点之间的高阶关系,利用超图卷积完成神经网络中信息的传递,并使用注意力机制来反应不同信息的重要程度,以此有效缓解传统推荐方法中高阶信息丢失的问题。In order to solve the problem that the complex high-order relationship of users, items, and tags in tag-aware recommendation cannot be represented by the traditional graph neural network, the present invention introduces a hypergraph to model the high-order relationship between nodes, and utilizes the hypergraph volume The transfer of information in the neural network is completed actively, and the attention mechanism is used to reflect the importance of different information, so as to effectively alleviate the problem of high-level information loss in traditional recommendation methods.
为了达到上述目的,本发明是通过以下技术方案实现的:In order to achieve the above object, the present invention is achieved through the following technical solutions:
本发明是一种基于注意力机制与超图卷积的标签感知推荐方法,标签感知推荐方法通过用户与项目的直接交互关系和用户、项目、标签的简介交互关系分别在用户侧和项目侧构建不同的超图,通过超图卷积提取高阶关系反应的信息,并采用注意力机制对不同重要程度的信息做出区分,通过得到的特征表示进行推荐,具体包括如下步骤:The present invention is a label-aware recommendation method based on attention mechanism and hypergraph convolution. The label-aware recommendation method is constructed on the user side and the project side through the direct interaction relationship between users and items and the brief interaction relationship between users, items, and labels. For different hypergraphs, the information of high-order relationship responses is extracted through hypergraph convolution, and the attention mechanism is used to distinguish information of different importance levels, and recommendations are made through the obtained feature representations. The specific steps are as follows:
步骤1:初始化用户与项目的特征表示得到u、i;Step 1: Initialize the feature representation of users and items to get u, i;
步骤2:根据用户和项目分别与标签的交互关系得到标签表示的用户特征u tag和项目特征i tagStep 2: Obtain the user feature u tag and the item feature i tag represented by the tag according to the interaction relationship between the user and the item and the tag respectively;
步骤3:根据用户与项目、用户与标签、项目与标签的交互关系构建三个交互二部图G user-item、G user-tag、G item-tagStep 3: Construct three interactive bipartite graphs G user- item , G user- tag , and G item-tag according to the interaction relationship between users and items, users and tags , and items and tags;
步骤4:根据步骤3得到的用户与项目二部图分别构建直接交互关系表示的用户侧和项目侧的超图结构
Figure PCTCN2022106913-appb-000001
Step 4: According to the bipartite graph of users and items obtained in step 3, respectively construct the hypergraph structure of the user side and the item side represented by the direct interaction relationship
Figure PCTCN2022106913-appb-000001
步骤5:根步骤3得到的用户与标签、项目与标签二部图分别构建标签关系表示的用户侧和项目侧的超图结构
Figure PCTCN2022106913-appb-000002
Step 5: Based on the bipartite graphs of users and tags and items and tags obtained in step 3, respectively construct the hypergraph structure of the user side and item side represented by the tag relationship
Figure PCTCN2022106913-appb-000002
步骤6:对步骤1得到的特征u、i用步骤4得到的两个超图
Figure PCTCN2022106913-appb-000003
Figure PCTCN2022106913-appb-000004
分别进行超图卷积,从而得到用户侧和项目侧更新后的直接关系表示的邻域特征表示u 1、i 1
Step 6: Use the two hypergraphs obtained in step 4 for the features u and i obtained in step 1
Figure PCTCN2022106913-appb-000003
Figure PCTCN2022106913-appb-000004
Perform hypergraph convolution respectively to obtain neighborhood feature representations u 1 and i 1 of the updated direct relationship representations on the user side and item side;
步骤7:对步骤1得到的特征u、i步骤5到的两个超图
Figure PCTCN2022106913-appb-000005
分别进行超图卷积,从而得到用户侧和项目侧更新后的标签关系表示的邻域特征表示u 2、i 2
Step 7: For the features u and i obtained in step 1, the two hypergraphs obtained in step 5
Figure PCTCN2022106913-appb-000005
Carry out hypergraph convolution respectively to obtain neighborhood feature representations u 2 and i 2 represented by updated tag relations on the user side and item side;
步骤8:使用注意力机制对步骤2、步骤6、步骤7得到的特征进行处理以得到不同特征表示的权重来得到最终的用户与项目特征表示u *、i *Step 8: Use the attention mechanism to process the features obtained in steps 2, 6, and 7 to obtain the weights of different feature representations to obtain the final user and item feature representations u * , i * ;
步骤9:根据第八步得到的用户与项目特征表示进行拼接得到z=[u *;i *],将其输入到全连接层中并利用Sigmoid函数获得预测概率,根据评分进行推荐。 Step 9: Concatenate the user and item feature representations obtained in the eighth step to obtain z=[u * ; i * ], input it into the fully connected layer and use the Sigmoid function to obtain the prediction probability, and make recommendations based on the score.
本发明的进一步改进在于:在步骤2中,根据用户打的标签的次数和项目被打标签的次数,初始化用户和项目的标签表示特征,并进行归一化处理。A further improvement of the present invention is: in step 2, according to the number of times the user tags and the number of times the item is tagged, the tag representation features of the user and the item are initialized and normalized.
本发明的进一步改进在于:在步骤4中:根据用户与项目二部图分别构建直接交互关系表示的用户侧和项目侧的超图。以用户侧为例:若两个项目m、 n之间存在路径,且路径中经过的用户数小于k,则这两个项目为k阶可到达邻居;对于项目m,若其有k阶可到达邻居n且用户u与m直接交互,则用户u是项目n的k阶可达用户。对于每个项目,其k阶可达用户视为一个集合,集合上用户作为结点,集合即可视为超边,因而构建超图
Figure PCTCN2022106913-appb-000006
项目侧与用户侧同理,构建超图
Figure PCTCN2022106913-appb-000007
The further improvement of the present invention lies in: in step 4: according to the bipartite graph of users and items, the hypergraphs of the user side and the item side represented by the direct interaction relationship are respectively constructed. Take the user side as an example: if there is a path between two items m and n, and the number of users passing through the path is less than k, then these two items are reachable neighbors of order k; for item m, if it has reachable neighbors of order k Reach neighbor n and user u directly interacts with m, then user u is the k-order reachable user of item n. For each item, its k-order reachable users are regarded as a set, and the users on the set are regarded as nodes, and the set can be regarded as a hyperedge, thus constructing a hypergraph
Figure PCTCN2022106913-appb-000006
The project side is the same as the user side, building a hypergraph
Figure PCTCN2022106913-appb-000007
本发明的进一步改进在于:在步骤5中根据用户与标签、项目与标签二部图分别构建标签关系表示的用户侧和项目侧的超图。思想与第四步类似,但是并非采用用户-项目关系,而是分别采用用户-标签和项目-标签关系。以用户侧为例:若两个标签m、n之间存在路径,且路径中经过的用户数小于k,则这两个标签为k阶可到达邻居;对于标签m,若其有k阶可到达邻居n且用户u与m直接交互,则用户u是标签n的k阶可达用户。对于每个标签,其k阶可达用户视为一个集合,集合上用户作为结点,集合即可视为超边,因而构建超图
Figure PCTCN2022106913-appb-000008
项目侧与用户侧同理,构建超图
Figure PCTCN2022106913-appb-000009
The further improvement of the present invention lies in that: in step 5, according to the bipartite graph of users and tags, and items and tags, hypergraphs on the user side and item side represented by the tag relationship are respectively constructed. The idea is similar to step 4, but instead of user-item relationships, user-tag and item-tag relationships are used respectively. Take the user side as an example: if there is a path between two labels m and n, and the number of users passing through the path is less than k, then these two labels are k-order reachable neighbors; for label m, if it has k-order reachable neighbors Reach neighbor n and user u directly interacts with m, then user u is the k-order reachable user of label n. For each label, its k-order reachable users are regarded as a set, and users on the set are regarded as nodes, and the set can be regarded as a hyperedge, thus constructing a hypergraph
Figure PCTCN2022106913-appb-000008
The project side is the same as the user side, building a hypergraph
Figure PCTCN2022106913-appb-000009
本发明的有益效果是:The beneficial effects of the present invention are:
(1)本发明利用超图数据结构的特点表示用户、项目的高阶关系,并利用超图卷积来更新信息,与图神经网络相比,可以在信息传递过程中减少信息的损失,充分利用高阶的交互关系以获取领域特征表示。(1) The present invention uses the characteristics of the hypergraph data structure to represent the high-order relationship of users and items, and utilizes hypergraph convolution to update information. Compared with the graph neural network, it can reduce the loss of information in the process of information transmission, fully Utilize high-order interaction relationships to obtain domain feature representations.
(2)本发明结合了注意力机制,有效地用多种特征表示中区分不同重要程度的特征,避免了高价值信息被低价值信息喧宾夺主。(2) The present invention combines the attention mechanism to effectively use multiple features to represent features with different degrees of importance, and avoid high-value information from being overwhelmed by low-value information.
(3)在标签感知推荐中,利用用户、项目、标签三者之间的关系进行特征表示和关系建模,充分利用多种信息以进行特征表示。(3) In tag-aware recommendation, the relationship between users, items, and tags is used for feature representation and relationship modeling, and a variety of information is fully utilized for feature representation.
(4)在结果预测中采用先对用户侧和项目侧的特征进行拼接再用全连接层和Sigmoid激活函数来处理的方法,减少了直接将特征内积导致的信息丢失问题的影响。(4) In the result prediction, the method of splicing the features of the user side and the item side is used first, and then processed by the fully connected layer and the Sigmoid activation function, which reduces the impact of the information loss problem caused by the direct feature inner product.
附图说明Description of drawings
图1是用户、项目、标签关系图。Figure 1 is a diagram of the relationship between users, items, and tags.
图2是多种表示超图的超图卷积。Figure 2 is a variety of hypergraph convolutions representing hypergraphs.
图3是基于注意力机制的多特征处理过程。Figure 3 is the multi-feature processing process based on the attention mechanism.
图4是评分预测过程。Figure 4 is the score prediction process.
具体实施方式Detailed ways
以下将以图式揭露本发明的实施方式,为明确说明起见,许多实务上的细节将在以下叙述中一并说明。然而,应了解到,这些实务上的细节不应用以限制本发明。也就是说,在本发明的部分实施方式中,这些实务上的细节是非必要的。此外,为简化图式起见,一些习知惯用的结构与组件在图式中将以简单的示意的方式绘示之。Embodiments of the present invention will be disclosed in the following diagrams. For the sake of clarity, many practical details will be described together in the following description. It should be understood, however, that these practical details should not be used to limit the invention. That is, in some embodiments of the invention, these practical details are not necessary. In addition, for the sake of simplifying the drawings, some well-known and commonly used structures and components will be shown in a simple schematic manner in the drawings.
为了解决标签感知推荐中的问题,本发明引入了超图来建模结点之间的高阶关系,利用超图卷积完成神经网络中信息的传递,并使用注意力机制来反应不同信息的重要程度,以此有效缓解传统推荐方法中高阶信息丢失的问题。图1给出用户、项目、标签之间的交互关系,其中既有用户与项目的直接交互关系,又有用户-标签和项目-标签的关系。为了从复杂的关系中提取特征进行推荐,本发明引入了注意力机制和超图卷积。In order to solve the problem in tag-aware recommendation, the present invention introduces a hypergraph to model the high-order relationship between nodes, uses hypergraph convolution to complete the information transfer in the neural network, and uses the attention mechanism to reflect the different information Importance, so as to effectively alleviate the problem of high-level information loss in traditional recommendation methods. Figure 1 shows the interactive relationship between users, items, and tags, including the direct interaction between users and items, and the relationship between users-tags and items-tags. In order to extract features from complex relations for recommendation, the present invention introduces attention mechanism and hypergraph convolution.
图2-3所示,本发明是一种基于注意力机制与超图卷积的标签感知推荐方法,该方法通过用户与项目的直接交互关系和用户、项目、标签的简介交互关系分别在用户侧和项目侧构建不同的超图,通过超图卷积提取高阶关系反应的信息,并采用注意力机制对不同重要程度的信息做出区分,通过得到的特征表示进行推荐,具体包括如下步骤:As shown in Figures 2-3, the present invention is a label-aware recommendation method based on attention mechanism and hypergraph convolution. This method uses the direct interaction relationship between users and items and the brief interaction relationship between users, items, and labels in the user Construct different hypergraphs on the side and the project side, extract the information of high-order relational responses through hypergraph convolution, and use the attention mechanism to distinguish information of different importance levels, and make recommendations through the obtained feature representations, including the following steps :
步骤1:初始化特征。这里在用户侧根据用户ID、性别、年龄等个人信息建立初始特征u,在项目侧根据商品信息建立初始特征i。在这一步中,不涉及标签感知的部分。Step 1: Initialize features. Here, the initial feature u is established on the user side based on personal information such as user ID, gender, and age, and the initial feature i is established on the project side based on product information. In this step, the label-aware part is not involved.
步骤2:用元组F=(U,I,T,A)来表示用户集合U、项目集合I、标签集合T和三者关系表示集合A。在用户-标签特征中,将用户对标签的标记次数表示特征
Figure PCTCN2022106913-appb-000010
同样的,项目-标签特征用
Figure PCTCN2022106913-appb-000011
其中
Figure PCTCN2022106913-appb-000012
Figure PCTCN2022106913-appb-000013
分别表示用户u和项目i打上标记p的次数,σ代表归一化操作。
Step 2: Use the tuple F=(U, I, T, A) to represent the user set U, the item set I, the label set T and the three-way relationship to represent the set A. In the user-label feature, the number of times the user marks the label represents the feature
Figure PCTCN2022106913-appb-000010
Similarly, the item-label feature uses
Figure PCTCN2022106913-appb-000011
in
Figure PCTCN2022106913-appb-000012
and
Figure PCTCN2022106913-appb-000013
Respectively represent the number of times that user u and item i are marked with p, and σ represents the normalization operation.
步骤3:根据用户与项目、用户与标签、项目与标签的交互关系构建三个交互二部图G user-item、G user-tag、G item-tag。图1可视为三个二部图的结合表示,建立二部图,是为了构建超图结构,其中用户与项目的二部图使用用户与项目的直接交互,如评分矩阵。 Step 3: Construct three interactive bipartite graphs G user-item , G user- tag , and G item-tag according to the interaction relationship between users and items, users and tags , and items and tags. Figure 1 can be regarded as a combined representation of three bipartite graphs. The purpose of establishing a bipartite graph is to construct a hypergraph structure. The bipartite graph of users and items uses the direct interaction between users and items, such as the rating matrix.
步骤4:根据用户与项目二部图分别构建直接交互关系表示的用户侧和项目侧的超图。以用户侧为例:若两个项目m、n之间存在路径,且路径中经过的用户数小于k,则这两个项目为k阶可到达邻居;对于项目m,若其有k阶可到达邻居n且用户u与m直接交互,则用户u是项目n的k阶可达用户。对于每个项目,其k阶可达用户视为一个集合,集合上用户作为结点,集合即可视为超边,因而构建超图
Figure PCTCN2022106913-appb-000014
项目侧与用户侧同理,构建超图
Figure PCTCN2022106913-appb-000015
具体为:若两个用户u、v之间存在路径,且路径中经过的项目数小于k,则这两个用户为k阶可到达邻居;对于用户u,若其有k阶可到达邻居v且项目m与u直接交互,则项目m是用户u的k阶可达项目,对于每个用户,其k阶可达项目视为一个集合,集合上项目作为结点,集合即可视为超边,因而构建超图
Figure PCTCN2022106913-appb-000016
Step 4: According to the bipartite graph of users and items, respectively construct the hypergraphs of the user side and the item side represented by the direct interaction relationship. Take the user side as an example: if there is a path between two items m and n, and the number of users passing through the path is less than k, then these two items are reachable neighbors of order k; for item m, if there is a reachable neighbor of order k Reach neighbor n and user u directly interacts with m, then user u is the k-order reachable user of item n. For each item, its k-order reachable users are regarded as a set, and the users on the set are regarded as nodes, and the set can be regarded as a hyperedge, thus constructing a hypergraph
Figure PCTCN2022106913-appb-000014
The project side is the same as the user side, building a hypergraph
Figure PCTCN2022106913-appb-000015
Specifically: if there is a path between two users u and v, and the number of items passed in the path is less than k, then the two users are k-order reachable neighbors; for user u, if it has a k-order reachable neighbor v And item m interacts directly with u, then item m is the k-level reachable item of user u. For each user, its k-level reachable items are regarded as a set, and the items on the set are used as nodes, and the set can be regarded as super edges, thus constructing the hypergraph
Figure PCTCN2022106913-appb-000016
步骤5:根据用户与标签、项目与标签二部图分别构建标签关系表示的用户侧和项目侧的超图。以用户侧为例:若两个标签m、n之间存在路径,且路径中经过的用户数小于k,则这两个标签为k阶可到达邻居;对于标签m,若 其有k阶可到达邻居n且用户u与m直接交互,则用户u是标签n的k阶可达用户。对于每个标签,其k阶可达用户视为一个集合,集合上用户作为结点,集合即可视为超边,因而构建超图
Figure PCTCN2022106913-appb-000017
项目侧与用户侧同理,构建超图
Figure PCTCN2022106913-appb-000018
具体为:若两个标签m、n之间存在路径,且路径中经过的项目数小于k,则这两个标签为k阶可到达邻居;对于标签m,若其有k阶可到达邻居n且项目p与m直接交互,则项目p是标签n的k阶可达项目,对于每个标签,其k阶可达项目视为一个集合,集合上项目作为结点,集合即可视为超边,因而构建超图
Figure PCTCN2022106913-appb-000019
Step 5: According to the bipartite graphs of users and tags and items and tags, respectively construct the hypergraphs on the user side and item side represented by the tag relationship. Take the user side as an example: if there is a path between two labels m and n, and the number of users passing through the path is less than k, then these two labels are k-order reachable neighbors; for label m, if it has k-order reachable neighbors Reach neighbor n and user u directly interacts with m, then user u is the k-order reachable user of label n. For each label, its k-order reachable users are regarded as a set, and users on the set are regarded as nodes, and the set can be regarded as a hyperedge, thus constructing a hypergraph
Figure PCTCN2022106913-appb-000017
The project side is the same as the user side, building a hypergraph
Figure PCTCN2022106913-appb-000018
Specifically: if there is a path between two labels m and n, and the number of items passed in the path is less than k, then the two labels are k-order reachable neighbors; for label m, if it has a k-order reachable neighbor n And item p interacts directly with m, then item p is the k-order reachable item of label n. For each label, its k-order reachable items are regarded as a set, and the items on the set are used as nodes, and the set can be regarded as super edges, thus constructing the hypergraph
Figure PCTCN2022106913-appb-000019
步骤6:将超图结构使用关联矩阵的形式表达H=(V,E),其中V为结点集合,E为超边集合,使用如下方法来表示结点v是否在超边e上:Step 6: Express the hypergraph structure in the form of an incidence matrix H=(V, E), wherein V is a node set, E is a hyperedge set, and use the following method to indicate whether the node v is on the hyperedge e:
Figure PCTCN2022106913-appb-000020
Figure PCTCN2022106913-appb-000020
因此,得到超图
Figure PCTCN2022106913-appb-000021
的关联矩阵表示H d-user、H d-item
Therefore, the hypergraph
Figure PCTCN2022106913-appb-000021
The incidence matrix of represents H d-user , H d-item .
以用户侧为例,超图卷积可以表示成:Taking the user side as an example, hypergraph convolution can be expressed as:
Figure PCTCN2022106913-appb-000022
Figure PCTCN2022106913-appb-000022
这里采用了频谱的方法进行超图卷积。其中Θ (l)表示第l层上的可学习的参数矩阵,σ为激活函数,D v为结点的度矩阵,D E为超边的度矩阵。H d-user为超图
Figure PCTCN2022106913-appb-000023
的关联矩阵,H=(V,E),其中V为结点集合,E为超边集合。与
Figure PCTCN2022106913-appb-000024
的相乘操作表示用户侧直接交互关系表示超图
Figure PCTCN2022106913-appb-000025
上从结点特征到超边特征的聚合,与H d-user的相乘操作表示
Figure PCTCN2022106913-appb-000026
上从超边特征到结点特征的聚合。同时,根据ResNet的思想,在超图卷积的每一层中都会通过加上原始特征以保留之前的特征的影响,以防止邻居特征影响过大而导致初始特征无法得到体现。
Here, the spectral method is used for hypergraph convolution. where Θ (l) represents the learnable parameter matrix on layer l, σ is the activation function, D v is the degree matrix of nodes, and DE is the degree matrix of hyperedges. H d-user is a hypergraph
Figure PCTCN2022106913-appb-000023
The incidence matrix of , H=(V, E), where V is the set of nodes, and E is the set of hyperedges. and
Figure PCTCN2022106913-appb-000024
The multiplication operation of represents the user-side direct interaction relation represents the hypergraph
Figure PCTCN2022106913-appb-000025
On the aggregation from node features to hyperedge features, the multiplication operation with H d-user represents
Figure PCTCN2022106913-appb-000026
Aggregation from hyperedge features to node features. At the same time, according to the idea of ResNet, the original features are added to each layer of the hypergraph convolution to retain the influence of the previous features, so as to prevent the initial features from being reflected due to the excessive influence of the neighbor features.
项目侧与用户侧同理,超图卷积可以表示成:The project side is the same as the user side, and hypergraph convolution can be expressed as:
Figure PCTCN2022106913-appb-000027
Figure PCTCN2022106913-appb-000027
其中:Θ (l)表示第l层上的可学习的参数矩阵,σ为激活函数,D v为结点的度矩阵,D E为超边的度矩阵。H d-item为超图
Figure PCTCN2022106913-appb-000028
的关联矩阵,H=(V,E),其中V为结点集合,E为超边集合。与
Figure PCTCN2022106913-appb-000029
的相乘操作表示项目侧直接交互关系表示超图
Figure PCTCN2022106913-appb-000030
上从结点特征到超边特征的聚合,与H d-item的相乘操作表示
Figure PCTCN2022106913-appb-000031
上从超边特征到结点特征的聚合。
Among them: Θ (l) represents the learnable parameter matrix on the l layer, σ is the activation function, D v is the degree matrix of the node, and DE is the degree matrix of the hyperedge. H d-item is a hypergraph
Figure PCTCN2022106913-appb-000028
The incidence matrix of , H=(V, E), where V is the set of nodes, and E is the set of hyperedges. and
Figure PCTCN2022106913-appb-000029
The multiplication operation represents the project-side direct interaction relation represents the hypergraph
Figure PCTCN2022106913-appb-000030
The aggregation from the node feature to the hyperedge feature, and the multiplication operation with the H d-item represent
Figure PCTCN2022106913-appb-000031
Aggregation from hyperedge features to node features.
由此,在超图卷积的每一层上,通过结点-超边-结点的形式进行信息更新,以超图神经网络的形式从高阶关系中提取信息。Thus, on each layer of hypergraph convolution, information is updated in the form of node-hyperedge-node, and information is extracted from high-order relations in the form of hypergraph neural network.
步骤7:与步骤6相似,得到超图
Figure PCTCN2022106913-appb-000032
的关联矩阵表示H t-user、H t-item
Step 7: Similar to Step 6, get the hypergraph
Figure PCTCN2022106913-appb-000032
The incidence matrix of represents H t-user , H t-item .
用户侧超图卷积可以表示成:User-side hypergraph convolution can be expressed as:
Figure PCTCN2022106913-appb-000033
Figure PCTCN2022106913-appb-000033
其中:Θ (l)表示第l层上的可学习的参数矩阵,σ为激活函数,D v为结点的度矩阵,D E为超边的度矩阵。
Figure PCTCN2022106913-appb-000034
为超图
Figure PCTCN2022106913-appb-000035
的关联矩阵,H=(V,E),其中V为结点集合,E为超边集合。与
Figure PCTCN2022106913-appb-000036
的相乘操作表示用户侧标签表示超图
Figure PCTCN2022106913-appb-000037
上从结点特征到超边特征的聚合,与H t-user的相乘操作表示
Figure PCTCN2022106913-appb-000038
上从超边特征到结点特征的聚合。
Among them: Θ (l) represents the learnable parameter matrix on the l layer, σ is the activation function, D v is the degree matrix of the node, and DE is the degree matrix of the hyperedge.
Figure PCTCN2022106913-appb-000034
hypergraph
Figure PCTCN2022106913-appb-000035
The incidence matrix of , H=(V, E), where V is the set of nodes, and E is the set of hyperedges. and
Figure PCTCN2022106913-appb-000036
The multiplication operation of represents the user-side label represents the hypergraph
Figure PCTCN2022106913-appb-000037
Aggregation from node features to hyperedge features, multiplication operation with H t-user represents
Figure PCTCN2022106913-appb-000038
Aggregation from hyperedge features to node features.
项目侧超图卷积可以表示成:The item-side hypergraph convolution can be expressed as:
Figure PCTCN2022106913-appb-000039
Figure PCTCN2022106913-appb-000039
其中:Θ (l)表示第l层上的可学习的参数矩阵,σ为激活函数,D v为结点 的度矩阵,D E为超边的度矩阵。H t-item为超图
Figure PCTCN2022106913-appb-000040
的关联矩阵,H=(V,E),其中V为结点集合,E为超边集合。与
Figure PCTCN2022106913-appb-000041
的相乘操作表示项目侧标签表示超图
Figure PCTCN2022106913-appb-000042
上从结点特征到超边特征的聚合,与H t-item的相乘操作表示
Figure PCTCN2022106913-appb-000043
上从超边特征到结点特征的聚合。
Among them: Θ (l) represents the learnable parameter matrix on the l layer, σ is the activation function, D v is the degree matrix of the node, and DE is the degree matrix of the hyperedge. H t-item is a hypergraph
Figure PCTCN2022106913-appb-000040
The incidence matrix of , H=(V, E), where V is the set of nodes, and E is the set of hyperedges. and
Figure PCTCN2022106913-appb-000041
The multiplication operation represents the item side label represents the hypergraph
Figure PCTCN2022106913-appb-000042
On the aggregation from node features to hyperedge features, the multiplication operation with H t-item represents
Figure PCTCN2022106913-appb-000043
Aggregation from hyperedge features to node features.
步骤8:如图3所示,根据步骤2、步骤6、步骤7得到的特征u tag、i tag、u 1、i 1、u 2、i 2,使用注意力机制得到用户侧和项目侧的最终特征表示u *、i *Step 8: As shown in Figure 3, according to the features u tag , i tag , u 1 , i 1 , u 2 , and i 2 obtained in steps 2, 6, and 7, use the attention mechanism to obtain the user-side and item-side The final features represent u * , i * .
以用户侧为例,因为u tag与u 1、u 2的维度不同,不适合直接进行相加操作。因此将它们分别进行拼接操作得到u 1-tag和u 2-tagTaking the user side as an example, since u tag has different dimensions from u 1 and u 2 , it is not suitable for direct addition operation. Therefore, they are respectively spliced to obtain u 1-tag and u 2-tag .
得到u 1-tag和u 2-tag后,再使用注意力机制将两种特征表示处理成最终特征表示: After getting u 1-tag and u 2-tag , use the attention mechanism to process the two feature representations into the final feature representation:
a(u,k)=W Ttanh(Wu k-tag+b 2) a(u,k)=W T tanh(Wu k-tag +b 2 )
Figure PCTCN2022106913-appb-000044
Figure PCTCN2022106913-appb-000044
根据注意力机制得到权重可以获得用户侧最终特征表示u * Obtaining the weight according to the attention mechanism can obtain the final feature representation u * on the user side
u *=α(u,1)u 1-tag+α(u,2)u 2-tag u * = α(u,1)u 1-tag + α(u,2)u 2-tag
将i tag与i 1、i 2分别进行拼接操作得到i 1-tag和i 2-tag,再使用注意力机制将两种特征表示处理成一种特征表示: Splicing i tag with i 1 and i 2 respectively to obtain i 1-tag and i 2-tag , and then use the attention mechanism to process the two feature representations into one feature representation:
a(i,k)=W Ttanh(Wi k-tag+b 2) a(i,k)=W T tanh(Wi k-tag +b 2 )
Figure PCTCN2022106913-appb-000045
Figure PCTCN2022106913-appb-000045
根据注意力机制得到权重获得项目侧最终特征表示i *Get the weight according to the attention mechanism to obtain the final feature representation i * on the item side:
i *=α(i,1)i 1-tag+α(i,2)u 2-tagi * = α(i,1)i 1-tag + α(i,2)u 2-tag .
步骤9:如图4所示,根据u *与i *进行拼接操作得到z=[u *;i *],,并采 用Sigmoid函数作为激活函数得到用户-项目概率预测以此进行推荐。 Step 9: As shown in Figure 4, according to the concatenation operation of u * and i * , z=[u * ; i * ] is obtained, and the Sigmoid function is used as the activation function to obtain the user-item probability prediction for recommendation.
Figure PCTCN2022106913-appb-000046
Figure PCTCN2022106913-appb-000046
根据得到的概率预测,将项目进行Top-K排序推荐给用户。According to the obtained probability prediction, the items are recommended to the user by Top-K sorting.
使用交叉熵函数作为损失函数进行模型训练:Use the cross-entropy function as the loss function for model training:
Figure PCTCN2022106913-appb-000047
Figure PCTCN2022106913-appb-000047
其中X为训练样本集合。where X is the training sample set.
本发明引入了超图卷积来挖掘高阶关系以进行特征提取。同时,采用注意力机制对于用户-项目直接交互的超图与标签感知构建的超图获得的特征进行权重分配,能够更好地区分不同重要程度的信息。通过巧妙地结合超图卷积和注意力机制,本发明提出的方法可以充分提取用户-项目的直接交互关系和与标签的交互关系中的特征,有效提高推荐方法的性能。The present invention introduces hypergraph convolution to mine high-order relations for feature extraction. At the same time, the attention mechanism is used to assign weights to the features obtained by the user-item direct interaction hypergraph and the label-aware hypergraph construction, which can better distinguish information of different importance. By cleverly combining hypergraph convolution and attention mechanism, the method proposed in the present invention can fully extract the features in the direct user-item interaction relationship and the interaction relationship with tags, effectively improving the performance of the recommendation method.
以上所述仅为本发明的实施方式而已,并不用于限制本发明。对于本领域技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原理的内所作的任何修改、等同替换、改进等,均应包括在本发明的权利要求范围之内。The above descriptions are only embodiments of the present invention, and are not intended to limit the present invention. Various modifications and variations of the present invention will occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention shall be included in the scope of the claims of the present invention.

Claims (8)

  1. 一种基于注意力机制与超图卷积的标签感知推荐方法,其特征在于:标签感知推荐方法通过用户与项目的直接交互关系和用户、项目、标签的简介交互关系分别在用户侧和项目侧构建不同的超图,通过超图卷积提取高阶关系反应的信息,并采用注意力机制对不同重要程度的信息做出区分,通过得到的特征表示进行推荐,具体包括如下步骤:A label-aware recommendation method based on attention mechanism and hypergraph convolution, characterized in that: the label-aware recommendation method uses the direct interaction relationship between users and items and the brief interaction relationship between users, items, and labels on the user side and item side respectively Construct different hypergraphs, extract the information of high-order relationship responses through hypergraph convolution, and use the attention mechanism to distinguish information of different importance levels, and make recommendations through the obtained feature representations, including the following steps:
    步骤1:初始化用户u与项目i的特征表示,得到用户u的初始特征u和项目i的初始特征i;Step 1: Initialize the feature representation of user u and item i, and obtain the initial feature u of user u and the initial feature i of item i;
    步骤2:根据用户和项目分别与标签的交互关系得到标签表示的用户特征u tag和项目特征i tagStep 2: Obtain the user feature u tag and the item feature i tag represented by the tag according to the interaction relationship between the user and the item and the tag respectively;
    步骤3:根据用户与项目、用户与标签、项目与标签的交互关系构建三个交互二部图G user-item、G user-tag、G item-tagStep 3: Construct three interactive bipartite graphs G user-item , G user- tag , and G item-tag according to the interaction relationship between users and items, users and tags , and items and tags;
    步骤4:根据步骤3得到的用户与项目二部图分别构建直接交互关系表示的用户侧和项目侧的超图结构
    Figure PCTCN2022106913-appb-100001
    Step 4: According to the bipartite graph of users and items obtained in step 3, respectively construct the hypergraph structure of the user side and the item side represented by the direct interaction relationship
    Figure PCTCN2022106913-appb-100001
    步骤5:根步骤3得到的用户与标签、项目与标签二部图分别构建标签关系表示的用户侧和项目侧的超图结构
    Figure PCTCN2022106913-appb-100002
    Step 5: Based on the bipartite graphs of users and tags and items and tags obtained in step 3, respectively construct the hypergraph structure of the user side and item side represented by the tag relationship
    Figure PCTCN2022106913-appb-100002
    步骤6:对步骤1得到的特征u、i用步骤4得到的两个超图
    Figure PCTCN2022106913-appb-100003
    Figure PCTCN2022106913-appb-100004
    分别进行超图卷积,从而得到用户侧和项目侧更新后的直接关系表示的邻域特征表示u 1、i 1
    Step 6: Use the two hypergraphs obtained in step 4 for the features u and i obtained in step 1
    Figure PCTCN2022106913-appb-100003
    Figure PCTCN2022106913-appb-100004
    Perform hypergraph convolution respectively to obtain neighborhood feature representations u 1 and i 1 of the updated direct relationship representations on the user side and item side;
    步骤7:对步骤1得到的特征u、i步骤5到的两个超图
    Figure PCTCN2022106913-appb-100005
    分别进行超图卷积,从而得到用户侧和项目侧更新后的标签关系表示的邻域特征表示u 2、i 2
    Step 7: For the features u and i obtained in step 1, the two hypergraphs obtained in step 5
    Figure PCTCN2022106913-appb-100005
    Carry out hypergraph convolution respectively to obtain neighborhood feature representations u 2 and i 2 represented by updated tag relations on the user side and item side;
    步骤8:使用注意力机制对步骤2、步骤6、步骤7得到的特征进行处理以得到不同特征表示的权重来得到最终的用户与项目特征表示u *、i *Step 8: Use the attention mechanism to process the features obtained in steps 2, 6, and 7 to obtain the weights of different feature representations to obtain the final user and item feature representations u * , i * ;
    步骤9:根据第八步得到的用户与项目特征表示进行拼接得到z=[u *;i *], 将其输入到全连接层中并利用Sigmoid函数获得预测概率,根据评分进行推荐。 Step 9: Concatenate the user and item feature representations obtained in the eighth step to obtain z=[u * ; i * ], input it into the fully connected layer and use the Sigmoid function to obtain the prediction probability, and make recommendations based on the score.
  2. 根据权利要求1所述一种基于注意力机制与超图卷积的标签感知推荐方法,其特征在于:所述步骤4中用户侧的超图结构
    Figure PCTCN2022106913-appb-100006
    的构建具体为:若两个项目m、n之间存在路径,且路径中经过的用户数小于k,则这两个项目为k阶可到达邻居;对于项目m,若其有k阶可到达邻居n且用户u与m直接交互,则用户u是项目n的k阶可达用户,对于每个项目,其k阶可达用户视为一个集合,集合上用户作为结点,集合即可视为超边,因而构建超图
    Figure PCTCN2022106913-appb-100007
    A label-aware recommendation method based on attention mechanism and hypergraph convolution according to claim 1, characterized in that: the hypergraph structure on the user side in step 4
    Figure PCTCN2022106913-appb-100006
    The specific construction of is as follows: if there is a path between two items m and n, and the number of users passing through the path is less than k, then these two items are reachable neighbors of order k; for item m, if it has reachable neighbors of order k Neighbor n and user u directly interacts with m, then user u is the k-order reachable user of project n. For each project, its k-order reachable users are regarded as a set, and the users on the set are regarded as nodes, and the set can be visualized is a hyperedge, thus constructing a hypergraph
    Figure PCTCN2022106913-appb-100007
  3. 根据权利要求2所述一种基于注意力机制与超图卷积的标签感知推荐方法,其特征在于:所述步骤4中项目侧的超图结构
    Figure PCTCN2022106913-appb-100008
    的构建具体为:
    A label-aware recommendation method based on attention mechanism and hypergraph convolution according to claim 2, characterized in that: the hypergraph structure on the item side in step 4
    Figure PCTCN2022106913-appb-100008
    The build is specifically:
    若两个用户u、v之间存在路径,且路径中经过的项目数小于k,则这两个用户为k阶可到达邻居;对于用户u,若其有k阶可到达邻居v且项目m与u直接交互,则项目m是用户u的k阶可达项目,对于每个用户,其k阶可达项目视为一个集合,集合上项目作为结点,集合即可视为超边,因而构建超图
    Figure PCTCN2022106913-appb-100009
    If there is a path between two users u and v, and the number of items passing through the path is less than k, then the two users are k-order reachable neighbors; for user u, if it has k-order reachable neighbor v and item m Interact directly with u, then item m is the k-level reachable item of user u, for each user, its k-level reachable items are regarded as a set, and the items on the set are used as nodes, and the set can be regarded as a hyperedge, so build hypergraph
    Figure PCTCN2022106913-appb-100009
  4. 根据权利要求1所述一种基于注意力机制与超图卷积的标签感知推荐方法,其特征在于:所述步骤5中用户侧的超图结构
    Figure PCTCN2022106913-appb-100010
    的构建具体为:若两个标签m、n之间存在路径,且路径中经过的用户数小于k,则这两个标签为k阶可到达邻居;对于标签m,若其有k阶可到达邻居n且用户u与m直接交互,则用户u是标签n的k阶可达用户,对于每个标签,其k阶可达用户视为一个集合,集合上用户作为结点,集合即可视为超边,因而构建超图
    Figure PCTCN2022106913-appb-100011
    A label-aware recommendation method based on attention mechanism and hypergraph convolution according to claim 1, characterized in that: the hypergraph structure on the user side in step 5
    Figure PCTCN2022106913-appb-100010
    The specific construction of is: if there is a path between two labels m and n, and the number of users passing through the path is less than k, then the two labels are reachable neighbors of order k; for label m, if there is a reachable neighbor of order k Neighbor n and user u interacts directly with m, then user u is a k-order reachable user of label n. For each label, its k-order reachable users are regarded as a set, and users on the set are regarded as nodes, and the set can be viewed is a hyperedge, thus constructing a hypergraph
    Figure PCTCN2022106913-appb-100011
  5. 根据权利要求4所述一种基于注意力机制与超图卷积的标签感知推荐方法,其特征在于:所述步骤5中项目侧的超图结构
    Figure PCTCN2022106913-appb-100012
    的构建具体为:若两个标签m、n之间存在路径,且路径中经过的项目数小于k,则这两个标签为k阶可到达邻居;对于标签m,若其有k阶可到达邻居n且项目p与m直接交互,则项目p是标签n的k阶可达项目,对于每个标签,其k阶可达项目视为一个集合,集合上项目作为结点,集合即可视为超边,因而构建超图
    Figure PCTCN2022106913-appb-100013
    A label-aware recommendation method based on attention mechanism and hypergraph convolution according to claim 4, characterized in that: the hypergraph structure on the item side in step 5
    Figure PCTCN2022106913-appb-100012
    The construction of is as follows: if there is a path between two labels m and n, and the number of items passing through the path is less than k, then these two labels are reachable neighbors of order k; for label m, if there is a reachable neighbor of order k Neighbor n and item p interacts directly with m, then item p is a k-order reachable item of label n. For each label, its k-order reachable items are regarded as a set, and the items on the set are used as nodes, and the set can be visualized is a hyperedge, thus constructing a hypergraph
    Figure PCTCN2022106913-appb-100013
  6. 根据权利要求1所述一种基于注意力机制与超图卷积的标签感知推荐方法,其特征在于:A label-aware recommendation method based on attention mechanism and hypergraph convolution according to claim 1, characterized in that:
    所述步骤6中用户侧超图卷积表示为:In the step 6, the hypergraph convolution on the user side is expressed as:
    Figure PCTCN2022106913-appb-100014
    Figure PCTCN2022106913-appb-100014
    其中:Θ (l)表示第l层上的可学习的参数矩阵,σ为激活函数,D v为结点的度矩阵,D E为超边的度矩阵。H d-user为超图
    Figure PCTCN2022106913-appb-100015
    的关联矩阵,H=(V,E),其中V为结点集合,E为超边集合,与
    Figure PCTCN2022106913-appb-100016
    的相乘操作表示用户侧直接交互关系表示超图
    Figure PCTCN2022106913-appb-100017
    上从结点特征到超边特征的聚合,与H d-user的相乘操作表示
    Figure PCTCN2022106913-appb-100018
    上从超边特征到结点特征的聚合;
    Among them: Θ (l) represents the learnable parameter matrix on the l layer, σ is the activation function, D v is the degree matrix of the node, and DE is the degree matrix of the hyperedge. H d-user is a hypergraph
    Figure PCTCN2022106913-appb-100015
    The incidence matrix of , H=(V, E), where V is the set of nodes, E is the set of hyperedges, and
    Figure PCTCN2022106913-appb-100016
    The multiplication operation of represents the user-side direct interaction relation represents the hypergraph
    Figure PCTCN2022106913-appb-100017
    On the aggregation from node features to hyperedge features, the multiplication operation with H d-user represents
    Figure PCTCN2022106913-appb-100018
    aggregation from hyperedge features to node features;
    所述步骤6中项目侧超图卷积表示为:In the step 6, the hypergraph convolution on the project side is expressed as:
    Figure PCTCN2022106913-appb-100019
    Figure PCTCN2022106913-appb-100019
    其中:Θ (l)表示第l层上的可学习的参数矩阵,σ为激活函数,D v为结点的度矩阵,D E为超边的度矩阵,H d-item为超图
    Figure PCTCN2022106913-appb-100020
    的关联矩阵,H=(V,E),其中V为结点集合,E为超边集合,与
    Figure PCTCN2022106913-appb-100021
    的相乘操作表示项目侧直接交互关系表示超图
    Figure PCTCN2022106913-appb-100022
    上从结点特征到超边特征的聚合,与H d-item的相乘 操作表示
    Figure PCTCN2022106913-appb-100023
    上从超边特征到结点特征的聚合。
    Among them: Θ (l) represents the learnable parameter matrix on the lth layer, σ is the activation function, D v is the degree matrix of the node, D E is the degree matrix of the hyperedge, and H d-item is the hypergraph
    Figure PCTCN2022106913-appb-100020
    The incidence matrix of , H=(V, E), where V is the set of nodes, E is the set of hyperedges, and
    Figure PCTCN2022106913-appb-100021
    The multiplication operation represents the project-side direct interaction relation represents the hypergraph
    Figure PCTCN2022106913-appb-100022
    The aggregation from the node feature to the hyperedge feature, and the multiplication operation with the H d-item represent
    Figure PCTCN2022106913-appb-100023
    Aggregation from hyperedge features to node features.
  7. 根据权利要求1所述一种基于注意力机制与超图卷积的标签感知推荐方法,其特征在于:A label-aware recommendation method based on attention mechanism and hypergraph convolution according to claim 1, characterized in that:
    所述步骤7中用户侧超图卷积表示为:In the step 7, the hypergraph convolution on the user side is expressed as:
    Figure PCTCN2022106913-appb-100024
    Figure PCTCN2022106913-appb-100024
    其中:Θ (l)表示第l层上的可学习的参数矩阵,σ为激活函数,D v为结点的度矩阵,D E为超边的度矩阵,
    Figure PCTCN2022106913-appb-100025
    为超图
    Figure PCTCN2022106913-appb-100026
    的关联矩阵,H=(V,E),其中V为结点集合,E为超边集合,与
    Figure PCTCN2022106913-appb-100027
    的相乘操作表示用户侧标签表示超图
    Figure PCTCN2022106913-appb-100028
    上从结点特征到超边特征的聚合,与H t-user的相乘操作表示
    Figure PCTCN2022106913-appb-100029
    上从超边特征到结点特征的聚合;
    Among them: Θ (l) represents the learnable parameter matrix on the l-th layer, σ is the activation function, D v is the degree matrix of the node, D E is the degree matrix of the hyperedge,
    Figure PCTCN2022106913-appb-100025
    hypergraph
    Figure PCTCN2022106913-appb-100026
    The incidence matrix of , H=(V, E), where V is the set of nodes, E is the set of hyperedges, and
    Figure PCTCN2022106913-appb-100027
    The multiplication operation of represents the user-side label represents the hypergraph
    Figure PCTCN2022106913-appb-100028
    Aggregation from node features to hyperedge features, multiplication operation with H t-user represents
    Figure PCTCN2022106913-appb-100029
    aggregation from hyperedge features to node features;
    所述步骤7中项目侧超图卷积表示为:The hypergraph convolution on the item side in step 7 is expressed as:
    Figure PCTCN2022106913-appb-100030
    Figure PCTCN2022106913-appb-100030
    其中:Θ (l)表示第l层上的可学习的参数矩阵,σ为激活函数,D v为结点的度矩阵,D E为超边的度矩阵。H t-item为超图
    Figure PCTCN2022106913-appb-100031
    的关联矩阵,H=(V,E),其中V为结点集合,E为超边集合,与
    Figure PCTCN2022106913-appb-100032
    的相乘操作表示项目侧标签表示超图
    Figure PCTCN2022106913-appb-100033
    上从结点特征到超边特征的聚合,与H t-item的相乘操作表示
    Figure PCTCN2022106913-appb-100034
    上从超边特征到结点特征的聚合。
    Among them: Θ (l) represents the learnable parameter matrix on the l layer, σ is the activation function, D v is the degree matrix of the node, and DE is the degree matrix of the hyperedge. H t-item is a hypergraph
    Figure PCTCN2022106913-appb-100031
    The incidence matrix of , H=(V, E), where V is the set of nodes, E is the set of hyperedges, and
    Figure PCTCN2022106913-appb-100032
    The multiplication operation represents the item side label represents the hypergraph
    Figure PCTCN2022106913-appb-100033
    On the aggregation from node features to hyperedge features, the multiplication operation with H t-item represents
    Figure PCTCN2022106913-appb-100034
    Aggregation from hyperedge features to node features.
  8. 根据权利要求1所述一种基于注意力机制与超图卷积的标签感知推荐方法,其特征在于:所述步骤8中用户侧和项目侧特征处理具体为:A label-aware recommendation method based on attention mechanism and hypergraph convolution according to claim 1, characterized in that: the user-side and item-side feature processing in the step 8 is specifically:
    根据步骤2、步骤6、步骤7得到的特征u tag、i tag、u 1、i 1、u 2、i 2,使用注意力机制得到用户侧和项目侧的最终特征表示u *、i *,其中: According to the features u tag , i tag , u 1 , i 1 , u 2 , i 2 obtained in steps 2, 6, and 7, use the attention mechanism to obtain the final feature representations u * , i * on the user side and the item side, in:
    将u tag与u 1、u 2分别进行拼接操作得到u 1-tag和u 2-tag,再使用注意力机制将两种特征表示处理成一种特征表示: Concatenate u tag with u 1 and u 2 respectively to obtain u 1-tag and u 2-tag , and then use the attention mechanism to process the two feature representations into one feature representation:
    a(u,k)=W Ttanh(Wu k-tag+b 2) a(u,k)=W T tanh(Wu k-tag +b 2 )
    Figure PCTCN2022106913-appb-100035
    Figure PCTCN2022106913-appb-100035
    根据注意力机制得到权重获得用户侧最终特征表示u *Obtain the weight according to the attention mechanism to obtain the final feature representation u * on the user side:
    u *=α(u,1)u 1-tag+α(u,2)u 2-tagu * = α(u, 1)u 1-tag + α(u, 2)u 2-tag ;
    将i tag与i 1、i 2分别进行拼接操作得到i 1-tag和i 2-tag,再使用注意力机制将两种特征表示处理成一种特征表示: Splicing i tag with i 1 and i 2 respectively to obtain i 1-tag and i 2-tag , and then use the attention mechanism to process the two feature representations into one feature representation:
    a(i,k)=W Ttanh(Wi k-tag+b 2) a(i,k)=W T tanh(Wi k-tag +b 2 )
    Figure PCTCN2022106913-appb-100036
    Figure PCTCN2022106913-appb-100036
    根据注意力机制得到权重获得项目侧最终特征表示i *Get the weight according to the attention mechanism to obtain the final feature representation i * on the item side:
    i *=α(i,1)i 1-tag+α(i,2)u 2-tagi * = α(i,1)i 1-tag + α(i,2)u 2-tag .
PCT/CN2022/106913 2021-12-02 2022-07-21 Tag-aware recommendation method based on attention mechanism and hypergraph convolution WO2023098098A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111461962.3A CN114117142A (en) 2021-12-02 2021-12-02 Label perception recommendation method based on attention mechanism and hypergraph convolution
CN202111461962.3 2021-12-02

Publications (1)

Publication Number Publication Date
WO2023098098A1 true WO2023098098A1 (en) 2023-06-08

Family

ID=80365627

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/106913 WO2023098098A1 (en) 2021-12-02 2022-07-21 Tag-aware recommendation method based on attention mechanism and hypergraph convolution

Country Status (2)

Country Link
CN (1) CN114117142A (en)
WO (1) WO2023098098A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116845889A (en) * 2023-09-01 2023-10-03 东海实验室 Hierarchical hypergraph neural network-based power load prediction method
CN116894097A (en) * 2023-09-04 2023-10-17 中南大学 Knowledge graph label prediction method based on hypergraph modeling
CN117034778A (en) * 2023-08-28 2023-11-10 黑龙江省网络空间研究中心(黑龙江省信息安全测评中心、黑龙江省国防科学技术研究院) Method for inverting aboveground biomass based on hypershaper-transducer structure
CN117788122A (en) * 2024-02-23 2024-03-29 山东科技大学 Goods recommendation method based on heterogeneous graph neural network
CN118013131A (en) * 2024-04-10 2024-05-10 南昌大学 Method, device and medium for recommending graph neural network data based on grading enhancement
CN118210916A (en) * 2024-05-20 2024-06-18 华东交通大学 Scientific literature recommendation method based on hypergraph attention and enhanced contrast learning
CN118485502A (en) * 2024-07-16 2024-08-13 北京茄豆网络科技有限公司 Method, device, equipment and storage medium for generating personalized custom commodity label

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114117142A (en) * 2021-12-02 2022-03-01 南京邮电大学 Label perception recommendation method based on attention mechanism and hypergraph convolution
CN114817663B (en) * 2022-05-05 2023-02-17 杭州电子科技大学 Service modeling and recommendation method based on class perception graph neural network

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210004402A1 (en) * 2018-08-14 2021-01-07 Tencent Technology (Shenzhen) Company Limited Method for making music recommendations and related computing device, and medium thereof
CN112487200A (en) * 2020-11-25 2021-03-12 吉林大学 Improved deep recommendation method containing multi-side information and multi-task learning
CN113434778A (en) * 2021-07-20 2021-09-24 陕西师范大学 Recommendation method based on regularization framework and attention mechanism
CN113672811A (en) * 2021-08-24 2021-11-19 广东工业大学 Hypergraph convolution collaborative filtering recommendation method and system based on topology information embedding and computer readable storage medium
CN114117142A (en) * 2021-12-02 2022-03-01 南京邮电大学 Label perception recommendation method based on attention mechanism and hypergraph convolution

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11651273B2 (en) * 2018-04-19 2023-05-16 National University Of Singapore Machine learning using partial order hypergraphs
CN112417219B (en) * 2020-11-16 2022-07-01 吉林大学 Hyper-graph convolution-based hyper-edge link prediction method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210004402A1 (en) * 2018-08-14 2021-01-07 Tencent Technology (Shenzhen) Company Limited Method for making music recommendations and related computing device, and medium thereof
CN112487200A (en) * 2020-11-25 2021-03-12 吉林大学 Improved deep recommendation method containing multi-side information and multi-task learning
CN113434778A (en) * 2021-07-20 2021-09-24 陕西师范大学 Recommendation method based on regularization framework and attention mechanism
CN113672811A (en) * 2021-08-24 2021-11-19 广东工业大学 Hypergraph convolution collaborative filtering recommendation method and system based on topology information embedding and computer readable storage medium
CN114117142A (en) * 2021-12-02 2022-03-01 南京邮电大学 Label perception recommendation method based on attention mechanism and hypergraph convolution

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117034778A (en) * 2023-08-28 2023-11-10 黑龙江省网络空间研究中心(黑龙江省信息安全测评中心、黑龙江省国防科学技术研究院) Method for inverting aboveground biomass based on hypershaper-transducer structure
CN116845889A (en) * 2023-09-01 2023-10-03 东海实验室 Hierarchical hypergraph neural network-based power load prediction method
CN116845889B (en) * 2023-09-01 2023-12-22 东海实验室 Hierarchical hypergraph neural network-based power load prediction method
CN116894097A (en) * 2023-09-04 2023-10-17 中南大学 Knowledge graph label prediction method based on hypergraph modeling
CN116894097B (en) * 2023-09-04 2023-12-22 中南大学 Knowledge graph label prediction method based on hypergraph modeling
CN117788122A (en) * 2024-02-23 2024-03-29 山东科技大学 Goods recommendation method based on heterogeneous graph neural network
CN117788122B (en) * 2024-02-23 2024-05-10 山东科技大学 Goods recommendation method based on heterogeneous graph neural network
CN118013131A (en) * 2024-04-10 2024-05-10 南昌大学 Method, device and medium for recommending graph neural network data based on grading enhancement
CN118013131B (en) * 2024-04-10 2024-06-11 南昌大学 Method, device and medium for recommending graph neural network data based on grading enhancement
CN118210916A (en) * 2024-05-20 2024-06-18 华东交通大学 Scientific literature recommendation method based on hypergraph attention and enhanced contrast learning
CN118485502A (en) * 2024-07-16 2024-08-13 北京茄豆网络科技有限公司 Method, device, equipment and storage medium for generating personalized custom commodity label

Also Published As

Publication number Publication date
CN114117142A (en) 2022-03-01

Similar Documents

Publication Publication Date Title
WO2023098098A1 (en) Tag-aware recommendation method based on attention mechanism and hypergraph convolution
WO2022161380A1 (en) Model training method and apparatus, and image retrieval method and apparatus
Zhang et al. Attributed network embedding via subspace discovery
Li et al. Image sentiment prediction based on textual descriptions with adjective noun pairs
CN113378048A (en) Personalized recommendation method based on multi-view knowledge graph attention network
US20230049817A1 (en) Performance-adaptive sampling strategy towards fast and accurate graph neural networks
Wang et al. Modelling local and global dependencies for next-item recommendations
Chen et al. Generative dynamic link prediction
CN115760279A (en) Knowledge graph and multi-head attention-based dual-target cross-domain recommendation method and system
Amara et al. Cross-network representation learning for anchor users on multiplex heterogeneous social network
CN115329215A (en) Recommendation method and system based on self-adaptive dynamic knowledge graph in heterogeneous network
CN114742071B (en) Cross-language ideas object recognition analysis method based on graph neural network
Yan et al. Soft-self and hard-cross graph attention network for knowledge graph entity alignment
Wu et al. Heterogeneous representation learning and matching for few-shot relation prediction
Zeng et al. Collaborative filtering via heterogeneous neural networks
CN114579852A (en) Recommendation method based on graph convolution network and attention mechanism
CN114048754A (en) Chinese short text classification method integrating context information graph convolution
Zhao et al. A novel embedding learning framework for relation completion and recommendation based on graph neural network and multi-task learning
Zhang et al. Bilinear graph neural network-enhanced Web services classification
Ren et al. Improving fraud detection via imbalanced graph structure learning
Meng et al. Regional bullying text recognition based on two-branch parallel neural networks
Le et al. Enhancing Anchor Link Prediction in Information Networks through Integrated Embedding Techniques
Jian et al. An improved memory networks based product model classification method
Chen et al. Hierarchical mixed neural network for joint representation learning of social-attribute network
Chen et al. Pne: Label embedding enhanced network embedding

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22899918

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE