CN116186385A

CN116186385A - Service recommendation method based on decoupling characterization learning and graph neural network

Info

Publication number: CN116186385A
Application number: CN202211634971.2A
Authority: CN
Inventors: 王东京; 万峰; 俞东进; 张新; 邓水光
Original assignee: Hangzhou Dianzi University
Current assignee: Hangzhou Dianzi University
Priority date: 2022-12-19
Filing date: 2022-12-19
Publication date: 2023-05-30

Abstract

The invention discloses a service recommendation method based on decoupling characterization learning and a graphic neural network, which takes a user interaction log recorded by online application as a data source to acquire a complete historical interaction sequence and a corresponding category sequence of a user, firstly, respectively capturing long-term stable interests and short-term dynamic interests of the user in a personalized degree, and then capturing popular interests of the user by utilizing service category information. The method aims at the problems that the implicit decoupling-based method relies on the number of potential interests set manually, is difficult to adapt to different application scenes, and does not consider different online application service class factors, long-term interests, short-term interests and popular interests of users are learned based on complete sequences, nearest sequences and class sequences respectively, the defect that the number of potential interests is set manually is avoided, the influence of different online application class factors is considered, the popular interests of the users are captured as the basis of the interpretation of recommendation results, and application providers are helped to adjust recommendation strategies in a targeted manner.

Description

A service recommendation method based on decoupled representation learning and graph neural network

技术领域Technical Field

本发明属于推荐系统技术领域，具体涉及一种基于解耦表征学习与图神经网络的服务推荐方法。The present invention belongs to the technical field of recommendation systems, and specifically relates to a service recommendation method based on decoupled representation learning and graph neural networks.

背景技术Background Art

近年来，随着互联网技术的快速普及，人们日常生活中对于在线应用服务(如在线音乐、视频流媒体、电子购物等)的使用需求日益增加。推荐系统利用用户的历史行为记录建模用户画像，帮助在线应用提供商有针对性的为用户推荐可能感兴趣的服务，具有很大的潜在经济价值。例如网易云音乐会根据用户的听歌记录生成用户画像，进而为用户推荐潜在的具有会员标识的歌曲来吸引用户升级会员；天猫购物平台会根据用户的购买、搜索等行为推荐相似的商品来帮助用户选购商品，从而提升购买转化率。In recent years, with the rapid popularization of Internet technology, people's daily life has an increasing demand for online application services (such as online music, video streaming, e-shopping, etc.). Recommendation systems use users' historical behavior records to model user portraits, helping online application providers to recommend services that users may be interested in in a targeted manner, which has great potential economic value. For example, NetEase Cloud Music will generate user portraits based on users' listening records, and then recommend potential songs with member logos to users to attract users to upgrade their membership; Tmall shopping platform will recommend similar products based on users' purchase, search and other behaviors to help users choose products, thereby improving purchase conversion rates.

随着深度学习技术的研究日益深入，目前很多基于用户历史行为序列建模用户偏好的关注点是用户与交互服务之间的关联关系，例如是通过序列模式来捕获用户长短期兴趣、通过建立图结构来捕获用户与服务之间的二阶与高阶联系，学习用户的整体兴趣特征。但是用户的行为是多种因素作用的结果，在个性化程度上存在长期的稳定兴趣和短期的随机动态兴趣，在服务受欢迎程度上存在从众的流行度兴趣，不同的兴趣会对用户的未来行为产生不同程度的影响。With the deepening of research on deep learning technology, many user preference models based on user historical behavior sequences currently focus on the relationship between users and interactive services, such as capturing users' long-term and short-term interests through sequence patterns, capturing the second-order and high-order connections between users and services through the establishment of graph structures, and learning users' overall interest characteristics. However, user behavior is the result of multiple factors. In terms of personalization, there are long-term stable interests and short-term random dynamic interests. In terms of service popularity, there are popular interests that conform to the crowd. Different interests will have different degrees of influence on users' future behavior.

为了通过用户的历史行为记录捕获用户在不同方面的潜在兴趣，许多方法采用隐式解耦的方式，首先设定潜在兴趣个数作为解耦表征学习的前提，然后利用图神经网络等特征学习方法学习不同方面的潜在兴趣，最后得到用户的整体偏好。现有方法的不足之处在于，没有从整体上分析潜在兴趣的主要分布情况，难以根据实际应用推荐场景调整潜在兴趣个数，并且忽略了不通在线服务的独有特点(如类别因素)，影响在线应用提供商的经济收益。In order to capture users' potential interests in different aspects through their historical behavior records, many methods adopt implicit decoupling. First, the number of potential interests is set as the premise of decoupling representation learning, and then feature learning methods such as graph neural networks are used to learn different aspects of potential interests, and finally the overall preference of users is obtained. The shortcomings of existing methods are that they do not analyze the main distribution of potential interests as a whole, making it difficult to adjust the number of potential interests according to actual application recommendation scenarios, and ignoring the unique characteristics of different online services (such as category factors), which affects the economic benefits of online application providers.

发明内容Summary of the invention

本发明针对基于隐式解耦表征学习的方法依赖于人工设置的潜在兴趣个数，难以适应不同服务场景，且没有考虑不同在线应用类别因素的问题，提出了一种基于解耦表征学习与图神经网络的服务推荐方法。首先在个性化程度上分别捕获用户长期的稳定兴趣和短期的动态兴趣，然后利用服务类别信息捕获用户的从众兴趣。Aiming at the problem that the implicit decoupled representation learning-based method relies on the number of potential interests set manually, is difficult to adapt to different service scenarios, and does not consider the factors of different online application categories, this paper proposes a service recommendation method based on decoupled representation learning and graph neural network. First, the user's long-term stable interests and short-term dynamic interests are captured in terms of personalization, and then the user's herd interests are captured using service category information.

本发明的具体步骤是：The specific steps of the present invention are:

步骤(1).以在线应用记录的用户交互日志为数据源，收集用户的完整历史交互序列以及对应类别序列。所述的完整历史交互序列是指用户已经产生的对于服务的交互记录(如点击、收藏、评分等行为)，所述的对应类别序列指每个服务对应的类别信息构建的序列。定义I为所有服务的集合，服务个数为n；定义C为所有服务对应类别的集合，类别个数为k；定义用户u的历史交互序列为T＝t₁,t₂,…,t_l-1,t_l,t_l+1，其中t_i表示第i次交互的记录，包含服务与对应的类别的二元组，即t_i＝(I_i,C_i)，l表示序列长度，t_l+1表示待预测的下一次交互。Step (1). Using the user interaction log recorded by the online application as the data source, collect the user's complete historical interaction sequence and the corresponding category sequence. The complete historical interaction sequence refers to the interaction record that the user has generated for the service (such as clicks, favorites, ratings, etc.), and the corresponding category sequence refers to the sequence constructed by the category information corresponding to each service. Define I as the set of all services, the number of services is n; define C as the set of categories corresponding to all services, the number of categories is k; define the historical interaction sequence of user u as T = t ₁ , t ₂ , ..., t _l-1 , t _l , t _l+1 , where _ti represents the record of the i-th interaction, including a binary tuple of the service and the corresponding category, that is, _ti = (I _i , C _i ), l represents the sequence length, and t _l+1 represents the next interaction to be predicted.

步骤(2).将服务集合I与类别集合C输入嵌入层得到每个服务以及对应类别的嵌入特征表示。所述的嵌入层是指通过one-hot编码器和多层感知机得到多维度的特征向量来表示每个服务或者类别，定义嵌入层得到的嵌入特征表示为两个特征矩阵：X_i∈R^n×d为服务特征矩阵，X_c∈R^k×d为类别特征矩阵，其中n表示服务个数，k表示类别个数，d表示特征维度。Step (2). Input the service set I and the category set C into the embedding layer to obtain the embedded feature representation of each service and the corresponding category. The embedding layer refers to a multi-dimensional feature vector obtained by a one-hot encoder and a multi-layer perceptron to represent each service or category. The embedded feature representation obtained by the embedding layer is defined as two feature matrices: _Xi∈Rn ^×d is the service feature matrix, and _Xc∈Rk ^×d is the category feature matrix, where n represents the number of services, k represents the number of categories, and d represents the feature dimension.

步骤(3).为了引入类别因素对用户兴趣进行解耦学习，设计了基于用户交互序列的长期兴趣和短期兴趣建模网络，以及基于类别的用户从众兴趣建模模块，包括以下三个步骤：Step (3). In order to introduce category factors to decouple user interests, a long-term interest and short-term interest modeling network based on user interaction sequences and a category-based user herd interest modeling module are designed, which includes the following three steps:

步骤(3.1).首先基于完整历史交互序列中的服务序列构建超图，经过超图卷积层后捕获用户的长期兴趣。Step (3.1). First, a hypergraph is constructed based on the service sequence in the complete historical interaction sequence, and the user's long-term interests are captured after the hypergraph convolution layer.

1)定义服务超图为GI∈(V_i,E_i)，V_i和E_i分别为顶点集和超边集，超图邻接矩阵H_i∈{0,1}^m×n，以一个用户的历史行为序列中的服务序列建立超边(超边是指能够连接多个顶点的特殊边)，m个用户会建立m条超边，服务序列中包含的服务对应的超图邻接矩阵H_i位置会设置为1，否则为0。例如第一个用户与第一个服务产生交互记录，则H_i(0,0)＝1。1) Define the service hypergraph as GI∈(V _i ,E _i ), where V _i and E _i are vertex sets and hyperedge sets respectively, and the hypergraph adjacency matrix H _i ∈{0,1} ^m×n . Hyperedges are established with the service sequence in a user's historical behavior sequence (hyperedges are special edges that can connect multiple vertices). m users will establish m hyperedges, and the hypergraph adjacency matrix H _i corresponding to the service included in the service sequence will be set to 1, otherwise it will be 0. For example, if the first user has an interaction record with the first service, then H _i (0,0)＝1.

2)定义超图的顶点度矩阵D_iv∈R^n×n和超边度矩阵D_ie∈R^m×m，他们分别是关于顶点度和超边度的对角矩阵，其中顶点v∈V_i的度

超边e∈E_i的度

2) Define the vertex degree matrix _Div∈Rn ^×n and the hyperedge degree matrix _Die∈Rm ^×m of the hypergraph, which are diagonal matrices of vertex degree and hyperedge degree respectively, where the degree of vertex _v∈Vi

The degree of hyperedge e∈E _i

3)定义超图卷积神经网络用于特征学习与信息传播，相邻网络层之间的计算方式为

其中

表示服务序列对应的服务特征表示，W_i∈R^m×m表示权重矩阵，σ(·)表示非线性激活函数。然后使用平均池化策略得到服务的特征表示：3) Define a hypergraph convolutional neural network for feature learning and information propagation. The calculation method between adjacent network layers is

in

represents the service feature representation corresponding to the service sequence, W _i ∈ R ^m×m represents the weight matrix, and σ(·) represents the nonlinear activation function. Then the average pooling strategy is used to obtain the feature representation of the service:

其中L表示卷积网络层数，最后使用平均策略聚合序列中的特征信息得到用户长期兴趣h_lo：Where L represents the number of convolutional network layers, and finally the feature information in the sequence is aggregated using the average strategy to obtain the user's long-term interest h _lo :

其中l表示用户历史交互序列长度。Where l represents the length of the user's historical interaction sequence.

步骤(3.2).然后基于最近历史交互序列中的服务序列捕获用户的短期兴趣，具体计算过程如下。Step (3.2). Then capture the user's short-term interest based on the service sequence in the recent historical interaction sequence. The specific calculation process is as follows.

1)针对用户完整交互序列T＝t₁,t₂,…,t_l-1,t_l，选择最近的交互子序列T^s＝t_l-a,…,t_l-1,t_l，a表示选择交互记录的个数，可以根据不同应用场景进行设置。然后将子序列对应的服务特征序列通过双向LSTM编码层，用于结合时间信息，捕获用户的短期兴趣表示

其中

表示初始服务特征表示。m个用户短期兴趣表示的特征矩阵为H，f表特征聚合方法。1) For the user's complete interaction sequence T = t ₁ , t ₂ , ..., t _l-1 , t _l , select the most recent interaction subsequence T ^s = t _la , ..., t _l-1 , t _l , where a represents the number of selected interaction records, which can be set according to different application scenarios. Then the service feature sequence corresponding to the subsequence is passed through a bidirectional LSTM encoding layer to combine time information and capture the user's short-term interest representation.

in

represents the initial service feature representation. The feature matrix of m users’ short-term interest representation is H, and f represents the feature aggregation method.

2)以用户为顶点建立用户图GU，顶点之间产生边连接的条件是用户之间存在相同最近交互服务，利用图表征学习捕获不同用户之间的潜在联系，具体传播方式如下：2) A user graph GU is established with users as vertices. The condition for edge connection between vertices is that there are the same recent interactive services between users. Graph representation learning is used to capture the potential connections between different users. The specific propagation method is as follows:

H^(l+1)＝σ(D^-1AH^l)H ^(l+1) =σ(D ^-1 AH ^l )

其中D和A分别是用户图GU的度矩阵和单位矩阵,σ(·)是激活函数。同样使用平均池化策略得到用户短期兴趣表示h_sh∈H：Where D and A are the degree matrix and identity matrix of the user graph GU, respectively, and σ(·) is the activation function. The average pooling strategy is also used to obtain the user's short-term interest representation h _sh ∈H:

步骤(3.3).为了适应不同应用的数据特点，本方法引入服务类别信息作为数据基础来捕获用户针对类别流行度所产生的从众兴趣。与步骤(3.1)相似，以完整历史交互序列中的类别序列构建类别超图GC∈(V_c,E_c)，V_c和E_c分别为顶点集和超边集，超图邻接矩阵H_c∈{0,1}^m×k。定义类别超图卷积计算：

其中

表示类别序列对应的类别特征表示，D_cv∈R^k×k和D_ce∈R^m×m分别表示类别超图的顶点度矩阵和超边度矩阵，W_c∈R^m×m表示权重矩阵，σ(·)表示非线性激活函数，通过平均池化层后得到用户的从众兴趣h_co。Step (3.3). In order to adapt to the data characteristics of different applications, this method introduces service category information as the data basis to capture the user's herd interest in category popularity. Similar to step (3.1), the category sequence in the complete historical interaction sequence is used to construct a category hypergraph GC∈(V _c ,E _c ), where V _c and E _c are the vertex set and hyperedge set respectively, and the hypergraph adjacency matrix H _c ∈{0,1} ^m×k . Define the category hypergraph convolution calculation:

in

represents the category feature representation corresponding to the category sequence, D _cv ∈R ^k×k and D _ce ∈R ^m×m represent the vertex degree matrix and hyperedge degree matrix of the category hypergraph respectively, W _c ∈R ^m×m represents the weight matrix, σ(·) represents the nonlinear activation function, and the user’s herd interest h _co is obtained after the average pooling layer.

步骤(4).在步骤(3)的基础上，分别将学习的用户长期兴趣h_lo、短期兴趣h_sh和从众兴趣h_co通过多兴趣聚合层进行特征融合，得到最终的用户兴趣表示h，融合方式如下：Step (4). Based on step (3), the learned user long-term interest h _lo , short-term interest h _sh and herd interest h _co are respectively fused through the multi-interest aggregation layer to obtain the final user interest representation h. The fusion method is as follows:

h＝W[h_lo||h_sh||h_co]+bh＝W[h _lo ||h _sh ||h _co ]+b

其中W和b分别表示权重矩阵和偏置项，[·||·||·]表示特征向量在维度上的拼接。Where W and b represent the weight matrix and bias term respectively, and [·||·||·] represents the concatenation of feature vectors in dimension.

步骤(5).基于步骤(4)中学习得到的用户兴趣h，执行服务推荐任务。针对备选服务集合I，每个服务的推荐评分z由服务特征向量x∈X_i和用户兴趣特征向量h的内积运算得到，即z_i＝x^Th，使用softmax函数将z进行归一化得到最终的服务评分

然后选择评分最高的top-N个服务作为推荐列表中的候选服务。推荐任务学习的目标函数表示为交叉熵损失：Step (5). Based on the user interest h learned in step (4), perform the service recommendation task. For the candidate service set I, the recommendation score z of each service is obtained by the inner product operation of the service feature vector x∈X _i and the user interest feature vector h, that is, z _i = x ^T h. The softmax function is used to normalize z to obtain the final service score

Then the top-N services with the highest scores are selected as candidate services in the recommendation list. The objective function of recommendation task learning is expressed as cross entropy loss:

其中

是指目标服务在给定交互记录序列T情况下会产生交互的概率。in

It refers to the probability that the target service will interact with a given interaction record sequence T.

本发明相对于现有技术而言，具有以下有益效果：利用本方法对用户兴趣的解耦学习，可以分别学习用户的长期兴趣、短期兴趣和从众兴趣，最大程度避免人工设置潜在兴趣个数的弊端，并且考虑不同在线应用类别因素的影响，捕获用户的从众兴趣作为推荐结果可解释性的基础，帮助应用提供商有针对性的调整推荐策略。Compared with the prior art, the present invention has the following beneficial effects: by utilizing the decoupled learning of user interests, the user's long-term interests, short-term interests and herd interests can be learned separately, avoiding the disadvantages of manually setting the number of potential interests to the greatest extent, and considering the influence of different online application category factors, capturing the user's herd interests as the basis for the interpretability of recommendation results, helping application providers to adjust recommendation strategies in a targeted manner.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本发明实施例一种基于解耦表征学习与图神经网络的服务推荐方法框架图。FIG1 is a framework diagram of a service recommendation method based on decoupled representation learning and graph neural network according to an embodiment of the present invention.

图2为本发明实施例中基于最近序列的特征嵌入模块图。FIG. 2 is a diagram of a feature embedding module based on a recent sequence in an embodiment of the present invention.

具体实施方式DETAILED DESCRIPTION

需要说明的是，在不冲突的情况下，本发明中的实施例及实施例中的特征可以相互组合。It should be noted that, in the absence of conflict, the embodiments of the present invention and the features in the embodiments may be combined with each other.

下面结合附图对本发明技术方案进行进一步的解释和说明。The technical solution of the present invention is further explained and illustrated below in conjunction with the accompanying drawings.

本实施例提供了一种基于解耦表征学习与图神经网络的服务推荐方法，方法的整体执行框架如图1所示。This embodiment provides a service recommendation method based on decoupled representation learning and graph neural network. The overall execution framework of the method is shown in Figure 1.

步骤(2).将服务集合I与类别集合C输入嵌入层得到每个服务以及对应类别的嵌入特征表示。所述的嵌入层是指通过one-hot编码器和多层感知机得到多维度的特征向量来表示每个服务或者类别，定义嵌入层得到的嵌入特征表示为两个特征矩阵：X_i∈R^n×d为服务特征矩阵，X_c∈R^k×d为类别特征矩阵，其中n和k分别表示服务和类别个数，d表示特征维度。Step (2). Input the service set I and the category set C into the embedding layer to obtain the embedded feature representation of each service and the corresponding category. The embedding layer refers to a multi-dimensional feature vector obtained by a one-hot encoder and a multi-layer perceptron to represent each service or category. The embedded feature representation obtained by the embedding layer is defined as two feature matrices: _Xi∈Rn ^×d is the service feature matrix, and _Xc∈Rk ^×d is the category feature matrix, where n and k represent the number of services and categories respectively, and d represents the feature dimension.

步骤(3.1).首先基于完整历史交互序列中的服务序列T_i＝I₁,I₂,…,I_l-1,I_l构建超图，经过超图卷积层后捕获用户的长期兴趣表示为h_lo。Step (3.1). First, a hypergraph is constructed based on the service sequence T _i =I ₁ ,I ₂ ,…,I _l-1 ,I _l in the complete historical interaction sequence. After the hypergraph convolution layer, the long-term interests of users are captured and represented as h _lo .

1)定义服务超图为GI∈(V_i,E_i)，V_i和E_i分别为顶点集和超边集，超图邻接矩阵H_i∈{0,1}^m×n，以一个用户的历史行为序列中的服务序列建立超边(超边是指能够连接多个顶点的特殊边)，m个用户会建立m条超边，服务序列中包含的服务对应的超图邻接矩阵Hi位置会设置为1，否则为0。若第一个用户与第一个服务产生交互记录，则H_i(0,0)＝1。例如有两个用户u₁和u₂的历史交互记录中的服务序列分别为{I₁,I₂,I₂,I₃}和{I₂,I₄,I₂,I₅}，则对应的服务超图邻接矩阵为：1) Define the service hypergraph as GI∈(V _i ,E _i ), where V _i and E _i are vertex sets and hyperedge sets respectively, and the hypergraph adjacency matrix H _i ∈{0,1} ^m×n . Hyperedges are established with the service sequence in a user's historical behavior sequence (hyperedges refer to special edges that can connect multiple vertices). m users will establish m hyperedges, and the hypergraph adjacency matrix Hi corresponding to the service included in the service sequence will be set to 1, otherwise it will be 0. If the first user generates an interaction record with the first service, then H _i (0,0)＝1. For example, if the service sequences in the historical interaction records of two users u ₁ and u ₂ are {I ₁ ,I ₂ ,I ₂ ,I ₃ } and {I ₂ ,I ₄ ,I ₂ ,I ₅ } respectively, then the corresponding service hypergraph adjacency matrix is:

超边e∈E_i的度

The degree of hyperedge e∈E _i

其中

in

其中L表示卷积网络层数，

表示第l层的服务特征矩阵，最后使用平均策略聚合序列中的特征信息得到用户长期兴趣h_lo：Where L represents the number of convolutional network layers,

represents the service feature matrix of the lth layer, and finally uses the average strategy to aggregate the feature information in the sequence to obtain the user's long-term interest h _lo :

其中l表示用户历史交互序列长度,X_i,t表示服务序列中第t个服务的特征表示。Where l represents the length of the user's historical interaction sequence, and Xi _,t represents the feature representation of the tth service in the service sequence.

步骤(3.2).然后基于最近历史交互序列中的服务序列捕获用户的短期兴趣，会话嵌入聚合模块如图2所示，具体计算过程如下。Step (3.2). Then, based on the service sequence in the recent historical interaction sequence, the user's short-term interest is captured. The session embedding aggregation module is shown in Figure 2. The specific calculation process is as follows.

其中

表示初始服务特征表示。m个用户短期兴趣表示的特征矩阵为H∈R^m×d，f表示双向LSTM特征聚合方法。1) For the user's complete interaction sequence T = t ₁ , t ₂ , ..., t _l-1 , t _l , select the most recent interaction subsequence T ^s = t _la , ..., t _l-1 , t _l , where a represents the number of selected interaction records, which can be set according to different application scenarios. Then the service feature sequence corresponding to the subsequence is passed through a bidirectional LSTM encoding layer to combine time information and capture the user's short-term interest representation.

in

represents the initial service feature representation. The feature matrix of m users’ short-term interest representation is H∈R ^m×d , and f represents the bidirectional LSTM feature aggregation method.

2)以用户为顶点建立用户图GU，顶点之间产生边连接的条件是用户之间存在相同最近交互服务，例如用户u₁和u₂的最近服务序列分别为{I₂,I₃}和{I₂,I₅}，存在相同服务I₂，因此用户u₁和u₂会产生边连接。利用图表征学习捕获不同用户之间的潜在联系，具体传播方式如下：2) Build a user graph GU with users as vertices. The condition for edge connections between vertices is that there are the same recent interactive services between users. For example, the recent service sequences of users u ₁ and u ₂ are {I ₂ ,I ₃ } and {I ₂ ,I ₅ } respectively. There is the same service I ₂ , so users u ₁ and u ₂ will have edge connections. Graph representation learning is used to capture the potential connections between different users. The specific propagation method is as follows:

H^(l+1)＝σ(D^-1AH^l)H ^(l+1) =σ(D ^-1 AH ^l )

其中D∈R^m×m和A∈R^m×m分别是用户图GU的度矩阵和单位矩阵,σ(·)是激活函数。同样使用平均池化策略得到用户短期兴趣表示h_sh∈H：Where D∈R ^m×m and A∈R ^m×m are the degree matrix and identity matrix of the user graph GU respectively, and σ(·) is the activation function. The average pooling strategy is also used to obtain the user short-term interest representation h _sh ∈H:

其中L表示用户图卷积神经网络的层数，

表示第l层的用户短期兴趣特征表示。Where L represents the number of layers of the user graph convolutional neural network,

Represents the user's short-term interest feature representation at the lth layer.

步骤(3.3).为了适应不同应用的数据特点，本方法引入服务类别信息作为数据基础来捕获用户针对类别流行度所产生的从众兴趣h_co。与步骤(3.1)相似，以完整历史交互序列中的类别序列T_c＝C₁,C₂,…,C_l-1,C_l构建类别超图GC∈(V_c,E_c)，V_c和E_c分别为顶点集和超边集，超图邻接矩阵H_c∈{0,1}^m×k。例如有两个用户u₁和u₂的历史交互记录中的类别序列分别为{C₁,C₁,C₁,C₂}和{C₂,C₁,C₂,C₃}，则对应的类别超图邻接矩阵为：Step (3.3). In order to adapt to the data characteristics of different applications, this method introduces service category information as the data basis to capture the user's herd interest h _co generated by category popularity. Similar to step (3.1), the category sequence T _c =C ₁ ,C ₂ ,…,C _l-1 ,C _l in the complete historical interaction sequence is used to construct a category hypergraph GC∈(V _c ,E _c ), where V _c and E _c are vertex sets and hyperedge sets respectively, and the hypergraph adjacency matrix H _c ∈{0,1} ^m×k . For example, there are two users u ₁ and u ₂ whose category sequences in their historical interaction records are {C ₁ ,C ₁ ,C ₁ ,C ₂ } and {C ₂ ,C ₁ ,C ₂ ,C ₃ } respectively, then the corresponding category hypergraph adjacency matrix is:

定义类别超图卷积计算：

其中

表示类别序列对应的类别特征表示，D_cv∈R^k×k和D_ce∈R^m×m分别表示类别超图的顶点度矩阵和超边度矩阵，W_c∈R^m×m表示权重矩阵，σ(·)表示非线性激活函数，通过平均池化层后得到用户的从众兴趣h_co。Define category hypergraph convolution calculation:

in

h＝W[h_lo||h_sh||h_co]+bh＝W[h _lo ||h _sh ||h _co ]+b

其中W∈R^d×3d和b∈R^d×1分别表示权重矩阵和偏置项，[·||·||·]表示特征向量在维度上的拼接。where W∈Rd ^×3d and b∈Rd ^×1 represent the weight matrix and bias term respectively, and [·||·||·] represents the concatenation of feature vectors in dimension.

步骤(5).基于步骤(4)中学习得到的用户兴趣h，执行服务推荐任务。针对备选服务集合I，每个服务的推荐评分z由服务特征向量x∈X_i和用户兴趣特征向量h的内积运算得到，即z_i＝x^Th，使用softmax函数将z进行归一化得到最终的服务分数

其中

是指目标服务在给定交互记录序列T情况下会产生交互的概率。在结果的可解释性方面，如用户u₁的服务序列和类别序列分别为{I₁,I₂,I₃}和{C₁,C₁,C₂}，若用户u₁实际上下一个交互的物品是I₄(类别是C₁)，仅根据服务历史无法对结果进行解释，但是从类别数据上可以看出用户u₁频繁交互类别为C₁的服务，则可以侧重于用户从众兴趣对推荐结果进行解释。in

It refers to the probability that the target service will interact with a given interaction record sequence T. In terms of the interpretability of the results, if the service sequence and category sequence of user u ₁ are {I ₁ , I ₂ , I ₃ } and {C ₁ , C ₁ , C ₂ } respectively, if the next item that user u ₁ actually interacts with is I ₄ (category C ₁ ), the result cannot be explained based on the service history alone. However, from the category data, it can be seen that user u ₁ frequently interacts with services of category C ₁ , so the recommendation results can be explained by focusing on the user's herd interest.

本方法从用户历史交互记录数据为基础，在基于解耦表征学习的基础上分别捕获用户长期兴趣和短期兴趣，然后引入服务类别流行度因素，构建了类别超图卷积神经网络捕获用户对于服务类别的从众兴趣，提升了推荐模型适应不同应用场景和对于结果可解释性的能力。This method is based on the user's historical interaction record data, and captures the user's long-term and short-term interests respectively based on decoupled representation learning. Then, the service category popularity factor is introduced to construct a category hypergraph convolutional neural network to capture the user's herd interest in service categories, thereby improving the recommendation model's ability to adapt to different application scenarios and the interpretability of the results.

Claims

1. A service recommendation method based on decoupled representation learning and graph neural network, characterized by comprising the following steps:

S1. Collect the user's complete historical interaction sequence and corresponding category sequence, where the complete historical interaction sequence refers to the user's interaction record with the service, and the corresponding category sequence refers to the sequence constructed by the category information corresponding to each service;

S2. Input the set of services and the set of categories into the embedding layer to obtain the embedded feature representation of each service and the corresponding category. The embedding layer refers to obtaining a multi-dimensional feature vector through a one-hot encoder and a multi-layer perceptron to represent each service or category. The embedded feature representation obtained by the embedding layer is defined as two feature matrices: _Xi∈Rn ^×d is the service feature matrix, and _Xc∈Rk ^×d is the category feature matrix, where n represents the number of services, k represents the number of categories, and d represents the feature dimension;

S3: Construct a long-term interest and short-term interest modeling network based on user interaction sequences, as well as a category-based user herd interest modeling module

S3-1, build a hypergraph based on the service sequence in the complete historical interaction sequence, and capture the user's long-term interests after the hypergraph convolution layer;

S3-2, capturing users’ short-term interests based on service sequences in recent historical interaction sequences;

S3-3, introduce service category information as the data basis to capture the user's herd interest in category popularity;

S4. Based on step S3, the learned user long-term interest h _lo , short-term interest h _sh and herd interest h _co are respectively fused through the multi-interest aggregation layer to obtain the final user interest representation h. The fusion method is as follows:

h＝W[h _lo ||h _sh ||h _co ]+b

Where W and b represent the weight matrix and bias term respectively, and [·||·||·] represents the concatenation of feature vectors in dimension;

S5. Based on the user interest h learned in step S4, perform the service recommendation task. For the candidate service set I, the recommendation score z of each service is obtained by the inner product operation of the service feature vector x∈X _i and the user interest feature vector h, that is, z _i =x ^T h. Use the softmax function to normalize z to obtain the final service score.

in

2. According to claim 1, the service recommendation method based on decoupled representation learning and graph neural network is characterized in that the user's complete historical interaction sequence and the corresponding category sequence are based on the user interaction log recorded by the online application as the data source, and the interaction record includes clicks, favorites, and ratings.

3. According to the service recommendation method based on decoupled representation learning and graph neural network in claim 1, it is characterized in that I is defined as the set of all services, the number of services is n; C is defined as the set of categories corresponding to all services, the number of categories is k; the historical interaction sequence of user u is defined as T = t ₁ , t ₂ , ..., t _l-1 , t _l , t _l+1 , where _ti represents the record of the i-th interaction, including a tuple of the service and the corresponding category, that is, _ti = (I _i , C _i ), l represents the sequence length, and t _l+ represents the next interaction to be predicted.

4. The service recommendation method based on decoupled representation learning and graph neural network according to claim 3 is characterized in that the step S3-1 is specifically as follows:

S3-1-1. Define the service hypergraph as GI∈(V _i ,E _i ), where V _i and E _i are vertex sets and hyperedge sets respectively, and the hypergraph adjacency matrix H _i ∈{0,1} ^m×n . Hyperedges are established with the service sequence in a user's historical behavior sequence. The hyperedge refers to a special edge that can connect multiple vertices. m users will establish m hyperedges. The position of the hypergraph adjacency matrix H corresponding to the service included in the service sequence will be set to 1, otherwise it will be 0.

S3-1-2. Define the vertex degree matrix _Div∈Rn ^×n and the hyperedge degree matrix Die∈Rm _× ^m of the hypergraph. The vertex degree matrix _Div∈Rn ^× n and the hyperedge degree matrix _Die∈Rm ^×m are diagonal matrices of vertex degree and hyperedge degree, respectively. The degree of vertex _v∈Vi is

The degree of hyperedge e∈E _i

S3-1-3. Define the hypergraph convolutional neural network for feature learning and information propagation. The calculation method between adjacent network layers is:

in

represents the service feature representation corresponding to the service sequence, _Wi represents the weight matrix, σ(·) represents the nonlinear activation function, and then the average pooling strategy is used to obtain the feature representation of the service:

Where L represents the number of convolutional network layers, and finally the feature information in the sequence is aggregated using the average strategy to obtain the user's long-term interest h _lo :

Where l represents the length of the user's historical interaction sequence.

5. According to the service recommendation method based on decoupled representation learning and graph neural network according to claim 4, it is characterized in that the specific method of step S3-2 is as follows:

S3-2-1. For the complete user interaction sequence T = t ₁ , t ₂ , ..., t _l-1 , t _l , select the most recent interaction subsequence T ^s = t _la , ..., t _l-1 , t _l , where a represents the number of selected interaction records. Then, the service feature sequence corresponding to the subsequence is passed through a bidirectional LSTM encoding layer to combine time information and capture the user's short-term interest representation.

in

represents the initial service feature representation, the feature matrix of m users’ short-term interest representation is H, and f represents the feature aggregation method;

S3-2-2. Build a user graph GU with users as vertices. The condition for edge connections between vertices is that there are the same recent interactive services between users. Use graph representation learning to capture the potential connections between different users. The specific propagation method is as follows:

H ^(l+1) =σ(D ^-1 AH ^l )

Where D and A are the degree matrix and identity matrix of the user graph GU, respectively, σ(·) is the activation function, and the average pooling strategy is also used to obtain the user's short-term interest representation h _sh ∈H:

6. The service recommendation method based on decoupled representation learning and graph neural network according to claim 5, characterized in that the specific method of step S3-3 is as follows:

Construct a category hypergraph GC∈( _Vc , _Ec ) with the category sequence in the complete historical interaction sequence, where _Vc and _Ec are the vertex set and hyperedge set respectively, and the hypergraph adjacency matrix _Hc∈ {0,1} ^m×k . Define the category hypergraph convolution calculation:

in

represents the category feature representation corresponding to the category sequence, D _cv ∈R ^k×k and D _ce ∈R ^m×m represent the vertex degree matrix and edge degree matrix of the category hypergraph respectively, W _c represents the weight matrix, σ(·) represents the nonlinear activation function, and the user’s herd interest h _co is obtained after the average pooling layer.