CN116186385A - Service recommendation method based on decoupling characterization learning and graph neural network - Google Patents
Service recommendation method based on decoupling characterization learning and graph neural network Download PDFInfo
- Publication number
- CN116186385A CN116186385A CN202211634971.2A CN202211634971A CN116186385A CN 116186385 A CN116186385 A CN 116186385A CN 202211634971 A CN202211634971 A CN 202211634971A CN 116186385 A CN116186385 A CN 116186385A
- Authority
- CN
- China
- Prior art keywords
- service
- user
- sequence
- category
- feature
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 32
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 13
- 238000012512 characterization method Methods 0.000 title abstract 2
- 230000003993 interaction Effects 0.000 claims abstract description 57
- 230000007774 longterm Effects 0.000 claims abstract description 19
- 239000011159 matrix material Substances 0.000 claims description 50
- 244000144980 herd Species 0.000 claims description 17
- 230000006870 function Effects 0.000 claims description 15
- 239000013598 vector Substances 0.000 claims description 12
- 230000004913 activation Effects 0.000 claims description 9
- 230000006399 behavior Effects 0.000 claims description 9
- 238000011176 pooling Methods 0.000 claims description 9
- 238000004364 calculation method Methods 0.000 claims description 8
- 230000002776 aggregation Effects 0.000 claims description 7
- 238000004220 aggregation Methods 0.000 claims description 7
- 238000013527 convolutional neural network Methods 0.000 claims description 5
- 230000002457 bidirectional effect Effects 0.000 claims description 4
- 230000002452 interceptive effect Effects 0.000 claims description 4
- 238000007500 overflow downdraw method Methods 0.000 claims description 3
- 230000007547 defect Effects 0.000 abstract 1
- 238000010586 diagram Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 101001121408 Homo sapiens L-amino-acid oxidase Proteins 0.000 description 1
- 102100026388 L-amino-acid oxidase Human genes 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域Technical Field
本发明属于推荐系统技术领域,具体涉及一种基于解耦表征学习与图神经网络的服务推荐方法。The present invention belongs to the technical field of recommendation systems, and specifically relates to a service recommendation method based on decoupled representation learning and graph neural networks.
背景技术Background Art
近年来,随着互联网技术的快速普及,人们日常生活中对于在线应用服务(如在线音乐、视频流媒体、电子购物等)的使用需求日益增加。推荐系统利用用户的历史行为记录建模用户画像,帮助在线应用提供商有针对性的为用户推荐可能感兴趣的服务,具有很大的潜在经济价值。例如网易云音乐会根据用户的听歌记录生成用户画像,进而为用户推荐潜在的具有会员标识的歌曲来吸引用户升级会员;天猫购物平台会根据用户的购买、搜索等行为推荐相似的商品来帮助用户选购商品,从而提升购买转化率。In recent years, with the rapid popularization of Internet technology, people's daily life has an increasing demand for online application services (such as online music, video streaming, e-shopping, etc.). Recommendation systems use users' historical behavior records to model user portraits, helping online application providers to recommend services that users may be interested in in a targeted manner, which has great potential economic value. For example, NetEase Cloud Music will generate user portraits based on users' listening records, and then recommend potential songs with member logos to users to attract users to upgrade their membership; Tmall shopping platform will recommend similar products based on users' purchase, search and other behaviors to help users choose products, thereby improving purchase conversion rates.
随着深度学习技术的研究日益深入,目前很多基于用户历史行为序列建模用户偏好的关注点是用户与交互服务之间的关联关系,例如是通过序列模式来捕获用户长短期兴趣、通过建立图结构来捕获用户与服务之间的二阶与高阶联系,学习用户的整体兴趣特征。但是用户的行为是多种因素作用的结果,在个性化程度上存在长期的稳定兴趣和短期的随机动态兴趣,在服务受欢迎程度上存在从众的流行度兴趣,不同的兴趣会对用户的未来行为产生不同程度的影响。With the deepening of research on deep learning technology, many user preference models based on user historical behavior sequences currently focus on the relationship between users and interactive services, such as capturing users' long-term and short-term interests through sequence patterns, capturing the second-order and high-order connections between users and services through the establishment of graph structures, and learning users' overall interest characteristics. However, user behavior is the result of multiple factors. In terms of personalization, there are long-term stable interests and short-term random dynamic interests. In terms of service popularity, there are popular interests that conform to the crowd. Different interests will have different degrees of influence on users' future behavior.
为了通过用户的历史行为记录捕获用户在不同方面的潜在兴趣,许多方法采用隐式解耦的方式,首先设定潜在兴趣个数作为解耦表征学习的前提,然后利用图神经网络等特征学习方法学习不同方面的潜在兴趣,最后得到用户的整体偏好。现有方法的不足之处在于,没有从整体上分析潜在兴趣的主要分布情况,难以根据实际应用推荐场景调整潜在兴趣个数,并且忽略了不通在线服务的独有特点(如类别因素),影响在线应用提供商的经济收益。In order to capture users' potential interests in different aspects through their historical behavior records, many methods adopt implicit decoupling. First, the number of potential interests is set as the premise of decoupling representation learning, and then feature learning methods such as graph neural networks are used to learn different aspects of potential interests, and finally the overall preference of users is obtained. The shortcomings of existing methods are that they do not analyze the main distribution of potential interests as a whole, making it difficult to adjust the number of potential interests according to actual application recommendation scenarios, and ignoring the unique characteristics of different online services (such as category factors), which affects the economic benefits of online application providers.
发明内容Summary of the invention
本发明针对基于隐式解耦表征学习的方法依赖于人工设置的潜在兴趣个数,难以适应不同服务场景,且没有考虑不同在线应用类别因素的问题,提出了一种基于解耦表征学习与图神经网络的服务推荐方法。首先在个性化程度上分别捕获用户长期的稳定兴趣和短期的动态兴趣,然后利用服务类别信息捕获用户的从众兴趣。Aiming at the problem that the implicit decoupled representation learning-based method relies on the number of potential interests set manually, is difficult to adapt to different service scenarios, and does not consider the factors of different online application categories, this paper proposes a service recommendation method based on decoupled representation learning and graph neural network. First, the user's long-term stable interests and short-term dynamic interests are captured in terms of personalization, and then the user's herd interests are captured using service category information.
本发明的具体步骤是:The specific steps of the present invention are:
步骤(1).以在线应用记录的用户交互日志为数据源,收集用户的完整历史交互序列以及对应类别序列。所述的完整历史交互序列是指用户已经产生的对于服务的交互记录(如点击、收藏、评分等行为),所述的对应类别序列指每个服务对应的类别信息构建的序列。定义I为所有服务的集合,服务个数为n;定义C为所有服务对应类别的集合,类别个数为k;定义用户u的历史交互序列为T=t1,t2,…,tl-1,tl,tl+1,其中ti表示第i次交互的记录,包含服务与对应的类别的二元组,即ti=(Ii,Ci),l表示序列长度,tl+1表示待预测的下一次交互。Step (1). Using the user interaction log recorded by the online application as the data source, collect the user's complete historical interaction sequence and the corresponding category sequence. The complete historical interaction sequence refers to the interaction record that the user has generated for the service (such as clicks, favorites, ratings, etc.), and the corresponding category sequence refers to the sequence constructed by the category information corresponding to each service. Define I as the set of all services, the number of services is n; define C as the set of categories corresponding to all services, the number of categories is k; define the historical interaction sequence of user u as T = t 1 , t 2 , ..., t l-1 , t l , t l+1 , where ti represents the record of the i-th interaction, including a binary tuple of the service and the corresponding category, that is, ti = (I i , C i ), l represents the sequence length, and t l+1 represents the next interaction to be predicted.
步骤(2).将服务集合I与类别集合C输入嵌入层得到每个服务以及对应类别的嵌入特征表示。所述的嵌入层是指通过one-hot编码器和多层感知机得到多维度的特征向量来表示每个服务或者类别,定义嵌入层得到的嵌入特征表示为两个特征矩阵:Xi∈Rn×d为服务特征矩阵,Xc∈Rk×d为类别特征矩阵,其中n表示服务个数,k表示类别个数,d表示特征维度。Step (2). Input the service set I and the category set C into the embedding layer to obtain the embedded feature representation of each service and the corresponding category. The embedding layer refers to a multi-dimensional feature vector obtained by a one-hot encoder and a multi-layer perceptron to represent each service or category. The embedded feature representation obtained by the embedding layer is defined as two feature matrices: Xi∈Rn ×d is the service feature matrix, and Xc∈Rk ×d is the category feature matrix, where n represents the number of services, k represents the number of categories, and d represents the feature dimension.
步骤(3).为了引入类别因素对用户兴趣进行解耦学习,设计了基于用户交互序列的长期兴趣和短期兴趣建模网络,以及基于类别的用户从众兴趣建模模块,包括以下三个步骤:Step (3). In order to introduce category factors to decouple user interests, a long-term interest and short-term interest modeling network based on user interaction sequences and a category-based user herd interest modeling module are designed, which includes the following three steps:
步骤(3.1).首先基于完整历史交互序列中的服务序列构建超图,经过超图卷积层后捕获用户的长期兴趣。Step (3.1). First, a hypergraph is constructed based on the service sequence in the complete historical interaction sequence, and the user's long-term interests are captured after the hypergraph convolution layer.
1)定义服务超图为GI∈(Vi,Ei),Vi和Ei分别为顶点集和超边集,超图邻接矩阵Hi∈{0,1}m×n,以一个用户的历史行为序列中的服务序列建立超边(超边是指能够连接多个顶点的特殊边),m个用户会建立m条超边,服务序列中包含的服务对应的超图邻接矩阵Hi位置会设置为1,否则为0。例如第一个用户与第一个服务产生交互记录,则Hi(0,0)=1。1) Define the service hypergraph as GI∈(V i ,E i ), where V i and E i are vertex sets and hyperedge sets respectively, and the hypergraph adjacency matrix H i ∈{0,1} m×n . Hyperedges are established with the service sequence in a user's historical behavior sequence (hyperedges are special edges that can connect multiple vertices). m users will establish m hyperedges, and the hypergraph adjacency matrix H i corresponding to the service included in the service sequence will be set to 1, otherwise it will be 0. For example, if the first user has an interaction record with the first service, then H i (0,0)=1.
2)定义超图的顶点度矩阵Div∈Rn×n和超边度矩阵Die∈Rm×m,他们分别是关于顶点度和超边度的对角矩阵,其中顶点v∈Vi的度超边e∈Ei的度 2) Define the vertex degree matrix Div∈Rn ×n and the hyperedge degree matrix Die∈Rm ×m of the hypergraph, which are diagonal matrices of vertex degree and hyperedge degree respectively, where the degree of vertex v∈Vi The degree of hyperedge e∈E i
3)定义超图卷积神经网络用于特征学习与信息传播,相邻网络层之间的计算方式为其中表示服务序列对应的服务特征表示,Wi∈Rm×m表示权重矩阵,σ(·)表示非线性激活函数。然后使用平均池化策略得到服务的特征表示:3) Define a hypergraph convolutional neural network for feature learning and information propagation. The calculation method between adjacent network layers is in represents the service feature representation corresponding to the service sequence, W i ∈ R m×m represents the weight matrix, and σ(·) represents the nonlinear activation function. Then the average pooling strategy is used to obtain the feature representation of the service:
其中L表示卷积网络层数,最后使用平均策略聚合序列中的特征信息得到用户长期兴趣hlo:Where L represents the number of convolutional network layers, and finally the feature information in the sequence is aggregated using the average strategy to obtain the user's long-term interest h lo :
其中l表示用户历史交互序列长度。Where l represents the length of the user's historical interaction sequence.
步骤(3.2).然后基于最近历史交互序列中的服务序列捕获用户的短期兴趣,具体计算过程如下。Step (3.2). Then capture the user's short-term interest based on the service sequence in the recent historical interaction sequence. The specific calculation process is as follows.
1)针对用户完整交互序列T=t1,t2,…,tl-1,tl,选择最近的交互子序列Ts=tl-a,…,tl-1,tl,a表示选择交互记录的个数,可以根据不同应用场景进行设置。然后将子序列对应的服务特征序列通过双向LSTM编码层,用于结合时间信息,捕获用户的短期兴趣表示其中表示初始服务特征表示。m个用户短期兴趣表示的特征矩阵为H,f表特征聚合方法。1) For the user's complete interaction sequence T = t 1 , t 2 , ..., t l-1 , t l , select the most recent interaction subsequence T s = t la , ..., t l-1 , t l , where a represents the number of selected interaction records, which can be set according to different application scenarios. Then the service feature sequence corresponding to the subsequence is passed through a bidirectional LSTM encoding layer to combine time information and capture the user's short-term interest representation. in represents the initial service feature representation. The feature matrix of m users’ short-term interest representation is H, and f represents the feature aggregation method.
2)以用户为顶点建立用户图GU,顶点之间产生边连接的条件是用户之间存在相同最近交互服务,利用图表征学习捕获不同用户之间的潜在联系,具体传播方式如下:2) A user graph GU is established with users as vertices. The condition for edge connection between vertices is that there are the same recent interactive services between users. Graph representation learning is used to capture the potential connections between different users. The specific propagation method is as follows:
H(l+1)=σ(D-1AHl)H (l+1) =σ(D -1 AH l )
其中D和A分别是用户图GU的度矩阵和单位矩阵,σ(·)是激活函数。同样使用平均池化策略得到用户短期兴趣表示hsh∈H:Where D and A are the degree matrix and identity matrix of the user graph GU, respectively, and σ(·) is the activation function. The average pooling strategy is also used to obtain the user's short-term interest representation h sh ∈H:
步骤(3.3).为了适应不同应用的数据特点,本方法引入服务类别信息作为数据基础来捕获用户针对类别流行度所产生的从众兴趣。与步骤(3.1)相似,以完整历史交互序列中的类别序列构建类别超图GC∈(Vc,Ec),Vc和Ec分别为顶点集和超边集,超图邻接矩阵Hc∈{0,1}m×k。定义类别超图卷积计算:其中表示类别序列对应的类别特征表示,Dcv∈Rk×k和Dce∈Rm×m分别表示类别超图的顶点度矩阵和超边度矩阵,Wc∈Rm×m表示权重矩阵,σ(·)表示非线性激活函数,通过平均池化层后得到用户的从众兴趣hco。Step (3.3). In order to adapt to the data characteristics of different applications, this method introduces service category information as the data basis to capture the user's herd interest in category popularity. Similar to step (3.1), the category sequence in the complete historical interaction sequence is used to construct a category hypergraph GC∈(V c ,E c ), where V c and E c are the vertex set and hyperedge set respectively, and the hypergraph adjacency matrix H c ∈{0,1} m×k . Define the category hypergraph convolution calculation: in represents the category feature representation corresponding to the category sequence, D cv ∈R k×k and D ce ∈R m×m represent the vertex degree matrix and hyperedge degree matrix of the category hypergraph respectively, W c ∈R m×m represents the weight matrix, σ(·) represents the nonlinear activation function, and the user’s herd interest h co is obtained after the average pooling layer.
步骤(4).在步骤(3)的基础上,分别将学习的用户长期兴趣hlo、短期兴趣hsh和从众兴趣hco通过多兴趣聚合层进行特征融合,得到最终的用户兴趣表示h,融合方式如下:Step (4). Based on step (3), the learned user long-term interest h lo , short-term interest h sh and herd interest h co are respectively fused through the multi-interest aggregation layer to obtain the final user interest representation h. The fusion method is as follows:
h=W[hlo||hsh||hco]+bh=W[h lo ||h sh ||h co ]+b
其中W和b分别表示权重矩阵和偏置项,[·||·||·]表示特征向量在维度上的拼接。Where W and b represent the weight matrix and bias term respectively, and [·||·||·] represents the concatenation of feature vectors in dimension.
步骤(5).基于步骤(4)中学习得到的用户兴趣h,执行服务推荐任务。针对备选服务集合I,每个服务的推荐评分z由服务特征向量x∈Xi和用户兴趣特征向量h的内积运算得到,即zi=xTh,使用softmax函数将z进行归一化得到最终的服务评分然后选择评分最高的top-N个服务作为推荐列表中的候选服务。推荐任务学习的目标函数表示为交叉熵损失:Step (5). Based on the user interest h learned in step (4), perform the service recommendation task. For the candidate service set I, the recommendation score z of each service is obtained by the inner product operation of the service feature vector x∈X i and the user interest feature vector h, that is, z i = x T h. The softmax function is used to normalize z to obtain the final service score Then the top-N services with the highest scores are selected as candidate services in the recommendation list. The objective function of recommendation task learning is expressed as cross entropy loss:
其中是指目标服务在给定交互记录序列T情况下会产生交互的概率。in It refers to the probability that the target service will interact with a given interaction record sequence T.
本发明相对于现有技术而言,具有以下有益效果:利用本方法对用户兴趣的解耦学习,可以分别学习用户的长期兴趣、短期兴趣和从众兴趣,最大程度避免人工设置潜在兴趣个数的弊端,并且考虑不同在线应用类别因素的影响,捕获用户的从众兴趣作为推荐结果可解释性的基础,帮助应用提供商有针对性的调整推荐策略。Compared with the prior art, the present invention has the following beneficial effects: by utilizing the decoupled learning of user interests, the user's long-term interests, short-term interests and herd interests can be learned separately, avoiding the disadvantages of manually setting the number of potential interests to the greatest extent, and considering the influence of different online application category factors, capturing the user's herd interests as the basis for the interpretability of recommendation results, helping application providers to adjust recommendation strategies in a targeted manner.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1为本发明实施例一种基于解耦表征学习与图神经网络的服务推荐方法框架图。FIG1 is a framework diagram of a service recommendation method based on decoupled representation learning and graph neural network according to an embodiment of the present invention.
图2为本发明实施例中基于最近序列的特征嵌入模块图。FIG. 2 is a diagram of a feature embedding module based on a recent sequence in an embodiment of the present invention.
具体实施方式DETAILED DESCRIPTION
需要说明的是,在不冲突的情况下,本发明中的实施例及实施例中的特征可以相互组合。It should be noted that, in the absence of conflict, the embodiments of the present invention and the features in the embodiments may be combined with each other.
下面结合附图对本发明技术方案进行进一步的解释和说明。The technical solution of the present invention is further explained and illustrated below in conjunction with the accompanying drawings.
本实施例提供了一种基于解耦表征学习与图神经网络的服务推荐方法,方法的整体执行框架如图1所示。This embodiment provides a service recommendation method based on decoupled representation learning and graph neural network. The overall execution framework of the method is shown in Figure 1.
步骤(1).以在线应用记录的用户交互日志为数据源,收集用户的完整历史交互序列以及对应类别序列。所述的完整历史交互序列是指用户已经产生的对于服务的交互记录(如点击、收藏、评分等行为),所述的对应类别序列指每个服务对应的类别信息构建的序列。定义I为所有服务的集合,服务个数为n;定义C为所有服务对应类别的集合,类别个数为k;定义用户u的历史交互序列为T=t1,t2,…,tl-1,tl,tl+1,其中ti表示第i次交互的记录,包含服务与对应的类别的二元组,即ti=(Ii,Ci),l表示序列长度,tl+1表示待预测的下一次交互。Step (1). Using the user interaction log recorded by the online application as the data source, collect the user's complete historical interaction sequence and the corresponding category sequence. The complete historical interaction sequence refers to the interaction record that the user has generated for the service (such as clicks, favorites, ratings, etc.), and the corresponding category sequence refers to the sequence constructed by the category information corresponding to each service. Define I as the set of all services, the number of services is n; define C as the set of categories corresponding to all services, the number of categories is k; define the historical interaction sequence of user u as T = t 1 , t 2 , ..., t l-1 , t l , t l+1 , where ti represents the record of the i-th interaction, including a binary tuple of the service and the corresponding category, that is, ti = (I i , C i ), l represents the sequence length, and t l+1 represents the next interaction to be predicted.
步骤(2).将服务集合I与类别集合C输入嵌入层得到每个服务以及对应类别的嵌入特征表示。所述的嵌入层是指通过one-hot编码器和多层感知机得到多维度的特征向量来表示每个服务或者类别,定义嵌入层得到的嵌入特征表示为两个特征矩阵:Xi∈Rn×d为服务特征矩阵,Xc∈Rk×d为类别特征矩阵,其中n和k分别表示服务和类别个数,d表示特征维度。Step (2). Input the service set I and the category set C into the embedding layer to obtain the embedded feature representation of each service and the corresponding category. The embedding layer refers to a multi-dimensional feature vector obtained by a one-hot encoder and a multi-layer perceptron to represent each service or category. The embedded feature representation obtained by the embedding layer is defined as two feature matrices: Xi∈Rn ×d is the service feature matrix, and Xc∈Rk ×d is the category feature matrix, where n and k represent the number of services and categories respectively, and d represents the feature dimension.
步骤(3).为了引入类别因素对用户兴趣进行解耦学习,设计了基于用户交互序列的长期兴趣和短期兴趣建模网络,以及基于类别的用户从众兴趣建模模块,包括以下三个步骤:Step (3). In order to introduce category factors to decouple user interests, a long-term interest and short-term interest modeling network based on user interaction sequences and a category-based user herd interest modeling module are designed, which includes the following three steps:
步骤(3.1).首先基于完整历史交互序列中的服务序列Ti=I1,I2,…,Il-1,Il构建超图,经过超图卷积层后捕获用户的长期兴趣表示为hlo。Step (3.1). First, a hypergraph is constructed based on the service sequence T i =I 1 ,I 2 ,…,I l-1 ,I l in the complete historical interaction sequence. After the hypergraph convolution layer, the long-term interests of users are captured and represented as h lo .
1)定义服务超图为GI∈(Vi,Ei),Vi和Ei分别为顶点集和超边集,超图邻接矩阵Hi∈{0,1}m×n,以一个用户的历史行为序列中的服务序列建立超边(超边是指能够连接多个顶点的特殊边),m个用户会建立m条超边,服务序列中包含的服务对应的超图邻接矩阵Hi位置会设置为1,否则为0。若第一个用户与第一个服务产生交互记录,则Hi(0,0)=1。例如有两个用户u1和u2的历史交互记录中的服务序列分别为{I1,I2,I2,I3}和{I2,I4,I2,I5},则对应的服务超图邻接矩阵为:1) Define the service hypergraph as GI∈(V i ,E i ), where V i and E i are vertex sets and hyperedge sets respectively, and the hypergraph adjacency matrix H i ∈{0,1} m×n . Hyperedges are established with the service sequence in a user's historical behavior sequence (hyperedges refer to special edges that can connect multiple vertices). m users will establish m hyperedges, and the hypergraph adjacency matrix Hi corresponding to the service included in the service sequence will be set to 1, otherwise it will be 0. If the first user generates an interaction record with the first service, then H i (0,0)=1. For example, if the service sequences in the historical interaction records of two users u 1 and u 2 are {I 1 ,I 2 ,I 2 ,I 3 } and {I 2 ,I 4 ,I 2 ,I 5 } respectively, then the corresponding service hypergraph adjacency matrix is:
2)定义超图的顶点度矩阵Div∈Rn×n和超边度矩阵Die∈Rm×m,他们分别是关于顶点度和超边度的对角矩阵,其中顶点v∈Vi的度超边e∈Ei的度 2) Define the vertex degree matrix Div∈Rn ×n and the hyperedge degree matrix Die∈Rm ×m of the hypergraph, which are diagonal matrices of vertex degree and hyperedge degree respectively, where the degree of vertex v∈Vi The degree of hyperedge e∈E i
3)定义超图卷积神经网络用于特征学习与信息传播,相邻网络层之间的计算方式为其中表示服务序列对应的服务特征表示,Wi∈Rm×m表示权重矩阵,σ(·)表示非线性激活函数。然后使用平均池化策略得到服务的特征表示:3) Define a hypergraph convolutional neural network for feature learning and information propagation. The calculation method between adjacent network layers is in represents the service feature representation corresponding to the service sequence, W i ∈ R m×m represents the weight matrix, and σ(·) represents the nonlinear activation function. Then the average pooling strategy is used to obtain the feature representation of the service:
其中L表示卷积网络层数,表示第l层的服务特征矩阵,最后使用平均策略聚合序列中的特征信息得到用户长期兴趣hlo:Where L represents the number of convolutional network layers, represents the service feature matrix of the lth layer, and finally uses the average strategy to aggregate the feature information in the sequence to obtain the user's long-term interest h lo :
其中l表示用户历史交互序列长度,Xi,t表示服务序列中第t个服务的特征表示。Where l represents the length of the user's historical interaction sequence, and Xi ,t represents the feature representation of the tth service in the service sequence.
步骤(3.2).然后基于最近历史交互序列中的服务序列捕获用户的短期兴趣,会话嵌入聚合模块如图2所示,具体计算过程如下。Step (3.2). Then, based on the service sequence in the recent historical interaction sequence, the user's short-term interest is captured. The session embedding aggregation module is shown in Figure 2. The specific calculation process is as follows.
1)针对用户完整交互序列T=t1,t2,…,tl-1,tl,选择最近的交互子序列Ts=tl-a,…,tl-1,tl,a表示选择交互记录的个数,可以根据不同应用场景进行设置。然后将子序列对应的服务特征序列通过双向LSTM编码层,用于结合时间信息,捕获用户的短期兴趣表示其中表示初始服务特征表示。m个用户短期兴趣表示的特征矩阵为H∈Rm×d,f表示双向LSTM特征聚合方法。1) For the user's complete interaction sequence T = t 1 , t 2 , ..., t l-1 , t l , select the most recent interaction subsequence T s = t la , ..., t l-1 , t l , where a represents the number of selected interaction records, which can be set according to different application scenarios. Then the service feature sequence corresponding to the subsequence is passed through a bidirectional LSTM encoding layer to combine time information and capture the user's short-term interest representation. in represents the initial service feature representation. The feature matrix of m users’ short-term interest representation is H∈R m×d , and f represents the bidirectional LSTM feature aggregation method.
2)以用户为顶点建立用户图GU,顶点之间产生边连接的条件是用户之间存在相同最近交互服务,例如用户u1和u2的最近服务序列分别为{I2,I3}和{I2,I5},存在相同服务I2,因此用户u1和u2会产生边连接。利用图表征学习捕获不同用户之间的潜在联系,具体传播方式如下:2) Build a user graph GU with users as vertices. The condition for edge connections between vertices is that there are the same recent interactive services between users. For example, the recent service sequences of users u 1 and u 2 are {I 2 ,I 3 } and {I 2 ,I 5 } respectively. There is the same service I 2 , so users u 1 and u 2 will have edge connections. Graph representation learning is used to capture the potential connections between different users. The specific propagation method is as follows:
H(l+1)=σ(D-1AHl)H (l+1) =σ(D -1 AH l )
其中D∈Rm×m和A∈Rm×m分别是用户图GU的度矩阵和单位矩阵,σ(·)是激活函数。同样使用平均池化策略得到用户短期兴趣表示hsh∈H:Where D∈R m×m and A∈R m×m are the degree matrix and identity matrix of the user graph GU respectively, and σ(·) is the activation function. The average pooling strategy is also used to obtain the user short-term interest representation h sh ∈H:
其中L表示用户图卷积神经网络的层数,表示第l层的用户短期兴趣特征表示。Where L represents the number of layers of the user graph convolutional neural network, Represents the user's short-term interest feature representation at the lth layer.
步骤(3.3).为了适应不同应用的数据特点,本方法引入服务类别信息作为数据基础来捕获用户针对类别流行度所产生的从众兴趣hco。与步骤(3.1)相似,以完整历史交互序列中的类别序列Tc=C1,C2,…,Cl-1,Cl构建类别超图GC∈(Vc,Ec),Vc和Ec分别为顶点集和超边集,超图邻接矩阵Hc∈{0,1}m×k。例如有两个用户u1和u2的历史交互记录中的类别序列分别为{C1,C1,C1,C2}和{C2,C1,C2,C3},则对应的类别超图邻接矩阵为:Step (3.3). In order to adapt to the data characteristics of different applications, this method introduces service category information as the data basis to capture the user's herd interest h co generated by category popularity. Similar to step (3.1), the category sequence T c =C 1 ,C 2 ,…,C l-1 ,C l in the complete historical interaction sequence is used to construct a category hypergraph GC∈(V c ,E c ), where V c and E c are vertex sets and hyperedge sets respectively, and the hypergraph adjacency matrix H c ∈{0,1} m×k . For example, there are two users u 1 and u 2 whose category sequences in their historical interaction records are {C 1 ,C 1 ,C 1 ,C 2 } and {C 2 ,C 1 ,C 2 ,C 3 } respectively, then the corresponding category hypergraph adjacency matrix is:
定义类别超图卷积计算:其中表示类别序列对应的类别特征表示,Dcv∈Rk×k和Dce∈Rm×m分别表示类别超图的顶点度矩阵和超边度矩阵,Wc∈Rm×m表示权重矩阵,σ(·)表示非线性激活函数,通过平均池化层后得到用户的从众兴趣hco。Define category hypergraph convolution calculation: in represents the category feature representation corresponding to the category sequence, D cv ∈R k×k and D ce ∈R m×m represent the vertex degree matrix and hyperedge degree matrix of the category hypergraph respectively, W c ∈R m×m represents the weight matrix, σ(·) represents the nonlinear activation function, and the user’s herd interest h co is obtained after the average pooling layer.
步骤(4).在步骤(3)的基础上,分别将学习的用户长期兴趣hlo、短期兴趣hsh和从众兴趣hco通过多兴趣聚合层进行特征融合,得到最终的用户兴趣表示h,融合方式如下:Step (4). Based on step (3), the learned user long-term interest h lo , short-term interest h sh and herd interest h co are respectively fused through the multi-interest aggregation layer to obtain the final user interest representation h. The fusion method is as follows:
h=W[hlo||hsh||hco]+bh=W[h lo ||h sh ||h co ]+b
其中W∈Rd×3d和b∈Rd×1分别表示权重矩阵和偏置项,[·||·||·]表示特征向量在维度上的拼接。where W∈Rd ×3d and b∈Rd ×1 represent the weight matrix and bias term respectively, and [·||·||·] represents the concatenation of feature vectors in dimension.
步骤(5).基于步骤(4)中学习得到的用户兴趣h,执行服务推荐任务。针对备选服务集合I,每个服务的推荐评分z由服务特征向量x∈Xi和用户兴趣特征向量h的内积运算得到,即zi=xTh,使用softmax函数将z进行归一化得到最终的服务分数然后选择评分最高的top-N个服务作为推荐列表中的候选服务。推荐任务学习的目标函数表示为交叉熵损失:Step (5). Based on the user interest h learned in step (4), perform the service recommendation task. For the candidate service set I, the recommendation score z of each service is obtained by the inner product operation of the service feature vector x∈X i and the user interest feature vector h, that is, z i = x T h. The softmax function is used to normalize z to obtain the final service score Then the top-N services with the highest scores are selected as candidate services in the recommendation list. The objective function of recommendation task learning is expressed as cross entropy loss:
其中是指目标服务在给定交互记录序列T情况下会产生交互的概率。在结果的可解释性方面,如用户u1的服务序列和类别序列分别为{I1,I2,I3}和{C1,C1,C2},若用户u1实际上下一个交互的物品是I4(类别是C1),仅根据服务历史无法对结果进行解释,但是从类别数据上可以看出用户u1频繁交互类别为C1的服务,则可以侧重于用户从众兴趣对推荐结果进行解释。in It refers to the probability that the target service will interact with a given interaction record sequence T. In terms of the interpretability of the results, if the service sequence and category sequence of user u 1 are {I 1 , I 2 , I 3 } and {C 1 , C 1 , C 2 } respectively, if the next item that user u 1 actually interacts with is I 4 (category C 1 ), the result cannot be explained based on the service history alone. However, from the category data, it can be seen that user u 1 frequently interacts with services of category C 1 , so the recommendation results can be explained by focusing on the user's herd interest.
本方法从用户历史交互记录数据为基础,在基于解耦表征学习的基础上分别捕获用户长期兴趣和短期兴趣,然后引入服务类别流行度因素,构建了类别超图卷积神经网络捕获用户对于服务类别的从众兴趣,提升了推荐模型适应不同应用场景和对于结果可解释性的能力。This method is based on the user's historical interaction record data, and captures the user's long-term and short-term interests respectively based on decoupled representation learning. Then, the service category popularity factor is introduced to construct a category hypergraph convolutional neural network to capture the user's herd interest in service categories, thereby improving the recommendation model's ability to adapt to different application scenarios and the interpretability of the results.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211634971.2A CN116186385A (en) | 2022-12-19 | 2022-12-19 | Service recommendation method based on decoupling characterization learning and graph neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202211634971.2A CN116186385A (en) | 2022-12-19 | 2022-12-19 | Service recommendation method based on decoupling characterization learning and graph neural network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116186385A true CN116186385A (en) | 2023-05-30 |
Family
ID=86437430
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202211634971.2A Pending CN116186385A (en) | 2022-12-19 | 2022-12-19 | Service recommendation method based on decoupling characterization learning and graph neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116186385A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117131438A (en) * | 2023-10-27 | 2023-11-28 | 深圳市迪博企业风险管理技术有限公司 | Litigation document analysis method, model training method, device, equipment and medium |
-
2022
- 2022-12-19 CN CN202211634971.2A patent/CN116186385A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117131438A (en) * | 2023-10-27 | 2023-11-28 | 深圳市迪博企业风险管理技术有限公司 | Litigation document analysis method, model training method, device, equipment and medium |
CN117131438B (en) * | 2023-10-27 | 2024-02-13 | 深圳市迪博企业风险管理技术有限公司 | Litigation document analysis method, model training method, device, equipment and medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN114817663B (en) | Service modeling and recommendation method based on class perception graph neural network | |
CN108304556A (en) | The personalized recommendation method being combined with collaborative filtering based on content | |
CN113064968B (en) | A social media sentiment analysis method and system based on tensor fusion network | |
CN112348191A (en) | Knowledge base completion method based on multi-mode representation learning | |
CN107105320A (en) | A kind of Online Video temperature Forecasting Methodology and system based on user emotion | |
CN112559878B (en) | A Sequence Recommendation System and Recommendation Method Based on Graph Neural Network | |
CN113051468B (en) | Movie recommendation method and system based on knowledge graph and reinforcement learning | |
CN116204729A (en) | A cross-domain group intelligent recommendation method based on hypergraph neural network | |
CN115545834B (en) | Personalized service recommendation method based on graph neural network and metadata | |
CN116186385A (en) | Service recommendation method based on decoupling characterization learning and graph neural network | |
Chakder et al. | Graph network based approaches for multi-modal movie recommendation system | |
CN116561426A (en) | Personalized recommendation method based on graph comparison learning and negative interest propagation | |
CN115689639A (en) | Commercial advertisement click rate prediction method based on deep learning | |
CN116701766A (en) | Research on Graph-Coupled Time Interval Networks for Serialized Recommendation | |
CN113742562B (en) | Video recommendation method and device, electronic equipment and storage medium | |
CN114048396A (en) | NQI service recommendation method and system based on heterogeneous information network and deep learning | |
Yang et al. | Effective isp tuning framework based on user preference feedback | |
CN118964748B (en) | Online group recommendation method and system based on multi-scale contrastive learning twin tower model | |
Joshi et al. | A Personalized Video Recommendation Model Based on Multi-Graph Neural Network and Attention Mechanism | |
Lim et al. | A multimedia recommendation model based on collaborative graph | |
Yang et al. | Recommended Method based on Noise Reduction Self Encoder | |
Sang et al. | Feature Interaction Fusion Self-Distillation Network For CTR Prediction | |
Yuan | Personalized online learning recommendation system based on collaborative filtering algorithm | |
Wang et al. | MIG: Addressing the Cold-Start Problem in Task Recommendations Through Enhanced Meta Embeddings | |
Zhou et al. | Dual Attention-based Interest Network for Personalized Recommendation System |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |