CN114169975A

CN114169975A - Method and system for shopping network product recommendation based on random walk heterogeneous attention

Info

Publication number: CN114169975A
Application number: CN202111551744.9A
Authority: CN
Inventors: 郭昆; 刘俊杰; 张鹏
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2021-12-17
Filing date: 2021-12-17
Publication date: 2022-03-11
Anticipated expiration: 2041-12-17
Also published as: CN114169975B

Abstract

The invention relates to a shopping network commodity recommendation system based on random walk heterogeneous attention, which comprises a heterogeneous network construction module, a random walk module, an intra-class attention coefficient calculation module, an inter-class attention coefficient calculation module, a node expression vector optimization module and a clustering and commodity recommendation module, wherein the random walk module is used for calculating intra-class attention coefficients of different classes; the shopping network is modeled into a heterogeneous network comprising user nodes and commodity nodes, heterogeneous information is fused by using random walk and heterogeneous attention, and finally communities formed by related commodities are obtained by combining a clustering algorithm, so that the method is used for personalized recommendation to improve commodity recommendation rationality.

Description

Shopping network commodity recommendation method and system based on random walk heterogeneous attention

Technical Field

The invention relates to the field of big data analysis, in particular to a shopping network commodity recommendation method and system based on random walk heterogeneous attention.

Background

With the development of the times and the advancement of technology, especially the information technology in the third industrial leather, many daily lives of people are shifted to online, and online shopping is a typical example. The online shopping is connected with consumers and producers through the Internet, so that the consumers can purchase psychographic products without going out of households, and the producers can sell the products to the consumers all over the country. The convenience of online shopping makes the online shopping continuously developed, and in the twenty-one of 2021, the final bargain amount of the online mart of the Tianmao reaches 5403 billion yuan. The rapid development also makes the shopping network very complicated, and how to carry out accurate commodity recommendation on users has important significance for consumers and producers. However, the traditional shopping network commodity recommendation method does not perform refined modeling on the shopping network, mainly recommends similar commodities of the same kind according to the purchased commodities, and is worthless for the user who has purchased the commodities.

Disclosure of Invention

In view of this, the invention aims to provide a shopping network commodity recommendation method and system based on random walk heterogeneous attention, which perform more refined modeling on a shopping network, construct a heterogeneous network including user nodes and commodity nodes, and effectively improve the commodity recommendation accuracy by considering heterogeneous information.

In order to achieve the purpose, the invention adopts the following technical scheme:

a shopping network commodity recommendation method based on random walk heterogeneous attention comprises the following steps:

step S1, constructing a heterogeneous network containing user nodes and commodity nodes according to the shopping records of the users;

step S2, traversing all nodes on the heterogeneous network based on the heterogeneous network, taking each node as a starting point of the migration, performing random migration for a plurality of times, recording the access times of each node in the migration process, arranging the access times in a descending order, and selecting the front node as a closely related neighbor node set of the starting point according to a preset proportion;

step S3: for each node, respectively executing in a user node neighbor set commodity node neighbor set to obtain a user expression vector and a commodity expression vector;

step S4, calculating attention coefficients of user representation vectors and commodity representation vectors for each node, representing different importance of user node neighbors and commodity node neighbors to the migration starting point, and finally performing weighted aggregation to generate final representation vectors of the migration starting point;

step S5, calculating a loss function and performing backward propagation according to the final expression vector, and optimizing by using a random gradient descent method to obtain an optimized node expression vector;

step S6: according to the optimized node expression vector, the expression vector of the commodity node is processed by using a K-means algorithm to obtain a commodity set with closely related inner parts, an interested commodity set is determined from commodities in a shopping record of a user who wants to recommend the commodities, and the commodities in the commodity set are recommended to the user.

Further, the step S3 is specifically:

step S31: classifying the closely related neighbor node sets obtained in the step S2 according to the types of the nodes, dividing the closely related neighbor node sets into a user node set and a commodity node set, and calculating the intra-class attention coefficient in each type;

step S32: in the same node type, the information of different nodes is aggregated by using attention, which is called an in-class attention mechanism; in this way, a representation vector representing the influence of the neighbor of the type is generated;

step S33: on the basis of the attention coefficient calculated at step S32, a representative vector representing such neighbors is calculated using the following formula.

Where σ denotes a non-linear sigmoid function and W is a weight matrix.

Further, the step S32 isThe body is as follows: if the type of the currently processed node is t, the node v in the type_iAnd node v_jThe attention coefficient therebetween is calculated as follows:

where | | | denotes join operation, W is a weight matrix, LeakReLU is an activation function, and vector α is a parameter that can be learned in training, h_iIs node v_iThe hidden layer vector of (1).

Further, the step S33 uses a multi-head attention mechanism, and the final output is an average of multiple attention head output vectors or is directly spliced.

Further, the step S4 is specifically:

step S41: aggregating the user node expression vector and the commodity node expression vector, wherein the specific calculation formula is as follows:

wherein the vector γ is one of the parameters learned in training; w is the weight matrix used for the linear transformation, and b is the offset vector that plays a role in the offset; attention coefficient a of node type t_tNormalization is performed using a softmax function;

step S42: node v_iThe final representation vector is represented by each node type representation vector z_i ^tBased on inter-class attention coefficient a_tAnd generating weighted aggregation.

Further, the step S5 specifically includes the following steps:

for unsupervised associated commodity set discovery tasks, the following loss function is used as the objective function for training:

wherein

Is shown at the sum node v_iSet of closely related nodes N_iAnd the node type is t. Theta represents the parameters of the model, and the definition of p (j | i; theta) is shown in the following formula

Using a negative sampling technique to speed up, the final loss function is as follows:

where node k is a set of nodes that are not present

Negative sample node in (1).

Further, the step S5 specifically includes the following steps:

and (3) processing the expression vectors of the commodity nodes by using a K-means algorithm, and minimizing a cost function by using an iterative optimization method to obtain a commodity set with closely related internal parts, wherein the optimization function is as follows:

wherein u is_jRepresenting the vector, x, for the center in the set of items C_iThe expression vector of each commodity in the commodity set is shown, and n is the number of commodities in the commodity set.

After each commodity set is obtained through iterative optimization, an interested commodity set is determined from commodities in a shopping record of a user who wants to recommend the commodities, and the commodities in the commodity set are recommended to the user.

A shopping network commodity recommendation system based on random walk heterogeneous attention comprises a heterogeneous network construction module, a random walk module, an intra-class attention coefficient calculation module, an inter-class attention coefficient calculation module, a node expression vector optimization module and a clustering and commodity recommendation module,

the heterogeneous network construction module starts from a user who wants to recommend commodities, constructs a heterogeneous network comprising user nodes and commodity nodes according to shopping records, and executes the heterogeneous network construction module once every preset time;

the random walk module executes random walk on the heterogeneous network generated by the heterogeneous network construction module;

the intra-class attention coefficient calculation module is used for respectively executing neighbor centralized execution on the commodity node neighbor set of the user node neighbor set for each node on the basis of the output of the random walk module; in the user node neighbor set, calculating the attention coefficient of each node by using an attention layer, finally weighting and aggregating the vectors of each node, generating a user representation vector representing the user node neighbor, and representing the influence of the neighbor node of the user node type on the wandering starting point; similarly, commodity expression vectors are independently generated in the commodity node set to express the influence of neighbor nodes of the commodity node type on the wandering starting point; the intra-class attention coefficient calculation module outputs two vectors which are a user representation vector and a commodity representation vector respectively;

the inter-class attention coefficient calculation module calculates the attention coefficients of user representation vectors and commodity representation vectors by using an attention layer for each node on the basis of the output of the intra-class attention calculation module, represents different importance of user node neighbors and commodity node neighbors to the migration starting point, and finally performs weighted aggregation to generate a final representation vector of the migration starting point;

the node expression vector optimization module calculates a loss function and performs back propagation by using a final expression vector output by the inter-class attention coefficient calculation module, optimizes parameters in the inter-class attention coefficient calculation module and the intra-class attention coefficient calculation module by using a random gradient descent method, designs a loss function based on positive and negative samples, and performs iterative optimization in an unsupervised mode;

the clustering and commodity recommending module processes the expression vectors of the commodity nodes by using a K-means algorithm on the basis of the node expression vectors optimized by the node expression vector optimizing module to obtain a commodity set with closely related inner parts, determines an interested commodity set from commodities in a shopping record of a user who needs commodity recommendation, and recommends the commodities in the commodity set to the user.

Further, the random walk module specifically includes: traversing all nodes on the heterogeneous network, taking each node as a starting point of the migration, performing multiple random migrations, recording the access times of each node in the migration process, arranging the access times in a descending order, and selecting the previous node as a closely related neighbor node set of the starting point; the selected proportion is controlled by a hyper-parameter, and a proper balance is searched in insufficient information and introduced noise; and for the close neighbor node set of each node, separating the nodes according to the user nodes and the commodity nodes.

Compared with the prior art, the invention has the following beneficial effects:

1. according to the invention, a shopping network is more finely modeled, a heterogeneous network comprising user nodes and commodity nodes is constructed, and the accuracy of commodity recommendation is effectively improved by considering heterogeneous information;

2. the invention does not need prior knowledge to preset the meta path, and is more convenient in practical application.

Drawings

FIG. 1 is a flow chart of the method of the present invention.

Detailed Description

The invention is further explained below with reference to the drawings and the embodiments.

Referring to fig. 1, the present invention provides a shopping network commodity recommendation method based on random walk heterogeneous attention, and provides a system, including: the system comprises a heterogeneous network construction module, a random walk module, an intra-class attention coefficient calculation module, an inter-class attention coefficient calculation module, a node expression vector optimization module and a clustering and commodity recommendation module;

step S1: the heterogeneous network construction module constructs a heterogeneous network comprising user nodes and commodity nodes according to shopping records from users who want to recommend commodities. Since the user may continuously purchase new commodities, the heterogeneous network construction module needs to be executed at intervals, so that the accuracy of subsequent commodity recommendation is improved;

step S2: the random walk module performs random walks on the heterogeneous network generated at step S1. Specifically, all nodes on the heterogeneous network are traversed, and each node is taken as a starting point of the migration, and multiple random migrations are performed. And recording the access times of each node in the walking process, arranging the nodes in a descending order, and selecting the previous node as a closely related neighbor node set of a starting point. The selected ratio is controlled by a hyper-parameter to find a suitable balance between insufficient information and introduced noise. And for the close neighbor node set of each node, the nodes are separated according to the user node and the commodity node, so that the processing of subsequent steps is facilitated.

Step S3: the intra-class attention coefficient calculation module is respectively executed in the user node neighbor set commodity node neighbor set for each node on the basis of the output of the random walk module in the step S2. In the user node neighbor set, the attention layer is used for calculating the attention coefficient of each node, finally, the vectors of each node are weighted and aggregated, and a user representation vector representing the user node neighbor is generated and represents the influence of the neighbor node of the user node type on the wandering starting point. Similarly, commodity expression vectors are independently generated in the commodity node set and represent the influence of the neighbor nodes of the commodity node types on the wandering starting point. The intra-class attention coefficient calculation module outputs two vectors, namely a user representation vector and a commodity representation vector.

Step S4: the inter-class attention coefficient calculating module calculates, for each node, the attention coefficients of the user representation vector and the commodity representation vector using the attention layer on the basis of the output of the intra-class attention calculating module in step S3, and represents different importance of the user node neighbor and the commodity node neighbor to the migration start point, and finally performs weighted aggregation to generate a final representation vector of the migration start point. By means of the method of classifying and processing according to the node types, heterogeneous information that nodes in a heterogeneous network have different types is considered, so that node expression vectors are generated more finely, and the quality of the node expression vectors is improved.

Step S5: the nodes represent a vector optimization module, the nodes represent vectors generated in the step S4, a loss function is calculated and propagated in reverse, and parameters in the inter-class attention coefficient calculation module in the step S3 and the intra-class attention coefficient calculation module in the step S4 are optimized by using a stochastic gradient descent method. By designing a loss function based on positive and negative samples, iterative optimization can be carried out in an unsupervised mode, and particularly, the node representation vector is similar to the nodes of the positive samples and is not similar to the nodes of the negative samples.

Step S6: the clustering and commodity recommending module processes the expression vectors of the commodity nodes by using a K-means algorithm on the basis of the optimized node expression vectors in the step S5 to obtain a commodity set with closely related inner parts. And aiming at the user who needs to recommend the commodities, determining an interested commodity set from the commodities in the shopping record of the user, and recommending the commodities in the commodity set to the user.

Preferably, in this embodiment, the step S2 specifically includes the following steps:

step S21: each node on the heterogeneous network can be used as a starting point of random walk, and a neighbor node set closely related to the node is searched through multiple random walks. Meanwhile, the accessed nodes and the access times are recorded in the walking process and are arranged in a descending order according to the accessed frequency.

Step S22: from the result obtained in step S21, the node with the highest access frequency is selected as the input of the subsequent processing. The specific selection ratio is controlled by the hyperparameter β. For example, when the value of the parameter β is set to 0.1, only the top 10% of the nodes obtained in step S21 are selected as the input of the subsequent processing.

Preferably, in this embodiment, the step S3 specifically includes the following steps:

step S31: and classifying the closely related node sets output in the step S22 according to the types of the nodes, dividing the node sets into a user node set and a commodity node set, and calculating the intra-class attention coefficient in each type.

Step S32: within the same node type, we use attention to aggregate information of different nodes, which is called an intra-class attention mechanism. In this way, a representation vector representing this type of neighbor influence is generated. Assuming that the type of the currently processed node is t, the node v in the type_iAnd node v_jThe attention coefficient therebetween is calculated as follows:

where | | represents join operations, W is a weight matrix, LeakReLU is an activation function, and vector α is a parameter that may be learned in training. h is_iIs node v_iThe hidden layer vector of (1).

Where σ denotes a non-linear sigmoid function and W is a weight matrix. In addition, in order to make the result more stable, a multi-head attention mechanism is used, and the final output can be the average of a plurality of attention head output vectors or directly spliced.

Preferably, in this embodiment, the step S4 specifically includes the following steps:

step S41: and aggregating the user node representation vectors and the commodity node representation vectors output in the step S33 by using another attention layer, wherein a specific calculation formula is as follows:

where the vector gamma is one of the parameters learned in the training. W is the weight matrix used for the linear transformation and b is the offset vector that acts as the offset. Attention coefficient a of node type t_tLike the injection force coefficient in the formula (4-1)

Also, the softmax function is used for normalization.

Preferably, in this embodiment, the step S5 specifically includes the following steps:

step S51: for unsupervised associated commodity set discovery tasks, the following loss function is used as the objective function for training:

wherein

Is shown at the sum node v_iSet of closely related nodes N_iAnd the node type is t. Theta represents the parameters of the model, and the definition of p (j | i; theta) is shown in the following formula.

Step S52: in practice, the calculation of the formula in step S51 is time consuming, and uses a negative sampling technique to speed up, and the final loss function is as follows:

where node k is a set of nodes that are not present

Negative sample node in (1).

Step S6: and (3) processing the expression vectors of the commodity nodes by using a K-means algorithm, and minimizing a cost function by using an iterative optimization method to obtain a commodity set with closely related internal parts, wherein the optimization function is as follows:

wherein u is_jRepresenting the vector, x, for the center in the set of items C_iRepresenting vectors of all commodities in the commodity set, wherein n is the number of the commodities in the commodity set;

The above description is only a preferred embodiment of the present invention, and all equivalent changes and modifications made in accordance with the claims of the present invention should be covered by the present invention.

Claims

1. A shopping network commodity recommendation method based on random walk heterogeneous attention is characterized by comprising the following steps:

2. The shopping network commodity recommendation method based on random walk heterogeneous attention of claim 1, wherein the step S3 is specifically:

Where σ denotes a non-linear sigmoid function and W is a weight matrix.

3. The shopping network commodity recommendation method based on random walk heterogeneous attention of claim 1, wherein the step S32 is specifically: if the type of the currently processed node is t, the node v in the type_iAnd node v_jThe attention coefficient therebetween is calculated as follows:

4. The shopping network commodity recommendation method based on random walk heterogeneous attention of claim 1, wherein the step S33 uses a multi-head attention mechanism, and the final output is an average of a plurality of attention head output vectors or is directly spliced.

5. The shopping network commodity recommendation method based on random walk heterogeneous attention of claim 1, wherein the step S4 is specifically:

wherein the vector γ is one of the parameters learned in training; w is the weight matrix used for the linear transformation, and b is the offset vector that plays a role in the offset; attention of node type tCoefficient a_tNormalization is performed using a softmax function;

6. The shopping network commodity recommendation method based on random walk heterogeneous attention according to claim 1, wherein the step S5 specifically comprises the steps of:

wherein

where node k is a set of nodes that are not present

Negative sample node in (1).

7. The shopping network commodity recommendation method based on random walk heterogeneous attention according to claim 1, wherein the step S6 specifically comprises the steps of:

8. A shopping network commodity recommendation system based on random walk heterogeneous attention is characterized by comprising a heterogeneous network construction module, a random walk module, an intra-class attention coefficient calculation module, an inter-class attention coefficient calculation module, a node expression vector optimization module and a clustering and commodity recommendation module,

the clustering and commodity recommending module processes the expression vectors of the commodity nodes by using a K-means algorithm, for example, on the basis of the node expression vectors optimized by the node expression vector optimizing module to obtain a commodity set with closely related inner parts, determines an interested commodity set from commodities in a shopping record of a user who needs commodity recommendation, and recommends the commodities in the commodity set to the user.

9. The shopping network commodity recommendation system based on random walk heterogeneous attention of claim 8, wherein the random walk module is specifically: traversing all nodes on the heterogeneous network, taking each node as a starting point of the migration, performing multiple random migrations, recording the access times of each node in the migration process, arranging the access times in a descending order, and selecting the previous node as a closely related neighbor node set of the starting point; the selected proportion is controlled by a hyper-parameter, and a proper balance is searched in insufficient information and introduced noise; and for the close neighbor node set of each node, separating the nodes according to the user nodes and the commodity nodes.