CN112347362A

CN112347362A - Personalized recommendation method based on graph self-encoder

Info

Publication number: CN112347362A
Application number: CN202011283015.5A
Authority: CN
Inventors: 吴国栋; 刘玉良; 李方; 涂立静; 李景霞; 王伟娜
Original assignee: Anhui Agricultural University AHAU
Current assignee: Anhui Agricultural University AHAU
Priority date: 2020-11-16
Filing date: 2020-11-16
Publication date: 2021-02-09
Anticipated expiration: 2040-11-16
Also published as: CN112347362B

Abstract

The invention discloses a personalized recommendation method based on a graph self-encoder, which comprises the steps of constructing an adjacent matrix by utilizing the interaction behavior of a user and an article, carrying out normalization processing on the adjacent matrix, carrying out convolution operation by using a graph convolution network, and obtaining hidden layer representation of a node; obtaining an initial feature vector of each node by using a user comment text and an article description text as a source of node information, aggregating neighbor node features by using a graph attention network, and updating the node information; constructing a full-connection network by using the attribute characteristics of users and articles to calculate to obtain hidden layer characteristics; and splicing the hidden layer characteristics to obtain new node information, constructing a full-connection network, encoding, reconstructing scores of the user on the articles as prediction scores by using a bilinear decoder, and generating a recommended article list by adopting Top-N recommendation aiming at the obtained prediction scores. The invention can more accurately help to analyze the preference degree of the user to the articles and find the attention point of the user, thereby carrying out more effective recommendation.

Description

Personalized recommendation method based on graph self-encoder

Technical Field

The invention relates to the technical field of text classification, deep learning and recommendation system research, in particular to a personalized recommendation method based on a graph self-encoder.

Background

With the continuous development of internet technology, network information is growing explosively, but the rapid growth also brings information overload problem. Therefore, the process of accurately finding out the real interest of the user from a large amount of information is of great importance, and the proposal of the personalized recommendation algorithm is used for solving the problem and gradually becomes a great hot field in the development prospect of the internet.

The traditional recommendation method such as the collaborative filtering recommendation technology generally utilizes the scoring data of the user on the articles to acquire the user preference, although the algorithm is simple and easy to implement, the input data is single, and other useful historical behavior data of the user are not fully utilized, so that the user preference information cannot be intuitively and comprehensively acquired, and the recommendation effect is to be improved urgently. With the development of deep learning, more and more neural network algorithms are applied to a recommendation system, and therefore, a recommendation method using a graph neural network is proposed in the patent. The graph structure can contain a large amount of information, not only contains information of each node, but also contains interactive information between the node and a neighbor node, and the information contained in the node and the interactive information between the node and the neighbor node can be more prominently displayed through a graph convolution network and a graph attention network and the aggregation updating of the node in the graph. The graph neural network is applied to the recommendation algorithm, and the preference degree of the user on the articles can be accurately obtained through a deep learning method, so that the recommendation accuracy is improved.

In the existing recommendation technology, user preference is generally captured and recommended based on scoring information or interactive information such as user browsing and purchasing records, but the attribute characteristics of a user and an article are rarely considered and fused, and comment information of the user on the article is rarely considered, so that the recommendation effect is not ideal enough. The following problems are mainly faced in the current collaborative filtering-based recommendation system:

(1) only single scoring data, attribute information of the user and the article, and comment information of the user on the article are used less;

(2) the same score given by the user does not identify the user's points of interest well. For example, when a user scores 5 items of the same type, some users pay attention to quality, and some users pay attention to price, it is difficult for a general recommendation system to recognize such differences.

Disclosure of Invention

The invention aims to make up the defects of the prior art and provides a personalized recommendation method based on a graph self-encoder. And Top-N recommendation is adopted, so that the accuracy and recall rate of recommendation are improved.

The invention is realized by the following technical scheme:

a personalized recommendation method based on a graph self-encoder comprises the following specific steps:

step 1, constructing an encoder fusing a graph convolution network, a graph attention network and a full-connection network, and encoding user related information by aggregating information of neighbor nodes;

step 2, constructing an encoder fusing a graph convolution network, a graph attention network and a full-connection network, and encoding article related information by aggregating information of neighbor nodes;

step 3, constructing a bilinear decoder by using the information related to the user and the article coded in the

steps

1 and 2, and reconstructing the scoring condition of the user on the article;

and 4, performing Top-N sequencing on the articles by using the scores of the articles reconstructed by the user in the step 3, and selecting the Top N articles to recommend to the user.

The encoder for constructing the fusion graph convolution network, the graph attention network and the full-connection network in the step 1 encodes the user related information by aggregating the information of the neighbor nodes, and specifically comprises the following steps:

step 1.1, constructing a GCN graph convolution network, and learning hidden layer representation of a user node by aggregating neighbor node information;

step 1.1.1, establishing a user-article sparse adjacency matrix for interaction conditions under different scores by using scores of the interaction conditions of the user and the articles, and carrying out normalization processing on the adjacency matrix:

wherein the content of the first and second substances,

representing a normalized user-item adjacency matrix with a score of i, D_userRepresenting a diagonal matrix of degrees for each user node,

representing an unnormalized user-item adjacency matrix;

step 1.1.2, constructing a GCN layer for convolution operation for interaction conditions under different grades to obtain hidden layer characteristics H of each user node^GCN_user：

Where σ (·) denotes the ReLU activation function; the | indicates a splicing operation and,

wherein: w_i ^GCN_userRepresenting a weight parameter with a score of i for performing convolution operation on the user node;

step 1.2, constructing a GAT (generic object transform) graph attention network, and learning hidden layer representation from comments of a user on an article;

step 1.2.1, expressing the comment text into a comment feature vector by using word2vec and an average vectorization method by using the comment text of a user:

wherein, U_iThe node represents the comment feature vector, U (word), of the ith user node_i) Word used by user_iUsing word2vec to vectorize to obtain word vectors, wherein N is the number of words in the comment text;

the same is that:

wherein, I_jThe nodes represent descriptive feature vectors, I (word), for the jth item node_j) Word used for describing articles_jUsing word2vec to vectorize to obtain word vectors, wherein N is the number of words in the comment text;

step 1.2.2, compute node U_iWith its neighbour node I_jThe correlation degree of (c):

wherein the content of the first and second substances,

is the node U after normalization processing_iAnd node I_jThe degree of correlation of (c); i is_j∈N(U_i)，N(U_i) Representing the item set purchased by the ith user; w is a^GAT_userWeight parameter for transformation of user node information, a_userAs a weight parameter, LeakyReLU (-) is an activation function; exp (·) represents an exponential function with e as base;

step 1.2.3, aggregating neighbor node information for updating user node U_iNode information of (2):

step 1.3, constructing a full-connection network, and learning hidden layer representation from user attribute characteristics;

step 1.3.1, normalization processing is carried out on continuous data by utilizing attribute feature information of a user, one-hot coding is carried out on discrete data, and the length of the processed attribute feature vector is aligned with the length of the attribute feature vector of the article in a way of filling zero at the tail.

Step 1.3.2, the processed user attribute feature vector passes through a full-connection network to obtain hidden layer features H of each user attribute^Dense_user：

H^Dense_user＝σ(P^user·W^Dense_user+b^Dense_user) (8)

Where σ (·) denotes the ReLU activation function; p^userRepresenting the user attribute characteristics processed in the step 1.3.1; w^Dense_userA weight parameter representing a fully connected network for handling user characteristics; b^Dense_userA bias term representing a fully connected network for processing user characteristics;

step 1.4, learning hidden layer characteristics H from different information in step 1.1, step 1.2 and step 1.3^GCN_user、H^GAT_user、H^Dense_userSplicing together as hidden layer characteristic of user, and obtaining coding result E of coder through a full-connection network for coding^user：

E^user＝σ([H^GCN_user|H^GAT_user|H^Dense_user]·W^E_user+b^E_user) (9)

Where σ (·) denotes the ReLU activation function; i represents a splicing operation; w^E_userA weight parameter representing a fully connected network for encoding user information; b^E_userA bias term representing a fully connected network for encoding user information;

the encoder for constructing the fusion graph convolution network, the graph attention network and the full-connection network in the step 2 encodes the article related information by aggregating the information of the neighbor nodes, and specifically comprises the following steps:

step 2.1, constructing a GCN graph convolution network, and learning hidden layer representation of an article node by aggregating information of neighbor nodes (user nodes);

step 2.1.1, constructing an article-user sparse adjacency matrix for the interaction conditions under different scores by using the interaction conditions (scores) of the user and the article, and normalizing the adjacency matrix:

wherein the content of the first and second substances,

expressing the normalized article-user adjacency matrix with score i, D_itemRepresenting a diagonal matrix of degrees of each item node,

representing an unnormalized item-user adjacency matrix;

step 2.1.2, constructing a GCN layer for convolution operation for interaction conditions under different grades to obtain hidden layer characteristics H of each article node^GCN_item：

Where σ (·) denotes the ReLU activation function; i represents splicingIn the operation of the method, the operation,

W_i ^GCN_itemrepresenting a weight parameter with a score of i for performing convolution operation on the article node;

step 2.2, constructing a GAT graph attention network, and learning hidden layer representation from comments of users on articles;

step 2.2.1, by using the comment text of the user, expressing the comment text into a comment feature vector by using word2vec and an average vectorization method:

the same is that:

step 2.2.2, compute node I_jWith its neighbour node U_iThe correlation degree of (c):

wherein the content of the first and second substances,

is the node I after normalization processing_jAnd node U_iThe degree of correlation of (c); u shape_i∈N(I_j)，N(I_j) A set of users representing purchases of a jth item; w is a^GAT_itemWeight parameter for item node information transformation, a_itemAs a weight parameter, LeakyReLU (-) is an activation function; exp (·) represents an exponential function with e as base;

step 2.2.3, aggregating neighbor node information for updating item node I_jNode information of (2):

step 2.3, constructing a full-connection network, and learning hidden layer representation from the article attribute characteristics;

step 2.3.1, normalization processing is carried out on the continuous data by utilizing the attribute feature information of the article, one-hot coding is carried out on the discrete data, and the processed attribute feature vector length is aligned with the attribute feature vector length of the user in a front zero filling mode;

step 2.3.2, the processed article attribute feature vectors pass through a full-connection network to obtain hidden layer features H of each article attribute^Dense_item：

H^Dense_item＝σ(P^item·W^Dense_item+b^Dense_item) (17)

Where σ (·) denotes the ReLU activation function; p^itemRepresenting the attribute characteristics of the article processed in the step 2.3.1; w^Dense_itemA weight parameter representing a fully connected network for processing characteristics of the item; b^Dense_itemA bias term representing a fully connected network for processing item features;

step 2.4, learning hidden layer characteristics H from different information in step 2.1, step 2.2 and step 2.3^GCN_item、H^GAT_item、H^Dense_itemSplicing together as hidden layer characteristic of the article and obtaining the coding result E of the coder through a full-connection network for coding^item：

E^item＝σ([H^GCN_item|H^GAT_item|H^Dense_item]·W^E_item+b^E_item) (18)

Where σ (·) denotes the ReLU activation function; i represents a splicing operation; w^E_itemA weight parameter representing a fully connected network for encoding item information; b^E_itemA bias term representing a fully connected network for encoding item information;

step 3, constructing a bilinear decoder by using the information about the user and the article coded in step 1 and step 2, and reconstructing the scoring condition of the user on the article, specifically calculating as follows:

y_hat＝(embedding1|embedding2)·W_classifier (19)

wherein y _ hat represents the rating of the item by the reconstructed user; i represents a splicing operation, and:

embedding1＝sum(E^userW₁)*E^item (20)

embedding2＝sum(E^userW₂)*E^item (21)

where denotes the hadamard product and sum (-) denotes summing each row of the matrix; w_classifier、W₁And W₂Are the weight parameters in a bilinear decoder.

The invention has the advantages that: 1. the method and the system not only utilize the scores as the interaction condition of the user and the articles, but also utilize the comment text content of the user to the articles and the attribute characteristics of the user and the articles, and integrate various information for discovering the user interest, and compared with the traditional recommendation method using single score data, the method and the system can carry out more reasonable and accurate recommendation;

2. the encoder part (graph convolution network, graph attention network and full connection network) constructed by the invention can effectively obtain the preference degree of the user to the article;

3. according to the method, the mutual information and comment text information in the graph are aggregated by using a graph convolution network and a graph attention network, the attribute characteristics are processed by using a full-connection network to obtain new node information of each node, the full-connection network is used for coding, and finally the score of a user on an article is reconstructed by a bilinear decoder.

Drawings

FIG. 1 is a user graph convolutional network model;

FIG. 2 is a user graph attention network model;

FIG. 3 is a fully connected network that handles user attribute features;

FIG. 4 is a user encoder that merges a graph convolution network, a graph attention network, and a fully connected network;

FIG. 5 is a diagram of a convolutional network model (article);

FIG. 6 is a diagram of an attention network model (article);

FIG. 7 is a fully connected network that handles item attribute features;

FIG. 8 is an article encoder incorporating a graph convolution network, a graph attention network, and a fully connected network;

fig. 9 is a schematic structural diagram of the present invention.

Detailed Description

A personalized recommendation method based on a graph self-encoder is characterized in that an adjacency matrix is constructed by utilizing the interaction behavior of a user and an article, normalization processing is carried out on the adjacency matrix, convolution operation is carried out by using a graph convolution network, and hidden layer representation of a node is obtained; aggregating the characteristics of the neighbor nodes by using the comment text and the article description text and using the attention network, thereby updating the node information; constructing a full-connection network by using the attribute characteristics of users and articles to calculate to obtain hidden layer characteristics; and splicing the hidden layer characteristics obtained by the calculation of the three networks to obtain new node information, constructing a full-connection network, and encoding the information. And then reconstructing the user's score for the item using a bilinear decoder. And selecting the items with high preference degree according to the reconstructed prediction scores of the user on each item to generate a recommendation list.

As shown in fig. 9, in this embodiment, a method for personalized recommendation based on a graph self-encoder is performed according to the following steps:

step 1.1, constructing a GCN graph convolution network, and learning hidden layer representation of a user node by aggregating neighbor node (commodity node) information, as shown in FIG. 1.

Step 1.1.1, constructing a user-article sparse adjacency matrix for the interaction conditions under different scores by using the interaction conditions (scores) of the user and the articles, and normalizing the adjacency matrix:

wherein the content of the first and second substances,

representing an unnormalized user-item adjacency matrix.

wherein: w_i ^GCN_userAnd representing the weight parameter with the score of i for performing convolution operation on the user node.

Step 1.2, constructing a GAT (goal oriented technology) graph attention network, and learning hidden layer representation from comments of users on articles, as shown in figure 2.

similarly:

wherein, I_jThe nodes represent descriptive feature vectors, I (word), for the jth item node_j) Word used for describing articles_jAnd (5) using word2vec to vectorize the obtained word vector, wherein N is the number of words in the comment text.

wherein the content of the first and second substances,

is the node U after normalization processing_iAnd node I_jThe degree of correlation of (c); i is_j∈N(U_i)，N(U_i) Representing the item set purchased by the ith user; w is a^GAT_userWeight parameter for transformation of user node information, a_userAs a weight parameter, LeakyReLU (-) is an activation function; exp (. cndot.) represents an exponential function with e as base.

step 1.3, constructing a full-connection network, and learning hidden layer representation from the user attribute characteristics, as shown in fig. 3.

H^Dense_user＝σ(P^user·W^Dense_user+b^Dense_user) (8)

Where σ (·) denotes the ReLU activation function; p^userRepresenting the user attribute characteristics processed in the step 1.3.1; w^Dense_userA weight parameter representing a fully connected network for handling user characteristics; b^Dense_userA bias term representing a fully connected network for handling user characteristics.

E^user＝σ([H^GCN_user|H^GAT_user|H^Dense_user]·W^E_user+b^E_user) (9)

Where σ (·) denotes the ReLU activation function; i represents a splicing operation; w^E_userA weight parameter representing a fully connected network for encoding user information; b^E_userAn offset term representing a fully connected network used to encode user information. As shown in fig. 4.

and 2.1, constructing a GCN graph convolution network, and learning hidden layer representation of the article nodes by aggregating information of neighbor nodes (user nodes), as shown in FIG. 5.

wherein the content of the first and second substances,

representing an unnormalized item-user adjacency matrix.

Where σ (·) denotes the ReLU activation function;the | indicates a splicing operation and,

W_i ^GCN_itema weight parameter is represented that scores i below for the convolution operation on the item node.

And 2.2, constructing a GAT (goal oriented technology) graph attention network, and learning hidden layer representation from the comments of the user on the article, as shown in FIG. 6.

similarly:

wherein the content of the first and second substances,

is the node I after normalization processing_jAnd node U_iThe degree of correlation of (c); u shape_i∈N(I_j)，N(I_j) A set of users representing purchases of a jth item; w is a^GAT_itemWeight parameter for item node information transformation, a_itemAs a weight parameter, LeakyReLU (-) is an activation function; exp (. cndot.) represents an exponential function with e as base.

and 2.3, constructing a full-connection network, and learning hidden layer representation from the article attribute characteristics, as shown in fig. 7.

And 2.3.1, performing normalization processing on the continuous data by utilizing the attribute feature information of the article, performing one-hot coding on the discrete data, and aligning the length of the processed attribute feature vector with the length of the attribute feature vector of the user in a front zero filling mode.

H^Dense_item＝σ(P^item·W^Dense_item+b^Dense_item) (17)

Where σ (·) denotes the ReLU activation function; p^itemRepresenting the attribute characteristics of the article processed in the step 2.3.1; w^Dense_itemA weight parameter representing a fully connected network for processing characteristics of the item; b^Dense_itemA bias term representing a fully connected network for processing characteristics of an item.

E^item＝σ([H^GCN_item|H^GAT_item|H^Dense_item]·W^E_item+b^E_item) (18)

Where σ (·) denotes the ReLU activation function; i represents a splicing operation; w^E_itemA weight parameter representing a fully connected network for encoding item information; b^E_itemA bias term representing a fully connected network for encoding item information. As shown in fig. 8.

And 3, constructing a bilinear decoder, and reconstructing the scoring condition of the user on the article.

Constructing a bilinear decoder, reconstructing the scoring condition of the user on the article, and specifically calculating as follows:

y_hat＝(embedding1|embedding2)·W_classifier (19)

embedding1＝sum(E^userW₁)*E^item (20)

embedding2＝sum(E^userW₂)*E^item (21)

Claims

1. A personalized recommendation method based on a graph self-encoder is characterized in that: the method comprises the following specific steps:

step 3, constructing a bilinear decoder by using the information related to the user and the article coded in the steps 1 and 2, and reconstructing the scoring condition of the user on the article;

2. The personalized recommendation method based on graph self-encoder as claimed in claim 1, wherein: the encoder for constructing the fusion graph convolution network, the graph attention network and the full-connection network in the step 1 encodes the user related information by aggregating the information of the neighbor nodes, and specifically comprises the following steps:

wherein the content of the first and second substances,

representing an unnormalized user-item adjacency matrix;

step 1.1.2, interaction conditions under different scores are obtainedConstructing a GCN layer according to conditions to carry out convolution operation to obtain hidden layer characteristics H of each user node^GCN_user：

the same is that:

wherein, I_jThe nodes represent descriptive feature vectors, I (word), for the jth item node_j) For articles describedWord_jUsing word2vec to vectorize to obtain word vectors, wherein N is the number of words in the comment text;

wherein the content of the first and second substances,

is the node U after normalization processing_iAnd node I_jThe degree of correlation of (c); i is_j∈N(U_i)，N(U_i) Representing the item set purchased by the ith user; w is a^GAT_userWeight parameter for transformation of user node information, a_userFor the weight parameter, Leaky ReLU (-) is the activation function; exp (·) represents an exponential function with e as base;

step 1.3.1, carrying out normalization processing on continuous data by utilizing attribute feature information of a user, carrying out one-hot coding on discrete data, and aligning the length of a processed attribute feature vector with the length of an attribute feature vector of an article in a way of filling zero at the tail;

H^Dense_user＝σ(P^user·W^Dense_user+b^Dense_user) (8)

Where σ (·) denotes the ReLU activation function; p^userRepresenting the user attribute characteristics processed in the step 1.3.1; w^Dense ^_userA weight parameter representing a fully connected network for handling user characteristics; b^Dense_userA bias term representing a fully connected network for processing user characteristics;

step 1.4, learning hidden layer characteristics H from different information in step 1.1, step 1.2 and step 1.3^GCN_user、H^GAT ^_user、H^Dense_userSplicing together as hidden layer characteristic of user, and obtaining coding result E of coder through a full-connection network for coding^user：

E^user＝σ([H^GCN_user|H^GAT_user|H^Dense_user]·W^E_user+b^E_user) (9)

Where σ (·) denotes the ReLU activation function; i represents a splicing operation; w^E_userA weight parameter representing a fully connected network for encoding user information; b^E_userAn offset term representing a fully connected network used to encode user information.

3. The personalized recommendation method based on graph self-encoder as claimed in claim 2, wherein: the encoder for constructing the fusion graph convolution network, the graph attention network and the full-connection network in the step 2 encodes the article related information by aggregating the information of the neighbor nodes, and specifically comprises the following steps:

wherein the content of the first and second substances,

representing an unnormalized item-user adjacency matrix;

wherein: w_i ^GCN_itemRepresenting a weight parameter with a score of i for performing convolution operation on the article node;

wherein, U_iThe node represents the ith user sectionComment feature vector of points, U (word)_i) Word used by user_iUsing word2vec to vectorize to obtain word vectors, wherein N is the number of words in the comment text;

the same is that:

wherein the content of the first and second substances,

is the node I after normalization processing_jAnd node U_iThe degree of correlation of (c); u shape_i∈N(I_j)，N(I_j) A set of users representing purchases of a jth item; w is a^GAT_itemWeight parameter for item node information transformation, a_itemFor the weight parameter, Leaky ReLU (-) is the activation function; exp (·) represents an exponential function with e as base;

H^Dense_item＝σ(P^item·W^Dense_item+b^Dense_item) (17)

Where σ (·) denotes the ReLU activation function; p^itemRepresenting the attribute characteristics of the article processed in the step 2.3.1; w^Dense ^_itemA weight parameter representing a fully connected network for processing characteristics of the item; b^Dense_itemA bias term representing a fully connected network for processing item features;

step 2.4, learning hidden layer characteristics H from different information in step 2.1, step 2.2 and step 2.3^GCN_item、H^GAT ^_item、H^Dense_itemSplicing together as hidden layer characteristic of the article and obtaining the coding result E of the coder through a full-connection network for coding^item：

E^item＝σ([H^GCN_item|H^GAT_item|H^Dense_item]·W^E_item+b^E_item) (18)

Where σ (·) denotes the ReLU activation function; i represents a splicing operation; w^E_itemA weight parameter representing a fully connected network for encoding item information; b^E_itemA bias term representing a fully connected network for encoding item information.

4. The personalized recommendation method based on graph self-encoder as claimed in claim 3, wherein: step 3, constructing a bilinear decoder by using the information about the user and the article coded in step 1 and step 2, and reconstructing the scoring condition of the user on the article, specifically calculating as follows:

y_hat＝(embedding1|embedding2)·W_classifier (19)

embedding1＝sum(E^userW₁)*E^item (20)

embedding2＝sum(E^userW₂)*E^item (21)