CN114880583A

CN114880583A - Cross-domain social recommendation method based on self-supervision learning

Info

Publication number: CN114880583A
Application number: CN202210511621.0A
Authority: CN
Inventors: 吴乐; 罗苏常; 杨永晖; 张琨; 孙培杰; 洪日昌; 张志强; 周俊
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2022-05-11
Filing date: 2022-05-11
Publication date: 2022-08-09
Anticipated expiration: 2042-05-11
Also published as: CN114880583B

Abstract

The invention discloses a cross-domain social product recommendation method based on self-supervision learning, which comprises the following steps: 1. constructing heterogeneous data: a scoring matrix of the product by the user and a social relationship matrix between the user and the user; 2. processing the heterogeneous data through a heterogeneous graph construction layer to obtain a heterogeneous graph network of a user-product; 3. obtaining a user cooperation matrix and a product cooperation matrix through a single hot coding layer; 4. performing feature propagation through the graph convolution layer; 5. constructing a node feature aggregation layer; 6. constructing a node prediction layer to recommend products; 7. and constructing a self-supervision learning layer to carry out mutual information maximization learning on the local characteristics and the global characteristics of the nodes until the product recommendation effect of the social cold start user in the social domain is optimal. The method can fully mine the global information of the whole abnormal graph, reduce the dependence of the model on the number of bridge users and learn better bridge user characteristics, thereby improving the product recommendation performance on social cold-start users.

Description

Cross-domain social recommendation method based on self-supervision learning

Technical Field

The invention relates to the field of cross-domain recommendation and social recommendation, in particular to a cross-domain social recommendation method based on self-supervision learning.

Background

Social recommendation effectively alleviates the problems of data sparsity, system cold start and the like. According to social homogeneity theory, users tend to buy products recommended by their friends rather, and there are often more similar interest preferences among friends. Therefore, compared with the traditional collaborative filtering recommendation algorithm, the social recommendation method has more information so as to more effectively model the interest preference of the user and improve the recommendation performance of the recommendation system. However, current social recommendation methods all have a strict constraint that a user must have a social relationship based on historical product interaction records, so that the user can learn the potential interest and preference of the user, which is not in accordance with a real application scenario. In a real scenario, most users often have only social relationships without historical product interaction records, and we define this group as a social cold-start user. Therefore, how to recommend products to a large number of social cold-start users becomes a difficult problem which needs to be solved urgently at present.

One possible approach is to use the idea of cross-domain recommendation to address the problem of product recommendation to a large number of social cold-start users. The main idea of cross-domain recommendation is to use a source domain with richer information content as an auxiliary domain and transfer valuable information to a target domain, so as to improve the performance of a recommendation system on the target domain. However, the current cross-domain recommendation methods are not suitable for the proposed cross-domain social recommendation scenario, because the methods depend on the number of bridge users to a large extent, which may cause that when the number of bridge users is small, the model becomes fragile, and the recommendation effect on the social cold-start user is not accurate. Therefore, how to capture more information from a very limited bridge user is the focus of the present invention.

Disclosure of Invention

The invention provides a cross-domain social recommendation method based on self-supervision learning, aiming at overcoming the defects of the prior art, so that the global information of the whole abnormal graph can be fully mined, the dependence of a model on the number of bridge users is reduced, and the bridge user representation with better learning is realized, thereby improving the product recommendation performance on social cold-start users.

The invention adopts the following technical scheme for solving the technical problems:

the invention relates to a cross-domain social product recommendation method based on self-supervision learning, which is characterized by comprising the following steps of:

step 1, constructing heterogeneous data, comprising: a scoring matrix R of the product by the user and a social relationship matrix S between the users are as follows:

let U ^I Represents a set of users over an information domain, and

wherein the content of the first and second substances,

indicating the a-th user in the information field,

representing the b-th user in the information domain, M represents the total number of users in the information domain, a is more than or equal to 1, and b is more than or equal to M;

let V denote the product set on the information field, and V ═ V ₁ ,…v _i ,…v _j ,…v _L In which v is _i Indicating the ith product in the information field, v _j Indicating the jth product in the information field, L indicating the letterTotal number of products on the domain;

let R _ai Representing the a-th user

For the ith product v _i The scoring value of (2) is that the scoring matrix of the product by the user on the information domain is R ═ { R ═ R _ai } _M×L ；

Let U ^S Represents a set of users on a social domain,

wherein the content of the first and second substances,

representing the c-th user in the social domain,

representing the d-th user in the social domain, N representing the total number of users in the social domain, c being more than or equal to 1, and d being more than or equal to N;

order S _cd Represents the c-th user

For the d-th user

If there is a connection between two users, let S _cd If not, let S _cd 0, the social relationship matrix between users in the social domain is S ═ S _cd } _N×N ；

Step 2, utilizing a bridge user set U coexisting in an information domain and a social domain ^B ＝U ^I ∩U ^S Constructing a heterogeneous graph network G ═ G ^I ∪G ^S Wherein G is ^I Representing the set U of all users in the information domain ^I Graph network Structure with product set V, G ^S Representing a set U of all users in a social domain ^S A graph network structure for collecting U-shaped bridge users ^B The total number of users in (1) is T;

and 3, obtaining a node cooperation matrix through single hot coding, wherein the method comprises the following steps: user cooperation matrix, product cooperation matrix:

step 3.1, constructing a user cooperation matrix P ═ P in a single hot coding mode for all users in the heterogeneous graph network G ₁ ,…,p _a ,…,p _M-T ,…,p _b ,…,p _c ,…,p _M ,…,p _d ,…,p _M+N-T And P ∈ R ^(M+N-T)×d Wherein p is _a Representing a d-dimensional cooperation vector of an a-th user in the user cooperation matrix P, wherein the a-th user belongs to a non-bridge user in the information domain; p is a radical of _b Representing the d-dimensional cooperation vector of the b-th user in the user cooperation matrix P, wherein the b-th user belongs to the bridge user, P _d Representing a d-dimensional cooperation vector of a d-th user in the user cooperation matrix P, wherein the d-th user belongs to a non-bridge user in the social domain;

step 3.2, constructing a product cooperation matrix Q ═ Q in a single-hot coding mode for all products in the heteromorphic graph network G ₁ ,…,q _i ,…q _L And Q ∈ R ^L×d Wherein q is _i A d-dimensional collaborative vector representing an ith product;

and 4, carrying out characteristic propagation through the graph volume layer:

step 4.1, making the graph convolution layers have K convolution layers, and initializing the number K of the current convolution layers to be 0;

step 4.2, obtaining the d-dimensional cooperation vector p of the a-th non-bridge user in the information domain according to the formula (1), the formula (2) and the formula (3) respectively _a D-dimensional cooperation vector p of the b-th bridge user _b D-dimensional cooperation vector p of the d-th non-bridge user in social domain _d After passing through the (k + 1) th graph convolution layer, the corresponding user characteristic vector is obtained

Obtaining a d-dimensional cooperative vector q of the ith product according to the formula (4) _i The product characteristic vector obtained after passing through the (k + 1) th graph convolution layer

In the formula (1), R _a Representing the set of products that the a-th non-bridge user interacted with,

is the d-dimensional cooperation vector p of the a-th non-bridge user in the information domain _a When k is 0, let the user characteristic vector output from the k-th convolution layer

A user feature vector is initialized randomly;

in the formula (2), R _b Representing the set of products, S, interacted by the b-th bridge user _b Representing a set of users with social connections to the b-th bridge user,

is d-dimensional cooperation vector p of the b-th bridge user _b When the user characteristic vector output by the k-th convolutional layer is equal to 0,

a user feature vector is initialized randomly;

in the formula (3), S _d Representing a set of users for which the d-th non-bridge user has social connections,

the user feature vector is output by the d-th non-bridge user in the social domain at the kth layer convolutional layer, when k is 0,

a user feature vector is initialized randomly;

in the formula (4), R _i Representing a set of users who have interacted with the ith product,

is the d-dimensional cooperation vector q of the ith product _i When the characteristic vector of the product output by the k-th layer convolution layer is 0,

the feature vector of the product is initialized randomly;

4.3, assigning K +1 to K, judging whether K is greater than K, and if so, executing the step 5; otherwise, returning to the step 4.2 for sequential execution;

step 5, constructing a node feature aggregation layer:

performing feature aggregation on the K-layer user feature vector and the product feature vector according to the formula (5) to obtain a feature vector p of any user u in the heterogeneous network G _u And the feature vector q of the ith product _i ：

In the formula (5), AVG represents the average pooling operation,

a feature vector representing the user u output by the kth convolutional layer,

i-th product q representing the output of the k-th convolutional layer _i The feature vector of (2);

step 6, constructing a node prediction layer for product recommendation:

step 6.1, calculating the prediction score value of the user u on the ith product according to the formula (6)

Thereby obtaining a prediction scoring matrix of the product by the user

In the formula (6), <, > represents a vector inner product operation;

step 6.2, establishing a supervision loss function L according to the formula (7) _R ：

In formula (7), σ (z) is a nonlinear activation function, and θ ═ P, Q]Is the parameter to be optimized,. psi. ^I ＝{(u,i,j)|u∈U ^I ,i∈R _u ,j∈(V-R _u ) Is the training data of all users on the information domain; r _u Representing a set of products representing the u-th user interaction;

representing the predicted value of credit of the jth product to the jth user;

and 7, constructing an automatic supervision learning layer to carry out mutual information maximization learning on the local characteristics and the global characteristics of the nodes:

step 7.1, constructing the d user in the social domain according to the formula (8) and the formula (9)

Local feature h of _d And the a-th user in the information domain

Local feature h of _ai ：

h _d ＝σ(p _d ) (8)

h _ai ＝[σ(p _a )，σ(q _i )] (9)

In the formula (8), sigma is sigmoid nonlinear activation function;

in the formula (9), [, ] is the splicing operation of the vectors;

step 7.2, constructing global features g of all users in the social domain according to the formula (10) and the formula (11) ^S And global characteristics g of all users in the information domain ^I ：

Step 7.3, constructing a discriminator D on the social domain according to the formula (12) and the formula (13) _S And a discriminator D on the information field _I ：

In the formula (12), h _S Local features expressed as users on the social domain; w _S ∈R ^d×d Is a trainable weight matrix, (h) _S ,g ^S ) Is a positive sample pair on the social domain;

in the formula (13), h _I Local features expressed as users on the information domain; w _I ∈R ^2d×2d Is a trainable weight matrix, (h) _I ,g ^I ) Is a positive sample pair over the information field;

and 7.4, randomly adding and deleting the same number of edges of the heterogeneous graph network G to form a disturbing graph

And obtaining local features on the fictitious social domain according to the processes of the step 4, the step 5 and the step 7.1

And local features on fictitious information domains

Thus forming negative sample pairs on the social domain

And negative example pairs on the information field

Step 7.5, constructing a social domain self-supervision loss function L according to the formula (14) and the formula (15) _S And information Domain autonomous loss function L _I ：

And 7.5, constructing a total loss function L according to the formula (16):

L＝L _R +αL _s +λL _I (16)

in the formula (16), α and λ are hyper-parameters for balancing two kinds of the auto-supervised loss functions;

7.6, solving all the loss functions L by using a gradient descent method to ensure that L converges to a minimum value, thereby obtaining an optimal parameter theta ^* ＝[P ^* ,Q ^* ]Further obtain the optimal user pre-predictionEvaluation matrix

And predicting a scoring matrix according to the optimal user

And recommending the product to the non-bridge user in the social domain.

Compared with the prior art, the invention has the beneficial effects that:

1. the invention provides a cross-domain social recommendation method based on self-supervision, which aims at social cold start users, utilizes bridge users to construct a heterogeneous graph network of an information domain and a social domain, learns potential cooperative information between the social cold start users and products through graph convolution operation, and utilizes a self-supervision learning method to construct supervision signals of nodes in a heterogeneous graph, fully excavates global characteristics of the heterogeneous graph, effectively improves the representation precision of the users and the products, and improves the recommendation precision on the social cold start users.

2. The invention processes the user set and the product set in a single hot coding mode, can effectively expand data dimensionality and is convenient for fast matrix operation.

3. The method and the system utilize the bridge user to construct the information domain and the heteromorphic network on the social domain to construct the connection between the social cold start user and the product, thereby relieving the problem of social cold start.

4. The invention adopts an automatic supervision optimization method, captures more overall information of the heterogeneous graph in a mode of maximizing the mutual information of local characteristics and overall characteristics of user nodes in the heterogeneous graph, and improves the node representation precision.

Drawings

FIG. 1 is a flowchart of a cross-domain social recommendation method based on self-supervised learning according to the present invention.

Detailed Description

In the embodiment, the cross-domain social recommendation method based on the self-supervision learning considers the problem that the number of bridge users is excessively depended on in cross-domain recommendation, establishes the connection between the social cold-starting users and products by establishing a heterogeneous graph network, realizes the mutual information maximization of the local features and the global features of heterogeneous graph nodes by an self-supervision optimization method, and captures more global features of heterogeneous graphs so as to realize more accurate product recommendation of the social cold-starting users. Specifically, as shown in fig. 1, the method comprises the following steps:

let U ^I Represents a set of users over an information domain, and

wherein the content of the first and second substances,

indicating the a-th user in the information field,

let V denote the product set on the information field, and V ═ V ₁ ,…v _i ,…v _j ,…v _L In which v is _i Indicating the ith product in the information field, v _j Indicating the jth product in the information field, and L indicating the total number of products in the information field;

let R _ai Representing the a-th user

Let U ^S Represents a set of users on a social domain,

wherein, the first and the second end of the pipe are connected with each other,

representing the c-th user in the social domain,

order S _cd Represents the c-th user

For the d-th user

Step 2, utilizing a bridge user set U coexisting in an information domain and a social domain ^B ＝U ^I ∩U ^S Constructing a heterogeneous graph network G ═ G ^I ∪G ^S Wherein G is ^I Representing the set U of all users in the information domain ^I Graph network structure, G, with product set V ^S Representing a set U of all users in a social domain ^S A graph network structure for collecting U-shaped bridge users ^B The total number of users in (1) is T;

step 3.1, constructing a user cooperation matrix P ═ P in a single hot coding mode for all users in the heterogeneous graph network G ₁ ,…,p _a ,…,p _M-T ,…,p _b ,…,p _c ,…,p _M ,…,p _d ,…,p _M+N-T And P ∈ R ^(M+N-T)×d Wherein p is _a Representing a d-dimensional cooperation vector of an a-th user in the user cooperation matrix P, wherein the user belongs to a non-bridge user on the information domain; p is a radical of _b Representing the d-dimensional cooperation vector of the b-th user in the cooperation matrix P of the users belonging to the bridge user, P _d Representing the d-dimensional cooperation vector of the d-th user in the user cooperation matrix P, wherein the user belongs to the social domainNon-bridge users;

and 4, carrying out characteristic propagation through the graph volume layer:

In the formula (1), R _a Representing the collection of products that the a-th non-bridge user has interacted with,

A user feature vector is initialized randomly;

a user feature vector is initialized randomly;

in formula (3), S _d Representing a set of users for which the d-th non-bridge user has social connections,

a user feature vector is initialized randomly;

the feature vector of the product is initialized randomly;

step 5, constructing a node feature aggregation layer:

In the formula (5), AVG represents the average pooling operation,

a feature vector representing the user u output by the kth convolutional layer,

step 6, constructing a node prediction layer for product recommendation:

step 6.1, calculating the prediction scoring value of the user u on the ith product according to the formula (6)

Thereby obtaining a prediction scoring matrix of the product by the user

In the formula (6), <, > represents a vector inner product operation;

In formula (7), σ (z) is a nonlinear activation function, and θ ═ P, Q]Is the parameter to be optimized,. psi. ^I ＝{(u,i,j)|u∈U ^I ,i∈R _u ,j∈(V-R _u ) The training data of all users in the information domain; r _u Representing a set of products representing the u-th user interaction;

representing the predicted value of credit of the jth product to the jth user;

Local feature h of _d And the a-th user in the information domain

Local feature h of _ai ：

h _d ＝σ(p _d ) (8)

h _ai ＝[σ(p _a )，σ(q _i )] (9)

In the formula (8), sigma is sigmoid nonlinear activation function;

in the formula (9), [, ] is the splicing operation of the vectors;

And local features on the imaginary information domain

Thus forming negative sample pairs on the social domain

And negative example pairs on the information field

And 7.5, constructing a total loss function L according to the formula (16):

L＝L _R +αL _S +λL _I (16)

7.6, solving all loss functions L by using a gradient descent method to ensure that L converges to a minimum value, thereby obtaining an optimal parameter theta ^* ＝[P ^* ,Q ^* ]Further obtain the optimal user prediction scoring matrix

And predicting a scoring matrix according to the optimal user

And recommending the product to the non-bridge user in the social domain. The method excavates the inter-domain collaborative information in a very limited bridge user, optimizes the model in a self-supervision learning mode, and fully excavates the global characteristics of the heterogeneous map, thereby realizing more accurate product recommendation of the social cold start user.

Example (b):

in order to verify the effectiveness of the method, the invention performs data processing on two public data sets: epionins, Dianping, to meet the experimental conditions of the present invention.

For the product recommendation task, the invention adopts Hit Ratio (HR) and Normalized counted graphical Gain (NDCG) as evaluation criteria. The invention selects 3 methods for effect comparison, namely EMCDR, NSCR and BiTGCF.

TABLE 1 social Cold Start user product recommendation results on Epines dataset by the method and comparison method of the present invention

TABLE 2 social Cold Start user product recommendation results on Dianping datasets by the method of the present invention and the comparison method

Specifically, table 1 and table 2 show the experimental results on the epipons and Dianping data sets, respectively, and it can be seen that on the two data sets, the method proposed by the present invention is superior to the 3 methods of comparison in both HR and NDCG indexes on social cold-start users.

Claims

1. A cross-domain social product recommendation method based on self-supervision learning is characterized by comprising the following steps:

let U ^I Represents a set of users over an information domain, and

wherein the content of the first and second substances,

indicating the a-th user in the information field,

let R _ai Representing the a-th user

Let U ^S Represents a set of users on a social domain,

wherein the content of the first and second substances,

representing the c-th user in the social domain,

order S _cd Represents the c-th user

For the d-th user

and 4, carrying out characteristic propagation through the graph volume layer:

step 4.2, obtaining the d-dimensional cooperation vector p of the a-th non-bridge user in the information domain according to the formula (1), the formula (2) and the formula (3) respectively _a D-dimensional cooperation vector p of the b-th bridge user _b D-dimensional collaborative direction of the d-th non-bridge user in social domainQuantity p _d After passing through the (k + 1) th graph convolution layer, the corresponding user characteristic vector is obtained

A user feature vector is initialized randomly;

in the formula (2), R _b Represents the product set interacted by the b-th bridge user, S _b Representing the b-th bridge userA collection of users having a social connection,

a user feature vector is initialized randomly;

a user feature vector is initialized randomly;

a randomly initialized product feature vector;

4.3, assigning K +1 to K, judging whether K is more than K, and if so, executing step 5; otherwise, returning to the step 4.2 for sequential execution;

step 5, constructing a node feature aggregation layer:

In the formula (5), AVG represents the average pooling operation,

a feature vector representing the user u output by the kth convolutional layer,

step 6, constructing a node prediction layer for product recommendation:

Thereby obtaining a prediction scoring matrix of the product by the user

In the formula (6), <, > represents a vector inner product operation;

In formula (7), σ (z) is a nonlinear activation function, and θ ═ P, Q]Is the parameter to be optimized,. psi. ^I ＝{(u,i,j)|u∈U ^I ,i∈R _u ,j∈(V-R _u ) Is aTraining data of all users on the information domain; r _u Representing a set of products representing the u-th user interaction;

representing the predicted value of credit of the jth product to the jth user;

Local feature h of _d And the a-th user in the information domain

Local feature h of _ai ：

h _d ＝σ(p _d ) (8)

h _ai ＝[σ(p _a ),σ(q _i )] (9)

In the formula (8), sigma is sigmoid nonlinear activation function;

in the formula (9), [, ] is the splicing operation of the vectors;

And local features on the imaginary information domain

Thus forming negative sample pairs on the social domain

And negative example pairs on the information field

Step 7.5, constructing a social domain self-supervision loss function L according to the formula (14) and the formula (15) _S And information domain self-monitoringGovernor loss function L _I ：

And 7.5, constructing a total loss function L according to the formula (16):

L＝L _R +αL _S +λL _I (16)

7.6, solving all the loss functions L by using a gradient descent method to ensure that L converges to a minimum value, thereby obtaining an optimal parameter theta ^* ＝[P ^* ,Q ^* ]Further obtain the optimal user prediction scoring matrix

And predicting a scoring matrix according to the optimal user

And recommending the product to the non-bridge user in the social domain.