CN115422442A

CN115422442A - Cold-start recommendation-oriented anti-self-coding migration learning method

Info

Publication number: CN115422442A
Application number: CN202210976839.3A
Authority: CN
Inventors: 吴汉瑞; 龙锦益; 李诺思
Original assignee: Jinan University
Current assignee: Jinan University
Priority date: 2022-08-15
Filing date: 2022-08-15
Publication date: 2022-12-02
Anticipated expiration: 2042-08-15
Also published as: CN115422442B

Abstract

The invention discloses a cold start recommendation-oriented anti-self-coding transfer learning method which is used for processing the problem of article cold start through five steps and firstly constructing a positive and negative hypergraph. And then acquiring the conventional characteristics of warm users and cold users by utilizing a multilayer perceptron. And constructing a hypergraph self-encoder, respectively acquiring positive and negative characteristics of the warm user and the article, and reconstructing the positive and negative hypergraphs. A matching discriminator is developed to minimize the loss of classification of the positive and negative features of the warm user and the distribution gap between the positive and regular features of the warm user. Therefore, the positive characteristics of the warm user and the normal characteristics of the goods are connected, the positive characteristics of the warm user and the positive characteristics of the goods are related through a positive hypergraph, the relation between the normal characteristics of the cold start user and the positive characteristics of the goods is found, euclidean distances of the normal characteristics of the cold user and the positive characteristics of the goods are calculated and ranked, top-K goods are recommended to the user, and the superiority of the method is verified on a plurality of real data sets by using the indexes of precision, recall ratio, NDCG and hit ratio. The method effectively solves the cold start problem and provides more accurate and personalized recommendation suggestions for the user.

Description

Cold-start recommendation-oriented anti-self-coding migration learning method

Technical Field

The invention relates to the technical field of information recommendation, in particular to a cold start recommendation oriented anti-self-coding transfer learning method.

Background

With the continuous development of the internet, people enjoy the convenience of computer technology and simultaneously generate and manufacture massive data. The problem of information overload caused by these large-scale and complex data increases the difficulty for users to capture useful information in tasks such as information retrieval and electronic commerce. The recommendation system, as an information filtering tool, plays a crucial role in providing accurate and personalized recommendations for users, and has been receiving wide attention in the past decade.

The traditional recommendation method based on collaborative filtering greatly depends on historical interaction data of users and articles, and effective recommendation suggestions cannot be provided for new users or new articles, namely the cold start problem exists. Researchers have proposed some methods of deep learning to address the cold start problem by discovering potential relationships between users and items. Recently, hypergraph learning has received a wide attention due to excellent modeling ability on complex data, and researchers have introduced hypergraph learning into recommendation systems. In the hypergraph, one hyperedge can connect a plurality of nodes; in the user interaction information of the recommendation system, one article can interact with a plurality of users. Therefore, the hypergraph can naturally express the high-level complex relation between the user and the article, namely the value of 1 in the hypergraph represents interaction, and the value of 0 represents no interaction, so that the high-level feature representation of the user and the article can be learned by utilizing the hypergraph convolution network.

Domain adaptation aims to leverage knowledge from other source domains to assist the learning task of the target domain, and the main challenge of domain adaptation is to reduce the domain gap between the source domain and the target domain. Existing work tends to employ methods such as maximum average difference, optimal transmission, and combat loss to estimate the distribution difference between two domains. But not accurate and personalized in dealing with the user cold start and item cold start recommendation problems.

Therefore, it is necessary to invent a method of counteracting the self-encoding migration learning for cold start recommendation to solve the above problems.

Disclosure of Invention

The invention aims to provide a cold-start recommendation-oriented anti-self-coding migration learning method, and aims to provide more accurate and personalized recommendation suggestions for users, so as to solve the problems of cold start of the users and cold start of articles.

In order to achieve the above purpose, the invention provides the following technical scheme: a cold-start recommendation-oriented anti-self-coding transfer learning method comprises the following steps:

s1, constructing a hypergraph, modeling interactive information of a user and an article, and designing a positive hypergraph and a negative hypergraph according to the original hypergraph;

s2, designing a multi-layer perceptron network to obtain conventional feature representations of warm users and cold start users;

and S3, constructing a hypergraph self-encoder for respectively using the positive hypergraph and the negative hypergraph, acquiring positive and negative feature representations of the warm user and the warm object, and regarding the positive features as source data and the conventional features as target data. In addition, positive and negative hypergraphs are reconstructed and used for storing the associated information between the user and the article;

s4, constructing a matching discriminator, distributing pseudo labels for the positive and negative characteristics of the warm user, and minimizing the classification loss of the positive and negative characteristics of the warm user and the distribution difference between the positive characteristics and the conventional characteristics;

s5, calculating Euclidean distances between the conventional features and the article features of the cold-start user, sequencing, recommending Top-K articles to the user, and calculating precision, recall rate, NDCG and hit rate;

the step S1 includes:

s1.1, preprocessing a user scoring matrix, deleting articles with the score of 0, and reserving the rest articles as interactive items;

s1.2, enabling a user to perform the following steps according to the following steps of 9:1 into warm users and cold start users, and constructing a hypergraph according to the interactive information of the warm users and the articles, specifically

And the form of the hypergraph is as follows:

wherein n is _w Representing the number of warm users, m represents the total number of items, the value of R (i, j) is 1 representing that the warm user i has interaction with the item j, otherwise the value of R (i, j) is 0;

s1.3, positive superscript R ₊ Equal to the original hypergraph R, the positive hypergraph represents the preference items of the user, and the items with the value of R (i, j) of 0 are randomly selected from the original hypergraph R to construct a negative hypergraph R _- ；

The form of the negative hypergraph is then as follows:

the step S2 includes:

s2.1, constructing a multilayer perceptron neural network to obtain the conventional characteristics of the warm user and the cold start user, wherein the specific formula is as follows:

where h (-) denotes a fully connected neural network,

representing the characteristics of users, the trust relations of the users exist in the original data, the trust relations are used as the trust relations, n represents the total number of the users, d represents the dimension of the characteristics, phi represents the trainable network weight and has

U＝[U ^w ,U ^c ]

U ^w And U ^c Representing the original characteristics of a warm user and a cold start user respectively,

and

conventional characterizations representing warm and cold start users, respectively.

The step S3 includes:

s3.1, the relation between the user and the article is represented by a hypergraph, and the hypergraph self-encoder is used for learning high-level feature representation of the warm user and the article on the hypergraph, wherein the specific formula is as follows:

wherein f is ₊ Is a hypergraph self-encoder, T is the characteristic of the article,

in order to reconstruct the hypergraph,

is a positive feature of warming the user,

is a positive feature of the article;

s3.2, reconstructing a positive hypergraph by using the positive characteristics of the warm user and the characteristics of the article acquired in the step S3.1, wherein the loss of the reconstructed positive hypergraph is represented as:

wherein

Is a binary cross entropy function.

S3.3, learning high-level feature representation of warm users and articles by using a hypergraph self-encoder on the negative hypergraph, wherein a specific formula is as follows:

wherein f is _- Is a hypergraph self-encoder.

In order to reconstruct the hypergraph,

is a negative feature of a warm user,

is a negative characteristic of the article;

s3.4, reconstructing a negative hypergraph by using the negative characteristics of the warm user and the characteristics of the object acquired in the step S3.3, wherein the loss of the reconstructed negative hypergraph can be expressed as follows:

the step S4 includes:

s4.1 Positive characteristics for warming users respectively

And negative characteristic

Assigning pseudo-labels, i.e. setting

Assign a pseudo label as

Wherein n is _w The number of warm users, d is the characteristic dimension;

the positive characteristics comprise preference information of the user, and the negative characteristics comprise information which is not interested by the user;

s4.2, positive feature to bridge Warm user and regular feature, setting

Assign a pseudo label as

S4.3, designing a matching discriminator, and minimizing the classification loss of the positive and negative characteristics of the warm user and the distribution difference between the positive characteristics and the conventional characteristics according to the following formula:

where a is a trainable parameter which is,

and

each represents

And

the (j) th row vector of (c),

and

respectively represent

And

value of j row of (1), G _y (. Is a classifier, G) _c (ii) a characteristic matcher,

is a loss function of the classifier and,

representing the penalty function between domains;

s4.4, by step S4.2, the positive and negative characteristics of the warm user are separated, the positive characteristics are connected with the regular characteristics, thereby bridging the regular characteristics of the cold start user and the item characteristic image recommendation, introducing a gradient back propagation layer, following the following formula:

s4.5, integrating the steps, designing an overall loss function of the model for training, wherein the formula is as follows:

where β and η are adjustable parameters to balance the weights between the three loss functions;

s4.6, repeating the steps S4.3 and S4.4 until convergence, and connecting the positive characteristic and the conventional characteristic of the warm user;

the positive characteristics of the warm user and the positive characteristics of the article are connected through a positive hypergraph, the normal characteristics of the warm user and the normal characteristics of the cold start user have similar distribution, and the normal characteristics of the cold start user and the positive characteristics of the article are observed and recorded.

The step S5 includes:

s5.1, discovering the relation between the cold start user and the articles, calculating the Euclidean distance between the conventional characteristics and the article characteristics of the cold start user, recommending Top-K articles to the user, and evaluating the performance of the method by adopting the following four indexes:

A. precision: the ratio of the total number of the items in the recommendation list is calculated, all users are averaged, and the larger the value is, the better the value is, the calculation formula is as follows:

wherein N is _ts Is the number of cold start users, L ⁱ Is the ith cold start user's true favorite item,

is the recommended Top-K items;

B. the recall ratio is as follows: as the proportion of correctly recommended articles to the total number of articles to be recommended, the larger the value is, the better the value is, and the calculation formula is as follows:

c, NDCG: for measuring the superiority of the recommendation list, when the result with high relevance appears at a more front position, the higher the index is, the calculation formula is as follows:

wherein r is ⁱ Is the correlation of the ith item, the user prefers the item i then r ⁱ The value is 1, otherwise 0, and the value of IDCG ensures that the value of NDCG can be atBetween 0 and 1;

D. the hit rate: as a commonly used index for measuring the recall ratio, the larger the value is, the better the value is, and the calculation formula is as follows:

in the technical scheme, the invention provides the following technical effects and advantages:

1. the negative hypergraph is introduced to learn the negative high-dimensional characteristic representation so as to assist in acquiring the high-dimensional characteristic representation of the user and the object, and the proposed model can effectively solve the cold start problem;

2. the invention applies the hypergraph automatic encoder to the positive hypergraph and the negative hypergraph to respectively obtain positive and negative feature representations, and also develops a matching discriminator to minimize the classification loss of the positive and negative features and the distribution difference between the positive features and the routine, so that the features can be distinguished while enriching the feature information;

3. the invention provides more accurate and personalized recommendation suggestions for the user by being superior to the existing methods on a plurality of real data.

Drawings

In order to more clearly illustrate the embodiments of the present application or technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments described in the present invention, and other drawings can be obtained by those skilled in the art according to the drawings.

Fig. 1 is a schematic flow chart of a cold-start recommendation oriented anti-self-coding migration learning method of the present invention.

Detailed Description

In order to make the technical solutions of the present invention better understood, those skilled in the art will now describe the present invention in further detail with reference to the accompanying drawings.

The invention provides a cold-start recommendation oriented anti-self-coding transfer learning method as shown in figure 1, which comprises the following steps:

s1, constructing a hypergraph, modeling interaction information of a user and an article, and designing a positive hypergraph and a negative hypergraph according to an original hypergraph;

s1.2, according to the following steps that 9:1 into warm users and cold start users, and constructing a hypergraph according to the interactive information of the warm users and the articles, specifically

And the form of the hypergraph is as follows:

s1.3, positive superscript graph R ₊ Equal to the original hypergraph R, the positive hypergraph represents the preference items of the user, and the items with the value of R (i, j) of 0 are randomly selected from the original hypergraph R to construct a negative hypergraph R _- ；

The form of the negative hypergraph is as follows:

where h (-) represents a fully connected neural network,

U＝[U ^w ,U ^c ]

and

conventional characterization representative of warm and cold start users, respectively;

and S3, constructing a hypergraph self-encoder for the positive hypergraph and the negative hypergraph respectively, acquiring positive and negative feature representations of the warm user and the article, and regarding the positive features as source data and the conventional features as target data. In addition, positive and negative hypergraphs are reconstructed and used for storing the associated information between the user and the article;

wherein f is ₊ Is a hypergraph self-encoder, T is a characteristic of the article,

in order to reconstruct the hypergraph,

is a positive feature of a warm user,

is a positive feature of the article;

wherein

Is a binary cross entropy function.

wherein f is _- Is a hypergraph autoencoder.

In order to reconstruct the hypergraph,

is a negative feature of a warm user,

is a negative characteristic of the article;

s3.4, reconstructing a negative hypergraph by using the negative characteristics of the warm user and the characteristics of the article acquired in the step S3.3, wherein the loss of the reconstructed negative hypergraph can be expressed as:

s4.1 Positive characteristics for warming users respectively

And negative characteristics

Assigning pseudo-labels, i.e. setting

Assign a pseudo label as

Wherein n is _w The number of warm users, d is the characteristic dimension;

s4.2, positive feature to bridge Warm user and regular feature, i.e. settings

Assign a pseudo label as

where a is a trainable parameter which is,

and

respectively represent

And

the (j) th row vector of (a),

and

respectively represent

And

value of j row of (1), G _y (. Is) a classifier, G _c (ii) a characteristic matcher,

is a loss function of the classifier and is,

representing the penalty function between domains;

s4.4, by step S4.2, the positive and negative characteristics of the warm user are separated, the positive characteristics are connected with the regular characteristics, thereby bridging the regular characteristics of the cold-start user and the item characteristic image recommendation, introducing a gradient back propagation layer, following the following formula:

the positive characteristics of the warm user and the positive characteristics of the article are connected through a positive hypergraph, the conventional characteristics of the warm user and the conventional characteristics of the cold start user have similar distribution, and the relationship between the conventional characteristics of the cold start user and the positive characteristics of the article is observed and recorded;

wherein N is _ts Is coldNumber of active users, L ⁱ Is the ith cold start user's true favorite item,

is the recommended Top-K items;

B. and (4) recall rate: as the proportion of correctly recommended articles to the total number of articles to be recommended, the larger the value is, the better the value is, and the calculation formula is as follows:

c, NDCG: for measuring the superiority of the recommendation list, when the result with high relevance appears at a more advanced position, the higher the index is, the calculation formula is as follows:

wherein r is ⁱ Is the correlation of the ith item, the user prefers the item i then r ⁱ A value of 1, otherwise 0, while the value of IDCG ensures that the value of NDCG can be between 0 and 1;

s5.2, using the Ciao data set for a user cold start recommendation problem, collecting scores and opinions of the user on various products from the Ciao official website, deleting the items with the score of 0, and finally selecting 2153 users and 8000 products, wherein the user is subjected to the following steps of 9:1 divided into warm users and cold start users;

s5.3, comparing the Top-20 prediction result on the Ciao with CMF (David cortex.2018. Cold-start criteria in collectible matrix factorization. ArXiv prediction arXiv:1809.00366 (2018)) and Heater (Ziwei Zhu, shahin Sefati, parsa Saadatpana, and James conduit.2020. Recommendation for new users and new items via random tracking and mixing-of-experiments transformation. In ACM SIGIR Conference Research and Development Information retrieval.1121-1130);

TABLE 1

The experimental results are shown in table 1, and the algorithm 1 corresponds to the verification results of the algorithm provided by the invention; the algorithm 2 corresponds to the verification result of the CMF; the algorithm 3 corresponds to the verification result of Heater; as can be seen from Table 1, the present invention is better than other algorithms in most evaluation indexes;

while certain exemplary embodiments of the present invention have been described above by way of illustration only, it will be apparent to those of ordinary skill in the art that the described embodiments may be modified in various different ways without departing from the spirit and scope of the present invention. Accordingly, the drawings and description are illustrative in nature and are not to be construed as limiting the scope of the invention.

Claims

1. A cold-start recommendation-oriented antagonistic self-coding transfer learning method is characterized by comprising the following steps: the method comprises the following steps:

s5, calculating Euclidean distances between the conventional features and the article features of the cold-start user, sequencing, recommending Top-K articles to the user, and calculating precision, recall rate, NDCG and hit rate.

2. The method for learning against self-coding migration facing to cold start recommendation according to claim 1, wherein: the step S1 includes:

And the form of the hypergraph is as follows:

The form of the negative hypergraph is then as follows:

。

3. the method for learning against self-coding migration for cold start recommendation according to claim 1, wherein: the step S2 includes:

where h (-) denotes a fully connected neural network,

U＝[U ^w ,U ^c ]

and

4. The method for learning against self-coding migration for cold start recommendation according to claim 1, wherein: the step S3 includes:

in order to reconstruct the hypergraph,

is a positive feature of a warm user,

is a positive feature of the article;

wherein

Is a binary cross entropy function.

S3.3, learning high-level characteristic representation of warm users and articles by using a hypergraph self-encoder on the negative hypergraph, wherein a specific formula is as follows:

wherein f is _- Is a hypergraph self-encoder.

In order to reconstruct the hypergraph,

is a negative feature of a warm user,

is a negative characteristic of the article;

。

5. the method for learning against self-coding migration for cold start recommendation according to claim 1, wherein: the step S4 includes:

s4.1 Positive characteristics for warming users respectively

And negative characteristics

Assigning pseudo-labels, i.e. setting

Assign a pseudo label as

Wherein n is _w The number of warm users, d is the characteristic dimension;

the positive characteristics contain preference information of the user, and the negative characteristics contain information which is not interesting to the user;

s4.2, positive feature to bridge Warm user and regular feature, i.e. settings

Assign a pseudo label as

where a is a trainable parameter which is,

and

respectively represent

And

the (j) th row vector of (a),

and

respectively represent

And

value of j row of (1), G _y (. Is a classifier, G) _c (v) a characteristic matcher for matching the characteristics of the target,

is a loss function of the classifier and,

representing the penalty function between domains;

s4.5, integrating the steps, designing a total loss function of the model for training, wherein the formula is as follows:

the positive characteristics of the warm user and the positive characteristics of the article are connected through a positive supergraph, the normal characteristics of the warm user and the normal characteristics of the cold start user have similar distribution, and the normal characteristics of the cold start user and the positive characteristics of the article are observed and recorded.

6. The method for learning against self-coding migration for cold start recommendation according to claim 1, wherein: the step S5 includes:

is the recommended Top-K items;

wherein r is ⁱ Is the correlation of the ith item, the user prefers the item i then r ⁱ The value of the sum of the values is 1,otherwise 0, and the value of IDCG ensures that the value of NDCG can be between 0 and 1;

D. the hit rate is as follows: as a commonly used index for measuring the recall ratio, the larger the value is, the better the value is, and the calculation formula is as follows:

。