CN113159892B

CN113159892B - Commodity recommendation method based on multi-mode commodity feature fusion

Info

Publication number: CN113159892B
Application number: CN202110444726.4A
Authority: CN
Inventors: 蔡国永; 宋亚飞
Original assignee: Guilin University of Electronic Technology
Current assignee: Guilin University of Electronic Technology
Priority date: 2021-04-24
Filing date: 2021-04-24
Publication date: 2022-05-06
Anticipated expiration: 2041-04-24
Also published as: CN113159892A

Abstract

The invention belongs to the field of commodity recommendation, and particularly relates to a commodity recommendation method based on multi-mode commodity feature fusion. The commodity recommendation method comprises the following steps: constructing a user-commodity bipartite graph according to a commodity sequence purchased by a user, and obtaining vector representation of user nodes and vector representation of commodity nodes through graph convolution; extracting features of comment texts obtained by the commodities through a convolutional neural network to obtain vector representation of the commodity comments; performing feature extraction on the title and description information of the commodity through a convolutional neural network to obtain vector representation of commodity content; and connecting the vector representations of the commodity nodes, the comments and the contents to obtain a final representation of the commodity, and taking the vector representation of the user node as a final representation of the user. According to the method, the problem of data sparsity in commodity recommendation can be greatly relieved by utilizing the multi-modal characteristics of the commodities, and the recommendation accuracy is improved.

Description

Commodity recommendation method based on multi-mode commodity feature fusion

Technical Field

The invention relates to a commodity recommendation method, and belongs to the field of commodity recommendation.

Background

Most of the existing commodity recommendation methods only utilize the id of a commodity to extract a collaborative signal hidden in interaction between a user and the commodity in the process of modeling the commodity, so that the commodity is modeled, which generally faces a serious data sparsity problem and greatly restricts the performance of a recommendation system. Although there is some work to take into account review information to capture the product characteristic information contained in the review while mitigating the data sparsity problem, the title and description information of the product itself is rarely utilized. However, comment information is given by users, where information contained in different comments often has different informativeness due to differences in user expression habits and points of interest, and even a lot of noise information may be contained. Unlike review information, the title and description of the item is typically written by the merchant, which may include more and more comprehensive characteristics of the item, and may be more specialized and accurate in presentation. Therefore, when modeling a commodity in recommendation, on the basis of the commodity id and the commodity comment information, the combination of the commodity title and the description information can help to achieve better recommendation performance.

Disclosure of Invention

In order to solve the problems, the invention provides a commodity recommendation method based on multi-mode commodity feature fusion, which comprises the following steps:

s1: constructing a user-commodity bipartite graph according to a commodity sequence purchased by a user in history, and obtaining vector representation of user nodes and vector representation of commodity nodes through graph convolution;

s2: obtaining a comment document of a commodity, and extracting vector representation of the commodity comment through a convolutional neural network;

s3: acquiring the title and description information of the commodity, and extracting vector representation of the commodity content through a convolutional neural network;

s4: obtaining a final representation of the user and a final representation of the good;

s5: calculating the similarity between the user and the commodity;

s6: parameters in the method are provided through Bayes personalized sorting loss optimization.

Further, the constructing the user-commodity bipartite graph in S1 includes:

s11: obtaining a user historical commodity purchase sequence according to implicit feedback or explicit feedback, constructing a user-commodity bipartite graph through the historical commodity purchase sequence, and using a user-commodity adjacency matrix

Is represented by the formula (I) in which n_uAnd n_pThe number of users and the number of commodities,

is a user-commodity interaction matrix, R^TIs the transpose of the R and is,

s12: to utilize user-commodity dichotomyAdding an identity matrix to A according to the information of the nodes in the graph

Meanwhile, to avoid gradient disappearance or gradient explosion during training, a diagonal matrix is used

Carrying out normalization processing, wherein the value on the diagonal line is the degree of each node in the user-commodity bipartite graph, thereby obtaining

Further, the obtaining of the vector representation of the user node and the vector representation of the commodity node in S1 includes:

s13: and performing neighbor propagation and aggregation operation on the user-commodity bipartite graph through graph convolution to obtain vector representation of user nodes and vector representation of commodity nodes.

Further, the specific steps of the graph convolution in S13 are as follows:

s131: converting the unique corresponding id of each user and each commodity into a dense vector through an embedding layer to obtain a user characteristic vector

And commodity feature vector

Where d is the dimension of the feature vector;

s132: building an embedded table

To represent a feature matrix of the user-commodity bipartite graph;

s133: aggregating features of node neighbors using graph convolution of the t layers, wherein the propagation process is defined as:

s134: by convolution of the t layers, is obtained from

To

The t feature matrixes are connected to obtain a final feature matrix

E is then divided into two parts of the feature matrix

And

vector representations as user nodes, respectively

And vector representation of commodity nodes

Further, the vector representation for extracting the commodity comment in S2 includes:

s21: integrating comments obtained from each commodity into a comment document of the commodity, and performing preprocessing such as word segmentation, word shape restoration, stop word removal, word removal with extremely high occurrence frequency, word with extremely low occurrence frequency and the like on the comment document of the commodity;

s22: feature extraction is carried out on the commodity comment document through a text feature extractor, and vector representation of the commodity comment is obtained

Further, the text feature extractor in S22 includes:

s221: representing a sequence of words of the input text as [ w ]₁,w₂,…,w_l]Where l is the length of the input text;

s222: converting the word sequence representation of S5 into a word vector representation sequence by a word embedding layer

Wherein d is_vIs the word embedding dimension;

s223: processing the word vector representation sequence using a convolutional neural network to obtain a context word vector representation sequence c₁,c₂,…,c_l]Wherein the context of the ith word represents c_iThe calculation method comprises the following steps: c. C_i＝LeakyReLU(W_t×v_(i-k):(i+k)+b_t)；

S224: computing a weight [ alpha ] for each word vector representation in a sequence of context word vector representations using an attention mechanism₁,α₂,…,α_l]The sequence of context word vector representations is then multiplied by the corresponding weights to obtain a final representation of the input text

Wherein alpha is_iThe calculation method comprises the following steps:

further, the extracting the vector representation of the commodity content in S3 includes:

s31: acquiring the title and description information of the commodity, and carrying out preprocessing such as word segmentation, word shape restoration, stop word removal, word removal with extremely high occurrence frequency, word with extremely low occurrence frequency and the like on the information;

s32: extracting the title and description information of the commodity through the same text characteristic extractor in S22 to obtain the vector representation of the commodity content

Further, the obtaining of the user final representation and the commodity final representation in S4 includes:

s41: expressing the vector of the commodity node as e^pVector representation of comments a^rVector representation of content a^tConnecting to obtain a final expression p of the commodity; representing the node of the user as e^uAs the final representation u of the user.

Further, the calculating the similarity between the user and the commodity in S5 includes:

calculating the similarity of the user and the commodity through the dot product of the user final representation and the commodity final representation:

further, the parameters in the method for proposing the ranking loss optimization through bayesian personalization in S6 include:

drawings

Fig. 1 is a flow chart of a commodity recommendation method according to the present invention.

Fig. 2 is a schematic structural diagram of a commodity recommendation method according to the present invention.

Detailed Description

The invention is further described below with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and the protection scope of the present invention is not limited thereby.

As shown in FIG. 1, the invention provides a commodity recommendation method based on multi-mode commodity feature fusion, which comprises the following steps:

step 1: constructing a user-commodity bipartite graph according to a commodity sequence purchased by a user in history, and obtaining vector representation of user nodes and vector representation of commodity nodes through graph convolution;

specifically, the graph convolution specifically comprises the following steps:

firstly, a user-commodity bipartite graph is constructed according to historical interaction of users and commodities, and a user-commodity adjacency matrix is used

is a user-commodity interaction matrix, R^TIs the transpose of the R and is,

in order to utilize the information of the nodes in the user-commodity bipartite graph, an identity matrix is added to A

Then, the unique corresponding id of each user and each commodity is converted into a dense vector through an embedding layer, and a user characteristic vector is obtained

And commodity feature vector

Where d is the dimension of the feature vector; we build the following embedding Table E₀To represent the feature matrix of the user-product bipartite graph:

then, we use the graph convolution of the t-layer to aggregate the features of the node neighbors, where the propagation process is defined as:

wherein

Is a trainable weight matrix and σ is the LeakyRelu activation function.

By convolution of the t layers, is obtained from

To

The t feature matrices are connected to obtain a final feature matrix E, and then the E is divided into two parts of the feature matrix

And

vector representations as user nodes, respectively

And vector representation of commodity nodes

Step 2: integrating comments obtained by each commodity into a comment document of the commodity, carrying out word segmentation, word shape restoration, stop word removal, extremely high word removal, extremely low word removal and other processing on the comment document of the commodity, and then processing the comment document of the commodity by using a text feature extractor to obtain vector representation of the commodity comment

Specifically, the text feature extractor comprises the following specific steps:

first, word embedding, representing the word sequence of the input text as [ w ]₁,w₂,…,w_l]Where l is the length of the input text, which is then converted into a sequence of word vector representations by the word embedding layer

Wherein d is_vIs the word embedding dimension.

To exploit local context information in the input text, we process the word vector representation sequence using a convolutional neural network to obtain a context word vector representation sequence c₁,c₂,…,c_l]Wherein the context of the ith word represents c_iThe calculation method comprises the following steps:

c_i＝LeakyReLU(W_t×v_(i-k):(i+k)+b_t)

wherein v is_(i-k):(i+k)Is the concatenation of word embeddings from the i-k word to the i + k word, W_tAnd b_tRespectively convolution kernel and offset.

Considering that different words in the input text have different informativeness, we use a mechanism of attentionComputing a weight [ alpha ] for each word vector representation in the sequence of context word vector representations₁,α₂,…,α_l]The sequence of context word vector representations is then multiplied by the corresponding weights to obtain a final representation of the input text

Wherein alpha is_iThe calculation method comprises the following steps:

wherein W_aAnd b_aTrainable weight matrices and biases, respectively, and q is an attention query vector.

And step 3: acquiring the title and description information of the commodity, performing word segmentation, word shape restoration, stop word removal, word with extremely high appearance frequency, word with extremely low appearance frequency and the like on the title and description information of the commodity, and processing the title and description information of the commodity by using the text feature extractor which is the same as the step 2 to obtain vector representation of the content of the commodity

And 4, step 4: expressing the vector of the commodity node as e^pVector representation of comments a^rVector representation of content a^tConnecting to obtain a final expression p of the commodity; representing the node of the user as e^uAs the final representation u of the user.

And 5: calculating the similarity of the user and the commodity through the dot product of the user final representation and the commodity final representation:

step 6: parameters in the proposed method are optimized using bayesian personalized ranking loss:

wherein

Representing the training data in pairs of training data,

representing the set of items purchased by user u,

indicating a commodity set that the user u has not purchased; σ is a sigmoid function; θ represents all trainable model parameters and λ controls the L2 regularization strength to prevent overfitting.

The experimental data set is Amazon review three panels of data set CDs _ and _ Vinyl, Movies _ and _ TV and Books. The following table describes the statistics of three data sets:

for each data set, 70% of all its interactions were taken as training set, 10% as validation set, and 20% as test set.

Recall @ K and NDCG @ K were chosen as evaluation criteria, and in the experiment, K is 20.

The selected comparison method comprises the following steps: BPRMF, NGCF, deepconnn, the following table shows the corresponding experimental results:

from experimental results, it can be seen that the method provided by the invention achieves superior performance to the comparative method on all three data sets.

Claims

1. A commodity recommendation method based on multi-mode commodity feature fusion is characterized by comprising the following steps:

1.1, constructing a user-commodity bipartite graph, and obtaining vector representation of user nodes and vector representation of commodity nodes through graph convolution;

1.2, obtaining a comment document of a commodity, and extracting vector representation of the commodity comment through a convolutional neural network, wherein the vector representation of the commodity comment is extracted by the following method: integrating all comments obtained by each commodity into a comment document of the commodity, and performing word segmentation, word form restoration, stop word removal, extremely high-frequency word removal and extremely low-frequency word pretreatment on the comment document of the commodity; then, feature extraction is carried out on the commodity comment document through a text feature extractor, and vector representation of the commodity comment is obtained

The text feature extractor comprises the following steps: first, a word sequence of an input text is represented as [ w ]₁,w₂,…,w_l]Where l is the length of the input text; the word sequence representation is then converted into a word vector representation sequence by a word embedding layer

Wherein d is_vIs the word embedding dimension; the word vector representation sequence is then processed using a convolutional neural network to obtain a context word vector representation sequence c₁,c₂,…,c_l]Wherein the context of the ith word represents c_iThe calculation method comprises the following steps: c. C_i＝LeakyReLU(W_t×v_(i-k):(i+k)+b_t) (ii) a Finally, a weight [ alpha ] is calculated for each word vector representation in the sequence of context word vector representations using the attention mechanism₁,α₂,…,α_l]The sequence of context word vector representations is then multiplied by the corresponding weights to obtain a final representation of the input text

Wherein alpha is_iThe calculation method comprises the following steps:

1.3, acquiring the title and description information of the commodity, and extracting the vector representation of the commodity content through a convolutional neural network, wherein the extraction method of the vector representation of the commodity content is as follows: firstly, acquiring the title and description information of a commodity, and performing word segmentation, word shape restoration, stop word removal, extremely high-frequency word removal and extremely low-frequency word preprocessing on the information; then, feature extraction is carried out on the commodity title and the description information through a text feature extractor, and vector representation of the commodity content is obtained

1.4, obtaining a user final representation and a commodity final representation;

1.5 calculating the similarity between the user and the commodity;

1.6 parameters in the method are proposed through Bayes personalized ranking loss optimization.

2. The method for recommending commodities based on multi-modal fusion of commodity features according to claim 1, wherein the specific method for obtaining the vector representation of the user node and the commodity node in 1.1 is as follows:

2.1, constructing a user-commodity bipartite graph according to the historical records of commodities purchased by users;

and 2.2, carrying out neighbor propagation and aggregation on the user-commodity bipartite graph through graph convolution to obtain vector representation of the user node and vector representation of the commodity node.

3. The method for recommending commodities based on multi-modal commodity feature fusion according to claim 2, wherein the specific steps for constructing the user-commodity bipartite graph in 2.1 are as follows:

3.1 historical interaction records from user and merchandiseConstructing a user-commodity bipartite graph using a user-commodity adjacency matrix

is a matrix of user-goods interactions,

is the transpose of the R, and,

3.2 to exploit the information of the nodes themselves in the user-commodity bipartite graph, an identity matrix is added to A

4. The commodity recommendation method based on multi-modal commodity feature fusion as claimed in claim 2, wherein the specific steps of the 2.2 middle graph convolution are as follows:

4.1 converting the unique corresponding id of each user and each commodity into a dense vector through an embedding layer to obtain the user characteristics(Vector)

And commodity feature vector

Where d is the dimension of the feature vector;

4.2 building Embedded tables

To represent a feature matrix of the user-commodity bipartite graph;

4.3 use the graph convolution of the t layers to aggregate the characteristics of the node neighbors, wherein the propagation process is defined as:

4.4 obtaining the data from t layers by graph convolution

To

The t feature matrixes are connected to obtain a final feature matrix

E is then divided into two parts of the feature matrix

And

vector representations as user nodes, respectively

And vector representation of commodity nodes

5. The method for recommending commodities based on multi-modal fusion of commodity features according to claim 1, wherein the specific method for obtaining the final user representation and the final commodity representation in 1.4 is as follows:

5.1 representing the vectors of the commodity nodes by e^pVector representation of comments a^rVector representation of content a^tConnecting to obtain a final expression p of the commodity; representing the vector of the user node as e^uAs the final representation u of the user.

6. The method for recommending commodities based on multi-modal commodity feature fusion according to claim 1, wherein the calculation formula of the similarity between the user and the commodity is as follows:

where u is the final representation of the user,

representing the transpose of u and p the final representation of the good.

7. The method for recommending commodities based on multi-modal commodity feature fusion according to claim 1, wherein the specific formula of the parameters in the method for proposing loss optimization through Bayesian personalized ranking is as follows:

wherein

Representing the training data in pairs of training data,

representing the set of items purchased by user u,

indicating a set of goods not purchased by user u; σ is a sigmoid function; θ represents all trainable model parameters and λ controls the L2 regularization strength to prevent overfitting.