CN108280738A

CN108280738A - Method of Commodity Recommendation based on image and socialized label

Info

Publication number: CN108280738A
Application number: CN201711324898.8A
Authority: CN
Inventors: 赵伟; 黄若谷; 管子玉; 王泉
Original assignee: Xidian University
Current assignee: Xidian University
Priority date: 2017-12-13
Filing date: 2017-12-13
Publication date: 2018-07-13

Abstract

The invention discloses a kind of Method of Commodity Recommendation based on image and socialized label, mainly solves the problems, such as that existing recommendation method cannot capture the preference of user well.Its implementation is：1. extracting color characteristic, textural characteristics and the shape feature of commodity picture；2. designing optimal semantic space：According to user, the contact of label and commodity picture between any two builds bipartite graph and neighbor relationships figure, and by user, label and commodity picture, these three data are mapped in the same space, i.e., optimal semantic space；3. in optimal semantic space according to the Euclidean distance between user and commodity picture to commodity picture according to from being closely ranked up to remote sequence, the top n commodity picture nearest apart from user is recommended into user.The present invention can more accurately calculate the similarity between picture, and the effective complicated relational structure handled between three class object of label data obtains and preferably recommends performance, can be used for the personalized recommendation system of e-commerce.

Description

Commodity recommendation method based on images and social labels

Technical Field

The invention belongs to the technical field of data mining, and particularly relates to a commodity recommendation method which can be used for an electronic commerce personalized recommendation system.

Background

In recent years, with the increasing scale of electronic commerce, the number and types of products are rapidly increasing, and customers need to spend a lot of time to find the products they want to buy. This process of browsing through large amounts of unrelated information and products will undoubtedly result in a constant loss of consumers who are overwhelmed with a huge amount of information. To solve this problem, personalized recommendation systems have been developed. The personalized recommendation system is an active information service system established on the basis of mass data mining, and is used for helping an e-commerce website to provide completely personalized decision support and information service for shopping of customers.

Most of the traditional recommendation systems utilize user scoring data, use a collaborative filtering algorithm to perform calculation screening, and then recommend the user. However, the rating data cannot intuitively reflect the preference of the user, such as rating of a movie, and only whether the user likes the movie or not but not the specific attention point of the user to the movie can be known by using the rating data of the user. Therefore, recommendation systems based on user scoring data do not capture user specific preferences well.

Disclosure of Invention

The invention aims to provide a commodity recommendation method based on images and social labels to better capture specific preferences of a user aiming at the defects of the prior art.

The technical scheme of the invention is realized as follows:

technical principle

The social label can reflect the preference of the user more directly, because the label provides a meaningful description for the resource, namely the commodity picture, for example, for the picture of the clothes, whether the clothes is 'rural wind' or 'sweet fair maiden' can be known through the label, therefore, the label embodies the semantic understanding of the user for the commodity, and implies the preference of the user. In addition, since tag data has three types of objects: the user, the label and the commodity picture have complex association structures, so that the commodity recommendation cannot be performed by using the traditional collaborative filtering algorithm. Therefore, the invention designs a multi-class associated object dimension reduction algorithm based on a graph, which constructs a bipartite graph for the relationship among a user, a label and a commodity picture, constructs a neighbor relationship graph for a commodity picture set, and maps the three graphs into the same space on the premise of keeping the structures of the two graphs, wherein the space is called as a semantic space because the semantic association relationship of label data is kept. After the optimal semantic space is obtained, the Euclidean distance between the user and the commodity picture is calculated in the semantic space, the commodity corresponding to the picture closest to the user is recommended to the user, and the recommended commodity is displayed in a picture form.

Second, the technical scheme

According to the principle, the technical scheme of the invention comprises the following steps:

(1) extracting content features, namely color features, texture features and shape features, of the commodity picture;

(2) designing an optimal semantic space:

2a) learning a weight parameter vector between the three characteristics of the commodity pictures by using a multi-core learning algorithm to obtain the final similarity between the two commodity pictures, and fusing the weight parameter vector and the calculated similarity between the commodity pictures to obtain a similarity matrix;

2b) defining the tag data as (u)_i,d_j,t_k) U is a user set, D is a commodity picture set, T is a label set, U_iIndicating the ith in UDoor, d_jRepresents the jth commodity picture, t, in D_kDenotes the kth tag in T, tag data (u)_i,d_j,t_k) Representing user u_iUse of the tag t_kTo mark picture d_j；

2c) Constructing a weighted bipartite graph for the relation between the user and the label in the label data, constructing a weighted bipartite graph for the relation between the label and the commodity picture in the label data, constructing a non-weighted bipartite graph for the relation between the user and the commodity picture in the label data, and constructing a neighbor relation graph for the commodity picture set;

2d) on the premise of keeping two graph structures of a bipartite graph and a neighbor relation graph in the step 2c), mapping three data of a user, a label and a commodity picture into the same space, wherein the space is the optimal semantic space;

(3) in the optimal semantic space obtained in the step (2), the commodity pictures are sorted from near to far according to the Euclidean distance between the user and the commodity pictures, the first N commodity pictures closest to the user are recommended to the user, and the range of N is (1, + ∞).

The invention has the following beneficial effects:

1. the invention can reflect the preference of the user more directly and obtain better recommendation performance because of using the label data instead of the user scoring data.

2. According to the method, the features of the three pictures are extracted, the weight relation among the features is learned by utilizing a multi-core learning technology, and the picture similarity after weighted fusion among the features is calculated.

3. According to the method, the bipartite graph and the neighbor relation graph are constructed to establish the complex relation among the user, the label and the commodity picture, so that the problem of data sparsity is solved, and the recommendation method is more effective and accurate.

4. The semantic space designed by the invention can effectively process the complex association structure among three types of objects of the label data, namely the user, the label and the commodity picture, and the traditional collaborative filtering algorithm can not process the label data.

Drawings

FIG. 1 is a general flow chart of an implementation of the present invention;

FIG. 2 is a semantic space sub-flow diagram of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the detailed description and specific examples, while indicating the scope of the invention, are intended for purposes of illustration only and are not intended to limit the scope of the invention.

Referring to fig. 1, the implementation steps of the invention are as follows:

step 1, extracting content features, namely color features, texture features and shape features, of a commodity picture;

1.1) extracting color features of the commodity picture:

establishing a color histogram, and for the picture Q, counting the color histogram on a color vector space, wherein the color histogram is a one-dimensional discrete function, namely:

in the formula, n_kObtaining a color histogram H (k) of the image Q by taking the number of pixels with a color characteristic value of k, N as the total number of image pixels and l as the number of the color characteristic value;

1.2) extracting the texture features of the commodity picture:

describing texture features of the picture by extracting scale invariant feature sift descriptors;

1.2.1) let the picture be I (x, y), (x, y) is the spatial scale coordinate, then the picture scale space L (x, y, σ) is:

L(x,y,σ)＝G(x,y,σ)*I(x,y)，

wherein G (x, y, σ) is a gaussian function, σ is a scale parameter, and x represents a multiplier;

1.2.2) calculating the difference of gaussians D (x, y, sigma) of adjacent scale images:

D(x,y,σ)＝L(x,y,k₁σ)-L(x,y,σ)

wherein k is₁GetL (-) is a scale space;

after the Gaussian difference of the images with adjacent scales is calculated, a series of Gaussian difference images are obtained;

1.2.3) solving extreme points in the space of the Gaussian difference image, and respectively comparing a pixel point in each Gaussian difference image with all adjacent points: if a certain pixel point is larger or smaller than all the adjacent points, the pixel point is an extreme point, and the Gaussian difference of the extreme point is as follows:

wherein,the extreme value of Z is represented by (x, y, sigma) which is the offset of the pixel point, and D (·) is a Gaussian difference function;

1.2.4) determining the extreme point, it is necessary to doPerforming curve fitting on the DoG function in the scale space to screen extreme points and remove low-contrast points, namely for each candidate extreme pointAnd (4) judging:

if it is notIf the candidate extreme point is less than 0.03, judging that the candidate extreme point is an extreme point with low contrast, removing the candidate extreme point, and otherwise, keeping the candidate extreme point;

1.2.5) removing the extreme point with low contrast, and then removing the extreme point with abnormal principal curvature, because an extreme value of a poorly defined gaussian difference has a larger principal curvature at a place crossing an edge and a smaller principal curvature at a direction perpendicular to the edge, and ideally, the principal curvatures of the extreme points in any directions are the same, it is necessary to detect whether the principal curvature is below a certain threshold r, and in order to detect whether the principal curvature is below a certain threshold r, only the following inequality needs to be detected:

wherein, tr (H) represents the trace of the matrix H, det (H) represents the determinant of the matrix H, the matrix H is a Hession matrix, and the expression is as follows:

wherein D is_xxIs the result of twice derivation of D (x, y, σ) of a scale image in the x direction in DoG space; d_yyIs the result of twice derivation of D (x, y, σ) of a scale image in the y direction in DoG space; d_xyThe method is characterized in that D (x, y, sigma) of an image with a certain scale is derived once in the x direction of a DoG space and then derived once in the y direction; when the inequalities are satisfied, the extreme point is retained, otherwise, the extreme point is removed and retainedThe remaining extreme points are the characteristic points to be found;

1.2.6) after the positions of the image feature points are determined, a direction needs to be assigned to the image feature points, and the step is realized by solving the gradient of the neighborhood of each feature point, namely defining the gradient amplitude m (x, y) and the gradient direction theta (x, y) as follows:

θ(x,y)＝tan^-1(L(x,y+1)-L(x,y-1))/(L(x+1,y)-L(x-1,y))

wherein L (-) is a scale space function;

1.2.7) taking an image feature point as a center, defining a region, constructing a direction histogram by using gradient amplitudes and gradient directions of all points in the region, wherein the gradient amplitude is a vertical axis, the gradient direction is a horizontal axis, selecting a gradient direction corresponding to an item with the maximum gradient amplitude from the direction histogram as a main direction of the feature point, and if the gradient amplitudes corresponding to other gradient directions are more than 80% of the gradient amplitudes corresponding to the main direction, also taking the gradient direction as an auxiliary direction of the image feature point;

1.2.8) after the image characteristic points are detected, determining the descriptors of the characteristic points according to the following steps:

firstly, rotating the neighborhood of the feature point by theta by taking the feature point as a circle center⁰Where θ is the direction of the feature point;

secondly, taking a 16 multiplied by 16 neighborhood window by taking the feature point as a center in the rotated image, wherein each cell represents one pixel in the neighborhood window of the feature point; uniformly dividing a 16 x 16 rectangular window into 16 sub-regions, increasing the weight value of a neighborhood close to the feature point by adopting a Gaussian blur method, and reducing the weight value of a neighborhood far away from the feature point; then, calculating gradient histograms of 8 directions in each region to obtain a feature vector of the feature point descriptor, wherein the vector is a 128-dimensional vector with 4 multiplied by 8;

thirdly, normalizing the feature vector of the feature point descriptor, and setting S as the feature vector of the feature point descriptor, namely S ═ S (S)₁,s₂,......s₁₂₈) Normalizing S to obtain normalized feature point descriptor

To reduce the effect of large gradient values, isSetting a threshold value of 0.2 ifIf the value of one dimension is greater than 0.2, setting the value to 0.2 and then pairing againNormalization processing is carried out, and finally 128-dimensional normalized feature point descriptors are obtained

1.2.9) pairs of normalized feature point descriptorsClustering to obtain sift characteristic vectors;

in order to obtain fixed-length sift feature vectors, normalized feature point descriptors need to be obtainedClustering is performed because the number of descriptors contained in different pictures is different, which results in thatThe sift characteristic vectors extracted from different pictures are lengthened, so the clustering number is set to be 500, k-means clustering is carried out on the characteristic point descriptors contained in each picture, a fixed-length 500-dimensional clustering result can be obtained from each picture, and the result is the final sift characteristic vector and is used for describing the texture characteristics of the commodity picture;

1.3) extracting the shape characteristics of the commodity picture:

1.3.1) utilizing a Gabor filter to perform sampling filtering on the commodity picture according to the following formula:

wherein,

l₁is the size of the filter; k is a radical of₂Is a positive constant; sigma₁Is the standard deviation of the gaussian function;is scale l₁The total number of directions of the lower part,

1.3.2) convolving the commodity picture with a Gabor filter to obtain a filtered commodity picture:

dividing the filtered commodity picture into 4 multiplied by 4 grids, taking an average value in each grid, and finally putting the average values obtained in grids of all directions and scales in a vector, wherein the vector is a final universal search tree feature vector and is used for describing the shape features of the commodity picture;

thus, the extraction of the color feature, the texture feature and the shape feature of the commodity picture is completed.

And 2, designing a semantic space.

Referring to FIG. 2, the specific implementation of this step is as follows

2.1) constructing a similarity matrix:

the invention uses the multi-core learning technology to construct the similarity matrix among the commodity pictures, namely three characteristics of color characteristics, texture characteristics and shape characteristics are extracted from the commodity pictures, and when the similarity among the commodity pictures is calculated, the similarity among the three characteristics and the commodity pictures needs to be weighted and fused to obtain the similarity among the weighted and fused commodity pictures, and then the similarity matrix is constructed according to the similarity among the weighted and fused commodity pictures, and the method comprises the following implementation steps:

2.1.1) calculating the similarity between the commodity pictures:

selecting a Gaussian kernel as the kernel function, the kernel function K (x)_i,x_j) Is represented as follows:

wherein x is_iFeature vector, x, representing the ith product picture_jThe characteristic vector of the jth commodity picture is represented, i is more than or equal to 1, j is more than or equal to 1, i is not equal to j, and sigma is₂Denotes the standard deviation, | x_i-x_j||²Represents a vector (x)_i-x_j) 2 norm of, kernel function K (x)_i,x_j) Representing the similarity between the ith commodity picture and the jth commodity picture if K (x)_i,x_j) The closer to 1, the more similar the two commodity pictures are;

2.1.2) degree of similarity K (x)_i,x_j) Normalization is performed to select kernel function K (x)_i,x_j) All values are [0,1 ]]The normalized similarity K (i, j) is as follows:

2.1.3) fusing the normalized similarity K (i, j) with the weight parameter vector η to obtain the similarity K (i, j, η) between the two weighted and fused commodity pictures:

wherein i represents the ith commodity picture, j represents the jth commodity picture, H₁The number of feature vectors indicating a commercial picture is [ η ] as parameter η₁,η₂,......,η_H]^TIs a vector of weight parameters, parameters η, to be learned_vRepresents the weight parameter vector η ═ η₁,η₂,......,η_H]^TThe v-th parameter, K_v(i, j) represents the similarity between the ith commodity picture and the jth commodity picture under the vth feature;

2.1.4) defining the kernel function K in the ideal state_ideal(i, j) is:

wherein y is_iIndicates the category of the ith product picture, y_jA category representing a jth commodity picture;

2.1.5) define the square loss function:

for the ith commodity picture and the jth commodity picture, the similarity K (i, j, η) between the two commodity pictures needs to be close to K as much as possible_ideal(i, j) accordinglyThe square loss function l (i, j, η) is defined as:

l(i,j,η)＝(K(i,j,η)-K_ideal(i,j))²，

the smaller the value of l (i, j, η), the closer the similarity K (i, j, η) is to K_ideal(i,j)；

2.1.6) defining an optimization calculation formula to obtain a weight parameter vector η to minimize the loss function l (i, j, η), wherein the specific calculation formula is as follows:

wherein q represents the total number of the commodity pictures, t_ijThe weight of the loss function representing the ith and jth product pictures, λ is a compromise parameter,representing a regularization term;

the above equation finally yields a weight parameter vector η, which is added with a regularization termThe formula is a typical quadratic programming problem, can be directly solved by using a common convex optimization software package, and adopts a data set which is a large-scale image set ImageNet of Google in order to ensure that the obtained weight parameter vector η is more accurate;

2.1.7) defining a similarity matrix W based on the obtained weight parameter vector η as:

2.2) setting parameters:

defining the tag data as (u)_i,d_j,t_k) U is a user set, D is a commodity picture set, T is a label set, U_iRepresents the ith user in U, d_jRepresents the jth commodity picture, t, in D_kDenotes the kth tag in T, tag data (u)_i,d_j,t_k) Representing user u_iUse of the tag t_kTo mark picture d_j；

2.3) constructing a weighted bipartite graph of the relation between the user and the label in the label data:

2.3.1) define the following first matrix:

wherein u is_iRepresents the ith user in U, d_kDenotes the kth product picture in D, t_jRepresents the jth label in T, and B represents the set of all label data;

2.3.2) for the first matrixAnd normalizing to obtain a normalized first matrix as follows:

then R is^utThe weighted bipartite graph between the users and the labels is formed by structure matrixing;

2.4) constructing a weighted bipartite graph for the relation between the labels and the commodity pictures in the label data:

2.4.1) defining the second matrix

Wherein,u_kdenotes the kth user in U, d_jRepresents the jth commodity picture, t, in D_iRepresenting the ith label in T, and B representing the set of all label data;

2.4.2) for the second matrixAnd normalizing to obtain a normalized second matrix as follows:

R^tdweighted bipartite graph between labels and commodity pictures in structural matrixing

2.5) establishing an unauthorized bipartite graph for the relation between the user and the commodity picture in the tag data

A third matrix R is defined as^ud：

Wherein u is_iRepresents the ith user in U, d_jRepresents the jth commodity picture, t, in D_kRepresenting the kth label in T, and B representing the set of all label data;

R^udthe method comprises the following steps of (1) forming an unweighted bipartite graph between users and commodity pictures in a structural matrixing manner;

2.6) constructing a neighbor relation graph for the commodity picture set:

defining a fourth matrix as follows:

wherein d is_iRepresenting the ith product picture in D, D_jTo representJ-th commodity picture in D, sim (D)_i,d_j) Representing the similarity between the ith commodity picture and the jth commodity picture;

W_ija neighbor relation graph of the structural matrixing commodity picture set;

2.7) constructing semantic spaces

In this step, in order to learn an optimal semantic space, a graph-based subspace learning idea is fused, that is, for two strongly correlated objects, they should be very close in the semantic space;

2.7.1) consider the simplest case, the semantic space being one-dimensional, i.e. when k is 1, the following vector and loss function are defined:

define a vector of | U | × 1: f ═ f₁,f₂,...f_|U|}，

Define a vector of | T | × 1: g ═ g₁,g₂,...g_|T|}，

Define a vector of | D | × 1: p ═ p₁,p₂,...p_|D|}，

The loss function is defined as:

wherein f is_iE f, representing the coordinate of the ith user in a one-dimensional space; g_iE g, representing the coordinate of the ith label in a one-dimensional space; g_jE g, representing the coordinate of the jth label in a one-dimensional space; p is a radical of_iE.g. p, representing the coordinate of the ith commodity picture in one dimension; p is a radical of_jThe parameter α, gamma and η are four scaling factors with different values and satisfy 0 < α, gamma, η <1, α + β + gamma + η as 1;

loss function as described aboveThe first term on the right of the equation indicates if user u_iFrequently used labels t_jThey should be mapped very close to each other in space, i.e., f_iAnd g_jThe squared difference of (c) is to be as small as possible; the second item represents if many users use the tag t_iDe-labeling picture d_j，t_iAnd d_jShould also be very close, i.e. g_iAnd p_jThe difference is not large; the third term represents each user u_iAll should be in contact with user u_iMarked picture d_jVery close, i.e. f_iAnd p_jThe difference is not large; the last item is a smooth item of the internal structure of the commodity picture, namely for two very similar commodity pictures d_iAnd d_jThe influence of the above four terms on the loss function is determined by parameters α, gamma, η;

2.7.2) to solve the above-mentioned loss function Q (f, g, p), it is necessary to define seven diagonal matrices, D respectively^ut,D^tu,D^td,D^dt,D^ud,D^duAnd D, wherein:

D^utis a diagonal matrix of size | Ux | U |, D^utIs the matrix R^utThe sum of the elements of row i of (1);

D^udis a diagonal matrix of size | Ux | U |, D^udIs the matrix R^udThe sum of the elements of row i of (1);

D^tuis a diagonal matrix of size | T | × | T |, D^tuIs the matrix R^utThe sum of the ith column element of (1);

D^tdis a diagonal matrix of size | T | × | T |, D^tdIs the matrix R^tdThe sum of the elements of row i of (1);

D^duis a diagonal matrix of size | D | × | D |, D^duIs the matrix R^udThe sum of the ith column element of (1);

D^dtis a diagonal matrix of size | D | × | D |, D^dtIs the matrix R^tdThe sum of the ith column element of (1);

d is a diagonal matrix with size | D | × | D |, the (i, i) -th element in D is the matrix W_ijThe sum of the elements of row i of (1);

the first term of the loss function Q (f, g, p) can be rewritten as:

the second term of the loss function Q (f, g, p) is also rewritten as:

the third term of the loss function Q (f, g, p) is also rewritten as:

the fourth term for Q (f, g, p) can be rewritten as:

wherein the matrix L ═ D-W_ijIs a Laplace matrix;

2.7.3) the loss function Q (f, g, p) is represented as a matrix according to equations <1> to <5 >:

Q₁(f,g,p)＝α(f^TD^utf+g^TD^tug-2f^TR^utg)

+β(g^TD^tdg+p^TD^dtp-2g^TR^tdp)

+γ(f^TD^udf+p^TD^dup-2f^TR^udp)

+ηp^TLp

＝f^T(αD^ut+γD^ud)f+g^T(αD^tu+βD^td)g<6>

+p^T(βD^dt+γD^du+L)p

-2αf^TR^utg-2βg^TR^tdp-2γf^TR^udp

2.7.4) to eliminate the effect of any scaling factors α, γ, η, Q is added₁Normalized (f, g, p) to obtain normalized

Formula (II)<7>Is equivalent to f^Tf+g^Tg+p^TMinimizing loss function Q under constraint of p-1₁(f,g,p)；

2.7.5) defines an augmented vector h ═ f^Tg^Tp^T]^TSimplified form<7>；

Due to f^TR^utg，g^TR^tdp and f^TR^udp is a scalar, the transpose of the scalar is equal to the scalar itself, so that the normalizedCan be abbreviated as:

where e is a unit vector of the vector,for a semi-positive definite matrix, the following is defined:

2.7.6) maximizing the global variance of the target subspace;

formula (II)<8>Denotes that h is required to minimize the loss function Q (f, g, p)^Th maximization, h maximization^Th is a better method to maximize the global variance of the target subspace, and the specific method is as follows:

for a random variable x, the variance is:

var(x)＝∫(x-μ)²dP(x),μ＝∫xdP(x)<9>

where dP (x) represents the probability of x being observed;

in the discrete case, the probability of observing a node on the graph can be estimated by the degree of the node, so the global variance of f, g, and p is:

matrix arrayIs a diagonal matrix defined as follows:

2.7.7) according to formula<8>And formula<10>Will lose a functionIs rewritten as

2.7.8) in practical applications, in order to better capture the relationship between objects, it is often necessary to obtain a semantic space in k dimension (k > 1), and therefore, it is necessary to extend the above calculation process to the k dimension case, where k > 1, defines the following vectors, matrices and loss functions:

defining a matrix F ═ k of | U | ×₁f₂......f_k]，

Defining a matrix G ═ k of | T | ×₁g₂......g_k]，

Defining a matrix P ═ P | D × k₁p₂......p_k]，

Definition matrix H₃＝[h₁h₂......h_k]Wherein h is_i＝[f_i ^Tg_i ^Tp_i ^T]^T，

For each dimension i e {1, 1_i,g_i,p_i) And maximize (f)_i ^Tf_i+g_i ^Tg_i+p_i ^Tp_i) Thus, a loss function in k-dimension is definedThe following were used:

wherein f is_ie.F represents the coordinates of all users in the user set U in the ith dimension, g_iE G represents the coordinates of all the labels in the label set T in the ith dimension, p_iE.g. P represents the coordinates of all the commodity pictures in the commodity picture set D in the ith dimension; tr (-) represents the trace of the matrix;

2.7.9) like the case where k is 1, it is also necessary to maximize the global variance of the target subspace when k > 1, so in the case where k > 1, the equation <12> is rewritten as:

solution formula<13>Obtaining a semi-positive definite matrixAnd diagonal matrixThen, according to Rayleigh theory, solvingAnd removing the eigenvectors with equal components from the eigenvectors corresponding to the first k smallest eigenvalues, wherein lambda is the eigenvalue, and the matrix formed by the remaining eigenvectors is the optimal semantic space M.

And 3, calculating the Euclidean distance between the user and the commodity picture.

Giving a target user, and calculating the Euclidean distance between the target user and the commodity picture through the corresponding row vector of the target user and the commodity picture in the optimal semantic space M:

using n-dimensional vectors A₁＝(x₁₁,x₁₂,...x_1k...x_1n) Indicates that the user is at the mostCorresponding row vectors in the preferred semantic space M, where x_1kIs represented by A₁The k value is more than or equal to 1 and less than or equal to n,

using n-dimensional vectors A₂＝(x₂₁,x₂₂,...x_2k...x_2n) Representing the corresponding line vector of the commodity picture in the optimal semantic space M, wherein x_2kIs represented by A₂The k-th value of the number k,

calculating the Euclidean distance d between the user and the commodity picture₁₂Comprises the following steps:

and sequencing all the commodity pictures according to the obtained Euclidean distance in the order from near to far, wherein the larger the Euclidean distance is, the more irrelevant the user is to the commodity pictures, and returning the commodity recommendation corresponding to the first N commodity pictures closest to the user, wherein the range of N is (1, + ∞), so that the personalized commodity recommendation work for the user is completed.

The above description is only one specific example of the present invention and should not be construed as limiting the scope of the invention; it should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the inventive concept, which falls within the scope of the present invention.

Claims

1. A commodity recommendation method based on images and social labels is characterized by comprising the following steps:

(2) designing an optimal semantic space:

2b) defining the tag data as (u)_i,d_j,t_k) U is a user set, D is a commodity picture set, T is a label set, U_iRepresents the ith user in U, d_jRepresents the jth commodity picture, t, in D_kDenotes the kth tag in T, tag data (u)_i,d_j,t_k) Representing user u_iUse of the tag t_kTo mark picture d_j；

2. The method according to claim 1, wherein the learning of the weight relationship between the three features in the commodity picture by using the multi-kernel learning algorithm in step 2a) is to learn the weight parameter vector between the color feature, the texture feature and the shape feature of the commodity picture by using the following kernel functions:

wherein i represents the ith commodity picture, j represents the jth commodity picture, H₁The number of feature vectors indicating a commercial picture is [ η ] as parameter η₁,η₂,......,η_H]^TIs the weight parameter to be learnedVector, parameter η_vRepresents the weight parameter vector η ═ η₁,η₂,......,η_H]^TThe v-th parameter, K_v(i, j) represents the similarity between the two product pictures under the v-th feature, and K (i, j, η) represents the similarity between the two final product pictures.

3. The method of claim 1, wherein the step 2c) of constructing a weighted bipartite graph of the association between the user and the tags is performed by:

2c1) the following matrix is defined:

2c2) for the above matrixNormalization is carried out to obtain a normalized matrix:

R^uta weighted bipartite graph between users and tags matrixed to the structure.

4. The method according to claim 1, wherein the step 2c) of constructing the weighted bipartite graph for the association between the labels and the commodity pictures is performed as follows:

2c3) define the following matrix

Wherein u is_kIndicating the k-th use in UDoor, d_jRepresents the jth commodity picture, t, in D_iRepresenting the ith label in T, and B representing the set of all label data;

2c4) for the above matrixNormalization is carried out to obtain a normalized matrix:

R^tdthe method is a weighted bipartite graph between the labels and the commodity pictures which are structured in a matrix mode.

5. The method according to claim 1, wherein the non-weighted bipartite graph is constructed in step 2c) for the association between the user and the commodity picture, and the following matrix is defined:

R^udthe method is an unweighted bipartite graph between users and commodity pictures which are structured in a matrix mode.

6. The method according to claim 1, wherein the constructing of the neighborhood relationship graph for the commodity picture set in step 2c) is performed by defining a matrix as follows:

wherein d is_iRepresenting the ith product picture in D, D_jRepresenting the jth commodity picture in the D;

W_ijproximity between sets of pictures of a commodity matrixed for structureAnd (4) a neighborhood relationship graph.

7. The method according to claim 1, wherein the three data of the user, the label and the commodity picture are mapped into the same space in the step 2d), and the following steps are performed:

2d1) considering the semantic space as one-dimensional, a vector f of | U | × 1 is defined, a vector g of | T | × 1 is defined, a vector p of | D | × 1 is defined, and the following loss function is defined:

wherein f is_iRepresents user u_iCoordinates in one dimension, g_iRepresents a label t_iCoordinates in one dimension, g_jRepresents a label t_jCoordinate in one dimension, p_jCommodity picture d_jCoordinates in one dimension, R^utWeighted bipartite graph, R, between users and labels that is structured into a matrix^tdIs a weighted bipartite graph, R, between structured matrixed labels and commodity pictures^udIs an unweighted bipartite graph, W, between users and commodity pictures in a structural matrixing_ijFor the neighbor relation graph among the structure matrixed commercial picture sets, parameters α, γ, η satisfy α + β + γ + η as 1, 0 < α, γ, η < 1;

2d2) optimizing and solving the loss function, expanding the solving process to the situation of k dimension, and finally obtaining a semi-positive definite matrixAnd a diagonal matrix

Wherein D is^utIs a diagonal matrix of size | Ux | U |, D^utIs the matrix R^utThe sum of the elements of row i of (1); d^udIs a diagonal matrix of size | Ux | U |, D^udIs the matrix R^udThe sum of the elements of row i of (1); d^tuIs a diagonal matrix of size | T | × | T |, D^tuIs the matrix R^utThe sum of the ith column element of (1); d^tdIs a diagonal matrix of size | T | × | T |, D^tdIs the matrix R^tdThe sum of the elements of row i of (1); d^duIs a diagonal matrix of size | D | × | D |, D^duIs the matrix R^udThe sum of the ith column element of (1); d^dtIs a diagonal matrix of size | D | × | D |, D^dtIs the matrix R^tdThe sum of the ith column element of (1); d is a diagonal matrix with size | D | × | D |, the (i, i) -th element in D is the matrix W_ijThe element of row i.

2d3) Obtaining a semi-positive definite matrixAnd diagonal matrixThen, according to Rayleigh theory, solvingAnd removing the eigenvectors with equal components from the eigenvectors corresponding to the first k smallest eigenvalues, wherein a matrix formed by the remaining eigenvectors is the optimal semantic space.