CN108717654A - A kind of more electric business intersection recommendation method based on cluster feature migration - Google Patents

A kind of more electric business intersection recommendation method based on cluster feature migration Download PDF

Info

Publication number
CN108717654A
CN108717654A CN201810470713.2A CN201810470713A CN108717654A CN 108717654 A CN108717654 A CN 108717654A CN 201810470713 A CN201810470713 A CN 201810470713A CN 108717654 A CN108717654 A CN 108717654A
Authority
CN
China
Prior art keywords
matrix
user
cluster
item
feature
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810470713.2A
Other languages
Chinese (zh)
Other versions
CN108717654B (en
Inventor
吴骏
方贺贺
张怡
杜云涛
王崇骏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University
Original Assignee
Nanjing University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University filed Critical Nanjing University
Priority to CN201810470713.2A priority Critical patent/CN108717654B/en
Publication of CN108717654A publication Critical patent/CN108717654A/en
Application granted granted Critical
Publication of CN108717654B publication Critical patent/CN108717654B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0202Market predictions or forecasting for commercial activities

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of more electric business based on cluster feature migration to intersect recommendation method, includes the following steps 1) rating matrix construction phase:A acquires each electric quotient data;Noise is removed in b data cleansings;C builds rating matrix;D terminates;2) auxiliary domain learns the stage:A obtains rating matrix;B extracts user/item characteristic matrix;C is to user/item characteristic Matrix Cluster;D calculates average score;E constructs cluster feature matrix;F repeats above step to terminating for each auxiliary electric business;3) aiming field learns the stage:A obtains target electric business rating matrix;B migrates cluster feature, completes matrix decomposition.C reconstructs target electric business rating matrix;D generates recommendation list;E terminates.The present invention provides a kind of new resolving ideas using transfer learning technology for Deta sparseness, cold start-up and diversity existing for electric business commending system and accuracy awkward predicament problem.

Description

Multi-provider cross recommendation method based on clustering feature migration
Technical Field
The invention relates to a multi-provider cross recommendation method, which solves the problem that an e-provider recommendation system is low in recommendation accuracy under the conditions of extremely sparse data and cold start.
Background
With the continuous expansion of the scale of the e-commerce website, the problem of information overload becomes more and more serious, and a very potential method for solving the problem is a personalized recommendation system. Such as the well-known e-commerce platform Amazon, recommends other products to the user that may be of interest using behavior records such as clicks, browses, favorites, and shopping carts that reflect the user's purchasing interest. According to the preference of each user, the intelligent content recommendation of thousands of people and thousands of faces is carried out, so that key indexes such as user activity, stay time, payment rate, retention rate and the like can be effectively improved, and huge values are created for the society and enterprises. However, the rapid increase of the number of users and commodities brings about a plurality of troubles such as data sparsity, cold start, diversity and accuracy difficulty and the like to the traditional e-commerce recommendation system.
Currently, most e-commerce recommendation systems are performed in a single domain. The internet is an open environment, almost every user cannot generate data in only one field, the user can shop in Taobao, Amazon and Jingdong at the same time, and can listen to songs on Internet music, QQ music and dog music at the same time. The single field recommendation cannot effectively share internet resources, so that information is relatively blocked, and an information island is easily formed.
The cross-domain recommendation aims to extract knowledge from other fields containing rich data through information sharing and complementation between domains, provides help for recommendation of a target domain, can relieve the problems of sparsity and cold start of data on one hand, and can also give consideration to diversity and accuracy on the other hand, so that the cross-domain recommendation becomes a research hotspot in the field of recommendation systems. The invention provides a new solution for the problems of the e-commerce recommendation system by applying the transfer learning to the e-commerce recommendation from the consideration of cross-domain recommendation technology.
Disclosure of Invention
The purpose of the invention is as follows: the invention aims to solve the technical problems that in consideration of the challenges of data sparsity, cold start, diversity and accuracy dilemma and the like of a traditional e-commerce recommendation system, a multi-e-commerce cross recommendation method based on clustering feature migration is provided by introducing a migration learning idea: firstly, extracting a user/item feature matrix from each auxiliary e-commerce; then, clustering is carried out on the users/projects, and the average scores of the user clusters on the project clusters are calculated to form clustering characteristics which are used as domain knowledge and transmitted to the target e-commerce; and finally, migrating the domain knowledge of each auxiliary e-commerce to the target e-commerce in a weighting mode to help the target e-commerce to reconstruct a user-item scoring matrix, thereby completing final recommendation.
The technical scheme is as follows: in order to achieve the purpose, the invention adopts the technical scheme that:
a multi-provider cross recommendation method based on cluster feature migration comprises the following steps:
1) and (3) a scoring matrix construction stage:
1) a, collecting user historical behavior data of each E-commerce website;
1) b, cleaning and denoising the historical behavior data of the user;
the data in the steps 1) -b are cleaned to remove repeated data and missing data, and the noise removal is to delete the data with few user behavior records;
1) c, respectively constructing a user-item scoring matrix of each E-commerce website by comprehensively using behavior data capable of reflecting the purchasing interest of the user;
the step 1) -c of constructing the user-item scoring matrix refers to replacing the user name and the item name with the row number and the column number of the matrix, and converting the behavior data into specific numerical values; the behavior data is data reflecting clicking, browsing, collecting and purchasing behaviors of the purchasing interest of the user;
1) -d ends;
2) and (3) auxiliary domain learning stage:
2) -a acquisition of auxiliary e-commerceUser-item scoring matrix R ofz,z∈{1,2,…,Z};
2) B implementing ALS algorithm from user-item scoring matrix RzUser feature matrix M with D dimension extracted from the user feature matrixzAnd item feature matrix Nz
The ALS algorithm in the steps 2) -b specifically comprises the following steps:
step 2) -b-1) randomly initializing an item feature matrix N by using the value in (0,1)z
Step 2) -b-2) fixing the project feature matrix NzUpdating each user feature vector M one by one according to the following formulai.
Wherein N isuiA matrix of eigenvectors representing the items scored by the ith user, nuiThe score of the ith user is shown, I is an identity matrix of DxD, lambda represents the step length, T represents the iteration number, I represents a matrix MzLine number of, Mi.The user feature vector representing the ith user, i.e. the matrix MzTo (1) ai line;
step 2) -b-3), fixing the user characteristic matrix MzUpdating each item feature vector N one by one according to the following formulaj.
Wherein M ismjA matrix of eigenvectors representing users scoring the jth item, nmjThe number of the j-th item to be scored is shown, I is an identity matrix of DxD, and j is a matrix NzLine number of, Nj.Item feature vector representing the jth item, i.e. matrix NzRow j of (1);
step 2) -b-4), repeating the steps 2) -b-2) and 2) -b-3) for T times until the end;
2) c applying K-means algorithm to user feature matrix MzAnd item feature matrix NzClustering is carried out to obtain kzIndividual user clustering and lzClustering the items;
the K-means clustering algorithm in the steps 2) -c comprises the following specific processes:
step 2) -c-1) randomly selecting K data as an initial clustering center, wherein K is predetermined;
step 2) -c-2) assigning each row of data to its nearest cluster according to the Euclidean distance formula as follows:
where dis (a, b) represents the Euclidean distance of data a and data b, Xa,dFor the value of data a on the d-th attribute, Xa,dIs the value of data b on the d-th attribute;
step 2) -c-3) recalculating the cluster center value of each cluster;
steps 2) -c-4) repeating steps 2) -c-2) and 2) -c-3) T times until the end;
2) -d calculating the average score p of each user cluster over the project clusterskl
The formula for calculating the average score of each user cluster to the project cluster in the steps 2) -d is as follows:
wherein p isklRepresents the average score, r, of the kth user cluster over the l item clusteru,vRepresents the rating of the item v by the user u,represents a clusterThe number of users is increased, and the number of users,represents a clusterThe number of middle items.
2) E constructing a clustering feature matrix P of the auxiliary e-commercezCluster feature matrix PzWherein the element is pkl
2) -f for each auxiliary e-commerceZ belongs to {1,2, …, Z }, and the steps are repeated until the end;
3) and a target domain learning stage:
3) -a obtaining a target e-commerceUser-item scoring matrix R ofT
3) B migration clustering feature matrix PzHelping the user-item scoring matrix RTCompleting matrix decomposition to obtain parameter Uz、VzAnd αz
The specific process of matrix decomposition described in steps 3) -b is as follows:
3) -b-1) an objective equation defining an objective domain matrix decomposition, the formula being as follows:
wherein, Uz、VzAnd αzParameters, U, to be solved for this objective equationzRepresenting a source domain to which a target domain user belongsWhich user in (b) is clustered, VzRepresenting a source domain to which a target domain item belongsWhich item in(s) is clustered, αzRepresenting a source domainA parameter of the degree of migration is,kzas an auxiliary domainNumber of user clusters, lzAs an auxiliary domainNumber of item clusters, W represents RTOf a marking matrix, matrix1 represents the full "1" matrix, the symbol ° represents the multiplication of the elements between the matrices, Uz1=1,Vz1-1 ensures that each user and item only belongs to one cluster feature, i.e. only one element in each row is 1, and the rest are 0;
3) -b-2) random initialization VzEnsuring that only one element in each row is 1 and the rest are 0;
3) -b-3) order
3) -b-4) per user uiAuxiliary domain to which a possible belongsUser cluster has kzConsidering Z auxiliary domain knowledge together, the combined situation is k1×k2×…×kzSelecting a combination mode to minimize the following formula, namely, selecting the combination which can predict the target score most to find the corresponding auxiliary domain cluster [ U ] of the target user by checking different combinations of user clusters in all auxiliary domainsz]i
Wherein,
3) -b-5) order UzIth row of (1)zColumn is 1, and the rest are 0;
3) b-6) for RTRepeat 3) -b-4) and 3) -b-5) for each row i);
3) b-7) Each item viAuxiliary domain to which a possible belongsThe item cluster has lzConsidering multiple auxiliary domain knowledge, the combination condition is l1×l2×…×lzSelecting a combination mode to minimize the following formula, namely, selecting the combination which can predict the target score most to find the auxiliary domain cluster [ V ] to which the target domain item belongs by checking different combinations of the item clusters in all the auxiliary domainsz]i
3) B-8) order VzIth row of (1)zColumn is 1, and the rest are 0;
3) b-9) for RTRepeating steps 3) -b-7) and 3) -b-8) for each column i);
3) -b-10) update vectorThe formula is as follows:
wherein,w is RTThe tag matrix of (2);
3) -b-11) repeating steps 3) -b-4) to 3) -b-10) T times until the end;
3) c, reconstructing a user-item scoring matrix of the target e-commerce to obtain a reconstruction matrix
The formula of the project-score matrix of the reconstructed target electric business user in the steps 3) to c is as follows:
wherein W represents RT1 represents a matrix whose matrix elements are all values 1.
3) D determining the number N of the commodities to be recommended according to specific requirements, and reconstructing a matrixFind user uiRecommending the top N commodities with the highest scores;
3) -e ends.
Compared with the prior art, the invention has the following beneficial effects:
the invention provides a new solution for various troubles of data sparsity, cold start, diversity and accuracy and the like of a traditional e-commerce recommendation system, and provides a multi-e-commerce cross recommendation method based on cluster feature migration.
Drawings
Fig. 1 is a flowchart of a multi-provider cross recommendation method based on cluster feature migration.
FIG. 2 is a user-item scoring matrix conversion chart.
FIG. 3 is a flow chart of extracting a user/project feature matrix using ALS algorithm;
FIG. 4 is a flow chart for obtaining user/item clusters using the K-means algorithm.
Fig. 5 is a flow chart of a target e-commerce migration assisted e-commerce clustering feature to assist matrix decomposition.
Detailed Description
The present invention is further illustrated by the following description in conjunction with the accompanying drawings and the specific embodiments, it is to be understood that these examples are given solely for the purpose of illustration and are not intended as a definition of the limits of the invention, since various equivalent modifications will occur to those skilled in the art upon reading the present invention and fall within the limits of the appended claims.
Fig. 1 is a flowchart of a multi-provider cross recommendation method based on cluster feature migration according to an embodiment of the present invention. The specific steps are described as follows:
step 0 is the starting state of the present invention;
in a scoring matrix construction stage (step 1-3), step 1, collecting user historical behavior data of a plurality of e-commerce;
step 2, removing repeated data and missing data from the user historical behavior data and deleting data with few user behavior records;
step 3, comprehensively using behavior data capable of reflecting the purchasing interest of the user, and constructing a user-item scoring matrix of each E-commerce website by using the behavior data preprocessed in the step 2;
in the auxiliary domain learning phase (steps 4-8), step 4 is to acquire each auxiliary e-commerce separatelyUser-item scoring matrix R ofz,z∈{1,2,…,Z};
Step 5 is implemented in each auxiliary domain separatelyALS algorithm from RzUser feature matrix M with D dimension extracted from the user feature matrixzAnd item feature matrix Nz
Step 6, respectively implementing a K-means algorithm to the user feature matrix M in each auxiliary fieldzAnd item feature matrix NzClustering is carried out to obtain kzIndividual user clustering and lzClustering the items;
step 7, respectively calculating the average score p of each user cluster to the project cluster in each auxiliary fieldijAdding the scores of each user in the ith user cluster to the items in the jth item cluster, and dividing the sum by the product of the number of the users in the ith user cluster and the number of the items in the jth item cluster;
the average scoring formula for calculating the item cluster of each user cluster is as follows:
wherein p isklRepresents the average score, r, of the kth user cluster over the l item clusteru,vRepresents the rating of the item v by the user u,represents a clusterThe number of users is increased, and the number of users,represents a clusterThe number of middle items.
Step 8 is to construct a clustering feature matrix P of each auxiliary e-commercezWherein the matrix element is the average score p obtained in step 7kl
In the target domain learning phase (steps 9-12), step 9 is to acquire the target e-commerceUser item scoring matrix RT
Step 10 is to cluster a plurality of cluster features PzMigrating to a target domain, and accordingly helping a target e-commerce to complete matrix decomposition to obtain a parameter Uz、VzAnd αz
Step 11 is to obtain the parameter U according to step 9z、VzAnd αzReconstructing a target domain matrix of the formula Wherein R isTFor the purpose of electronic commerceW is RTThe tag matrix of (2);
step 12, determining the number N of the commodities to be recommended according to the specific requirements, and reconstructing the matrixFind user uiRecommending the top N commodities with the highest scores;
step 13 is the end state.
As shown in fig. 2, which is a detailed description of step 3 in fig. 1, there are various interaction behaviors between the user and the goods in the e-commerce recommendation field, such as browsing, clicking, adding a shopping cart, purchasing, etc. These behaviors are in fact implicit behavior data that can well represent user preferences. The data are comprehensively considered, and the data are converted into a user-item scoring matrix according to the preference degree of the user for the goods, which is conveyed by each behavior.
Fig. 3 is a detailed description of step 5 in fig. 1.
Step 14 is the start state;
step 15 is to initialize matrix N randomly with values between (0,1)z
Step 16 is to fix the matrix NzThe matrix M is updated row by row according to the following formula:
wherein N isuiA matrix of eigenvectors representing the items scored by the ith user, nuiThe score of the ith user is shown, I is an identity matrix of DxD, lambda represents the step length, T represents the iteration number, I represents a matrix MzLine number of, Mi.The user feature vector representing the ith user, i.e. the matrix MzRow i of (1);
step 17 is to fix the matrix MzThe matrix N is updated row by row according to the following formula:
wherein M ismjA matrix of eigenvectors representing users scoring the jth item, nmjThe number of the j-th item to be scored is shown, I is an identity matrix of DxD, and j is a matrix NzLine number of, Nj.Item feature vector representing the jth item, i.e. matrix NzRow j of (1);
step 18, judging whether the iteration is carried out for T times, if not, turning to step 15, and if so, turning to step 18;
step 19 is the end state.
FIG. 4 is a detailed description of the K-means algorithm in step 6.
Step 20 is the start state;
step 21 is to determine the number of clusters K, when clustering the user feature matrix of the z-th auxiliary domain, K is KzWhen clustering the item feature matrix of the z-th auxiliary domain, K is lz
Step 22, randomly selecting K data as an initial clustering center;
step 23 is to assign each row of data points to the cluster closest to it according to the euclidean distance, the formula is as follows:
wherein Xa,dFor the value of data a on the d-th attribute, Xb,dIs the value of data b on the d-th attribute;
step 24, recalculating new center values for each cluster based on the data points assigned to each cluster;
step 25, judging whether the iteration is carried out for T times, if not, turning to step 22, and if so, turning to step 25;
step 26 is the end state.
Fig. 5 is a detailed description of step 10 in fig. 1.
Step 27 is the start state;
step 28 is to initialize Z matrices V randomlyzEnsuring that only one element in each row is 1 and the rest are 0;
step 29 is to let the Z parameters characterizing the migration degree
Step 30 is to find the user uiWhich belongs to the z-th auxiliary domainA user cluster jzI.e. by examining different combinations of user clusters in all source domains, total k1×k2×…×kzUnder the condition, selecting the combination capable of predicting the target score to find the corresponding auxiliary domain cluster [ U ] of the target userz]iI.e. selecting a combinationLet the following equation take the minimum value:
wherein is RTA user-item scoring matrix for the target e-business,
step 31 is to make UzIth row of (1)zColumn 1, the rest 0, for each user u in the target domainiRepeating steps 30 and 31;
step 32 is to find item viWhich item cluster j belongs to the z-th auxiliary domainzI.e. by examining different combinations of clusters of items in all auxiliary domains, total1×l2×…×lzUnder the condition, selecting the combination of the most predictive target scores to find the corresponding auxiliary domain cluster [ U ] of the target itemz]iI.e. selecting a combinationLet the following equation take the minimum value:
step 33 is to let VzIth row of (1)zColumn 1, the rest 0, for each item v of the target domainiRepetition ofSteps 32 and 33 are performed;
step 34 is to update the vectorThe formula is as follows:
whereinW is RTThe tag matrix of (2);
step 35, judging whether the iteration is performed for T times, if not, turning to step 29, and if so, turning to step 35;
step 36 is an end state.
The method adopts a transfer learning technology, namely, the clustering characteristics are extracted from a plurality of auxiliary domains and are transferred to the target domain by different weights as knowledge to help the target e-commerce to reconstruct a user-project scoring matrix, thereby completing final recommendation. The negative migration problem caused by destructive information is reduced by adopting a migration learning technology and introducing parameters representing the migration degree, and experiments are carried out on real E-commerce website data, so that the method can effectively solve the problems of data sparsity, cold start, diversity and accuracy existing in the traditional E-commerce recommendation system, and improve the recommendation performance.
In conclusion, the multi-provider cross recommendation method based on cluster feature migration provides a new solution for the dilemma of data sparsity, cold start, diversity and accuracy in the e-provider recommendation system by using the migration learning technology.
The above description is only of the preferred embodiments of the present invention, and it should be noted that: it will be apparent to those skilled in the art that various modifications and adaptations can be made without departing from the principles of the invention and these are intended to be within the scope of the invention.

Claims (7)

1. A multi-provider cross recommendation method based on cluster feature migration is characterized by comprising the following steps:
step 1, a scoring matrix construction stage: collecting and preprocessing user historical behavior data of each e-commerce website, comprehensively using behavior data capable of reflecting user purchasing interest, and respectively constructing a user-item scoring matrix of each e-commerce website;
step 2, auxiliary domain learning stage: obtaining auxiliary e-commerceUser-item scoring matrix R ofzZ ∈ {1,2, …, Z }; implementing ALS algorithm from user-item scoring matrix RzUser feature matrix M with D dimension extracted from the user feature matrixzAnd item feature matrix Nz(ii) a Implementing K-means algorithm to respectively carry out user feature matrix MzAnd item feature matrix NzClustering is carried out to obtain kzIndividual user clustering and lzClustering the items; calculating the average score p of each user cluster to the project clusterkl(ii) a Constructing clustering characteristic matrix P of auxiliary E-commercezCluster feature matrix PzWherein the element is pkl
Step 3, target domain learning stage: obtaining target e-commerceUser-item scoring matrix R ofT(ii) a Migration clustering feature matrix PzHelping the user-item scoring matrix RTCompleting matrix decomposition to obtain parameter Uz、VzAnd αz(ii) a Reconstructing a user-item scoring matrix of the target e-commerce to obtain a reconstruction matrixThen, according to specific requirements, determining the number N of commodities to be recommended, and reconstructing a matrixFind user uiRecommending the top N commodities with the highest scores; the formula of the project-score matrix of the reconstructed target electric commercial user is as follows:
wherein W represents RT1 represents a matrix whose matrix elements are all values 1.
2. The multi-provider cross recommendation method based on cluster feature migration according to claim 1, wherein: the ALS algorithm in the step 2 specifically comprises the following steps:
step 2) -b-1) randomly initializing an item feature matrix N by using the value in (0,1)z
Step 2) -b-2) fixing the project feature matrix NzUpdating each user feature vector M one by one according to the following formulai.
Wherein N isuiA matrix of eigenvectors representing the items scored by the ith user, nuiThe score of the ith user is shown, I is an identity matrix of DxD, lambda represents the step length, T represents the iteration number, I represents a matrix MzLine number of, Mi.The user feature vector representing the ith user, i.e. the matrix MzRow i of (1);
step 2) -b-3), fixing the user characteristic matrix MzUpdating each item feature vector N one by one according to the following formulaj.
Wherein M ismjA matrix of eigenvectors representing users scoring the jth item, nmjThe number of the j-th item to be scored is shown, I is an identity matrix of DxD, and j is a matrix NzLine number of, Nj.Item feature vector representing the jth item, i.e. matrix NzRow j of (1);
step 2) -b-4), repeating the steps 2) -b-2) and 2) -b-3) for T times until finishing.
3. The multi-provider cross recommendation method based on cluster feature migration according to claim 4, wherein: the K-means clustering algorithm in the step 2 comprises the following specific processes:
step 2) -c-1) randomly selecting K data as an initial clustering center, wherein K is predetermined;
step 2) -c-2) assigning each row of data to its nearest cluster according to the Euclidean distance formula as follows:
where dis (a, b) represents the Euclidean distance of data a and data b, Xa,dFor the value of data a on the d-th attribute, Xb,dIs the value of data b on the d-th attribute;
step 2) -c-3) recalculating the cluster center value of each cluster;
steps 2) -c-4) repeat steps 2) -c-2) and steps 2) -c-3) T times until the end.
4. The multi-provider cross recommendation method based on cluster feature migration according to claim 5, wherein: the formula for calculating the average score of each user cluster to the project cluster in step 2 is as follows:
wherein p isklRepresents the average score, r, of the kth user cluster over the l item clusteru,vRepresents the rating of the item v by the user u,represents a clusterThe number of users is increased, and the number of users,represents a clusterThe number of middle items.
5. The multi-provider cross recommendation method based on cluster feature migration according to claim 6, wherein: the specific process of matrix decomposition in step 3 is as follows:
3) -b-1) an objective equation defining an objective domain matrix decomposition, the formula being as follows:
s.t.Uz1=1,Vz1=1
wherein, Uz、VzAnd αzParameters, U, to be solved for this objective equationzRepresenting a source domain to which a target domain user belongsWhich user in (b) is clustered, VzRepresenting the source domain to which the target domain item belongsWhich item in(s) is clustered, αzRepresenting a source domainA parameter of the degree of migration is,kzas an auxiliary domainNumber of user clusters, lzAs an auxiliary domainNumber of item clusters, W represents RTThe matrix 1 represents the full '1' matrix, the symbol DEG represents the multiplication of corresponding elements between the matrices, Uz1=1,Vz1-1 ensures that each user and item belongs to only one cluster feature, i.e. there is only one element per rowThe element is 1, and the rest is 0;
3) -b-2) random initialization VzEnsuring that only one element in each row is 1 and the rest are 0;
3) -b-3) order
3) -b-4) per user uiAuxiliary domain to which a possible belongsUser cluster has kzConsidering Z auxiliary domain knowledge together, the combined situation is k1×k2×…×kzSelecting a combination mode to minimize the following formula, namely, selecting the combination which can predict the target score most to find the corresponding auxiliary domain cluster [ U ] of the target user by checking different combinations of user clusters in all auxiliary domainsz]i
Wherein,
3) -b-5) order UzIth row of (1)zColumn is 1, and the rest are 0;
3) b-6) for RTRepeat 3) -b-4) and 3) -b-5) for each row i);
3) b-7) Each item viAuxiliary domain to which a possible belongsThe item cluster has lzConsidering multiple auxiliary domain knowledge, the combination condition is l1×l2×…×lzSelecting one combination from the combinations minimizes the following formula, namely, finding the target domain by checking different combinations of the item clusters in all the auxiliary domains and selecting the combination with the most predictable target scoreItem belonging to corresponding auxiliary domain cluster [ V ]z]i
3) B-8) order VzIth row of (1)zColumn is 1, and the rest are 0;
3) b-9) for RTRepeating steps 3) -b-7) and 3) -b-8) for each column i);
3) -b-10) update vectorThe formula is as follows:
wherein,w is RTThe tag matrix of (2);
3) -b-11) repeating steps 3) -b-4) to 3) -b-10) T times until the end.
6. The multi-provider cross recommendation method based on cluster feature migration according to claim 1, wherein: and (2) preprocessing operation data cleaning and denoising in the step 1, wherein the data cleaning is to remove repeated data and missing data, and the denoising is to delete data with few user behavior records.
7. The multi-provider cross recommendation method based on cluster feature migration according to claim 1, wherein: the step 1 of constructing the user-item scoring matrix refers to replacing the user name and the item name with the row number and the column number of the matrix, and converting behavior data into specific numerical values; the behavior data is data reflecting the behaviors of clicking, browsing, collecting and purchasing of the purchasing interest of the user.
CN201810470713.2A 2018-05-17 2018-05-17 Multi-provider cross recommendation method based on clustering feature migration Active CN108717654B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810470713.2A CN108717654B (en) 2018-05-17 2018-05-17 Multi-provider cross recommendation method based on clustering feature migration

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810470713.2A CN108717654B (en) 2018-05-17 2018-05-17 Multi-provider cross recommendation method based on clustering feature migration

Publications (2)

Publication Number Publication Date
CN108717654A true CN108717654A (en) 2018-10-30
CN108717654B CN108717654B (en) 2022-03-25

Family

ID=63900134

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810470713.2A Active CN108717654B (en) 2018-05-17 2018-05-17 Multi-provider cross recommendation method based on clustering feature migration

Country Status (1)

Country Link
CN (1) CN108717654B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109711925A (en) * 2018-11-23 2019-05-03 西安电子科技大学 Cross-domain recommending data processing method, cross-domain recommender system with multiple auxiliary domains
CN110070535A (en) * 2019-04-23 2019-07-30 东北大学 A kind of retinal vascular images dividing method of Case-based Reasoning transfer learning
CN110516165A (en) * 2019-08-28 2019-11-29 安徽农业大学 A kind of cross-cutting recommended method of hybrid neural networks based on text UGC
CN110955775A (en) * 2019-11-11 2020-04-03 南通大学 Drawing book recommendation method based on implicit inquiry
CN112364937A (en) * 2020-11-30 2021-02-12 腾讯科技(深圳)有限公司 User category determination method and device, recommended content determination method and electronic equipment
CN112669083A (en) * 2020-12-30 2021-04-16 杭州趣链科技有限公司 Commodity recommendation method and device and electronic equipment
WO2024114034A1 (en) * 2022-11-29 2024-06-06 腾讯科技(深圳)有限公司 Content recommendation method and apparatus, device, medium, and program product

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106339502A (en) * 2016-09-18 2017-01-18 电子科技大学 Modeling recommendation method based on user behavior data fragmentation cluster
CN106485537A (en) * 2016-09-07 2017-03-08 北京邮电大学 A kind of cross-cutting Method of Commodity Recommendation based on the latent layer factor and device
CN107273438A (en) * 2017-05-24 2017-10-20 深圳大学 A kind of recommendation method, device, equipment and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106485537A (en) * 2016-09-07 2017-03-08 北京邮电大学 A kind of cross-cutting Method of Commodity Recommendation based on the latent layer factor and device
CN106339502A (en) * 2016-09-18 2017-01-18 电子科技大学 Modeling recommendation method based on user behavior data fragmentation cluster
CN107273438A (en) * 2017-05-24 2017-10-20 深圳大学 A kind of recommendation method, device, equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李林峰: "基于知识迁移的跨领域推荐算法研究", 《CNKI优秀硕士学位论文全文数据库》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109711925A (en) * 2018-11-23 2019-05-03 西安电子科技大学 Cross-domain recommending data processing method, cross-domain recommender system with multiple auxiliary domains
CN110070535A (en) * 2019-04-23 2019-07-30 东北大学 A kind of retinal vascular images dividing method of Case-based Reasoning transfer learning
CN110516165A (en) * 2019-08-28 2019-11-29 安徽农业大学 A kind of cross-cutting recommended method of hybrid neural networks based on text UGC
CN110516165B (en) * 2019-08-28 2022-09-06 安徽农业大学 Hybrid neural network cross-domain recommendation method based on text UGC
CN110955775A (en) * 2019-11-11 2020-04-03 南通大学 Drawing book recommendation method based on implicit inquiry
CN112364937A (en) * 2020-11-30 2021-02-12 腾讯科技(深圳)有限公司 User category determination method and device, recommended content determination method and electronic equipment
CN112364937B (en) * 2020-11-30 2021-12-14 腾讯科技(深圳)有限公司 User category determination method and device, recommended content determination method and electronic equipment
CN112669083A (en) * 2020-12-30 2021-04-16 杭州趣链科技有限公司 Commodity recommendation method and device and electronic equipment
WO2024114034A1 (en) * 2022-11-29 2024-06-06 腾讯科技(深圳)有限公司 Content recommendation method and apparatus, device, medium, and program product

Also Published As

Publication number Publication date
CN108717654B (en) 2022-03-25

Similar Documents

Publication Publication Date Title
CN108717654B (en) Multi-provider cross recommendation method based on clustering feature migration
Mao et al. Multiobjective e-commerce recommendations based on hypergraph ranking
Li et al. Exploiting explicit and implicit feedback for personalized ranking
Changchien et al. Mining association rules procedure to support on-line recommendation by customers and products fragmentation
Zhu et al. Online purchase decisions for tourism e-commerce
Gabel et al. Product choice with large assortments: A scalable deep-learning model
CN102902691B (en) Recommend method and system
CN109783738B (en) Multi-similarity-based hybrid collaborative filtering recommendation method for double-pole-limit learning machine
CN106447463A (en) Commodity recommendation method based on Markov decision-making process model
CN110717098A (en) Meta-path-based context-aware user modeling method and sequence recommendation method
CN114065048B (en) Article recommendation method based on multi-different composition graph neural network
CN104391883B (en) A kind of online advertisement audient's sort method based on transfer learning
CN113763095B (en) Information recommendation method and device and model training method and device
CN102789462A (en) Project recommendation method and system
CN115082147A (en) Sequence recommendation method and device based on hypergraph neural network
EP4202725A1 (en) Joint personalized search and recommendation with hypergraph convolutional networks
CN112288471A (en) Advertisement click rate prediction method based on user historical behavior sequence
CN107958406A (en) Inquire about acquisition methods, device and the terminal of data
CN104899241A (en) Related data generating apparatus and related data generating method
CN116957691B (en) Cross-platform intelligent advertisement putting method and system for commodities of e-commerce merchants
CN115982463A (en) Resource recommendation method, device, equipment and storage medium
CN117455621A (en) Personalized recommendation method and device, storage medium and computer equipment
Kao et al. Prediction of remaining time on site for e‐commerce users: A SOM and long short‐term memory study
CN112232388A (en) ELM-RFE-based shopping intention key factor identification method
Ifada et al. Do-rank: DCG optimization for learning-to-rank in tag-based item recommendation systems

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant