CN111008334B - Top-K recommendation method and system based on local pairwise ordering and global decision fusion - Google Patents

Top-K recommendation method and system based on local pairwise ordering and global decision fusion Download PDF

Info

Publication number
CN111008334B
CN111008334B CN201911232316.2A CN201911232316A CN111008334B CN 111008334 B CN111008334 B CN 111008334B CN 201911232316 A CN201911232316 A CN 201911232316A CN 111008334 B CN111008334 B CN 111008334B
Authority
CN
China
Prior art keywords
user
article
item
subgroup
users
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201911232316.2A
Other languages
Chinese (zh)
Other versions
CN111008334A (en
Inventor
王邦
杨雪娇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huazhong University of Science and Technology
Original Assignee
Huazhong University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huazhong University of Science and Technology filed Critical Huazhong University of Science and Technology
Priority to CN201911232316.2A priority Critical patent/CN111008334B/en
Publication of CN111008334A publication Critical patent/CN111008334A/en
Application granted granted Critical
Publication of CN111008334B publication Critical patent/CN111008334B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A10/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE at coastal zones; at river basins
    • Y02A10/40Controlling or monitoring, e.g. of flood or hurricane; Forecasting, e.g. risk assessment or mapping

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a Top-K recommendation method and system based on local pairwise sorting and global decision fusion, and belongs to the field of personalized recommendation. The method comprises the following steps: constructing a user isomorphic drawing and an article isomorphic drawing according to the user-article bipartite drawing; selecting a user anchor point from the user composition and selecting an article anchor point from the article composition; obtaining a user subgroup corresponding to each user anchor point according to the correlation degree with the user anchor points, and obtaining an article subgroup corresponding to each article anchor point according to the correlation degree with the article anchor points; pairing the user subgroup and the article subgroup to obtain A user-article subgroups; for each user-item subgroup, training by using a Bayesian personalized sorting algorithm with paired difference perception to obtain a prediction score of each item by the user and obtain a local recommendation list of the user in the user-item subgroup; and fusing local recommendation lists of the user in different user-article subgroups to generate a global recommendation list of the user, so as to obtain a more accurate recommendation list.

Description

Top-K recommendation method and system based on local pairwise ordering and global decision fusion
Technical Field
The invention belongs to the field of personalized recommendation, and particularly relates to a Top-K recommendation method and system based on local pairwise sorting and global decision fusion.
Background
With the advent of the web2.0 era and the great increase of network bandwidth, various social network platforms begin to appear, and fragmented information begins to flood the lives of people. In order to solve the information overload problem, the personalized recommendation system increasingly shows important value. For example, in the e-commerce field, a recommendation system constructs an interest model of a user according to historical behavior information of the user, calculates the user's likeness to an item that they have not purchased, and then recommends an item that the user may like.
A recommendation method aiming at generating K item lists that a user may like is called a Top-K recommendation method, and a common Top-K recommendation method includes two types, namely a single-point model and a pair model. The optimization goal of the single-point model is to enable the prediction score of a user for a certain article to approach the real score of the user for the article, so that the articles which are not too much of the user are sorted according to the prediction score to generate a recommendation list. The pair-wise model learns the sequential relationship between a pair of items. Bayesian personalized ranking is one of the most common pairwise models, which predicts this pairwise preference by picking a positive sample i from items purchased by a user u and a negative sample j from items not purchased by the user at a time: user u scores item i higher than item j.
However, these algorithms are all modeled for the entire data, with an implicit assumption: i.e., all users (or all items) use a uniform feature space. This assumption of global feature space tends to make the learned features too coarse to be applicable to all users (or items). And the recommended items are often dominated by the most significant interests of the user, and it is difficult to capture different aspects of the user interests or item attributes, so that the generated recommendation list is not accurate enough.
Disclosure of Invention
Aiming at the technical problem that the accuracy of a Top-K recommendation list of a recommendation system in the prior art is limited, the invention provides a Top-K recommendation method and a Top-K recommendation system based on local pairwise sorting and global decision fusion, and aims to obtain a more accurate recommendation list.
To achieve the above object, according to a first aspect of the present invention, there is provided a Top-K recommendation method based on local pairwise ordering and global decision fusion, the method comprising the steps of:
s1, constructing a user isomorphic graph and an article isomorphic graph according to a user-article bipartite graph;
s2, selecting A user anchor points from the user composition, and selecting A article anchor points from the article composition;
s3, obtaining a user subgroup corresponding to each user anchor point according to the correlation degree of the user and the user anchor point, and obtaining an article subgroup corresponding to each article anchor point according to the correlation degree of the article and the article anchor point;
s4, pairing a user subgroup corresponding to the A user anchor points and an article subgroup corresponding to the A article anchor points to obtain an A user-article subgroup;
s5, for each user-article subgroup, training by using a Bayesian personalized sorting algorithm with paired difference perception to obtain a prediction score of each article by the user, and obtaining the top K articles with the largest score as a local recommendation list of the user in the user-article subgroup;
and S6, fusing the local recommendation lists of the users in different user-article subgroups according to the contribution of different subgroups to the global and the sequence of the articles in the local recommendation lists, and generating the global recommendation list of the users.
Specifically, step S1 includes the steps of:
s11, all M users in the training set form a set U, all N articles form a set V, the sets U and V form a point set of a user-article bipartite graph, the set U forms a point set of a user with a same composition, and the set V forms a point set of an article with a same composition;
s12, if the user U belongs to the U and the score r of the item V belongs to the V exists in the training set uv Then, there is a side connecting user u and item v in the bipartite graph, and the weight of the side connecting is r uv
S13, if two users in the bipartite graph have commonly connected articles, in the user same graph, a connecting edge exists between the two users, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure BDA0002302990700000031
wherein the content of the first and second substances,
Figure BDA0002302990700000032
and &>
Figure BDA0002302990700000033
Representing user u 1 And u 2 A scored item set, based on the number of items in the item set>
Figure BDA0002302990700000034
Representing user u 1 Scoring v;
s14, if two articles in the bipartite graph have users connected together, in the article same graph, a connecting edge exists between the two articles, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure BDA0002302990700000035
wherein the content of the first and second substances,
Figure BDA0002302990700000036
and &>
Figure BDA0002302990700000037
Representing an article v 1 And v 2 A superseded user set, based on the status of the user, or based on the status of the user>
Figure BDA0002302990700000038
Representing user u to item v 1 Scoring of (2).
Specifically, step S3 includes the steps of:
s31, each user anchor point is taken as a restart node respectively, random walk with restart is carried out, and a user convergence probability matrix is obtained
Figure BDA0002302990700000039
After random walk with restart is carried out by respectively taking each article anchor point as a restart node, an article convergence probability matrix can be obtained>
Figure BDA00023029907000000310
S32, for each user u, adding C U The u-th line of (1) is arranged according to a descending order, the front rho multiplied by A user anchor points in the arrangement are taken, andthe user is assigned to these user anchors and for each item v, C is assigned V The line v of (2) is arranged according to a descending order, the front rho multiplied by A article anchor points in the arrangement are taken, and the article is distributed to the article anchor points;
s33, for user anchor point u a All the users assigned to him constitute a subgroup of users
Figure BDA00023029907000000311
For item anchor points v a All items allocated to him constitute an item subgroup->
Figure BDA00023029907000000312
Wherein M is the number of users in the training set, N is the number of articles in the training set, and rho is a subgroup scale control parameter.
Specifically, in step S5, the user is found in the user-item subgroup
Figure BDA0002302990700000041
Local recommendation list in (1):
s51, constructing preference pairs based on the observed data, wherein the preference pairs of all the users form a training set
Figure BDA0002302990700000042
S52, for each preference pair (u, i, j), calculating the positive degree PL of a positive sample and the negative degree NL of a negative sample, and further calculating the gap value G of the positive sample and the negative sample;
Figure BDA0002302990700000043
Figure BDA0002302990700000044
Figure BDA0002302990700000045
s53, using the difference value G of the positive and negative samples as weight, and carrying out gradient descent on the training set
Figure BDA0002302990700000046
Training to obtain a user factor matrix U a Item factor matrix V a And an item offset vector B a The optimized loss function is as follows:
Figure BDA0002302990700000047
Figure BDA00023029907000000419
Figure BDA0002302990700000048
Figure BDA0002302990700000049
s54, for the articles which are not excessively scored by the user u, scoring according to the prediction
Figure BDA00023029907000000410
In the user-item subgroup, the first K items are taken as the user u>
Figure BDA00023029907000000411
Is selected based on the local recommendation list->
Figure BDA00023029907000000412
Wherein r is ui Representing the user u's score for item i,
Figure BDA00023029907000000413
is indicated in a subgroup->
Figure BDA00023029907000000414
In which an excessive number of users is awarded to item j, and>
Figure BDA00023029907000000415
b is a translational bias, superscript a denotes the corresponding subgroup->
Figure BDA00023029907000000416
U a And V a Is a user-item subgroup->
Figure BDA00023029907000000417
The user factor matrix and the item factor matrix in (1), B a Is a user-item subgroup->
Figure BDA00023029907000000418
The item bias vector in (1), λ is a regularization coefficient, and θ represents the set of all parameters to be trained, i.e., U a ,V a ,B a And | | represents the matrix norm.
Specifically, step S6 includes the steps of:
s61, defining an item v in a local list
Figure BDA0002302990700000051
Sorting score of +>
Figure BDA0002302990700000052
Figure BDA0002302990700000053
S62, defining the object v in subgroup
Figure BDA0002302990700000054
Characteristic weight ω of u,v
Figure BDA0002302990700000055
/>
S63, according to the object v
Figure BDA0002302990700000056
Sorting score of +>
Figure BDA0002302990700000057
And subgroup +>
Figure BDA0002302990700000058
Is based on the characteristic weight->
Figure BDA0002302990700000059
Calculating subgroup +>
Figure BDA00023029907000000510
A local score for each item v by the user u;
Figure BDA00023029907000000511
s64, obtaining the global score of the item v according to the local score of the item v in the local recommendation list generated by each subgroup for the user u;
Figure BDA00023029907000000512
s65, arranging the global scores of the user u on the items according to a descending order, and taking the largest K items as a global recommendation list L of the user u u
Wherein, order (·) indicates that the object v is in
Figure BDA00023029907000000513
Rank of (1), N u And &>
Figure BDA00023029907000000514
Respectively indicate the number and usage of all articles hit by user uU is in subgroup>
Figure BDA00023029907000000515
And the number of articles to be punched, M v And &>
Figure BDA00023029907000000516
Respectively indicates the total number of users who have overflowed the item v and the subgroup->
Figure BDA00023029907000000517
The article v is given an excessive number of users.
To achieve the above object, according to a second aspect of the present invention, there is provided a Top-K recommendation system based on local pairwise ordering and global decision fusion, the system comprising:
the same composition construction module is used for constructing a user same composition and an article same composition according to the user-article bipartite graph;
the anchor point selection module is used for selecting A user anchor points from the user composition and selecting A article anchor points from the article composition;
the subgroup acquisition module is used for acquiring a user subgroup corresponding to each user anchor point according to the correlation degree of the user and the user anchor point, and acquiring an article subgroup corresponding to each article anchor point according to the correlation degree of the article and the article anchor point;
the subgroup matching module is used for matching the user subgroup corresponding to the A user anchor points with the article subgroup corresponding to the A article anchor points to obtain A user-article subgroup;
the local recommendation list acquisition module is used for training each user-article subgroup by using a Bayesian personalized sorting algorithm with pairwise difference perception to obtain a prediction score of each article by the user, and acquiring the first K articles with the largest score as a local recommendation list of the user in the user-article subgroup;
and the global recommendation list acquisition module is used for fusing the local recommendation lists of the users in different user-item subgroups according to the contribution degrees of different subgroups to the global and the sequence of the items in the local recommendation lists to generate the global recommendation list of the user.
Specifically, the isomorphic model construction module obtains the user isomorphic model and the object isomorphic model by the following method:
s11, all M users in the training set form a set U, all N articles form a set V, the sets U and V form a point set of a user-article bipartite graph, the set U forms a point set of a user with a same composition, and the set V forms a point set of an article with a same composition;
s12, if the user U belongs to the U and the score r of the item V belongs to the V exists in the training set uv Then, there is a side connecting user u and item v in the bipartite graph, and the weight of the side connecting is r uv
S13, if two users in the bipartite graph have articles connected together, in the same graph of the users, a connecting edge exists between the two users, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure BDA0002302990700000061
wherein the content of the first and second substances,
Figure BDA0002302990700000062
and &>
Figure BDA0002302990700000063
Representing user u 1 And u 2 A scored item set, based on the number of items in the item set>
Figure BDA0002302990700000064
Representing user u 1 Scoring v;
s14, if two articles in the bipartite graph have users connected together, in the article same graph, a connecting edge exists between the two articles, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure BDA0002302990700000071
wherein the content of the first and second substances,
Figure BDA0002302990700000072
and &>
Figure BDA0002302990700000073
Representing an article v 1 And v 2 A superseded user set, based on the status of the user, or based on the status of the user>
Figure BDA0002302990700000074
Representing user u to item v 1 Scoring of (4).
Specifically, the subgroup acquiring module obtains the article subgroup corresponding to each article anchor point by the following method:
s31, each user anchor point is taken as a restart node respectively, random walk with restart is carried out, and a user convergence probability matrix is obtained
Figure BDA0002302990700000075
After random walk with restart is carried out by respectively taking each article anchor point as a restart node, an article convergence probability matrix can be obtained>
Figure BDA0002302990700000076
S32, for each user u, comparing C U The u-th line of (b) is arranged according to a descending order, the first rho multiplied by A user anchor points in the arrangement are taken, the user is allocated to the user anchor points, and for each article v, C is used V The line v of (1) is arranged according to a descending order, the front rho multiplied by A article anchor points in the arrangement are taken, and the article is distributed to the article anchor points;
s33, anchoring points u for users a All the users assigned to him constitute a subgroup of users
Figure BDA0002302990700000077
For an item anchor v a All items allocated to him constitute an item subgroup->
Figure BDA0002302990700000078
Wherein M is the number of users in the training set, N is the number of articles in the training set, and rho is a subgroup scale control parameter.
Specifically, the local recommendation list obtaining module obtains the user's user-item subgroup in the following manner
Figure BDA0002302990700000079
Local recommendation list in (1):
s51, constructing preference pairs based on the observed data, wherein the preference pairs of all the users form a training set
Figure BDA00023029907000000710
S52, for each preference pair (u, i, j), calculating the positive degree PL of a positive sample and the negative degree NL of a negative sample, and further calculating the difference value G of the positive sample and the negative sample;
Figure BDA00023029907000000711
Figure BDA00023029907000000712
Figure BDA0002302990700000081
s53, using the difference value G of the positive and negative samples as weight, and carrying out gradient descent on the training set
Figure BDA0002302990700000082
Training to obtain a user factor matrix U a Item factor matrix V a And an item offset vector B a The optimized loss function is as follows: />
Figure BDA0002302990700000083
Figure BDA0002302990700000084
Figure BDA0002302990700000085
Figure BDA0002302990700000086
S54, for the articles which are not excessively scored by the user u, scoring according to the prediction
Figure BDA0002302990700000087
In order from large to small, the first K items are taken as the user u in the user-item subgroup->
Figure BDA0002302990700000088
Is selected based on the local recommendation list->
Figure BDA0002302990700000089
Wherein r is ui Indicating the scoring of item i by user u,
Figure BDA00023029907000000810
is indicated in a subgroup->
Figure BDA00023029907000000811
An excessive number of users is granted to item j, and>
Figure BDA00023029907000000812
b is a translational bias, superscript a denotes the corresponding subgroup->
Figure BDA00023029907000000813
U a And V a Is a user-item subgroup->
Figure BDA00023029907000000814
The user factor matrix and the item factor matrix in (1), B a Is a user-item subgroup->
Figure BDA00023029907000000815
The item bias vector in (1), λ is a regularization coefficient, and θ represents the set of all parameters to be trained, i.e., U a ,V a ,B a And | | represents the matrix norm.
Specifically, the global recommendation list obtaining module generates the global recommendation list of the user by the following method:
s61, defining an item v in a local list
Figure BDA00023029907000000816
Is greater than the rank score pick>
Figure BDA00023029907000000817
Figure BDA00023029907000000818
S62, defining the object v in subgroup
Figure BDA00023029907000000819
Characteristic weight ω of u,v
Figure BDA00023029907000000820
S63, according to the object v
Figure BDA00023029907000000821
Sorting score of +>
Figure BDA00023029907000000822
And subgroup->
Figure BDA00023029907000000823
Is a characteristic ofWeight +>
Figure BDA00023029907000000824
Calculating a subgroup ≤>
Figure BDA0002302990700000091
A local score for each item v by user u;
Figure BDA0002302990700000092
s64, obtaining the global score of the item v according to the local score of the item v in the local recommendation list generated by each subgroup for the user u;
Figure BDA0002302990700000093
s65, arranging the global scores of the user u on the items according to a descending order, and taking the maximum K items as a global recommendation list L of the user u u
Wherein, order (·) indicates that the object v is in
Figure BDA0002302990700000094
Rank of (1), N u And &>
Figure BDA0002302990700000095
Respectively represents the total number of items which are over hit by the user u and the sub-group->
Figure BDA0002302990700000096
Number of articles hit excessively in middle, M v And &>
Figure BDA0002302990700000097
Respectively indicates the total number of users who have overflowed the item v and the subgroup->
Figure BDA0002302990700000098
The article v is given an excessive number of users.
Generally, compared with the prior art, the technical scheme conceived by the invention has the following beneficial effects:
(1) The invention uses a Bayes sorting algorithm for sensing the paired difference to learn the preference sequence between the object pairs, and distinguishes the relative strength of the preference of different object pairs by calculating the difference between positive and negative samples. And the training is guided by taking the weight as the training weight, so that the final recommendation list is more accurate.
(2) According to the invention, through training in subgroups and global fusion, the users and the articles are assumed to belong to a plurality of different feature spaces, so that different aspects of user interests and article attributes can be captured from different perspectives, the problem that the learned features are too rough under the assumption of the global feature space is avoided, and the final recommendation list is more accurate.
(3) The invention adopts a global decision fusion method, and simultaneously considers the contribution of different subgroups to the global and the ordering of the articles in the local recommendation list in the fusion process, so that the fusion result is more accurate. The global decision fusion only needs to consider the local recommendation list, and does not need to consider all articles, so that the fusion efficiency is higher.
(4) The invention adopts the user same composition and the object same composition to respectively carry out random walk, and the space complexity of the two same compositions is less than that of the user-object bipartite graph. By constructing a homogeneous graph, the graph connectivity is higher than the user-item bipartite graph, and is more suitable for the following random walks.
Drawings
FIG. 1 is a flowchart of a Top-K recommendation method based on local pairwise ordering and global decision fusion according to an embodiment of the present invention;
fig. 2 is a schematic diagram that splits a user-article bipartite graph into a user isomorphic graph and an article isomorphic graph according to an embodiment of the present invention;
fig. 3 is a schematic diagram of fusing local recommendation lists into a global recommendation list according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and do not limit the invention. In addition, the technical features involved in the respective embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
The overall concept of the invention is that the user-article bipartite graph is divided into a user identical composition graph and an article identical composition graph, random walk is firstly carried out on the two identical composition graphs to obtain a plurality of anchor points respectively, then the random walk is restarted from the anchor points to generate a plurality of user subgroups and article subgroups. And generating a user-item subgroup by pairing the user subgroup and the item subgroup. In each subgroup, training is carried out by adopting a gap perception Bayes sorting algorithm, namely, a positive sample and a negative sample pair are sampled for each user, and the gap value of the positive sample and the negative sample is calculated to be used as training weight. The optimization objective is to have positive samples score more than negative samples. And generating a local recommendation list for each user according to the prediction scores obtained after training. And finally, fusing the local recommendation lists of different subgroups according to the contribution of different subgroups to the global situation and the sequence of the articles in the local recommendation list to generate a final recommendation list.
As shown in FIG. 1, the present invention provides a Top-K recommendation method based on local pairwise ordering and global decision fusion, which comprises the following steps:
s1, constructing a user isomorphic graph and an article isomorphic graph according to a user-article bipartite graph;
s2, selecting A user anchor points from the user composition, and selecting A article anchor points from the article composition;
s3, obtaining a user subgroup corresponding to each user anchor point according to the correlation degree of the user and the user anchor point, and obtaining an article subgroup corresponding to each article anchor point according to the correlation degree of the article and the article anchor point;
s4, pairing a user subgroup corresponding to the A user anchor points and an article subgroup corresponding to the A article anchor points to obtain an A user-article subgroup;
s5, for each user-article subgroup, training by using a Bayesian personalized sorting algorithm with paired difference perception to obtain a prediction score of each article by the user, and obtaining the top K articles with the largest score as a local recommendation list of the user in the user-article subgroup;
and S6, fusing the local recommendation lists of the users in different user-article subgroups according to the contribution of different subgroups to the global and the sequence of the articles in the local recommendation lists, and generating the global recommendation list of the users.
Step S1, constructing a user composition and an article composition according to the user-article bipartite graph.
As shown in fig. 2, step S1 includes the steps of:
s11, all M users in the training set form a set U, all N articles form a set V, the sets U and V form a point set of a user-article bipartite graph, the set U forms a point set of a user with a picture composition, and the set V forms a point set of an article with a picture composition.
S12, if the user U belongs to the U and the score r of the item V belongs to the V exists in the training set uv Then, there is a side connecting user u and item v in the bipartite graph, and the weight of the side connecting is r uv
S13, if two users in the bipartite graph have articles connected together, in the same graph of the users, a connecting edge exists between the two users, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure BDA0002302990700000121
wherein the content of the first and second substances,
Figure BDA0002302990700000122
and &>
Figure BDA0002302990700000123
Representing user u 1 And u 2 A scored item set, based on the number of items in the item set>
Figure BDA0002302990700000124
Representing user u 1 Scoring v.
S14, if two articles in the bipartite graph have users connected together, in the article same graph, a connecting edge exists between the two articles, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure BDA0002302990700000125
wherein the content of the first and second substances,
Figure BDA0002302990700000126
and &>
Figure BDA0002302990700000127
Representing an article v 1 And v 2 A superseded user set, based on the status of the user, or based on the status of the user>
Figure BDA0002302990700000128
Representing user u to item v 1 Scoring of (4).
And S2, selecting A user anchor points from the user composition, and selecting A article anchor points from the article composition.
The step S2 includes the steps of:
s21, constructing a user transfer probability matrix P according to the user composition U Constructing an article transition probability matrix P from the article isomorphism map V
S211, obtaining the weighted adjacency matrix of the user identical composition picture according to the user identical composition picture
Figure BDA0002302990700000129
Based on the item isomorphism, a weighted adjacency matrix which obtains the item isomorphism>
Figure BDA00023029907000001210
In correspondence with the figure 2 of the drawings,
Figure BDA00023029907000001211
each element in the matrix represents a userAnd the weight of connecting edges of corresponding two user nodes in the same composition. />
Figure BDA00023029907000001212
Each element in the matrix represents the weight of the joint edge of the corresponding two article nodes in the article and the composition.
S212, respectively adding A U And A V Normalizing by columns to obtain a user transfer probability matrix
Figure BDA00023029907000001213
And an item transfer probability matrix>
Figure BDA00023029907000001214
P U The ith column of the matrix represents the transition probability, P, of the ith user transitioning to each user V The ith column of the matrix represents the transition probability for the ith item to transition to the respective item. For example, from the second user, there is a probability of 0.5/4.3 of transferring to the first item and a probability of 3.8/4.3 of transferring to the third item.
S22, random walk is conducted on the user composition picture, A users with the maximum node convergence probability after the walk are selected as A user anchor points, random walk is conducted on the article composition picture, and A articles with the maximum node convergence probability after the walk are selected as A article anchor points.
The random walk algorithm is to give each node a random initial value to let the nodes randomly walk in the graph, so that after iteration, each node will obtain a convergence probability which reflects the importance degree of each node in the graph.
From random walk on the user's composition, the probability vector u of the user is initialized randomly first (0) Then, random walk is performed by iteratively calculating the following formula:
Figure BDA0002302990700000131
wherein u is (t+1) Denotes the use after the t +1 th iterationAnd (3) probability vectors of users, wherein M represents the number of users, and alpha is random access probability, which indicates that each user has the probability of alpha in each iteration to randomly select one access from M users. A is typically small, e.g., 0.2. The random access probability ensures that the node randomly selects the node of the next hop with a certain probability every time the node walks, thereby avoiding the walking from falling into circulation.
When the difference of the probability distribution of the nodes which are walked twice | | | u (t+1) -u (t) When | is less than a threshold e (e is typically 1 e-8), the random walk can be considered to have converged. At this time, each node in the graph has its own convergence probability, and a users with the maximum probability during convergence are selected as user anchor points.
Similarly, random walk is performed from the same composition of the articles, and the probability vector v of the articles is initialized randomly (0) Then, the random walk is performed by iteratively calculating the following formula:
Figure BDA0002302990700000132
wherein v is (t+1) The probability vector of the item after the t +1 th iteration is represented, and N represents the number of the item. When the difference of the probability distribution of the nodes which are walked twice | | | v (t+1) -v (t) When | is less than the threshold e, the random walk is considered to have converged. And selecting the A articles with the maximum probability in convergence as the article anchor points. The value range of A in the invention is 5 < A < min (M, N), and 50 is generally adopted.
And S3, obtaining a user subgroup corresponding to each user anchor point according to the correlation degree of the user and the user anchor point, and obtaining an article subgroup corresponding to each article anchor point according to the correlation degree of the article and the article anchor point.
Preferably, the degree of correlation between the user and the user anchor point/the degree of correlation between the article and the article anchor point is calculated by adopting a restart random walk algorithm, the restart random walk algorithm starts to walk from a certain node as a starting point, only the probability of a randomly walks in each iteration, and the probability of 1-a directly returns to the initial node. After iteration, each node also obtains a convergence probability which reflects the correlation between each node and the initial node.
With user anchor u a For restarting nodes, randomly walking from the same composition of the user, and initializing a user probability vector u by using one-hot coding (0) I.e. when u (0) Dimension i = u a While u is (0) (i) =1, otherwise u (0) (i)=0,i∈[1,M]. The random walk with restart is then performed by iteratively calculating the following formula:
u (t+1) =(1-β)·P U ·u (t) +β·r U
wherein u is (t+1) Representing the probability vector of the user after the t +1 iteration, wherein beta is the restart probability, generally 0.5, and representing that only the probability of 1-beta continues to travel along the current path in each iteration, and the probability of beta directly returns to the restart node u a 。r U Is a restart vector when i = u a When r is U (i) =1, otherwise, r U (i)=0。
When the difference between the node probability distributions of the two walks is less than a threshold (typically 1 e-8), the random walk can be considered to have converged. The size of the convergence probability of each node reflects the degree of correlation between each node and a restart node, namely, the anchor point u of the user and the user with the higher convergence probability is considered a The tighter the relationship of (a). After random walk with restart is carried out by respectively taking each user anchor point as a restart node, a user convergence probability matrix can be obtained
Figure BDA0002302990700000141
C U Is denoted by user anchor u a For convergence vectors of users when restarting nodes, C U The u-th row of (a) represents the convergence probability vector of user u when different user anchor points are used as restart nodes.
For each user U ∈ U, C is added U The u line of the user anchor points is arranged according to a descending order, the front rho multiplied by A user anchor points in the arrangement are taken, the user is allocated to the user anchor points, wherein rho is a subgroup scale control parameter, and rho is more than 0.5 and less than 1; for user anchor u a All are divided intoThe users assigned to him form a user subgroup
Figure BDA0002302990700000151
In the same way, the following is the item v a To restart the node, a random walk is performed from the commodity isomorphism map. Initializing an item probability vector v using one-hot encoding (0) When v is (0) Dimension i = v a When, v (0) (i) =1, otherwise, v (0) (i)=0,i∈[1,N]Then, the random walk with restart is performed by iteratively calculating the following formula:
v (t+1) =(1-β)·P V ·v (t) +β·r V
wherein v is (t+1) Respectively representing the probability vectors of the articles after the t +1 th iteration, wherein beta is the restart probability, and representing that only the probability of 1-beta continues to travel along the current path in each iteration, and the probability of beta is directly returned to a restart node v a 。r V Is a restart vector when i = v a When r is V (i) =1, otherwise, r V (i)=0。
When the difference between the node probability distributions of the two walks is less than a threshold (typically 1 e-8), the random walk can be considered to have converged. The size of the convergence probability of each node reflects the degree of correlation between each node and the restart node, namely, the article and the article anchor point v with the larger convergence probability are considered a The tighter the relationship of (a). After random walk with restart is carried out by taking each article anchor point as a restart node respectively, an article convergence probability matrix can be obtained
Figure BDA0002302990700000152
C V Is denoted by item anchor point v a For converging vectors of articles when restarting nodes, C V Row v of (a) represents the converged probability vector of item v when the different anchor points are used as restart nodes. />
For each item V ∈ V, C is added V In descending order, the first rho x A article anchor points in the arrangement are taken and the article is assigned to the article anchor pointsPoints, wherein rho is a subgroup scale control parameter; for item anchor points v a All the items allocated to him constitute a subgroup of items
Figure BDA0002302990700000153
And S4, pairing the user subgroup corresponding to the A user anchor points and the article subgroup corresponding to the A article anchor points to obtain A user-article subgroup.
Each time randomly drawing a user anchor u from the user anchor set without putting back a Randomly drawing an item anchor v from the set of item anchors without being put back a Pairing them into an anchor pair (u) a ,v a ) And until the user anchor point set and the commodity anchor point set become empty sets. For each anchor point pair (u) a ,v a ) Their corresponding user subgroups
Figure BDA0002302990700000161
And a subgroup of items +>
Figure BDA0002302990700000162
Are also paired, constituting a user-item subgroup +>
Figure BDA0002302990700000163
And S5, for each user-item subgroup, training by using a Bayes personalized sorting algorithm with paired difference perception to obtain the prediction score of each item by the user, and obtaining the top K items with the largest score as a local recommendation list of the user in the user-item subgroup.
For each user-item subgroup
Figure BDA0002302990700000164
Obtaining a local recommendation list of the user in the user-item subgroup by:
s51, constructing preference pairs based on the observed data, wherein the preference pairs of all the users form a training set
Figure BDA0002302990700000165
For each user-item subgroup
Figure BDA0002302990700000166
In the method, a user u is randomly selected from a training set each time, an article is randomly selected from articles which are excessively hit (behavior is generated) by the user as a positive sample i, an article is randomly selected from articles which are not excessively hit (behavior is not generated) by the user as a negative sample j, and the user and the positive and negative samples form a triple (u, i, j) required by a Bayesian personalized sorting algorithm for pair-wise difference perception during training; and repeating the operation until the number of the triples is equal to the number of the samples scored by the user on the articles in the original training set.
And S52, calculating the positive degree PL of the positive sample and the negative degree NL of the negative sample of each preference pair (u, i, j), and further calculating the gap value G of the positive sample and the negative sample.
Figure BDA0002302990700000167
Figure BDA0002302990700000168
Figure BDA0002302990700000169
Wherein r is ui Indicating the scoring of item i by user u,
Figure BDA00023029907000001610
expressed in subgroups +>
Figure BDA00023029907000001611
In which an excessive number of users is awarded to item j, and>
Figure BDA00023029907000001612
b is a translational bias, superscript a denotes the corresponding subgroup->
Figure BDA00023029907000001613
S53, taking the difference value G of the positive and negative samples as weight, and performing gradient descent on the training set
Figure BDA00023029907000001614
Training to obtain a user factor matrix U a Item factor matrix V a And an item offset vector B a
The optimized loss function is:
Figure BDA0002302990700000171
Figure BDA0002302990700000172
Figure BDA0002302990700000173
Figure BDA0002302990700000174
wherein, U a And V a Is a user-item subgroup
Figure BDA0002302990700000175
The user factor matrix and the item factor matrix in (1), B a Is an article bias vector, lambda is a regularization coefficient, generally takes 0.01, theta represents all parameter sets to be trained, namely U a ,V a ,B a . And | | represents the matrix norm. The optimization objective is to have positive samples score more than negative samples. U at the end of training a ,V a ,B a It is the user factor matrix to be obtained by the present inventionAn item factor matrix and an item bias vector.
S54, for the articles which are not excessively scored by the user u, scoring according to the prediction
Figure BDA0002302990700000176
In order from large to small, the first K items are taken as the user u in the user-item subgroup->
Figure BDA0002302990700000177
Is selected based on the local recommendation list->
Figure BDA0002302990700000178
Sub-population predicted by the model at this time
Figure BDA0002302990700000179
The user u marks an item v as £>
Figure BDA00023029907000001710
The length of the local recommendation list is equal to the length K of the global recommendation list, which is typically 10. In the case of user u in a user-item subgroup +, as shown in FIG. 3>
Figure BDA00023029907000001711
Is selected based on the local recommendation list->
Figure BDA00023029907000001712
The ranking of each item in the local recommendation list can reflect how well the item is liked by user u in this subgroup.
By calculating the preference difference of the positive and negative samples and using the preference difference as weight guide training, the preference strengths of different positive and negative sample pairs can be distinguished, so that the model can learn more detailed preference relation among articles.
And S6, fusing the local recommendation lists of the users in different user-item subgroups according to the contribution of different subgroups to the global and the sequence of the items in the local recommendation lists, and generating the global recommendation list of the user.
Each subgroup contains different members and uses different feature spaces, so that each subgroup has its own characteristics, and thus the different subgroups contribute different degrees to the global recommendation list.
S61, defining an item v in a local list
Figure BDA00023029907000001713
Is greater than the rank score pick>
Figure BDA00023029907000001714
Figure BDA0002302990700000181
Wherein, order (·) indicates that the object v is in
Figure BDA0002302990700000182
Of (2). />
Figure BDA0002302990700000183
Stating v is on the list->
Figure BDA0002302990700000184
Is arranged at the head and is used for keeping the position of the blood vessel at the middle part>
Figure BDA0002302990700000185
Indicating that it is at the end.
S62, defining the object v in subgroup
Figure BDA0002302990700000186
Characteristic weight ω of u,v
Figure BDA0002302990700000187
Wherein N is u And
Figure BDA0002302990700000188
respectively represents the total number of items which are over hit by the user u and the sub-group->
Figure BDA0002302990700000189
The number of the excessive articles is counted. M v And &>
Figure BDA00023029907000001810
Respectively indicates the total number of users who have overflowed the item v and the subgroup->
Figure BDA00023029907000001811
The article v is given an excessive number of users.
The implication behind this formula is: user u is in subgroup
Figure BDA00023029907000001812
The number of over-hit items reflects his local recommendation list->
Figure BDA00023029907000001813
The degree of contribution to the final recommendation list. In a similar manner, in subgroup->
Figure BDA00023029907000001814
The number of users in v that score an item v also affects how much v's local ranking contributes to its global ranking.
S63, according to the object v
Figure BDA00023029907000001815
Is greater than the rank score pick>
Figure BDA00023029907000001816
And subgroup +>
Figure BDA00023029907000001817
Characteristic weight ω of u,v Counting a subgroup->
Figure BDA00023029907000001818
Of users u for each item vAnd (6) local scoring.
Figure BDA00023029907000001819
And S64, obtaining the global score of the item v according to the local score of the item v in the local recommendation list generated by each subgroup for the user u.
Figure BDA00023029907000001820
S65, arranging the global scores of the user u on all the articles according to a descending order, and taking the largest K articles as a global recommendation list L of the user u u
The global decision fusion method considers the ordering of the articles in the local recommendation list and the contribution degree of different subgroups to the global situation, so that the fusion result is more accurate. In addition, by means of decision fusion, only the local recommendation list in each subgroup needs to be considered, and all articles in each subgroup do not need to be considered, so that the fusion efficiency is higher.
In order to verify the prediction effect of the prediction method provided by the invention, a Movielens-100K data set is selected as a research object, the quality of a recommendation list of the method, a recommendation method based on restart random walk, a recommendation method based on Bayes personalized ranking of pairwise ranking and a collaborative filtering ranking recommendation method based on a neural network is compared, and the comparison result is shown in table 1, wherein the method 1 is the recommendation method based on restart random walk, the method 2 is the recommendation method based on Bayes personalized ranking of pairwise ranking, the method 3 is the collaborative filtering recommendation method based on the neural network, and the method 4 is the method provided by the invention.
Figure BDA0002302990700000191
TABLE 1
The comparison result shows that the Top-K recommendation method based on local pairwise sorting and global decision fusion provided by the invention has obviously improved evaluation indexes of MAP and NDCG compared with the prior Top-K recommendation method. The method is characterized in that a Bayesian sorting algorithm for sensing pairwise differences is used, so that the preference strength between different article pairs is distinguished, and a model can learn more detailed preference relationship between articles. By means of training in subgroups and global fusion, users and articles are assumed to belong to a plurality of different feature spaces, so that different aspects of user interests and article attributes can be captured from different perspectives, the problem that features learned under the assumption of the global feature spaces are too rough is avoided, and a final recommendation list is more accurate.
It will be understood by those skilled in the art that the foregoing is only an exemplary embodiment of the present invention, and is not intended to limit the invention to the particular forms disclosed, since various modifications, substitutions and improvements within the spirit and scope of the invention are possible and within the scope of the appended claims.

Claims (10)

1. The Top-K recommendation method based on local pairwise ordering and global decision fusion is characterized by comprising the following steps of:
s1, constructing a user isomorphic graph and an article isomorphic graph according to a user-article bipartite graph;
s2, selecting A user anchor points from the user composition, and selecting A article anchor points from the article composition;
s3, obtaining a user subgroup corresponding to each user anchor point according to the correlation degree of the user and the user anchor point, and obtaining an article subgroup corresponding to each article anchor point according to the correlation degree of the article and the article anchor point;
s4, pairing a user subgroup corresponding to the A user anchor points and an article subgroup corresponding to the A article anchor points to obtain an A user-article subgroup;
s5, for each user-article subgroup, training by using a Bayesian personalized sorting algorithm with paired difference perception to obtain a prediction score of each article by the user, and obtaining the top K articles with the largest score as a local recommendation list of the user in the user-article subgroup;
and S6, fusing the local recommendation lists of the users in different user-article subgroups according to the contribution of different subgroups to the global and the sequence of the articles in the local recommendation lists, and generating the global recommendation list of the users.
2. The method of claim 1, wherein step S1 comprises the steps of:
s11, all M users in the training set form a set U, all N articles form a set V, the sets U and V form a point set of a user-article bipartite graph, the set U forms a point set of a user composition graph, and the set V forms a point set of an article composition graph;
s12, if the scoring r of the user U belonging to the U to the item V belonging to the V exists in the training set uv Then, there is a side connecting user u and item v in the bipartite graph, and the weight of the side connecting is r uv
S13, if two users in the bipartite graph have commonly connected articles, in the user same graph, a connecting edge exists between the two users, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure FDA0004086860780000021
wherein the content of the first and second substances,
Figure FDA0004086860780000022
and &>
Figure FDA0004086860780000023
Representing user u 1 And u 2 A scored item set, based on the number of items in the item set>
Figure FDA0004086860780000024
Representing user u 1 Scoring v;
s14, if two articles in the bipartite graph have users connected together, in the article same graph, a connecting edge exists between the two articles, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure FDA0004086860780000025
wherein, the first and the second end of the pipe are connected with each other,
Figure FDA0004086860780000026
and &>
Figure FDA0004086860780000027
Representing an article v 1 And v 2 A superseded user set, based on the status of the user, or based on the status of the user>
Figure FDA0004086860780000028
Representing user u to item v 1 The score of (2) is given to the user,
Figure FDA0004086860780000029
representing user u to item v 2 Scoring of (2).
3. The method of claim 1, wherein step S3 comprises the steps of:
s31, each user anchor point is taken as a restart node respectively, random walk with restart is carried out, and a user convergence probability matrix is obtained
Figure FDA00040868607800000210
After random walk with restart is carried out by taking each article anchor point as a restart node respectively, an article convergence probability matrix can be obtained>
Figure FDA00040868607800000211
S32, for each user u, adding C U The u-th line of (b) is arranged according to a descending order, the first rho multiplied by A user anchor points in the arrangement are taken, the user is allocated to the user anchor points, and for each article v, C is used V The v-th line of (1) is arranged according to descending order, and the front rho multiplied by A in the arrangement is takenItem anchors to which the item is assigned;
s33, for user anchor point u a All the users assigned to him constitute a subgroup of users
Figure FDA00040868607800000212
For an item anchor v a All items allocated to him constitute an item subgroup +>
Figure FDA00040868607800000213
Wherein M is the number of users in the training set, N is the number of articles in the training set, and rho is a subgroup scale control parameter.
4. The method of claim 1, wherein in step S5, the user' S presence in a user-item subgroup is obtained by
Figure FDA00040868607800000214
Local recommendation list in (1):
s51, constructing preference pairs based on the observed data, wherein the preference pairs of all the users form a training set
Figure FDA00040868607800000215
S52, for each preference pair (u, i, j), calculating the positive degree PL of a positive sample and the negative degree NL of a negative sample, and further calculating the gap value G of the positive sample and the negative sample;
Figure FDA0004086860780000031
Figure FDA0004086860780000032
Figure FDA0004086860780000033
s53, taking the difference value G of the positive and negative samples as weight, and performing gradient descent on the training set
Figure FDA0004086860780000034
Training to obtain a user factor matrix U a Item factor matrix V a And an item offset vector B a The optimized loss function is as follows:
Figure FDA0004086860780000035
Figure FDA0004086860780000036
Figure FDA0004086860780000037
Figure FDA0004086860780000038
s54, for the articles which are not excessively scored by the user u, scoring according to the prediction
Figure FDA0004086860780000039
In order from large to small, the first K items are taken as the user u in the user-item subgroup->
Figure FDA00040868607800000310
Is selected based on the local recommendation list->
Figure FDA00040868607800000311
Wherein r is ui Representing the user u's score for item i,
Figure FDA00040868607800000312
expressed in subgroups +>
Figure FDA00040868607800000313
In which an excessive number of users is awarded to item j, and>
Figure FDA00040868607800000314
b is a translational bias, superscript a denotes the corresponding subgroup->
Figure FDA00040868607800000315
U a And V a Is a sub-group of users-goods
Figure FDA00040868607800000316
The user factor matrix and the item factor matrix in (1), B a Is a user-item subgroup->
Figure FDA00040868607800000317
The item bias vector in (1), λ is a regularization coefficient, and θ represents the set of all parameters to be trained, i.e., U a ,V a ,B a And | | represents the matrix norm.
5. The method of claim 1, wherein step S6 comprises the steps of:
s61, defining an item v in a local list
Figure FDA00040868607800000318
Is greater than the rank score pick>
Figure FDA00040868607800000319
/>
Figure FDA00040868607800000320
S62, defining the object v in subgroup
Figure FDA0004086860780000041
Characteristic weight ω of u,v
Figure FDA0004086860780000042
S63, according to the object v
Figure FDA0004086860780000043
Sorting score of +>
Figure FDA0004086860780000044
And subgroup->
Figure FDA0004086860780000045
Is based on the characteristic weight->
Figure FDA0004086860780000046
Calculating subgroup +>
Figure FDA0004086860780000047
A local score for each item v by user u;
Figure FDA0004086860780000048
s64, obtaining the global score of the item v according to the local score of the item v in the local recommendation list generated by each subgroup for the user u;
Figure FDA0004086860780000049
s65, arranging the global scores of the user u on the articles according to a descending order, and taking the maximum scoreK items as a global recommendation list L for user u u
Wherein, order (·) indicates that the object v is in
Figure FDA00040868607800000410
Rank of (1), N u And &>
Figure FDA00040868607800000411
Respectively represents the total number of items which are already marked by the user u and the subgroup of the user u>
Figure FDA00040868607800000412
Number of articles hit too far in, M v And &>
Figure FDA00040868607800000413
Respectively representing the total number of users who have given an item v an excessive number and in a subgroup->
Figure FDA00040868607800000414
The article v is given an excessive number of users.
6. The Top-K recommendation system based on local pairwise ordering and global decision fusion is characterized by comprising the following steps:
the same composition construction module is used for constructing a user same composition and an article same composition according to the user-article bipartite graph;
the anchor point selection module is used for selecting A user anchor points from the user composition and selecting A article anchor points from the article composition;
the subgroup acquisition module is used for acquiring a user subgroup corresponding to each user anchor point according to the correlation degree of the user and the user anchor point, and acquiring an article subgroup corresponding to each article anchor point according to the correlation degree of the article and the article anchor point;
the subgroup matching module is used for matching the user subgroups corresponding to the A user anchor points with the article subgroups corresponding to the A article anchor points to obtain A user-article subgroups;
the local recommendation list acquisition module is used for training each user-article subgroup by using a Bayesian personalized sorting algorithm with pairwise difference perception to obtain a prediction score of each article by the user, and acquiring the first K articles with the largest score as a local recommendation list of the user in the user-article subgroup;
and the global recommendation list acquisition module is used for fusing the local recommendation lists of the users in different user-item subgroups according to the contribution degrees of different subgroups to the global and the sequence of the items in the local recommendation lists to generate the global recommendation list of the user.
7. The system of claim 6, wherein the isomorphic construction module derives the user isomorphism and the item isomorphism by:
s11, all M users in the training set form a set U, all N articles form a set V, the sets U and V form a point set of a user-article bipartite graph, the set U forms a point set of a user with a same composition, and the set V forms a point set of an article with a same composition;
s12, if the user U belongs to the U and the score r of the item V belongs to the V exists in the training set uv Then, there is a side connecting user u and item v in the bipartite graph, and the weight of the side connecting is r uv
S13, if two users in the bipartite graph have commonly connected articles, in the user same graph, a connecting edge exists between the two users, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure FDA0004086860780000051
wherein, the first and the second end of the pipe are connected with each other,
Figure FDA0004086860780000052
and &>
Figure FDA0004086860780000053
For indicatingHuu (household) 1 And u 2 A scored item set, based on the number of items in the item set>
Figure FDA0004086860780000054
Representing user u 1 Marks v, and>
Figure FDA0004086860780000055
representing user u 2 Scoring the item v;
s14, if two articles in the bipartite graph have users connected together, in the article same graph, a connecting edge exists between the two articles, otherwise, the connecting edge does not exist, and the weight of the connecting edge is as follows:
Figure FDA0004086860780000056
wherein, the first and the second end of the pipe are connected with each other,
Figure FDA0004086860780000057
and &>
Figure FDA0004086860780000058
Representing an article v 1 And v 2 Scored user sets, <' > based on a predetermined number of user groups>
Figure FDA0004086860780000059
Representing user u to item v 1 Scoring of (4).
8. The system of claim 6, wherein the subgroup acquisition module obtains the subgroup of items corresponding to each item anchor by:
s31, each user anchor point is taken as a restart node respectively, random walk with restart is carried out, and a user convergence probability matrix is obtained
Figure FDA0004086860780000061
Respectively taking each article anchor point as a restart nodeAfter a random walk with restart, an article convergence probability matrix may be obtained>
Figure FDA0004086860780000062
S32, for each user u, adding C U The u-th line of (b) is arranged according to a descending order, the first rho multiplied by A user anchor points in the arrangement are taken, the user is allocated to the user anchor points, and for each article v, C is used V The line v of (2) is arranged according to a descending order, the front rho multiplied by A article anchor points in the arrangement are taken, and the article is distributed to the article anchor points;
s33, for user anchor point u a All users assigned to him constitute a subgroup of users
Figure FDA0004086860780000063
For an item anchor v a All items allocated to him constitute an item subgroup +>
Figure FDA0004086860780000064
Wherein M is the number of users in the training set, N is the number of articles in the training set, and rho is a subgroup scale control parameter.
9. The system of claim 6, wherein the local recommendation list acquisition module obtains the user's presence in a user-item subgroup by
Figure FDA0004086860780000065
Local recommendation list in (1):
s51, constructing preference pairs based on the observed data, wherein the preference pairs of all the users form a training set
Figure FDA0004086860780000066
S52, for each preference pair (u, i, j), calculating the positive degree PL of a positive sample and the negative degree NL of a negative sample, and further calculating the gap value G of the positive sample and the negative sample;
Figure FDA0004086860780000067
/>
Figure FDA0004086860780000068
Figure FDA0004086860780000069
s53, taking the difference value G of the positive and negative samples as weight, and performing gradient descent on the training set
Figure FDA00040868607800000610
Training to obtain a user factor matrix U a Item factor matrix V a And an item offset vector B a The optimized loss function is as follows:
Figure FDA0004086860780000071
Figure FDA0004086860780000072
Figure FDA0004086860780000073
Figure FDA0004086860780000074
s54, for the articles which are not excessively scored by the user u, scoring according to the prediction
Figure FDA0004086860780000075
In order from large to small, the first K items are taken as the user u in the user-item subgroup->
Figure FDA0004086860780000076
In a local recommendation list &>
Figure FDA0004086860780000077
Wherein r is ui Representing the user u's score for item i,
Figure FDA0004086860780000078
is indicated in a subgroup->
Figure FDA0004086860780000079
An excessive number of users is granted to item j, and>
Figure FDA00040868607800000710
b is a translational bias, superscript a denotes the corresponding subgroup->
Figure FDA00040868607800000711
U a And V a Is a user-item subgroup
Figure FDA00040868607800000712
The user factor matrix and the item factor matrix in (1), B a Is a user-item subgroup->
Figure FDA00040868607800000713
The item bias vector in (1), λ is a regularization coefficient, and θ represents the set of all parameters to be trained, i.e., U a ,V a ,B a And | | represents the matrix norm.
10. The system of claim 6, wherein the global recommendation list acquisition module generates the global recommendation list for the user by:
s61, defining an object v in a local list
Figure FDA00040868607800000714
Is greater than the rank score pick>
Figure FDA00040868607800000715
Figure FDA00040868607800000716
S62, defining the object v in subgroup
Figure FDA00040868607800000717
Characteristic weight ω of u,v
Figure FDA00040868607800000718
S63, according to the object v
Figure FDA00040868607800000719
Sorting score of +>
Figure FDA00040868607800000720
And subgroup->
Figure FDA00040868607800000721
Characteristic weight of +>
Figure FDA00040868607800000722
Calculating subgroup +>
Figure FDA00040868607800000723
A local score for each item v by the user u;
Figure FDA00040868607800000724
s64, obtaining a global score of the item v according to the local score of the item v in the local recommendation list generated by each subgroup for the user u;
Figure FDA0004086860780000081
s65, arranging the global scores of the user u on the items according to a descending order, and taking the maximum K items as a global recommendation list L of the user u u
Wherein, order (·) indicates that the object v is in
Figure FDA0004086860780000082
Rank of (1), N u And &>
Figure FDA0004086860780000083
Respectively represents the total number of items which are over hit by the user u and the sub-group->
Figure FDA0004086860780000084
Number of articles hit excessively in middle, M v And &>
Figure FDA0004086860780000085
Respectively indicates the total number of users who have overflowed the item v and the subgroup->
Figure FDA0004086860780000086
The article v is given an excessive number of users. />
CN201911232316.2A 2019-12-04 2019-12-04 Top-K recommendation method and system based on local pairwise ordering and global decision fusion Active CN111008334B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911232316.2A CN111008334B (en) 2019-12-04 2019-12-04 Top-K recommendation method and system based on local pairwise ordering and global decision fusion

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911232316.2A CN111008334B (en) 2019-12-04 2019-12-04 Top-K recommendation method and system based on local pairwise ordering and global decision fusion

Publications (2)

Publication Number Publication Date
CN111008334A CN111008334A (en) 2020-04-14
CN111008334B true CN111008334B (en) 2023-04-18

Family

ID=70115465

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911232316.2A Active CN111008334B (en) 2019-12-04 2019-12-04 Top-K recommendation method and system based on local pairwise ordering and global decision fusion

Country Status (1)

Country Link
CN (1) CN111008334B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112464098B (en) * 2020-12-05 2024-02-02 北京工业大学 Recommendation system item prediction method based on similarity pairwise ranking
CN112950324B (en) * 2021-03-15 2022-06-03 重庆邮电大学 Knowledge graph assisted pairwise sorting personalized merchant recommendation method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108363804A (en) * 2018-03-01 2018-08-03 浙江工业大学 Partial model Weighted Fusion Top-N films based on user clustering recommend method
CN108763362A (en) * 2018-05-17 2018-11-06 浙江工业大学 Method is recommended to the partial model Weighted Fusion Top-N films of selection based on random anchor point
CN109636509A (en) * 2018-11-20 2019-04-16 华中科技大学 A kind of score in predicting method based on non symmetrical distance building submatrix
CN110322053A (en) * 2019-06-13 2019-10-11 华中科技大学 A kind of score in predicting method constructing local matrix based on figure random walk

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150120722A1 (en) * 2013-10-31 2015-04-30 Telefonica Digital Espana, S.L.U. Method and system for providing multimedia content recommendations
US10679274B2 (en) * 2016-09-16 2020-06-09 Conduent Business Services, Llc Method and system for data processing to recommend list of physical stores in real-time

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108363804A (en) * 2018-03-01 2018-08-03 浙江工业大学 Partial model Weighted Fusion Top-N films based on user clustering recommend method
CN108763362A (en) * 2018-05-17 2018-11-06 浙江工业大学 Method is recommended to the partial model Weighted Fusion Top-N films of selection based on random anchor point
CN109636509A (en) * 2018-11-20 2019-04-16 华中科技大学 A kind of score in predicting method based on non symmetrical distance building submatrix
CN110322053A (en) * 2019-06-13 2019-10-11 华中科技大学 A kind of score in predicting method constructing local matrix based on figure random walk

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Binbin Hu.Local and global information fusion for top-n recommendation in heterogeneous information network.《In Proceedings of the 27th ACM International Conference on Information and Knowledge Management (CIKM '18)》.2018,全文. *
张俐.一种自适应局部和全局融合的协同过滤算法.《科学技术与工程》.2019,(第16期),全文. *
王科强.基于矩阵分解的个性化推荐系统.《中国博士学位论文全文数据库-信息科技辑》.2017,全文. *
赵震宇.基于深度学习和海云协同的推荐方法研究.《中国博士学位论文全文数据库-信息科技辑》.2019,全文. *

Also Published As

Publication number Publication date
CN111008334A (en) 2020-04-14

Similar Documents

Publication Publication Date Title
Pham et al. A general graph-based model for recommendation in event-based social networks
CN110555050B (en) Heterogeneous network node representation learning method based on meta-path
CN103971161B (en) Hybrid recommendation method based on Cauchy distribution quantum-behaved particle swarm optimization
CN111160954B (en) Recommendation method facing group object based on graph convolution network model
CN111008334B (en) Top-K recommendation method and system based on local pairwise ordering and global decision fusion
CN105354260B (en) The Mobile solution of a kind of mosaic society&#39;s network and item characteristic recommends method
CN111310045A (en) Network-embedded movie recommendation method based on meta-path
CN109815402A (en) Collaborative Filtering Recommendation Algorithm based on user characteristics
CN110322053B (en) Scoring prediction method for constructing local matrix based on graph random walk
CN107203590B (en) Personalized movie recommendation method based on improved NSGA-II
KR101356948B1 (en) User interest inference method and system in sns using topics on social activities with neighbors
CN107403390B (en) Friend recommendation method integrating Bayesian reasoning and random walk on graph
CN112948625A (en) Film recommendation method based on attribute heterogeneous information network embedding
CN110032682A (en) A kind of information recommendation list generation method, device and equipment
CN108628967A (en) A kind of e-learning group partition method generating network similarity based on study
CN114265986A (en) Information pushing method and system fusing knowledge graph structure and path semantics
CN110209946A (en) Based on social and community Products Show method, system and storage medium
Huang et al. Ponas: Progressive one-shot neural architecture search for very efficient deployment
CN114997476A (en) Commodity prediction method fusing commodity incidence relation
CN111191076A (en) Movie recommendation method based on prospect theory and multi-target evolution
CN112364245B (en) Top-K movie recommendation method based on heterogeneous information network embedding
CN105069140B (en) Recommendation method based on high-order user preference
CN108133426B (en) Social network link recommendation method
CN110825978B (en) Multitask collaborative filtering method based on neighbor user feature sharing
CN113849725A (en) Socialized recommendation method and system based on graph attention confrontation network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant