CN113407834A - Knowledge graph-assisted user multi-dimensional interest extraction method - Google Patents

Knowledge graph-assisted user multi-dimensional interest extraction method Download PDF

Info

Publication number
CN113407834A
CN113407834A CN202110674480.XA CN202110674480A CN113407834A CN 113407834 A CN113407834 A CN 113407834A CN 202110674480 A CN202110674480 A CN 202110674480A CN 113407834 A CN113407834 A CN 113407834A
Authority
CN
China
Prior art keywords
user
interest
knowledge graph
articles
entities
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110674480.XA
Other languages
Chinese (zh)
Inventor
毋立芳
杨然
简萌
张宸林
郭菁菁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Technology
Original Assignee
Beijing University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Technology filed Critical Beijing University of Technology
Priority to CN202110674480.XA priority Critical patent/CN113407834A/en
Publication of CN113407834A publication Critical patent/CN113407834A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Marketing (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Evolutionary Computation (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Artificial Intelligence (AREA)
  • Animal Behavior & Ethology (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a knowledge graph-assisted user multi-dimensional interest extraction method. The accurate user interest expression is one of key points for improving recommendation performance, most of the existing recommendation algorithms adopt a single and mixed vector to express the user interest, the user interest cannot be expressed from multiple dimensions, and the method for uniformly modeling the user interest ignores entanglement in a hidden vector, is easy to obtain suboptimal user interest expression and lacks certain interpretability. However, due to the sparsity problem of the interaction data, it is very difficult to learn the multi-dimensional interest expression of the user in the historical interaction data, and the related research is less at present. The invention designs a knowledge graph-assisted user multi-dimensional interest extraction network, can obtain multi-dimensional user interest expression and simultaneously provides certain interpretability. We verified the validity of the method on music and book data sets.

Description

Knowledge graph-assisted user multi-dimensional interest extraction method
Technical Field
The invention relates to a collaborative filtering technology, in particular to a collaborative filtering algorithm based on articles; a representation learning method based on a knowledge graph is specifically a Trans R algorithm. And also relates to data mining and deep learning technologies such as user interest modeling.
Background
With the development of multimedia technology and the popularization of handheld mobile electronic devices, people can acquire information on the network at any time and any place, the information amount generated by the internet in one day can not be seen by one person for a whole life, and in the face of the increasingly serious information overload problem, a recommendation system is produced at the discretion of the people, so that the recommendation system has the function of establishing more efficient connection between users and information, and a great amount of time and cost are saved. In 1994, a news recommendation system was designed by the Group Lens research Group of the computer system in the university of minnesota in the city of double cities, and the concept of Collaborative Filtering (CF) was proposed for the first time. Since birth, CF has not only been intensively studied in academia but also has been practically deployed in industry. The method mainly analyzes the historical interaction records of the user and the articles to predict the next possible interaction articles of the user, and fully utilizes collective intelligence to find out the content which can best meet the user interest. Item-based collaborative filtering (ICF for short) based on articles is recommended according to similarity among the articles, and the ICF considers that personalized interests of users are embodied in historical articles interacted by the users, and can be used as an important component of user interest modeling.
Knowledge map (KG) is a new concept proposed by Google in 2012. From an academic point of view, we can give a definition of knowledge-graph: a "knowledge graph is essentially a knowledge base of Semantic networks (Semantic networks). Knowledge graph is an important ring of cognitive intelligence, and has a series of applications in the fields of retrieval recommendation and the like, and knowledge-enabled intelligent recommendation becomes the mainstream of future recommendation. The knowledge graph is introduced into the recommendation system as auxiliary information, so that the problems of sparsity and cold start existing in the traditional recommendation system can be effectively solved, and a certain interpretability is provided for a final recommendation result. In order to mine the key features of data on KG, a knowledge-graph mapping (KGE) algorithm is required to encode information that can adequately characterize the original data as low-dimensional embedded features. The KGE algorithm can be generally divided into two categories, a translation distance model and a semantic matching model. The translation distance model measures the reasonableness of the fact through the distance between two entities by utilizing a distance-based scoring function, such as Trans E, Trans H, Trans R, Trans D and the like; the semantic matching model utilizes a similarity-based scoring function. Semantic matching models measure the trustworthiness of facts, such as Dist Multi and RESCAL, by matching the underlying semantics of the entities and the relationships contained in the vector space representation.
In the research of the personalized recommendation algorithm, user interest modeling is a constant topic. How to express, calculate and update user interests has become an important research topic. The user's interests tend to follow a hierarchical pattern, from higher level attributes (e.g., genre, director, actors) to specific lower level attributes (a specific movie), in the case of a movie recommendation scenario. Previous models regarding user interest extraction, such as DIN and DIEN methods in ali, apparently neglect this hierarchical model and only extract user interest from the item level. The user historical behavior information is considered to be important information, the recommendation effect can be effectively improved by extracting the user interest from the information, and the multi-dimensional interest expression of the user can be extracted from the historical interaction behavior of the user with the aid of an external information knowledge graph.
In summary, the recommended user interest expression has not achieved the optimal effect, most of research uses a uniform and mixed hidden vector to express the user, the expression of the user multi-dimensional interest is not considered, the performance improvement is limited, the multi-dimensional expression of the user interest is difficult to obtain due to the sparsity of interactive data and the problem of cold start, how to define multiple dimensions is a troublesome problem, and the interpretability is lacking. The invention provides a knowledge graph-assisted method for extracting multi-dimensional interest of a user.
Disclosure of Invention
Most of the existing recommendation algorithms adopt a single and mixed vector to express user interests, the user interests cannot be expressed from multiple dimensions, and the method for uniformly modeling the user interests ignores entanglement in hidden vectors, so that suboptimal user interest expression is easily obtained, and meanwhile, certain interpretability is lacked. Aiming at the requirements of the current recommendation system, the invention aims to explore a method for reasonably extracting the multi-dimensional interests of a user from a user historical interaction record, and obtains the expression of the user interests in different attribute spaces of an article with the assistance of an external information knowledge base, as shown in figure 1.
In order to express user interest from multiple dimensions, the invention discloses a knowledge graph assistance-based user multi-dimensional interest extraction method. The overall frame is shown in fig. 2. The data sets used in this method are both from academia and industry public data sets (Amazon-book: books, Last-FM: music). The method comprises three modules which are an input layer, a multi-dimensional interest extraction layer and a maximum interest response layer respectively. Inputting historical interactive articles and articles to be recommended of a user, wherein the used knowledge graph represents that a learning method is Trans R; the multi-dimensional interest extraction layer realizes the extraction of multi-dimensional interest of the user by aggregating different entities under different connection relations, namely different user interest expressions exist in different article attribute spaces, and the essence of the multi-dimensional interest extraction layer lies in clustering after articles are mapped to different semantic spaces; the maximum interest response layer firstly maps the item to be recommended to different item attribute spaces, then performs inner product with the corresponding interest of the user in the space, and selects the maximum value as the final prediction score.
The invention contents of each main module of the method are as follows:
1. input layer
The input layer is divided into two parts, firstly, historical articles interacted by a user are obtained in a data set, and the historical articles and target articles to be recommended are used as original input of a model; then, all the articles in the data set are in one-to-one correspondence with the entities in the knowledge graph, and the representation learning method Trans R is used for learning, namely, the entities are mapped into different relation spaces for comparison, as shown in FIG. 3. The triples (h, r, t) can be formed only when h + r ═ t is satisfied in the same relationship space, and the essence is that the entities (i.e., articles) in the knowledge graph are a complex of multiple attributes, different relationships concern different attributes of the entities, and two entities that are similar in the a relationship space may not be similar in the B relationship space. As shown in particular in figure 3.
2. Multi-dimensional interest extraction layer
The multi-dimensional interest extraction layer can obtain the interest expression of the user in different item attribute spaces. Firstly, according to the input user history interactive objects, the corresponding entities of the objects in the knowledge graph are found. Taking a movie as an example, the used knowledge graph does not contain users, and different movie entities are connected according to the relationship of the same director, the same actor and the like to form triplets of movie 1-director-movie 2, movie 3-actor-movie 4 and the like, as shown in fig. 4. After obtaining the historical interacted objects of the user and their respective knowledge connection entities, we extract the entities which are the same director as the interacted entities of the user, and it is worth emphasizing that since the historical objects of the user are not one in most cases, the obtained set of director entities does not have only one director but a plurality of directors, so that the set of entities expresses the interest of the user in the attribute of the director of the movie, and is not a specific director. Similarly, the entity interacted with the user is the same actor, so that the user is interested in the attribute of the actor of the movie, and the like. In the method, firstly, how many relationships are shared in the knowledge graph, that is, how many attributes of the kind of articles (movies) are, in each attribute, there is a corresponding expression of user interest. In addition, since the characteristics of Trans R are that even the same entity has different expressions in different relationship spaces, for example, the same movie, in the director relationship space, we focus on its director attribute, and in the actor relationship space, focus on the actor attribute, so that the characteristic expressions of the same movie in different relationship spaces are different.
3. Maximum interest response layer
The maximum interest response layer can finally obtain the interaction prediction score of the user on the article. After the multi-dimensional interest expression of the user is obtained, the method firstly maps the object to be recommended to different object attribute spaces by using different relation matrixes, then performs inner product with the corresponding interest of the user in the space, selects the maximum value as the final prediction score, and has the essence that the user likes a certain attribute of the object particularly, so that the object is recommended to the user reasonably. And after the maximum interest response layer outputs the interactive prediction scores of all the articles to be recommended of the user, all the articles are sorted in a descending order according to the scores, the top N articles are generated into a list to be recommended to the user, and the recommendation is completed.
Drawings
FIG. 1 is a schematic diagram illustrating user interest expression in different attribute spaces of an article;
FIG. 2 is a general block diagram of a method for extracting multi-dimensional user interests based on knowledge graph assistance in the present invention;
FIG. 3 is a schematic diagram of the Trans R algorithm;
FIG. 4 is a schematic view of a knowledge graph;
the specific implementation mode is as follows:
the invention discloses a knowledge graph-assisted personalized recommendation method for extracting multi-dimensional interests of a user. The method comprises the following concrete implementation steps:
the method comprises the following steps: data preprocessing and training set test set division: the data preprocessing is divided into two parts, one part is to select a proper public data set, sequence numbers are arranged for all users and articles, and the other part is to record user-article interaction records in a hidden feedback mode: if the interaction between the user and the object is in a scoring form, the label of the scored object can be marked as 1, otherwise the label of the scored object is 0. And then all the articles are in one-to-one correspondence with the entities in the knowledge graph. And finally, dividing a training set and a test set, wherein in the method, a knowledge graph represents that two tasks of learning and recommendation prediction are jointly trained, and for the recommendation prediction task, the proportion of positive samples in the training set and the test set is 4:1, through experimental verification, the knowledge graph shows that the performance is best when the ratio of positive and negative samples in a training set of the learning task and the recommendation and prediction task is 1: 1.
Step two: model input and Trans R training:
the input to the model is a set of historical item entities that have been interacted with by a user
Figure BDA0003120513170000051
Wherein
Figure BDA0003120513170000052
Is a historical item set interacted by a user and a target item to be recommended
Figure BDA0003120513170000053
And the articles are in one-to-one correspondence with the entities in the knowledge graph. First, to represent the knowledge of structuring, the present invention uses an undirected graph G ═ (V, R), where V ═ V1,v2...vNRepresents the set of entities in the knowledge-graph, N represents the total number of entities, R ═ R1,r2...rMAnd M represents the total number of the relations. In the Trans R algorithm, for each triplet (v)h,r,vt),vhAnd vtIs two connected head and tail entities, r represents the type of relationship between the two entities. Firstly, v ishAnd vtMapping to a particular space vh,vt∈RkK is the dimension of the mapped vector, the value is 64, and | h | calculation2≤1,||t||2≤1,r∈RdD is the dimension of the mapped vector, the value is 128, and | r | | y calculation2Less than or equal to 1. Then setting a conversion matrix M for the current relationshipr∈Rk·dWhich can convert an entity to a corresponding gateIn the system space, as shown in fig. 3. The method specifically comprises the following steps:
Figure BDA0003120513170000054
Figure BDA0003120513170000055
the scoring function for this one triplet is defined as:
Figure BDA0003120513170000056
step three: extracting the multi-dimensional interest of the user: the function of this layer is to obtain the expression of interest of the user in different item attribute spaces: i isu=fextractor(VuG ═ V, R)). Assuming that the used knowledge graph is complete, the relationship in the knowledge graph is the division of the attribute space of the class of articles, taking a movie as an example, and the relationship of a director, an actor and the like corresponds to the attribute division space of the entity of the movie, so that the number of relationships in the knowledge graph can be considered to be the same number of attribute spaces of the class of articles under the current cognition. Taking a history entity interacted by a user as a central point, sequentially acquiring interest expressions of the user in different attribute spaces of the article according to different relations, wherein a specific calculation formula is as follows:
Figure BDA0003120513170000061
Figure BDA0003120513170000062
......
Figure BDA0003120513170000063
and has r1≠r2≠...≠rM,r1,r2...rMe.R, in the method of the invention, only the information of hop1 in the knowledge graph is used without expandingThe large information aggregation range is considered that when the user interest is expressed, certain noise is introduced at the same time when high-order information is introduced, and the performance is further influenced; in addition, in the traditional method, the historical items interacted by the user are usually directly embodied by the user interests, and in the invention, when the interest expressions of the user in different attribute spaces of the items are obtained, the historical interaction items are also calculated in the interest expressions according to different relations.
Step four: maximum interest response obtains prediction score and recommendation list: after the interest expression of the user on different attribute spaces of the object is obtained, in order to obtain the most accurate interactive prediction score, the object to be recommended is respectively mapped to the space with the same multidimensional interest of the user for comparison, and the specific calculation formula is as follows:
Figure BDA0003120513170000064
wherein
Figure BDA0003120513170000065
Is the entity characteristic expression of the item to be recommended in the knowledge graph,
Figure BDA0003120513170000066
is a mapping matrix of any relation, and M is the number of relations in the knowledge graph. After the expressions of the to-be-recommended articles in different attribute spaces are obtained, inner products are respectively made with the corresponding interest characteristics of the user in the space, the maximum value is selected as the final prediction score, and the calculation formula is as follows:
Figure BDA0003120513170000071
wherein
Figure BDA0003120513170000072
Is an activation function. The nature of the maximum interest response layer is that the user likes a certain attribute of the article particularly, so there is reason to giveThis item is recommended to the user. And after the maximum interest response layer outputs the interactive prediction scores of all the articles to be recommended of the user, all the articles are sorted in a descending order according to the scores, and the top N articles are generated into a list to be recommended to the user.
Step five: optimization method and loss function: the loss function in the invention comprises three parts, wherein the first part is the loss function of the interactive prediction part. Two learning strategies commonly used in the recommendation system are a point (pointwise) and a pair (pair) optimization method, and the invention selects the point. The point method is widely applied to numerous recommendation algorithms and achieves excellent effects. It transforms the recommendation problem into a two-class task, minimizing the following objective function:
Figure BDA0003120513170000073
where δ (-) is a sigmoid function, controlling the prediction score between 0 and 1, R+Is a positive sample set, i.e. a set labeled 1, R-Is a negative sample set, i.e., a set labeled 0.
The second part is a loss function of a knowledge graph representing learning Trans R training part, wherein edge loss Margin loss is selected, the input sample pair is a positive sample set S and a negative sample set S' which are acquired from a training set, the difference between scores of positive and negative samples is larger than a threshold value gamma, and the target function is as follows:
Figure BDA0003120513170000074
where max (x, y) represents the maximum of x and y obtained and γ represents the threshold, set to 1.0. The third part is a regularization loss function for preventing model overfitting, and comprises two parts of parameters of interactive prediction and knowledge graph representation learning Trans R training, which are defined as follows:
Lreg=LKG_reg+LRec_reg=λ(||θKG||2+||θRec||2)
hyper-parametric lambda control L2The strength of regularization was set to 10-7,θKGAnd thetaRecModel parameters to prevent overfitting in separate table knowledge-graph training and cross-prediction, where θKGAn initialization vector representing the entities and relationships,
θRecincluding initialization vectors for the user and the item.
The loss function of the invention consists of the three parts, and the joint training is as follows: l ═ LRec+LKG+Lre
Step six: and (3) verifying the validity of the method: after model training is completed, in order to verify the effectiveness of the method, the method is carried out in a public data set Amazon-book: book, Last-FM: experiments were performed musically. After the interactive prediction scores of the target users for the items to be recommended are obtained, the Top 20, 40,60,80 and 100 items with the highest scores are selected for each target user to form a Top-N personalized recommendation list. Evaluation indexes used in the experiment are Recall (correct Rate for all positive samples), Precision (correct Rate for Top-N list), Hit Rate (check probability of hitting any positive sample in Top-N list), and Normalized dispersed relationship Gain (NDCG focuses on the position where a positive sample appears in the Top-N recommendation list, and NDCG increases the further forward the position). Table 1 shows the performance of the method of the invention on two public data sets.
Table one: experimental performance display of the method on Amazon-book and Last-FM data sets
Figure BDA0003120513170000081
Figure BDA0003120513170000091

Claims (7)

1. A method for extracting multi-dimensional user interest based on knowledge graph assistance is characterized by comprising the following steps: the method comprises the following steps:
s1, preprocessing data: obtaining a suitable data set comprising a user ID, an item ID, and a tag of a user-item interaction; acquiring a knowledge graph file, enabling the ID of the article to correspond to the entities in the knowledge graph one by one, and connecting different entities by using a triple of a head entity-relation-tail entity; the proportion of positive samples in the training set and the test set of the recommendation and prediction part is 4:1, and the knowledge graph shows that the proportion of the positive samples and the negative samples in the training set of the learning task and the recommendation and prediction task is 1: 1;
s2, model input and Trans R training: acquiring an entity set corresponding to the historical article interacted by the user on the knowledge graph in the data set, and taking the entity set and the target article to be recommended as the original input of the model; using a representation learning method Trans R to learn the knowledge structure, namely mapping the entity to different relation spaces for comparison;
s3, extracting multi-dimensional interest of the user: according to the entity set interacted with the user history, extracting the entities interacted with the user from the entities connected by a relationship to obtain the interest expression of the user in the item attribute (relationship) space; the method comprises the following steps of (1) acquiring the common relation of a knowledge graph, namely the attributes of the articles, extracting corresponding user interest expressions in each attribute, and finally obtaining the multi-dimensional interest expressions of the users;
s4, obtaining the prediction score according to the maximum interest response: after the multi-dimensional interest expression of the user is obtained, firstly, mapping the to-be-recommended articles to different article attribute spaces by using different relation matrixes, then, performing inner product with the corresponding interest of the user in the space, and selecting the maximum value as a final prediction score;
s5, optimization method and loss function: the recommended prediction part adopts cross entropy loss log loss, the Trans R algorithm part adopts edge loss Margin loss, and the two parts of initialization parameters are normalized by L2, so that a total loss function is formed by the three parts of initialization parameters;
s6, generating a recommendation list: and after the predicted user-item interaction scores are obtained, all the items are sorted in a descending order according to the scores for each user, and the top N items are generated into a list to be recommended to the user.
2. The method for extracting the multi-dimensional interest of the user based on knowledge graph assistance as claimed in claim 1, wherein the user-item interaction record is recorded in a hidden feedback manner: if the interaction of the data set is in a scoring form, the scored article with the score being greater than or equal to 4 is scored as a label of 1 and is scored as a positive sample, and the rest is 0 and is scored as a negative sample; for each user, the positive sample ratio of the training set to the test set is 4:1, and the positive sample ratio of the training set to the test set is 1: 1.
3. The method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
the input to the model is a set of historical item entities that have been interacted with by a user
Figure FDA0003120513160000021
Wherein
Figure FDA0003120513160000022
Is a historical item set interacted by a user and a target item to be recommended
Figure FDA0003120513160000023
The articles are in one-to-one correspondence with the entities in the knowledge graph; first, to represent the structured knowledge, an undirected graph G ═ (V, R) is used, where V ═ V1,v2...vNRepresents the set of entities in the knowledge-graph, N represents the total number of entities, R ═ R1,r2...rMExpressing a relation set in the knowledge graph, and expressing the total number of relations by M; in the Trans R algorithm, for each triplet (v)h,r,vt),vhAnd vtIs two connected head and tail entities, r represents the relationship type between the two entities; firstly, v ishAnd vtMapping to a particular space vh,vt∈RkK is the dimension of the mapped vector, the value is 64, and | h | calculation2≤1,||t||2≤1,r∈RdD is the dimension of the mapped vector, the value is 128, and | r | | y calculation2Less than or equal to 1; then setting a conversion matrix M for the current relationshipr∈Rk·dIt can convert the entity into the corresponding relationship space, specifically:
Figure FDA0003120513160000024
the scoring function for this one triplet is defined as:
Figure FDA0003120513160000025
4. the method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
the multi-dimensional interest extraction layer of the user obtains the interest expression of the user in different article attribute spaces: i isu=fextractor(VuG ═ V, R)); assuming that the used knowledge graph is complete, the relationship in the knowledge graph is the division of the attribute space of the class of articles, and the number of relationships in the knowledge graph can be regarded as that the class of articles have the same number of attribute spaces under the current cognition; taking a history entity interacted by a user as a central point, sequentially acquiring interest expressions of the user in different attribute spaces of the article according to different relations, wherein a specific calculation formula is as follows:
Figure FDA0003120513160000031
Figure FDA0003120513160000032
......
Figure FDA0003120513160000033
and has r1≠r2≠...≠rM,r1,r2...rMThe e belongs to R, only the information of hop1 in the knowledge graph is used, the information aggregation range is not expanded, and when the interest expression of the user in different attribute spaces of the article is obtained, the historical interactive article is also calculated in the interest expression according to different relations.
5. The method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
after the interest expressions of the users on different attribute spaces of the articles are obtained, in order to obtain the most accurate interactive prediction score, the articles to be recommended are respectively mapped to the spaces with the same multidimensional interest of the users for comparison, and the specific calculation formula is as follows:
Figure FDA0003120513160000034
wherein
Figure FDA0003120513160000035
Is the entity characteristic expression of the item to be recommended in the knowledge graph,
Figure FDA0003120513160000036
is a mapping matrix of any relation, and M is the number of the relations in the knowledge graph; after the expressions of the to-be-recommended articles in different attribute spaces are obtained, inner products are respectively made with the corresponding interest characteristics of the user in the space, the maximum value is selected as the final prediction score, and the calculation formula is as follows:
Figure FDA0003120513160000037
wherein
Figure FDA0003120513160000038
Is an activation function.
6. The method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
the loss function comprises three parts, wherein the first part is the loss function of the interactive prediction part; the point method converts the recommendation problem into a two-class task, minimizing the following objective function:
Figure FDA0003120513160000039
where δ (-) is a sigmoid function, controlling the prediction score between 0 and 1, R+Is a positive sample set, i.e. a set labeled 1, R-Is a negative sample set, i.e., a set labeled 0;
the second part is a loss function of a knowledge graph representing learning Trans R training part, wherein edge loss Margin loss is selected, the input sample pair is a positive sample set S and a negative sample set S' which are acquired from a training set, the difference between scores of positive and negative samples is larger than a threshold value gamma, and the target function is as follows:
Figure FDA0003120513160000041
where max (x, y) represents the maximum of x and y obtained, γ represents the threshold, set to 1.0;
the third part is a regularization loss function for preventing model overfitting, and comprises two parts of parameters of interactive prediction and knowledge graph representation learning Trans R training, which are defined as follows:
Lreg=LKG_reg+LRec_reg=λ(||θKG||2+||θRec||2)
hyper-parametric lambda control L2The strength of regularization was set to 10-7,θKGAnd thetaRecModel parameters to prevent overfitting in separate table knowledge-graph training and cross-prediction, where θKGInitialization vectors, θ, representing entities and relationshipsRecAn initialization vector comprising a user and an item;
the loss function consists of the above three parts, joint training: l ═ LRec+LKG+Lreg
7. The method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
and after the maximum interest response layer outputs the interactive prediction scores of all the articles to be recommended of the user, for each user, performing descending sorting on all the articles according to the scores, and generating a list of the first N articles to be recommended to the user.
CN202110674480.XA 2021-06-18 2021-06-18 Knowledge graph-assisted user multi-dimensional interest extraction method Pending CN113407834A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110674480.XA CN113407834A (en) 2021-06-18 2021-06-18 Knowledge graph-assisted user multi-dimensional interest extraction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110674480.XA CN113407834A (en) 2021-06-18 2021-06-18 Knowledge graph-assisted user multi-dimensional interest extraction method

Publications (1)

Publication Number Publication Date
CN113407834A true CN113407834A (en) 2021-09-17

Family

ID=77684880

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110674480.XA Pending CN113407834A (en) 2021-06-18 2021-06-18 Knowledge graph-assisted user multi-dimensional interest extraction method

Country Status (1)

Country Link
CN (1) CN113407834A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113822776A (en) * 2021-09-29 2021-12-21 中国平安财产保险股份有限公司 Course recommendation method, device, equipment and storage medium
CN113923006A (en) * 2021-09-30 2022-01-11 北京淇瑀信息科技有限公司 Equipment data authentication method and device and electronic equipment
CN114254194A (en) * 2021-12-20 2022-03-29 北京工业大学 Attention-based multi-interest sequence recommendation method and system
CN114861059A (en) * 2022-05-18 2022-08-05 北京百度网讯科技有限公司 Resource recommendation method and device, electronic equipment and storage medium
CN114936328A (en) * 2022-04-22 2022-08-23 北京工业大学 Graph convolution interest decoupling method based on Transformer model
CN116628179A (en) * 2023-05-30 2023-08-22 道有道科技集团股份公司 User operation data visualization and man-machine interaction recommendation method
CN117117923A (en) * 2023-10-19 2023-11-24 深圳市百酷新能源有限公司 Big data-based energy storage control grid-connected management method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110955834A (en) * 2019-11-27 2020-04-03 西北工业大学 Knowledge graph driven personalized accurate recommendation method
CN111523029A (en) * 2020-04-20 2020-08-11 浙江大学 Personalized recommendation method based on knowledge graph representation learning
US20210027178A1 (en) * 2019-07-26 2021-01-28 Ricoh Company, Ltd. Recommendation method and recommendation apparatus based on deep reinforcement learning, and non-transitory computer-readable recording medium
CN112328908A (en) * 2020-11-11 2021-02-05 北京工业大学 Personalized recommendation method based on collaborative filtering
CN112765486A (en) * 2021-01-22 2021-05-07 重庆邮电大学 Knowledge graph fused attention mechanism movie recommendation method
CN112800334A (en) * 2021-02-04 2021-05-14 河海大学 Collaborative filtering recommendation method and device based on knowledge graph and deep learning

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210027178A1 (en) * 2019-07-26 2021-01-28 Ricoh Company, Ltd. Recommendation method and recommendation apparatus based on deep reinforcement learning, and non-transitory computer-readable recording medium
CN110955834A (en) * 2019-11-27 2020-04-03 西北工业大学 Knowledge graph driven personalized accurate recommendation method
CN111523029A (en) * 2020-04-20 2020-08-11 浙江大学 Personalized recommendation method based on knowledge graph representation learning
CN112328908A (en) * 2020-11-11 2021-02-05 北京工业大学 Personalized recommendation method based on collaborative filtering
CN112765486A (en) * 2021-01-22 2021-05-07 重庆邮电大学 Knowledge graph fused attention mechanism movie recommendation method
CN112800334A (en) * 2021-02-04 2021-05-14 河海大学 Collaborative filtering recommendation method and device based on knowledge graph and deep learning

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
侯位昭 等: "融合知识图谱及贝叶斯网络的智能推荐方法", 中国电子科学研究院学报, no. 05, 20 May 2020 (2020-05-20) *
杨晋吉 等: "一种知识图谱的排序学习个性化推荐算法", 小型微型计算机系统, no. 11, 15 November 2018 (2018-11-15) *
许鹏程 等: "数据驱动下数字图书馆用户画像模型构建", 图书情报工作, 20 February 2019 (2019-02-20) *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113822776A (en) * 2021-09-29 2021-12-21 中国平安财产保险股份有限公司 Course recommendation method, device, equipment and storage medium
CN113822776B (en) * 2021-09-29 2023-11-03 中国平安财产保险股份有限公司 Course recommendation method, device, equipment and storage medium
CN113923006A (en) * 2021-09-30 2022-01-11 北京淇瑀信息科技有限公司 Equipment data authentication method and device and electronic equipment
CN113923006B (en) * 2021-09-30 2024-02-02 北京淇瑀信息科技有限公司 Equipment data authentication method and device and electronic equipment
CN114254194A (en) * 2021-12-20 2022-03-29 北京工业大学 Attention-based multi-interest sequence recommendation method and system
CN114254194B (en) * 2021-12-20 2024-04-26 北京工业大学 Attention-based multi-interest sequence recommendation method and system
CN114936328A (en) * 2022-04-22 2022-08-23 北京工业大学 Graph convolution interest decoupling method based on Transformer model
CN114861059A (en) * 2022-05-18 2022-08-05 北京百度网讯科技有限公司 Resource recommendation method and device, electronic equipment and storage medium
CN116628179A (en) * 2023-05-30 2023-08-22 道有道科技集团股份公司 User operation data visualization and man-machine interaction recommendation method
CN116628179B (en) * 2023-05-30 2023-12-22 道有道科技集团股份公司 User operation data visualization and man-machine interaction recommendation method
CN117117923A (en) * 2023-10-19 2023-11-24 深圳市百酷新能源有限公司 Big data-based energy storage control grid-connected management method and system
CN117117923B (en) * 2023-10-19 2024-04-05 深圳市百酷新能源有限公司 Big data-based energy storage control grid-connected management method and system

Similar Documents

Publication Publication Date Title
CN113407834A (en) Knowledge graph-assisted user multi-dimensional interest extraction method
CN110750656B (en) Multimedia detection method based on knowledge graph
Joy et al. Review and classification of content recommenders in E-learning environment
Nie et al. Data-driven answer selection in community QA systems
CN109947987B (en) Cross collaborative filtering recommendation method
CN113158023B (en) Public digital life accurate classification service method based on mixed recommendation algorithm
CN110110225B (en) Online education recommendation model based on user behavior data analysis and construction method
CN108897791B (en) Image retrieval method based on depth convolution characteristics and semantic similarity measurement
CN111125540A (en) Recommendation method integrating knowledge graph representation learning and bias matrix decomposition
Tran A survey of machine learning and data mining techniques used in multimedia system
CN112328832B (en) Movie recommendation method integrating labels and knowledge graph
CN112328908B (en) Personalized recommendation method based on collaborative filtering
Venkatraman et al. MapReduce neural network framework for efficient content based image retrieval from large datasets in the cloud
Wang et al. An enhanced multi-modal recommendation based on alternate training with knowledge graph representation
Ji et al. Semantic softmax loss for zero-shot learning
de Ves et al. Modeling user preferences in content-based image retrieval: A novel attempt to bridge the semantic gap
CN116578729B (en) Content search method, apparatus, electronic device, storage medium, and program product
Zhu A book recommendation algorithm based on collaborative filtering
CN116484067A (en) Target object matching method and device and computer equipment
Guo et al. Getting more from one attractive scene: Venue retrieval in micro-videos
CN115757897A (en) Intelligent culture resource recommendation method based on knowledge graph convolution network
Loor et al. Handling subjective information through augmented (fuzzy) computation
Wang et al. Training-free indexing refinement for visual media via multi-semantics
Ma et al. Learning resource recommendation via knowledge graphs and learning style clustering
Beibei Automatic Integration Algorithm of Vocal Performance Learning Materials Based on Multidimensional Association Rules

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination