CN113407834A - Knowledge graph-assisted user multi-dimensional interest extraction method - Google Patents
Knowledge graph-assisted user multi-dimensional interest extraction method Download PDFInfo
- Publication number
- CN113407834A CN113407834A CN202110674480.XA CN202110674480A CN113407834A CN 113407834 A CN113407834 A CN 113407834A CN 202110674480 A CN202110674480 A CN 202110674480A CN 113407834 A CN113407834 A CN 113407834A
- Authority
- CN
- China
- Prior art keywords
- user
- interest
- knowledge graph
- articles
- entities
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000605 extraction Methods 0.000 title claims abstract description 13
- 238000000034 method Methods 0.000 claims abstract description 43
- 230000014509 gene expression Effects 0.000 claims abstract description 35
- 230000003993 interaction Effects 0.000 claims abstract description 14
- 239000013598 vector Substances 0.000 claims abstract description 14
- 230000006870 function Effects 0.000 claims description 27
- 238000012549 training Methods 0.000 claims description 21
- 230000002452 interceptive effect Effects 0.000 claims description 14
- 238000004364 calculation method Methods 0.000 claims description 10
- 230000004044 response Effects 0.000 claims description 10
- 238000013507 mapping Methods 0.000 claims description 7
- 238000012360 testing method Methods 0.000 claims description 6
- 239000011159 matrix material Substances 0.000 claims description 4
- 238000005457 optimization Methods 0.000 claims description 3
- 238000007781 pre-processing Methods 0.000 claims description 3
- 101100339496 Caenorhabditis elegans hop-1 gene Proteins 0.000 claims description 2
- 230000004913 activation Effects 0.000 claims description 2
- 230000002776 aggregation Effects 0.000 claims description 2
- 238000004220 aggregation Methods 0.000 claims description 2
- 238000006243 chemical reaction Methods 0.000 claims description 2
- 230000019771 cognition Effects 0.000 claims description 2
- 238000011160 research Methods 0.000 abstract description 5
- 238000001914 filtration Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000006399 behavior Effects 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- RPNUMPOLZDHAAY-UHFFFAOYSA-N Diethylenetriamine Chemical compound NCCNCCN RPNUMPOLZDHAAY-UHFFFAOYSA-N 0.000 description 1
- 230000004931 aggregating effect Effects 0.000 description 1
- 230000001149 cognitive effect Effects 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0631—Item recommendations
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- General Engineering & Computer Science (AREA)
- Life Sciences & Earth Sciences (AREA)
- Finance (AREA)
- Accounting & Taxation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Marketing (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Development Economics (AREA)
- Economics (AREA)
- Evolutionary Computation (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Artificial Intelligence (AREA)
- Animal Behavior & Ethology (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a knowledge graph-assisted user multi-dimensional interest extraction method. The accurate user interest expression is one of key points for improving recommendation performance, most of the existing recommendation algorithms adopt a single and mixed vector to express the user interest, the user interest cannot be expressed from multiple dimensions, and the method for uniformly modeling the user interest ignores entanglement in a hidden vector, is easy to obtain suboptimal user interest expression and lacks certain interpretability. However, due to the sparsity problem of the interaction data, it is very difficult to learn the multi-dimensional interest expression of the user in the historical interaction data, and the related research is less at present. The invention designs a knowledge graph-assisted user multi-dimensional interest extraction network, can obtain multi-dimensional user interest expression and simultaneously provides certain interpretability. We verified the validity of the method on music and book data sets.
Description
Technical Field
The invention relates to a collaborative filtering technology, in particular to a collaborative filtering algorithm based on articles; a representation learning method based on a knowledge graph is specifically a Trans R algorithm. And also relates to data mining and deep learning technologies such as user interest modeling.
Background
With the development of multimedia technology and the popularization of handheld mobile electronic devices, people can acquire information on the network at any time and any place, the information amount generated by the internet in one day can not be seen by one person for a whole life, and in the face of the increasingly serious information overload problem, a recommendation system is produced at the discretion of the people, so that the recommendation system has the function of establishing more efficient connection between users and information, and a great amount of time and cost are saved. In 1994, a news recommendation system was designed by the Group Lens research Group of the computer system in the university of minnesota in the city of double cities, and the concept of Collaborative Filtering (CF) was proposed for the first time. Since birth, CF has not only been intensively studied in academia but also has been practically deployed in industry. The method mainly analyzes the historical interaction records of the user and the articles to predict the next possible interaction articles of the user, and fully utilizes collective intelligence to find out the content which can best meet the user interest. Item-based collaborative filtering (ICF for short) based on articles is recommended according to similarity among the articles, and the ICF considers that personalized interests of users are embodied in historical articles interacted by the users, and can be used as an important component of user interest modeling.
Knowledge map (KG) is a new concept proposed by Google in 2012. From an academic point of view, we can give a definition of knowledge-graph: a "knowledge graph is essentially a knowledge base of Semantic networks (Semantic networks). Knowledge graph is an important ring of cognitive intelligence, and has a series of applications in the fields of retrieval recommendation and the like, and knowledge-enabled intelligent recommendation becomes the mainstream of future recommendation. The knowledge graph is introduced into the recommendation system as auxiliary information, so that the problems of sparsity and cold start existing in the traditional recommendation system can be effectively solved, and a certain interpretability is provided for a final recommendation result. In order to mine the key features of data on KG, a knowledge-graph mapping (KGE) algorithm is required to encode information that can adequately characterize the original data as low-dimensional embedded features. The KGE algorithm can be generally divided into two categories, a translation distance model and a semantic matching model. The translation distance model measures the reasonableness of the fact through the distance between two entities by utilizing a distance-based scoring function, such as Trans E, Trans H, Trans R, Trans D and the like; the semantic matching model utilizes a similarity-based scoring function. Semantic matching models measure the trustworthiness of facts, such as Dist Multi and RESCAL, by matching the underlying semantics of the entities and the relationships contained in the vector space representation.
In the research of the personalized recommendation algorithm, user interest modeling is a constant topic. How to express, calculate and update user interests has become an important research topic. The user's interests tend to follow a hierarchical pattern, from higher level attributes (e.g., genre, director, actors) to specific lower level attributes (a specific movie), in the case of a movie recommendation scenario. Previous models regarding user interest extraction, such as DIN and DIEN methods in ali, apparently neglect this hierarchical model and only extract user interest from the item level. The user historical behavior information is considered to be important information, the recommendation effect can be effectively improved by extracting the user interest from the information, and the multi-dimensional interest expression of the user can be extracted from the historical interaction behavior of the user with the aid of an external information knowledge graph.
In summary, the recommended user interest expression has not achieved the optimal effect, most of research uses a uniform and mixed hidden vector to express the user, the expression of the user multi-dimensional interest is not considered, the performance improvement is limited, the multi-dimensional expression of the user interest is difficult to obtain due to the sparsity of interactive data and the problem of cold start, how to define multiple dimensions is a troublesome problem, and the interpretability is lacking. The invention provides a knowledge graph-assisted method for extracting multi-dimensional interest of a user.
Disclosure of Invention
Most of the existing recommendation algorithms adopt a single and mixed vector to express user interests, the user interests cannot be expressed from multiple dimensions, and the method for uniformly modeling the user interests ignores entanglement in hidden vectors, so that suboptimal user interest expression is easily obtained, and meanwhile, certain interpretability is lacked. Aiming at the requirements of the current recommendation system, the invention aims to explore a method for reasonably extracting the multi-dimensional interests of a user from a user historical interaction record, and obtains the expression of the user interests in different attribute spaces of an article with the assistance of an external information knowledge base, as shown in figure 1.
In order to express user interest from multiple dimensions, the invention discloses a knowledge graph assistance-based user multi-dimensional interest extraction method. The overall frame is shown in fig. 2. The data sets used in this method are both from academia and industry public data sets (Amazon-book: books, Last-FM: music). The method comprises three modules which are an input layer, a multi-dimensional interest extraction layer and a maximum interest response layer respectively. Inputting historical interactive articles and articles to be recommended of a user, wherein the used knowledge graph represents that a learning method is Trans R; the multi-dimensional interest extraction layer realizes the extraction of multi-dimensional interest of the user by aggregating different entities under different connection relations, namely different user interest expressions exist in different article attribute spaces, and the essence of the multi-dimensional interest extraction layer lies in clustering after articles are mapped to different semantic spaces; the maximum interest response layer firstly maps the item to be recommended to different item attribute spaces, then performs inner product with the corresponding interest of the user in the space, and selects the maximum value as the final prediction score.
The invention contents of each main module of the method are as follows:
1. input layer
The input layer is divided into two parts, firstly, historical articles interacted by a user are obtained in a data set, and the historical articles and target articles to be recommended are used as original input of a model; then, all the articles in the data set are in one-to-one correspondence with the entities in the knowledge graph, and the representation learning method Trans R is used for learning, namely, the entities are mapped into different relation spaces for comparison, as shown in FIG. 3. The triples (h, r, t) can be formed only when h + r ═ t is satisfied in the same relationship space, and the essence is that the entities (i.e., articles) in the knowledge graph are a complex of multiple attributes, different relationships concern different attributes of the entities, and two entities that are similar in the a relationship space may not be similar in the B relationship space. As shown in particular in figure 3.
2. Multi-dimensional interest extraction layer
The multi-dimensional interest extraction layer can obtain the interest expression of the user in different item attribute spaces. Firstly, according to the input user history interactive objects, the corresponding entities of the objects in the knowledge graph are found. Taking a movie as an example, the used knowledge graph does not contain users, and different movie entities are connected according to the relationship of the same director, the same actor and the like to form triplets of movie 1-director-movie 2, movie 3-actor-movie 4 and the like, as shown in fig. 4. After obtaining the historical interacted objects of the user and their respective knowledge connection entities, we extract the entities which are the same director as the interacted entities of the user, and it is worth emphasizing that since the historical objects of the user are not one in most cases, the obtained set of director entities does not have only one director but a plurality of directors, so that the set of entities expresses the interest of the user in the attribute of the director of the movie, and is not a specific director. Similarly, the entity interacted with the user is the same actor, so that the user is interested in the attribute of the actor of the movie, and the like. In the method, firstly, how many relationships are shared in the knowledge graph, that is, how many attributes of the kind of articles (movies) are, in each attribute, there is a corresponding expression of user interest. In addition, since the characteristics of Trans R are that even the same entity has different expressions in different relationship spaces, for example, the same movie, in the director relationship space, we focus on its director attribute, and in the actor relationship space, focus on the actor attribute, so that the characteristic expressions of the same movie in different relationship spaces are different.
3. Maximum interest response layer
The maximum interest response layer can finally obtain the interaction prediction score of the user on the article. After the multi-dimensional interest expression of the user is obtained, the method firstly maps the object to be recommended to different object attribute spaces by using different relation matrixes, then performs inner product with the corresponding interest of the user in the space, selects the maximum value as the final prediction score, and has the essence that the user likes a certain attribute of the object particularly, so that the object is recommended to the user reasonably. And after the maximum interest response layer outputs the interactive prediction scores of all the articles to be recommended of the user, all the articles are sorted in a descending order according to the scores, the top N articles are generated into a list to be recommended to the user, and the recommendation is completed.
Drawings
FIG. 1 is a schematic diagram illustrating user interest expression in different attribute spaces of an article;
FIG. 2 is a general block diagram of a method for extracting multi-dimensional user interests based on knowledge graph assistance in the present invention;
FIG. 3 is a schematic diagram of the Trans R algorithm;
FIG. 4 is a schematic view of a knowledge graph;
the specific implementation mode is as follows:
the invention discloses a knowledge graph-assisted personalized recommendation method for extracting multi-dimensional interests of a user. The method comprises the following concrete implementation steps:
the method comprises the following steps: data preprocessing and training set test set division: the data preprocessing is divided into two parts, one part is to select a proper public data set, sequence numbers are arranged for all users and articles, and the other part is to record user-article interaction records in a hidden feedback mode: if the interaction between the user and the object is in a scoring form, the label of the scored object can be marked as 1, otherwise the label of the scored object is 0. And then all the articles are in one-to-one correspondence with the entities in the knowledge graph. And finally, dividing a training set and a test set, wherein in the method, a knowledge graph represents that two tasks of learning and recommendation prediction are jointly trained, and for the recommendation prediction task, the proportion of positive samples in the training set and the test set is 4:1, through experimental verification, the knowledge graph shows that the performance is best when the ratio of positive and negative samples in a training set of the learning task and the recommendation and prediction task is 1: 1.
Step two: model input and Trans R training:
the input to the model is a set of historical item entities that have been interacted with by a userWhereinIs a historical item set interacted by a user and a target item to be recommendedAnd the articles are in one-to-one correspondence with the entities in the knowledge graph. First, to represent the knowledge of structuring, the present invention uses an undirected graph G ═ (V, R), where V ═ V1,v2...vNRepresents the set of entities in the knowledge-graph, N represents the total number of entities, R ═ R1,r2...rMAnd M represents the total number of the relations. In the Trans R algorithm, for each triplet (v)h,r,vt),vhAnd vtIs two connected head and tail entities, r represents the type of relationship between the two entities. Firstly, v ishAnd vtMapping to a particular space vh,vt∈RkK is the dimension of the mapped vector, the value is 64, and | h | calculation2≤1,||t||2≤1,r∈RdD is the dimension of the mapped vector, the value is 128, and | r | | y calculation2Less than or equal to 1. Then setting a conversion matrix M for the current relationshipr∈Rk·dWhich can convert an entity to a corresponding gateIn the system space, as shown in fig. 3. The method specifically comprises the following steps: the scoring function for this one triplet is defined as:
step three: extracting the multi-dimensional interest of the user: the function of this layer is to obtain the expression of interest of the user in different item attribute spaces: i isu=fextractor(VuG ═ V, R)). Assuming that the used knowledge graph is complete, the relationship in the knowledge graph is the division of the attribute space of the class of articles, taking a movie as an example, and the relationship of a director, an actor and the like corresponds to the attribute division space of the entity of the movie, so that the number of relationships in the knowledge graph can be considered to be the same number of attribute spaces of the class of articles under the current cognition. Taking a history entity interacted by a user as a central point, sequentially acquiring interest expressions of the user in different attribute spaces of the article according to different relations, wherein a specific calculation formula is as follows:
......
and has r1≠r2≠...≠rM,r1,r2...rMe.R, in the method of the invention, only the information of hop1 in the knowledge graph is used without expandingThe large information aggregation range is considered that when the user interest is expressed, certain noise is introduced at the same time when high-order information is introduced, and the performance is further influenced; in addition, in the traditional method, the historical items interacted by the user are usually directly embodied by the user interests, and in the invention, when the interest expressions of the user in different attribute spaces of the items are obtained, the historical interaction items are also calculated in the interest expressions according to different relations.
Step four: maximum interest response obtains prediction score and recommendation list: after the interest expression of the user on different attribute spaces of the object is obtained, in order to obtain the most accurate interactive prediction score, the object to be recommended is respectively mapped to the space with the same multidimensional interest of the user for comparison, and the specific calculation formula is as follows:
whereinIs the entity characteristic expression of the item to be recommended in the knowledge graph,is a mapping matrix of any relation, and M is the number of relations in the knowledge graph. After the expressions of the to-be-recommended articles in different attribute spaces are obtained, inner products are respectively made with the corresponding interest characteristics of the user in the space, the maximum value is selected as the final prediction score, and the calculation formula is as follows:
whereinIs an activation function. The nature of the maximum interest response layer is that the user likes a certain attribute of the article particularly, so there is reason to giveThis item is recommended to the user. And after the maximum interest response layer outputs the interactive prediction scores of all the articles to be recommended of the user, all the articles are sorted in a descending order according to the scores, and the top N articles are generated into a list to be recommended to the user.
Step five: optimization method and loss function: the loss function in the invention comprises three parts, wherein the first part is the loss function of the interactive prediction part. Two learning strategies commonly used in the recommendation system are a point (pointwise) and a pair (pair) optimization method, and the invention selects the point. The point method is widely applied to numerous recommendation algorithms and achieves excellent effects. It transforms the recommendation problem into a two-class task, minimizing the following objective function:
where δ (-) is a sigmoid function, controlling the prediction score between 0 and 1, R+Is a positive sample set, i.e. a set labeled 1, R-Is a negative sample set, i.e., a set labeled 0.
The second part is a loss function of a knowledge graph representing learning Trans R training part, wherein edge loss Margin loss is selected, the input sample pair is a positive sample set S and a negative sample set S' which are acquired from a training set, the difference between scores of positive and negative samples is larger than a threshold value gamma, and the target function is as follows:
where max (x, y) represents the maximum of x and y obtained and γ represents the threshold, set to 1.0. The third part is a regularization loss function for preventing model overfitting, and comprises two parts of parameters of interactive prediction and knowledge graph representation learning Trans R training, which are defined as follows:
Lreg=LKG_reg+LRec_reg=λ(||θKG||2+||θRec||2)
hyper-parametric lambda control L2The strength of regularization was set to 10-7,θKGAnd thetaRecModel parameters to prevent overfitting in separate table knowledge-graph training and cross-prediction, where θKGAn initialization vector representing the entities and relationships,
θRecincluding initialization vectors for the user and the item.
The loss function of the invention consists of the three parts, and the joint training is as follows: l ═ LRec+LKG+Lre
Step six: and (3) verifying the validity of the method: after model training is completed, in order to verify the effectiveness of the method, the method is carried out in a public data set Amazon-book: book, Last-FM: experiments were performed musically. After the interactive prediction scores of the target users for the items to be recommended are obtained, the Top 20, 40,60,80 and 100 items with the highest scores are selected for each target user to form a Top-N personalized recommendation list. Evaluation indexes used in the experiment are Recall (correct Rate for all positive samples), Precision (correct Rate for Top-N list), Hit Rate (check probability of hitting any positive sample in Top-N list), and Normalized dispersed relationship Gain (NDCG focuses on the position where a positive sample appears in the Top-N recommendation list, and NDCG increases the further forward the position). Table 1 shows the performance of the method of the invention on two public data sets.
Table one: experimental performance display of the method on Amazon-book and Last-FM data sets
Claims (7)
1. A method for extracting multi-dimensional user interest based on knowledge graph assistance is characterized by comprising the following steps: the method comprises the following steps:
s1, preprocessing data: obtaining a suitable data set comprising a user ID, an item ID, and a tag of a user-item interaction; acquiring a knowledge graph file, enabling the ID of the article to correspond to the entities in the knowledge graph one by one, and connecting different entities by using a triple of a head entity-relation-tail entity; the proportion of positive samples in the training set and the test set of the recommendation and prediction part is 4:1, and the knowledge graph shows that the proportion of the positive samples and the negative samples in the training set of the learning task and the recommendation and prediction task is 1: 1;
s2, model input and Trans R training: acquiring an entity set corresponding to the historical article interacted by the user on the knowledge graph in the data set, and taking the entity set and the target article to be recommended as the original input of the model; using a representation learning method Trans R to learn the knowledge structure, namely mapping the entity to different relation spaces for comparison;
s3, extracting multi-dimensional interest of the user: according to the entity set interacted with the user history, extracting the entities interacted with the user from the entities connected by a relationship to obtain the interest expression of the user in the item attribute (relationship) space; the method comprises the following steps of (1) acquiring the common relation of a knowledge graph, namely the attributes of the articles, extracting corresponding user interest expressions in each attribute, and finally obtaining the multi-dimensional interest expressions of the users;
s4, obtaining the prediction score according to the maximum interest response: after the multi-dimensional interest expression of the user is obtained, firstly, mapping the to-be-recommended articles to different article attribute spaces by using different relation matrixes, then, performing inner product with the corresponding interest of the user in the space, and selecting the maximum value as a final prediction score;
s5, optimization method and loss function: the recommended prediction part adopts cross entropy loss log loss, the Trans R algorithm part adopts edge loss Margin loss, and the two parts of initialization parameters are normalized by L2, so that a total loss function is formed by the three parts of initialization parameters;
s6, generating a recommendation list: and after the predicted user-item interaction scores are obtained, all the items are sorted in a descending order according to the scores for each user, and the top N items are generated into a list to be recommended to the user.
2. The method for extracting the multi-dimensional interest of the user based on knowledge graph assistance as claimed in claim 1, wherein the user-item interaction record is recorded in a hidden feedback manner: if the interaction of the data set is in a scoring form, the scored article with the score being greater than or equal to 4 is scored as a label of 1 and is scored as a positive sample, and the rest is 0 and is scored as a negative sample; for each user, the positive sample ratio of the training set to the test set is 4:1, and the positive sample ratio of the training set to the test set is 1: 1.
3. The method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
the input to the model is a set of historical item entities that have been interacted with by a userWhereinIs a historical item set interacted by a user and a target item to be recommendedThe articles are in one-to-one correspondence with the entities in the knowledge graph; first, to represent the structured knowledge, an undirected graph G ═ (V, R) is used, where V ═ V1,v2...vNRepresents the set of entities in the knowledge-graph, N represents the total number of entities, R ═ R1,r2...rMExpressing a relation set in the knowledge graph, and expressing the total number of relations by M; in the Trans R algorithm, for each triplet (v)h,r,vt),vhAnd vtIs two connected head and tail entities, r represents the relationship type between the two entities; firstly, v ishAnd vtMapping to a particular space vh,vt∈RkK is the dimension of the mapped vector, the value is 64, and | h | calculation2≤1,||t||2≤1,r∈RdD is the dimension of the mapped vector, the value is 128, and | r | | y calculation2Less than or equal to 1; then setting a conversion matrix M for the current relationshipr∈Rk·dIt can convert the entity into the corresponding relationship space, specifically:the scoring function for this one triplet is defined as:
4. the method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
the multi-dimensional interest extraction layer of the user obtains the interest expression of the user in different article attribute spaces: i isu=fextractor(VuG ═ V, R)); assuming that the used knowledge graph is complete, the relationship in the knowledge graph is the division of the attribute space of the class of articles, and the number of relationships in the knowledge graph can be regarded as that the class of articles have the same number of attribute spaces under the current cognition; taking a history entity interacted by a user as a central point, sequentially acquiring interest expressions of the user in different attribute spaces of the article according to different relations, wherein a specific calculation formula is as follows:
......
and has r1≠r2≠...≠rM,r1,r2...rMThe e belongs to R, only the information of hop1 in the knowledge graph is used, the information aggregation range is not expanded, and when the interest expression of the user in different attribute spaces of the article is obtained, the historical interactive article is also calculated in the interest expression according to different relations.
5. The method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
after the interest expressions of the users on different attribute spaces of the articles are obtained, in order to obtain the most accurate interactive prediction score, the articles to be recommended are respectively mapped to the spaces with the same multidimensional interest of the users for comparison, and the specific calculation formula is as follows:
whereinIs the entity characteristic expression of the item to be recommended in the knowledge graph,is a mapping matrix of any relation, and M is the number of the relations in the knowledge graph; after the expressions of the to-be-recommended articles in different attribute spaces are obtained, inner products are respectively made with the corresponding interest characteristics of the user in the space, the maximum value is selected as the final prediction score, and the calculation formula is as follows:
6. The method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
the loss function comprises three parts, wherein the first part is the loss function of the interactive prediction part; the point method converts the recommendation problem into a two-class task, minimizing the following objective function:
where δ (-) is a sigmoid function, controlling the prediction score between 0 and 1, R+Is a positive sample set, i.e. a set labeled 1, R-Is a negative sample set, i.e., a set labeled 0;
the second part is a loss function of a knowledge graph representing learning Trans R training part, wherein edge loss Margin loss is selected, the input sample pair is a positive sample set S and a negative sample set S' which are acquired from a training set, the difference between scores of positive and negative samples is larger than a threshold value gamma, and the target function is as follows:
where max (x, y) represents the maximum of x and y obtained, γ represents the threshold, set to 1.0;
the third part is a regularization loss function for preventing model overfitting, and comprises two parts of parameters of interactive prediction and knowledge graph representation learning Trans R training, which are defined as follows:
Lreg=LKG_reg+LRec_reg=λ(||θKG||2+||θRec||2)
hyper-parametric lambda control L2The strength of regularization was set to 10-7,θKGAnd thetaRecModel parameters to prevent overfitting in separate table knowledge-graph training and cross-prediction, where θKGInitialization vectors, θ, representing entities and relationshipsRecAn initialization vector comprising a user and an item;
the loss function consists of the above three parts, joint training: l ═ LRec+LKG+Lreg。
7. The method for extracting multi-dimensional interest of users based on knowledge graph assistance as claimed in claim 1,
and after the maximum interest response layer outputs the interactive prediction scores of all the articles to be recommended of the user, for each user, performing descending sorting on all the articles according to the scores, and generating a list of the first N articles to be recommended to the user.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110674480.XA CN113407834A (en) | 2021-06-18 | 2021-06-18 | Knowledge graph-assisted user multi-dimensional interest extraction method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110674480.XA CN113407834A (en) | 2021-06-18 | 2021-06-18 | Knowledge graph-assisted user multi-dimensional interest extraction method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113407834A true CN113407834A (en) | 2021-09-17 |
Family
ID=77684880
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110674480.XA Pending CN113407834A (en) | 2021-06-18 | 2021-06-18 | Knowledge graph-assisted user multi-dimensional interest extraction method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113407834A (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113822776A (en) * | 2021-09-29 | 2021-12-21 | 中国平安财产保险股份有限公司 | Course recommendation method, device, equipment and storage medium |
CN113923006A (en) * | 2021-09-30 | 2022-01-11 | 北京淇瑀信息科技有限公司 | Equipment data authentication method and device and electronic equipment |
CN114254194A (en) * | 2021-12-20 | 2022-03-29 | 北京工业大学 | Attention-based multi-interest sequence recommendation method and system |
CN114861059A (en) * | 2022-05-18 | 2022-08-05 | 北京百度网讯科技有限公司 | Resource recommendation method and device, electronic equipment and storage medium |
CN114936328A (en) * | 2022-04-22 | 2022-08-23 | 北京工业大学 | Graph convolution interest decoupling method based on Transformer model |
CN116628179A (en) * | 2023-05-30 | 2023-08-22 | 道有道科技集团股份公司 | User operation data visualization and man-machine interaction recommendation method |
CN117117923A (en) * | 2023-10-19 | 2023-11-24 | 深圳市百酷新能源有限公司 | Big data-based energy storage control grid-connected management method and system |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110955834A (en) * | 2019-11-27 | 2020-04-03 | 西北工业大学 | Knowledge graph driven personalized accurate recommendation method |
CN111523029A (en) * | 2020-04-20 | 2020-08-11 | 浙江大学 | Personalized recommendation method based on knowledge graph representation learning |
US20210027178A1 (en) * | 2019-07-26 | 2021-01-28 | Ricoh Company, Ltd. | Recommendation method and recommendation apparatus based on deep reinforcement learning, and non-transitory computer-readable recording medium |
CN112328908A (en) * | 2020-11-11 | 2021-02-05 | 北京工业大学 | Personalized recommendation method based on collaborative filtering |
CN112765486A (en) * | 2021-01-22 | 2021-05-07 | 重庆邮电大学 | Knowledge graph fused attention mechanism movie recommendation method |
CN112800334A (en) * | 2021-02-04 | 2021-05-14 | 河海大学 | Collaborative filtering recommendation method and device based on knowledge graph and deep learning |
-
2021
- 2021-06-18 CN CN202110674480.XA patent/CN113407834A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20210027178A1 (en) * | 2019-07-26 | 2021-01-28 | Ricoh Company, Ltd. | Recommendation method and recommendation apparatus based on deep reinforcement learning, and non-transitory computer-readable recording medium |
CN110955834A (en) * | 2019-11-27 | 2020-04-03 | 西北工业大学 | Knowledge graph driven personalized accurate recommendation method |
CN111523029A (en) * | 2020-04-20 | 2020-08-11 | 浙江大学 | Personalized recommendation method based on knowledge graph representation learning |
CN112328908A (en) * | 2020-11-11 | 2021-02-05 | 北京工业大学 | Personalized recommendation method based on collaborative filtering |
CN112765486A (en) * | 2021-01-22 | 2021-05-07 | 重庆邮电大学 | Knowledge graph fused attention mechanism movie recommendation method |
CN112800334A (en) * | 2021-02-04 | 2021-05-14 | 河海大学 | Collaborative filtering recommendation method and device based on knowledge graph and deep learning |
Non-Patent Citations (3)
Title |
---|
侯位昭 等: "融合知识图谱及贝叶斯网络的智能推荐方法", 中国电子科学研究院学报, no. 05, 20 May 2020 (2020-05-20) * |
杨晋吉 等: "一种知识图谱的排序学习个性化推荐算法", 小型微型计算机系统, no. 11, 15 November 2018 (2018-11-15) * |
许鹏程 等: "数据驱动下数字图书馆用户画像模型构建", 图书情报工作, 20 February 2019 (2019-02-20) * |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113822776A (en) * | 2021-09-29 | 2021-12-21 | 中国平安财产保险股份有限公司 | Course recommendation method, device, equipment and storage medium |
CN113822776B (en) * | 2021-09-29 | 2023-11-03 | 中国平安财产保险股份有限公司 | Course recommendation method, device, equipment and storage medium |
CN113923006A (en) * | 2021-09-30 | 2022-01-11 | 北京淇瑀信息科技有限公司 | Equipment data authentication method and device and electronic equipment |
CN113923006B (en) * | 2021-09-30 | 2024-02-02 | 北京淇瑀信息科技有限公司 | Equipment data authentication method and device and electronic equipment |
CN114254194A (en) * | 2021-12-20 | 2022-03-29 | 北京工业大学 | Attention-based multi-interest sequence recommendation method and system |
CN114254194B (en) * | 2021-12-20 | 2024-04-26 | 北京工业大学 | Attention-based multi-interest sequence recommendation method and system |
CN114936328A (en) * | 2022-04-22 | 2022-08-23 | 北京工业大学 | Graph convolution interest decoupling method based on Transformer model |
CN114861059A (en) * | 2022-05-18 | 2022-08-05 | 北京百度网讯科技有限公司 | Resource recommendation method and device, electronic equipment and storage medium |
CN116628179A (en) * | 2023-05-30 | 2023-08-22 | 道有道科技集团股份公司 | User operation data visualization and man-machine interaction recommendation method |
CN116628179B (en) * | 2023-05-30 | 2023-12-22 | 道有道科技集团股份公司 | User operation data visualization and man-machine interaction recommendation method |
CN117117923A (en) * | 2023-10-19 | 2023-11-24 | 深圳市百酷新能源有限公司 | Big data-based energy storage control grid-connected management method and system |
CN117117923B (en) * | 2023-10-19 | 2024-04-05 | 深圳市百酷新能源有限公司 | Big data-based energy storage control grid-connected management method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113407834A (en) | Knowledge graph-assisted user multi-dimensional interest extraction method | |
CN110750656B (en) | Multimedia detection method based on knowledge graph | |
Joy et al. | Review and classification of content recommenders in E-learning environment | |
Nie et al. | Data-driven answer selection in community QA systems | |
CN109947987B (en) | Cross collaborative filtering recommendation method | |
CN113158023B (en) | Public digital life accurate classification service method based on mixed recommendation algorithm | |
CN110110225B (en) | Online education recommendation model based on user behavior data analysis and construction method | |
CN108897791B (en) | Image retrieval method based on depth convolution characteristics and semantic similarity measurement | |
CN111125540A (en) | Recommendation method integrating knowledge graph representation learning and bias matrix decomposition | |
Tran | A survey of machine learning and data mining techniques used in multimedia system | |
CN112328832B (en) | Movie recommendation method integrating labels and knowledge graph | |
CN112328908B (en) | Personalized recommendation method based on collaborative filtering | |
Venkatraman et al. | MapReduce neural network framework for efficient content based image retrieval from large datasets in the cloud | |
Wang et al. | An enhanced multi-modal recommendation based on alternate training with knowledge graph representation | |
Ji et al. | Semantic softmax loss for zero-shot learning | |
de Ves et al. | Modeling user preferences in content-based image retrieval: A novel attempt to bridge the semantic gap | |
CN116578729B (en) | Content search method, apparatus, electronic device, storage medium, and program product | |
Zhu | A book recommendation algorithm based on collaborative filtering | |
CN116484067A (en) | Target object matching method and device and computer equipment | |
Guo et al. | Getting more from one attractive scene: Venue retrieval in micro-videos | |
CN115757897A (en) | Intelligent culture resource recommendation method based on knowledge graph convolution network | |
Loor et al. | Handling subjective information through augmented (fuzzy) computation | |
Wang et al. | Training-free indexing refinement for visual media via multi-semantics | |
Ma et al. | Learning resource recommendation via knowledge graphs and learning style clustering | |
Beibei | Automatic Integration Algorithm of Vocal Performance Learning Materials Based on Multidimensional Association Rules |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |