CN107679242A - Merge the label recommendation method that multiple information sources Coupling Tensor is decomposed - Google Patents

Merge the label recommendation method that multiple information sources Coupling Tensor is decomposed Download PDF

Info

Publication number
CN107679242A
CN107679242A CN201711040886.2A CN201711040886A CN107679242A CN 107679242 A CN107679242 A CN 107679242A CN 201711040886 A CN201711040886 A CN 201711040886A CN 107679242 A CN107679242 A CN 107679242A
Authority
CN
China
Prior art keywords
label
labels
similarity
matrix
tag
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711040886.2A
Other languages
Chinese (zh)
Other versions
CN107679242B (en
Inventor
杨忆
韩立新
刘元珍
勾智楠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hohai University HHU
Original Assignee
Hohai University HHU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hohai University HHU filed Critical Hohai University HHU
Priority to CN201711040886.2A priority Critical patent/CN107679242B/en
Publication of CN107679242A publication Critical patent/CN107679242A/en
Application granted granted Critical
Publication of CN107679242B publication Critical patent/CN107679242B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)

Abstract

The invention discloses a kind of label recommendation method for merging multiple information sources Coupling Tensor and decomposing, first, decomposed simultaneously in the tensor CP of label resources user triplet information construction, label is with the addition of to participate in combining decomposition with three resource, label and user auxiliary information matrixes with label, label, and when building label and label similarity matrix, the present invention considers the Semantic Similarity of cooccurrence relation and label in WordNet between label simultaneously, and similarity measurement between final label is used as using the linear Integrated of two kinds of similitudes.Secondly, after the loss function of Construct question, parameter optimization is carried out to object function with ADMM algorithms.Finally according to decompose completion prediction tensor more accurately to(User, resource)To recommending Top N labels.The present invention has merged the same Heterogeneous Information of label, applied to having versatility on each socialization labeling system.

Description

Label recommendation method fusing multi-information source coupling tensor decomposition
Technical Field
The invention relates to a label recommendation method fusing multi-information source coupling tensor decomposition, and belongs to the technical field of computer network labeling.
Background
With the increasing development of Web2.0 websites, information on the Web is rapidly growing at an alarming rate, and the information growth rate far exceeds the processing capacity of people. Recommendation systems have now played an increasingly important role in the efficient processing of information. Social annotation systems, which are typical applications of recommendation systems, such as last. Fm for sharing music, flicker for sharing pictures, and dericious for sharing bookmarks, are rapidly developed, and in these social annotation systems, users actively generate tags and identify, manage, and discover information resources through the tags. The recommendation of the label is a research hotspot of the current labeling system, and aims to reduce the burden of a user and help the user to select a proper label to finish labeling operation. Unlike the traditional recommendation system which only processes user-resource (user-item), the social annotation system needs to process three dimensions of user-tag-resource (user-tag-item), so that the model which only considers binary relation is no longer suitable for the social annotation system containing ternary relation of user, resource and tag. Meanwhile, tensor models have become popular methods for studying potential association between high-order data, so more and more students begin studying label recommendation based on tensor decomposition models. However, the existing tensor decomposition method still faces the problems of extreme sparseness of data, missing values and overfitting in the label recommendation process.
In view of the above three problems, the existing label recommendation method based on tensor decomposition only utilizes isomorphic auxiliary information between labels, and does not consider utilizing heterogeneous auxiliary information between label-resources and label-users.
Disclosure of Invention
The purpose is as follows: with the popularization of the application of a social labeling system, the invention provides the label recommendation method integrating the multi-information source coupling tensor decomposition, in order to overcome the defect that the conventional label recommendation method based on the tensor decomposition only utilizes isomorphic information between labels and improve the accuracy of label recommendation.
The technical scheme is as follows: in order to solve the technical problems, the technical scheme adopted by the invention is as follows:
a label recommendation method fusing multi-information source coupling tensor decomposition comprises the following specific steps:
step1, constructing a label similarity matrix B based on linear integration of two similarity measures of label co-occurrence and semantic correlation;
step2: constructing a Tag-resource, tag-Item matrix C;
step3: constructing a label-resource, tag-User matrix D;
step4: constructing a model and optimizing a parameter algorithm;
step5: and (5) recommending the label.
Preferably, step1 comprises the following steps:
step1.1 calculates the tag co-occurrence similarity:
let t be i And t j Two tags in the data set of the tag similarity matrix B are used, and the method for measuring the co-occurrence similarity between the two tags is shown in formula 1:
|t i ∩t j | denotes t i And t j Number of commonly labeled resources, | t i ∪t j | denotes t i And t j The sum of the annotated resources;
step1.2 computes tag semantic similarity:
calculating the semantic similarity of the labels according to the semantic similarity of the labels in WordNet, and collecting the data set by using two labels t i And t j Semantic similarity is shown in equation 2:
wherein LCS is t i And t j The minimum common super concept of (c), depth (LCS) is the number of nodes from LCS to the classification root; n1 is from t i Number of nodes on the path to LCS, N2 being from t j The number of nodes on the path to the LCS;
step1.3 integrates similarity and semantics as the final similarity:
step1.1 and Step1.2 respectively acquire the mutual correlation between the co-occurrence similarity of the labels and the semantic hierarchy of the labels, and in order to complement the two ways, the similarity calculation between the two labels is combined as t i And t j Final similarity:
B i,j =γ×cooccurrence_B i,j +(1-γ)×senmantic_B i,j ,γ∈[0,1] (3)
step1.4 tag map Laplacian as the regularization term:
the assumption of label graph regularization is that if label i and label j are similar, then the implicit eigen factor matrix U of the labels mined by the tensor decomposition process (1) Implicit feature row vectors for Medium tag i and tag jAndwill be very close;
wherein, B i,j Calculating final similarity values of the label i and the label j in Step 1.3;implicit characteristic factor matrix U for labels (1) The ith row and the d th column elements,implicit characteristic factor matrix U representing labels (1) The whole D column, L is called a graph Laplacian matrix, L = D-B, D is a diagonal matrix, the ith value of the diagonal element of D is the element sum of the similarity matrix B corresponding to the ith row, namely D ii =∑ j B i,j (ii) a And tr () denotes the trace of the matrix.
Preferably, step2 comprises the following steps:
all labels in a training data set of the social annotation system are regarded as a document set, and the labels t h Annotating a resource v j The weight of (A) is:
where Num (h, j) represents the tag t h The number of times of occurrence in the whole label set, M is the total number of resources in the system, d hj Indicating the number of resources tagged with the label.
Preferably, step3 comprises the following steps:
and (4) regarding all labels in the training data set of the social annotation system as a document set, and then, regarding the labels t k Is user u i The weights used are:
where Num (k, i) represents the label t k The number of times of concentrated appearance of the whole label, N is the total number of users in the system, W ki Indicating the number of users tagged with the label.
Preferably, step4 comprises the following steps:
step4.1: the formalized label recommendation is the basic CP tensor decomposition model:
set U (1) ,U (2) ,U (3) Respectively representing implicit characteristic factor matrixes corresponding to a Tag (Tag), a resource (Item) and a User (User); whereink is the rank of tensor, and k < min (| Tag |, | Item |, | User |), the label recommendation model based on CP tensor decomposition is defined as the following constrained optimization problem:
wherein,the method includes constructing an initial tensor by using a training data set of a socialized marking system, wherein lambda is a regularization parameter, | | | calting F Is L 2 The number of the norm is calculated,for the Tikhonov regularization term, prevent the target expression from overfitting and provide a unique solution,representative table and watchNon-negative indicator tensor of the same size ifWith the observed annotated behavior value, thenOtherwise equal to Representing an observed annotated behavior value;
step4.2: and (3) integrating auxiliary information of the Step1-Step3 structures as regularization items and adding the regularization items to the model constructed in the Step4.1, and converting the model in the Step4.1 to solve the following constraint optimization problem:
wherein alpha controls the weight of the auxiliary information of the similarity between the labels participating in the decomposition, and beta controls the weight of the label-resource matrix C and the label-user matrix D as the weight of the auxiliary information participating in the decomposition;
because the formula (8) has no closed solution, the invention uses ADMM, alternating Direction Method of Multipliers algorithm to optimize the objective function solution; the objective function formula (8) is written in the form of a partial lagrangian function as formula (9):
wherein Z (n) ≧ 0,n =1,2,3 is a temporary variable introduced for solving equation (9), Y (n) (n =1,2, 3) is a Lagrangian multiplier matrix, η is a penalty parameter,<*,*&gt is inner product operation;
step 4.3: ADMM optimization target expression formula (9):
the invention adopts an iteration scheme to solve the formula (9) and updates the formula respectivelyThen subsequently update { Y (1) ,Y (2) ,Y (3) The method comprises the following specific steps:
(1) Updating Z (1) ,Z (2) ,Z (3)
To update Z (1) ,Z (2) ,Z (3) Only the following optimization problem needs to be solved, as shown in equation (10):
writing Z, Y, U in the form of a column vector, such as Z = [ ] ]Z 1 ...Z m ],Z i Is the ith column of Z, thus Z i ,Y i ,U i Are all column vectors; z (n) Equations (11) and (12) are effectively updated by solving the optimization problem:
obtaining by solution (11)
Obtaining by solution (12)
Wherein I is an identity matrix;
(2) Updating U (1) ,U (2) ,U (3)
To update U (1) ,U (2) ,U (3) Equation (9) is rewritten into equation (15):
wherein E (n) =(U (N) ⊙...U (n+1) ⊙U (n-1) ⊙LU (1) ) T | N=3 Filament as Khatri-Rao product, X (n) Is the tensorThe mode-n expansion matrix of (1); then get the information about U (n) The update formula of (c) is as follows:
(3) Updating
WhereinIs thatIs equal to
(4) Updating Y (n)
(5) Updating eta:
updating eta in an adaptive mode to accelerate the optimization algorithm, wherein eta is updated according to an equation (21):
η t+1 =min(ρη tmax ) (21)。
preferably, step5 comprises the following steps:
through the joint division of formula (9)After the solution, the compensated tensor can be obtained The middle element represents a quadruple (t, v, u, p), and p represents the possibility that a user u tags a resource v with t; the label can be recommended according to the weight corresponding to (u, v); and if N labels are to be recommended when the user u labels the resource v, selecting the labels corresponding to the N weights from high to low.
Has the advantages that: the label recommendation method fusing multi-information source coupling tensor decomposition, provided by the invention, comprises the steps of firstly constructing three auxiliary information matrixes of labels and labels, labels and resources and labels and users, optimizing parameters of a training model by using an ADMM algorithm after a problem model is constructed, calculating implicit eigenvectors of the labels, the users and the resources by using a coupling tensor-matrix decomposition method, and recommending the labels for the labeled users. The algorithm can effectively utilize the isomorphic and heterogeneous semantic information of the label, and effectively relieves the problems of data sparseness, missing values and overfitting; the method has universality and is suitable for being applied to various social labeling systems.
Drawings
FIG. 1 is a schematic diagram of a coupling tensor-matrix decomposition model;
FIG. 2 is a framework diagram of a tag recommendation algorithm;
FIG. 3 is an ADMM optimization model parameter algorithm diagram.
Detailed Description
The present invention will be further described with reference to the accompanying drawings.
As shown in fig. 1-2, a label recommendation method fusing multi-information source coupling tensor decomposition includes the following specific steps:
step1, constructing a label similarity matrix B based on linear integration of label co-occurrence and semantic correlation similarity measurement;
if two resources are labeled with similar labels, the two resources are likely to have similar implicit eigenvectors, so the coupling tensor-matrix decomposition process can be regularized by label information.
Step1.1 calculates the tag co-occurrence similarity:
let t i And t j Two tags in the data set of the tag similarity matrix B are used, and the method for measuring the co-occurrence similarity between the two tags is shown in formula 1:
|t i ∩t j i denotes t i And t j Number of commonly labeled resources, | t i ∪t j I denotes t i And t j The sum of the annotated resources;
step1.2 computes tag semantic similarity:
calculating the semantic similarity of the labels according to the semantic similarity of the labels in WordNet, and collecting two labels t in the data set i And t j Semantic similarity is shown in equation 2:
wherein LCS is t i And t j The depth (LCS) is the number of nodes from the LCS to the classification root; n1 is from t i Number of nodes on the path to LCS, N2 being from t j The number of nodes on the path to the LCS;
step1.3 integrates similarity and semantics as the final similarity:
step1.1 and step1.2 obtain mutual correlation between the co-occurrence similarity of the labels and the semantic hierarchy of the labels respectively, and in order to complement the two ways, the similarity calculation between the two labels is combined as t i And t j Final similarity:
B i,j =γ×cooccurrence_B i,j +(1-γ)×senmantic_B i,j ,γ∈[0,1] (3)
step1.4 tag map Laplacian as the regularization term:
the main assumption for regularization of the label graph is that if label i and label j are similar, then the implicit eigen factor matrix U of the labels mined by the tensor decomposition process (1) Implicit feature row vectors for middle label i and label jAndbut also very close.
Wherein, B i,j Calculating final similarity values of the label i and the label j in Step 1.3;implicit characteristic factor matrix U for labels (1) The ith row and the d th column elements,implicit characteristic factor matrix U representing a label (1) The whole D column, L is called a Laplacian matrix, L = D-B, D is a diagonal matrix, the ith value of the diagonal elements of D is the sum of the elements of the ith row corresponding to the similarity matrix B, namely D ii =∑ j B i,j (ii) a And tr () denotes the trace of the matrix.
Step2: constructing a Tag-resource (Tag-Item) matrix C:
all labels in the training data set of the socialized annotation system are regarded as textArchives, labels t h Annotating a resource v j The weight of (A) is:
where Num (h, j) represents the tag t h The number of times of occurrence in the whole label set, M is the total number of resources in the system, d hj Representing the number of resources labeled by the label;
step3: constructing a Tag-resource (Tag-User) matrix D:
and (4) regarding all labels in the training data set of the social annotation system as a document set, and then, regarding the labels t k By user u i The weights used are:
where Num (k, i) represents the tag t k The number of times of concentrated appearance of the whole label, N is the total number of users in the system, W ki Representing the number of users labeled by the label;
step4: model construction and parameter optimization algorithm:
the Step4.1 formalized label recommendation is the basic CP tensor decomposition model:
set U (1) ,U (2) ,U (3) Respectively, the implicit characteristic factor matrixes corresponding to the label (Tag), the resource (Item) and the User (User). Whereink is rank of tensor, k < min (| Tag |, | Item |, | User |), then the Tag recommendation model based on CP tensor decomposition can be defined as the following constrained optimization problem:
wherein,the method comprises constructing an initial tensor by utilizing a training data set of a socialized labeling system, wherein lambda is a regularization parameter, | | | | computation F Is L 2 The norm of the number of the first-order-of-arrival,for the Tikhonov regularization term, prevent the target expression from overfitting and provide a unique solution,representative table and watchTensor of non-negative indicator of the same size, ifWith observed annotated behavior valuesOtherwise equal to Representing an observed annotated behavior value;
step4.2 integrates the auxiliary information of the Step1-Step3 structure as a regularization item and adds the regularization item to the model constructed in the Step4.1, and the model in the Step4.1 is transformed to solve the following constraint optimization problem:
wherein alpha controls the weight of the auxiliary information participating in the decomposition of the similarity between the labels, and beta controls the weight of the label-resource matrix C and the label-user matrix D as the weight of the auxiliary information participating in the decomposition.
Since equation (8) has no closed-form solution. The invention uses ADMM (Alternating Direction Method of Multipliers) algorithm to optimize the objective function solution. The objective function (8) is written in the form of a partial Lagrangian function as shown in equation (9):
wherein Z (n) ≧ 0,n =1,2,3 is a temporary variable introduced for the resolution of formula (9), Y (n) (n =1,2, 3) is a Lagrangian multiplier matrix, η is a penalty parameter,<*,*&gt, is the inner product operation.
As shown in fig. 3, step 4.3: ADMM optimizes the target expression:
the invention adopts an iterative scheme to solve the formula (9) and respectively updatesThen subsequently update { Y (1) ,Y (2) ,Y (3) The method comprises the following specific steps:
(1) Updating Z (1) ,Z (2) ,Z (3)
To update Z (1) ,Z (2) ,Z (3) Only the following optimization problem needs to be solved, as shown in equation (10):
writing Z, Y, U into column vector form, e.g. Z = [ Z ] 1 ...Z m ],Z i Is the ith column of Z, thus Z i ,Y i ,U i Are all column vectors. Z is a linear or branched member (n) Can be efficiently updated by solving the optimization problems (11) and (12):
obtaining by solution (11)
Obtaining by solution (12)
Where I is the identity matrix.
(2) Updating U (1) ,U (2) ,U (3) .
To update U (1) ,U (2) ,U (3) Equation (9) is rewritten into equation (15):
wherein E (n) =(U (N) ⊙...U (n+1) ⊙U (n-1) ⊙LU (1) ) T | N=3 Filament as Khatri-Rao product, X (n) Is the tensorThe mode-n expansion matrix. Then get the information about U (n) The update formula of (2) is as follows:
(3) Updating
WhereinIs thatIs equal to
(4) Updating Y (n)
(5) Updating eta
The eta acceleration optimization algorithm can be updated adaptively, and eta is updated according to the formula (21).
η t+1 =min(ρη tmax ) (21)
Step5 tag recommendation
After the joint decomposition of the formula (9), the compensated tensor can be obtained The middle element represents a quadruple (t, v, u, p), and p represents the possibility that a user u tags a resource v with t; the labels can be recommended according to the corresponding weight of (u, v); and if N labels are to be recommended when the user u labels the resource v, selecting the labels corresponding to the N weights from high to low.
The above description is only of the preferred embodiments of the present invention, and it should be noted that: it will be apparent to those skilled in the art that various modifications and adaptations can be made without departing from the principles of the invention and these are intended to be within the scope of the invention.

Claims (6)

1. A label recommendation method fusing multi-information source coupling tensor decomposition is characterized by comprising the following steps: the method comprises the following specific steps:
step1, constructing a label similarity matrix B based on linear integration of two similarity measures of label co-occurrence and semantic correlation;
step2: constructing a Tag-resource, tag-Item matrix C;
step3: constructing a label-resource, tag-User matrix D;
step4: constructing a model and optimizing a parameter algorithm;
step5: and (4) recommending the label.
2. The label recommendation method fusing multi-information source coupling tensor decomposition according to claim 1, wherein: the Step1 comprises the following steps:
step1.1 calculate tag co-occurrence similarity:
let t be i And t j Two tags in the data set of the tag similarity matrix B are used, and the method for measuring the co-occurrence similarity between the two tags is shown in formula 1:
|t i ∩t j i denotes t i And t j Number of commonly-labeled resources, | t i ∪t j | denotes t i And t j The sum of the annotated resources;
step1.2 computes tag semantic similarity:
calculating the semantic similarity of the labels according to the semantic similarity of the labels in WordNet, and collecting the data set by using two labels t i And t j Semantic faciesSimilarity is shown in formula 2:
wherein LCS is t i And t j The depth (LCS) is the number of nodes from the LCS to the classification root; n1 is from t i Number of nodes on the path to LCS, N2 being from t j The number of nodes on the path to the LCS;
step1.3 integrates similarity and semantics as the final similarity:
step1.1 and Step1.2 respectively acquire the mutual correlation between the co-occurrence similarity of the labels and the semantic hierarchy of the labels, and in order to complement the two ways, the similarity calculation between the two labels is combined as t i And t j Final similarity:
B i,j =γ×cooccurrence_B i,j +(1-γ)×senmantic_B i,j ,γ∈[0,1] (3)
the Step1.4 label graph Laplacian is used as a regularization term:
the assumption of label graph regularization is that if label i and label j are similar, then the implicit eigen factor matrix U of the labels mined by the tensor decomposition process (1) Implicit feature row vectors for middle label i and label jAndwill be very close;
wherein, B i,j Calculating final similarity values of the label i and the label j in Step 1.3;is a labelImplicit characteristic factor matrix U of (1) The ith row and the d th column elements,implicit characteristic factor matrix U representing labels (1) The whole D column, L is called a graph Laplacian matrix, L = D-B, D is a diagonal matrix, the ith value of the diagonal element of D is the element sum of the similarity matrix B corresponding to the ith row, namely D ii =∑ j B i,j (ii) a And tr () denotes the trace of the matrix.
3. The label recommendation method for merging multiple information source coupling tensor decompositions as recited in claim 1, wherein: the Step2 comprises the following steps:
all labels in a training data set of the social annotation system are regarded as a document set, and the labels t h Annotating a resource v j The weight of (A) is:
where Num (h, j) represents the tag t h The number of times of occurrence in the whole label set, M is the total number of resources in the system, d hj Indicating the number of resources tagged with the label.
4. The label recommendation method for merging multiple information source coupling tensor decompositions as recited in claim 1, wherein: the Step3 comprises the following steps:
and (4) regarding all labels in the training data set of the social annotation system as a document set, and then, regarding the labels t k Is user u i The weights used are:
where Num (k, i) represents the tag t k The number of times of concentrated appearance of the whole label, N is the total number of users in the system, W ki Indicating the labelling byThe number of the households.
5. The label recommendation method fusing multi-information source coupling tensor decomposition according to claim 1, wherein: the Step4 comprises the following steps:
step4.1: the formalized label recommendation is the basic CP tensor decomposition model:
set U (1) ,U (2) ,U (3) Respectively representing implicit characteristic factor matrixes corresponding to a Tag (Tag), a resource (Item) and a User (User); whereink is the rank of tensor, and k < min (| Tag |, | Item |, | User |), the label recommendation model based on CP tensor decomposition is defined as the following constrained optimization problem:
wherein,the method includes constructing an initial tensor by using a training data set of a socialized marking system, wherein lambda is a regularization parameter, | | | calting F Is L 2 The norm of the number of the first-order-of-arrival,for the Tikhonov regularization term, prevent the target expression from overfitting and provide a unique solution,representative table and watchThe same applies toTensor of non-negative indicator of size, ifWith the observed annotated behavior value, thenOtherwise equal to Representing observed annotated behavior values;
step4.2: and (3) integrating auxiliary information of the Step1-Step3 structures as regularization items and adding the regularization items to the model constructed in the Step4.1, and converting the model in the Step4.1 to solve the following constraint optimization problem:
wherein alpha controls the weight of the auxiliary information of the similarity between the labels participating in the decomposition, and beta controls the weight of the label-resource matrix C and the label-user matrix D as the auxiliary information participating in the decomposition;
because the formula (8) has no closed solution, the invention uses ADMM, alternating Direction Method of Multipliers algorithm to optimize the objective function solution; the objective function formula (8) is written in the form of a partial lagrangian function as formula (9):
wherein Z (n) ≧ 0,n =1,2,3 is a temporary variable introduced for the resolution of formula (9), Y (n) (n =1,2, 3) is a Lagrangian multiplier matrix, η is a penalty parameter,<*,*&gt is inner product operation;
step 4.3: ADMM optimization target expression formula (9):
the invention adopts an iteration scheme to solve the formula (9) and updates the formula respectivelyThen subsequently update { Y (1) ,Y (2) ,Y (3) The method comprises the following specific steps:
(1) Updating Z (1) ,Z (2) ,Z (3)
To update Z (1) ,Z (2) ,Z (3) Only the following optimization problem needs to be solved, as shown in formula (10):
writing Z, Y, U into column vector form, e.g. Z = [ Z ] 1 ...Z m ],Z i Is the ith column of Z, thus Z i ,Y i ,U i Are all column vectors; z (n) Equations (11) and (12) are effectively updated by solving the optimization problem:
obtaining by solution (11)
Obtaining by solution (12)
Wherein I is an identity matrix;
(2) Updating U (1) ,U (2) ,U (3)
To update U (1) ,U (2) ,U (3) Equation (9) is rewritten to equation (15):
wherein E (n) =(U (N) ⊙...U (n+1) ⊙U (n-1) ⊙LU (1) ) T | N=3 Filament as Khatri-Rao product, X (n) Is the tensorThe mode-n expansion matrix of (2); then get the information about U (n) The update formula of (2) is as follows:
(3) Updating
WhereinIs thatIs equal to
(4) Updating Y (n)
(5) Updating eta:
updating eta in an adaptive mode to accelerate the optimization algorithm, wherein eta is updated according to an equation (21):
η t+1 =min(ρη tmax ) (21)。
6. the label recommendation method fusing multi-information source coupling tensor decomposition according to claim 1, wherein: the Step5 comprises the following steps:
after the joint decomposition of the formula (9), the compensated tensor is obtained The middle element represents a quadruple (t, v, u, p), and p represents the possibility that a user u tags a resource v with t; the label is recommended according to the weight corresponding to the (u, v); and if N labels are to be recommended when the user u labels the resource v, selecting the labels corresponding to the N weights from high to low.
CN201711040886.2A 2017-10-30 2017-10-30 Merge the label recommendation method that multiple information sources Coupling Tensor is decomposed Expired - Fee Related CN107679242B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711040886.2A CN107679242B (en) 2017-10-30 2017-10-30 Merge the label recommendation method that multiple information sources Coupling Tensor is decomposed

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711040886.2A CN107679242B (en) 2017-10-30 2017-10-30 Merge the label recommendation method that multiple information sources Coupling Tensor is decomposed

Publications (2)

Publication Number Publication Date
CN107679242A true CN107679242A (en) 2018-02-09
CN107679242B CN107679242B (en) 2018-07-27

Family

ID=61142512

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711040886.2A Expired - Fee Related CN107679242B (en) 2017-10-30 2017-10-30 Merge the label recommendation method that multiple information sources Coupling Tensor is decomposed

Country Status (1)

Country Link
CN (1) CN107679242B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108874998A (en) * 2018-06-14 2018-11-23 华东师范大学 A kind of dialog mode music recommended method indicated based on composite character vector
CN109345496A (en) * 2018-09-11 2019-02-15 中国科学院长春光学精密机械与物理研究所 A kind of image interfusion method and device of total variation and structure tensor
CN109360607A (en) * 2018-10-16 2019-02-19 武汉大学 A kind of the network evolution analysis method and device of dynamic gene regulation net
CN110097112A (en) * 2019-04-26 2019-08-06 大连理工大学 A kind of figure learning model based on reconstruct image
CN110209905A (en) * 2018-12-20 2019-09-06 腾讯科技(深圳)有限公司 Label recommendation method, device and readable medium
CN110362780A (en) * 2019-07-17 2019-10-22 北京航空航天大学 A kind of big data tensor canonical decomposition calculation method based on Shen prestige many-core processor
CN110555137A (en) * 2018-03-31 2019-12-10 华为技术有限公司 Label filling method and device
CN111931041A (en) * 2020-07-03 2020-11-13 武汉卓尔数字传媒科技有限公司 Label recommendation method and device, electronic equipment and storage medium
CN112307352A (en) * 2020-11-26 2021-02-02 腾讯科技(深圳)有限公司 Content recommendation method, system, device and storage medium
CN113051463A (en) * 2019-12-26 2021-06-29 中移物联网有限公司 Project pushing method and system
CN113796874A (en) * 2021-11-19 2021-12-17 北京大学 Multi-element electroencephalogram data identification and analysis method based on tensor decomposition model
CN117273287A (en) * 2023-11-23 2023-12-22 山东科技大学 Tensor-matrix coupling-based subway passenger flow prediction method
CN117874628A (en) * 2024-02-02 2024-04-12 北京大学 Aircraft overrun identification method based on multisource sensing information fusion

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100070851A1 (en) * 2008-09-17 2010-03-18 International Business Machines Corporation Method and system for providing suggested tags associated with a target web page for manipulation by a user
US20120265772A1 (en) * 2009-03-03 2012-10-18 Microsoft Corporation Media tag recommendation technologies
CN103729360A (en) * 2012-10-12 2014-04-16 腾讯科技(深圳)有限公司 Interest label recommendation method and system
CN105426550A (en) * 2015-12-28 2016-03-23 Tcl集团股份有限公司 Collaborative filtering tag recommendation method and system based on user quality model
CN105740430A (en) * 2016-01-29 2016-07-06 大连理工大学 Personalized recommendation method with socialization information fused
CN106599037A (en) * 2016-11-04 2017-04-26 焦点科技股份有限公司 Recommendation method based on label semantic normalization
CN106682151A (en) * 2016-12-23 2017-05-17 长沙理工大学 Personalized educational-resource recommendation method and system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100070851A1 (en) * 2008-09-17 2010-03-18 International Business Machines Corporation Method and system for providing suggested tags associated with a target web page for manipulation by a user
US20120265772A1 (en) * 2009-03-03 2012-10-18 Microsoft Corporation Media tag recommendation technologies
CN103729360A (en) * 2012-10-12 2014-04-16 腾讯科技(深圳)有限公司 Interest label recommendation method and system
CN105426550A (en) * 2015-12-28 2016-03-23 Tcl集团股份有限公司 Collaborative filtering tag recommendation method and system based on user quality model
CN105740430A (en) * 2016-01-29 2016-07-06 大连理工大学 Personalized recommendation method with socialization information fused
CN106599037A (en) * 2016-11-04 2017-04-26 焦点科技股份有限公司 Recommendation method based on label semantic normalization
CN106682151A (en) * 2016-12-23 2017-05-17 长沙理工大学 Personalized educational-resource recommendation method and system

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110555137A (en) * 2018-03-31 2019-12-10 华为技术有限公司 Label filling method and device
CN108874998A (en) * 2018-06-14 2018-11-23 华东师范大学 A kind of dialog mode music recommended method indicated based on composite character vector
CN108874998B (en) * 2018-06-14 2021-10-19 华东师范大学 Conversational music recommendation method based on mixed feature vector representation
CN109345496B (en) * 2018-09-11 2021-05-14 中国科学院长春光学精密机械与物理研究所 Image fusion method and device for total variation and structure tensor
CN109345496A (en) * 2018-09-11 2019-02-15 中国科学院长春光学精密机械与物理研究所 A kind of image interfusion method and device of total variation and structure tensor
CN109360607A (en) * 2018-10-16 2019-02-19 武汉大学 A kind of the network evolution analysis method and device of dynamic gene regulation net
CN109360607B (en) * 2018-10-16 2021-05-04 武汉大学 Network evolution analysis method and device of dynamic gene regulation and control network
CN110209905A (en) * 2018-12-20 2019-09-06 腾讯科技(深圳)有限公司 Label recommendation method, device and readable medium
US11734362B2 (en) 2018-12-20 2023-08-22 Tencent Technology (Shenzhen) Company Limited Tag recommending method and apparatus, computer device, and readable medium
CN110097112A (en) * 2019-04-26 2019-08-06 大连理工大学 A kind of figure learning model based on reconstruct image
CN110097112B (en) * 2019-04-26 2021-03-26 大连理工大学 Graph learning model based on reconstruction graph
CN110362780A (en) * 2019-07-17 2019-10-22 北京航空航天大学 A kind of big data tensor canonical decomposition calculation method based on Shen prestige many-core processor
CN113051463A (en) * 2019-12-26 2021-06-29 中移物联网有限公司 Project pushing method and system
CN111931041A (en) * 2020-07-03 2020-11-13 武汉卓尔数字传媒科技有限公司 Label recommendation method and device, electronic equipment and storage medium
CN112307352A (en) * 2020-11-26 2021-02-02 腾讯科技(深圳)有限公司 Content recommendation method, system, device and storage medium
CN112307352B (en) * 2020-11-26 2023-05-26 腾讯科技(深圳)有限公司 Content recommendation method, system, device and storage medium
CN113796874A (en) * 2021-11-19 2021-12-17 北京大学 Multi-element electroencephalogram data identification and analysis method based on tensor decomposition model
CN113796874B (en) * 2021-11-19 2022-01-28 北京大学 Multi-element electroencephalogram data identification and analysis method based on tensor decomposition model
CN117273287A (en) * 2023-11-23 2023-12-22 山东科技大学 Tensor-matrix coupling-based subway passenger flow prediction method
CN117273287B (en) * 2023-11-23 2024-03-08 山东科技大学 Tensor-matrix coupling-based subway passenger flow prediction method
CN117874628A (en) * 2024-02-02 2024-04-12 北京大学 Aircraft overrun identification method based on multisource sensing information fusion

Also Published As

Publication number Publication date
CN107679242B (en) 2018-07-27

Similar Documents

Publication Publication Date Title
CN107679242B (en) Merge the label recommendation method that multiple information sources Coupling Tensor is decomposed
WO2023097929A1 (en) Knowledge graph recommendation method and system based on improved kgat model
US11281969B1 (en) Artificial intelligence system combining state space models and neural networks for time series forecasting
CN110674323B (en) Unsupervised cross-modal Hash retrieval method and system based on virtual label regression
CN111724867B (en) Molecular property measurement method, molecular property measurement device, electronic apparatus, and storage medium
Chen et al. General functional matrix factorization using gradient boosting
CN114329232A (en) User portrait construction method and system based on scientific research network
CN109034853B (en) Method, device, medium and electronic equipment for searching similar users based on seed users
CN116644755B (en) Multi-task learning-based few-sample named entity recognition method, device and medium
CN111461157A (en) Self-learning-based cross-modal Hash retrieval method
Ke et al. TabNN: A universal neural network solution for tabular data
He et al. A general cross-domain recommendation framework via Bayesian neural network
Lu et al. Learning from multi-view multi-way data via structural factorization machines
CN112883289A (en) PMF recommendation method based on social trust and tag semantic similarity
CN114281950B (en) Data retrieval method and system based on multi-graph weighted fusion
CN113850616A (en) Customer life cycle value prediction method based on depth map neural network
Mi et al. Fast multi-view subspace clustering with balance anchors guidance
Zhang et al. A novel collaborative filtering algorithm and its application for recommendations in e-commerce
Lu et al. Research on adaptive algorithm recommendation system based on parallel data mining platform
CN108664607A (en) A kind of power telecom network quality of data method for improving based on transfer learning
Du et al. Multi-view semi-supervised web image classification via co-graph
CN111259176A (en) Cross-modal Hash retrieval method based on matrix decomposition and integrated with supervision information
CN117131222A (en) Semi-automatic labeling method and device based on open world large model
Sheng et al. HTDA: Hierarchical time-based directional attention network for sequential user behavior modeling
Fan et al. Model-based clustering based on variational learning of hierarchical infinite beta-liouville mixture models

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180727

Termination date: 20211030

CF01 Termination of patent right due to non-payment of annual fee