CN115080861A

CN115080861A - Neural collaborative filtering bidirectional recommendation method based on migration head and tail knowledge

Info

Publication number: CN115080861A
Application number: CN202210852889.0A
Authority: CN
Inventors: 韩悦; 夏彬; 骆冰清
Original assignee: Nanjing University of Posts and Telecommunications
Current assignee: Nanjing University of Posts and Telecommunications
Priority date: 2022-07-20
Filing date: 2022-07-20
Publication date: 2022-09-20

Abstract

The invention provides a neural collaborative filtering bidirectional recommendation method based on migration head-tail knowledge. By constructing a bidirectional nearest neighbor feature matrix, combining the bilateral nearest neighbor features of the object and the subject with the features of the object and the subject corresponding to the neural collaborative filtering embedding layer, continuously inputting the features into a neural network for training, and fully mining local feature information between the bidirectional object and the subject. The change of parameters of the neural collaborative filtering network in the process of learning object-subject and object-subject head items from a few-sample model to a many-sample model through a full connection layer F () network is migrated and applied to long-tail items, and the recommendation of bidirectional long-tail items is improved.

Description

Neural collaborative filtering bidirectional recommendation method based on migration head and tail knowledge

Technical Field

The invention belongs to the field of recommendation systems, and particularly relates to a neural collaborative filtering bidirectional recommendation method based on migration head and tail knowledge.

Background

The recommendation system is an information filtering system, finds the interests and hobbies of the users by analyzing the historical behavior data of the users and the characteristics of each user, and recommends the articles of interest for the users. The traditional recommendation system recommends articles to a user only by meeting the one-way preference of the user, such as shopping, singing, and the like, while the two-way recommendation system needs to consider the two-way preference relationship. In daily life, more and more scenes are needed for bidirectional recommendation, such as an online dating platform, a job hunting and recruitment service, a social network and the like. Successful two-way recommendation means that the recommended subject attracts both the target object, which in turn attracts the recommended subject. Bidirectional recommendations must also take into account other factors not present in traditional recommendation systems, such as the consequences of recommending a popular user to an unwelcome user, etc.

Building a recommendation system is challenging. One of the most notable problems is the ubiquitous long tail distribution in user-item interactions: for items, a small percentage of popular items receive a large portion of user feedback, while most items have little user feedback. Therefore, training the recommendation model on a data set with such a long-tailed distribution of items will easily cycle through "richest" and "worse". Compared with the popular commodity recommendation, the long-tail recommendation has the following advantages: 1) the market competitiveness of popular goods is strong and thus the profit is very limited. However, the long-tailed project contains a relatively large marginal profit; 2) recommending long-tailed products will surprise the user and improve customer loyalty and satisfaction; 3) for two-way recommendations, the head items are recommended too much, and in the case of a comparatively small number of head items, the number of recommendations that produce satisfaction in both directions is greatly reduced. Therefore, solving the problem of long-tailed item distribution of bidirectional recommendation is very important.

In the current two-way recommendation, the processing normalization of the two-way scoring data set is aggregated according to a unified standard, and the difference sampling aggregation is not carried out according to the popularity of a subject and an object in two-way interaction. The traditional neural collaborative filtering network for recommendation utilizes the information of user number ID and article number ID, adopts a single hot coding mode, and does not fully excavate the two-way local relationship between the user and the article; furthermore, some recent studies addressing long tail distributions may employ resampling to rebalance the data set, with the reweighting assigning higher costs to the tail terms in the loss function. However, these recommendations focus only on the end items, and do not consider the connection to the head item (popular item) -which contains rich user feedback information and migratable context information related to the end items-e.g., of the same type, while few people are concerned with long-ended item recommendations in a two-way recommendation scenario.

Disclosure of Invention

Based on the technical problems, the invention provides a neural collaborative filtering bidirectional recommendation method based on the knowledge of head and tail migration.

The idea of the invention for realizing the above purpose is as follows: the bilateral interaction popularity of the object and the subject is classified and sampled through the bilateral interaction popularity of the two-way scoring data, the scoring data is normalized, the condition that the long-tail project is expressed and the phenomenon of the long tail of the aggregated one-way scoring data is more serious is prevented, the bilateral nearest neighbor characteristics of the object and the subject are combined with the object and the subject characteristics corresponding to the nerve collaborative filtering embedding layer through constructing a two-way nearest neighbor characteristic matrix, the object and the subject characteristics are continuously input into a neural network for training, and the local characteristic information between the two-way object and the subject is fully mined. The change of parameters of the neural collaborative filtering network in the process of learning object-subject and object-subject head items from a few-sample model to a many-sample model through a full connection layer F () network is migrated and applied to long-tail items, and the recommendation of bidirectional long-tail items is improved.

In order to achieve the purpose, the method comprises the following specific implementation steps:

and collecting object-subject and subject-object bidirectional grading data, and performing normalization pretreatment.

And respectively calculating similarity matrixes of the object-object and subject-object scoring data to obtain two object similarity matrixes and two subject similarity matrixes.

And respectively carrying out matrix decomposition training on the object-subject and the subject-object scoring data to obtain a corresponding object characteristic matrix and a corresponding subject characteristic matrix. And according to a plurality of similarity matrixes obtained from the bidirectional data, forming an object nearest neighbor characteristic matrix by using vectors corresponding to each object nearest neighbor in the two object characteristic matrixes, and forming a subject nearest neighbor characteristic matrix by using vectors corresponding to each subject nearest neighbor in the two subject characteristic matrixes.

Respectively calculating the popularity of the object-subject and the popularity of the object-subject in the object-subject scoring data, sampling according to different popularity of the object-subject and a certain rule, and giving different weights to the two-way scoring data to form new object-subject and object-subject scoring data.

And dividing the new object-subject scoring data into a few sample data sets and a multi-sample data set for multiple times, inputting a neural collaborative filtering recommendation network to obtain respective objects, embedding features of the objects, splicing with the corresponding nearest neighbor feature vectors, and continuing inputting the objects into the neural network for training. And dividing the new subject-object scoring data into a few sample data sets and a multi-sample data set for multiple times, inputting a neural collaborative filtering recommendation network to obtain respective subject and object embedding characteristics, splicing with the corresponding nearest neighbor characteristic vector, and continuing inputting the characteristic vector into the neural network for training.

Learning the neural collaborative filtering recommends a change in network parameters from object-subject low-sample to multi-sample dataset training using a fully connected layer as a mapping network for F (). Learning the neural collaborative filtering recommends a change in network parameters from subject-object low-sample to multi-sample dataset training using a fully connected layer as a mapping network for F ().

And generating the scores of the object to the subject and the scores of the subject to the object by using the trained neural collaborative filtering recommendation network, aggregating the two-way scores, and evaluating the performance of the model.

Further, the normalizing preprocessing the bidirectional scoring data includes:

normalizing the object-subject score data and mapping the normalized object-subject score data to [0,1]]The formula used is as follows:

wherein rating _max ，rating _min The score maximum and minimum values, respectively.

Normalizing the subject-object score data, and mapping the subject-object score data to [0,1]]The formula used is as follows:

wherein rating _max ，rating _min Maximum and minimum score values, respectively.

Further, the calculating a similarity matrix of the object-subject score data includes:

and forming a scoring matrix of the object to the object by using the scoring data of the object and the object, wherein the row and column of the scoring matrix are the object number ID and the host number ID, and the row number and column number of the scoring matrix are respectively the total number of the object and the host.

And calculating the similarity among all objects by utilizing a cosine similarity formula to form an object similarity matrix, and then calculating the similarity among all the subjects to form a subject similarity matrix.

The calculating of the similarity matrix of the subject-object scoring data comprises:

and forming a subject-object scoring matrix by the subject-object scoring data, wherein the line of the scoring matrix is subject number ID, the column of the scoring matrix is object number ID, and the row number and the column number of the scoring matrix are respectively the total number of the subject and the objects.

And calculating the similarity among all the objects by utilizing a cosine similarity formula to form an object similarity matrix, and calculating the similarity among all the subjects to form a subject similarity matrix.

Further, the obtaining an object and subject feature matrix includes:

performing matrix decomposition on the object-subject scoring matrix, initializing an object feature matrix and a subject feature matrix, and enabling the objectMultiplying the feature matrix and the main feature matrix to form a reconstruction training matrix, and utilizing a reconstruction loss formula and a reconstruction loss formula

Calculating a reconstruction error of a reconstruction matrix, wherein Y _i,u Denotes the score of the ith object on the u th subject, I _i Indicates the characteristics of the ith object, U _u Representing features of the u-th body, T being a transpose operation, λ being a regularization parameter, | |) _F Is the F norm. And updating the object characteristic matrix and the subject characteristic matrix by gradient descent, and continuously iterating until convergence to obtain the implicit vectors of each object and subject.

Performing matrix decomposition on the subject-object scoring matrix, initializing a subject characteristic matrix and an object characteristic matrix, multiplying the subject characteristic matrix and the object characteristic matrix to form a reconstruction training matrix, and utilizing a reconstruction loss formula and a reconstruction loss formula by utilizing a reconstruction loss formula

Calculating a reconstruction error of a reconstruction matrix, wherein Y _u,i Represents the score of the ith subject on the ith object, U is a subject feature matrix, U _u Representing the characteristics of the u-th subject, I being a guest feature matrix, I _i Representing features of the ith object, T being a transpose operation, λ being a regularization parameter, | |) _F Updating a subject feature matrix and an object feature matrix by gradient descent for the F norm, and continuously iterating until convergence to obtain implicit feature vectors of each subject and object;

and respectively selecting nearest neighbors of each object from an object similarity matrix obtained from the object-object scoring data and an object similarity matrix obtained from the object-object scoring data, and splicing implicit vectors of the nearest neighbors of each object in the corresponding object feature matrix to form an object nearest neighbor feature matrix.

And respectively selecting nearest neighbors of each subject from a subject similarity matrix obtained from the subject-object score data and a subject similarity matrix obtained from the object-subject score data, and splicing implicit vectors of the nearest neighbors of each subject in the corresponding subject feature matrix to form a subject nearest neighbor feature matrix.

Further, the obtaining of new object-to-object score, object-to-object data, comprises:

calculating the popularity of each subject of the subject-subject scoring data and the popularity of each object of the subject-object scoring data, wherein the calculation formula is as follows:

popularity of the object: ppk _i ＝ln(1+|N _i |)

The popularity of the subject: ppz _u ＝ln(1+|N _u |)

Wherein N is _i Number of subjects for which the object is preferred, N _u The number of objects preferred by the subject.

Ppz obtained from object-host scoring data _u Regarding subjects with k values larger than or equal to k as head subjects, regarding subjects with k values smaller than k as long-tail subjects, and regarding the preferred subjects of each subject in the subject-object scoring data as ppk _i Sorting from big to small, selecting the first k object interaction data of the long-tail subject in the subject-object scoring data, and selecting the front of the head subject in the subject-object scoring data

And (3) updating the object-subject score by weighted combination of individual object interaction data, wherein the calculation formula is as follows:

wherein,

scoring preference of an object for a subject, y _ui And scoring the preference of the subject to the object, wherein alpha is a weight coefficient. Thus a new guest-host scoring matrix is obtained.

Calculating the popularity of each object of the object-object scoring data and the popularity of each object of the object-object scoring data, wherein the calculation formula is as follows:

popularity of the object: ppk _i ＝ln(1+|N _i |)

The popularity of the subject: ppz _u ＝ln(1+|N _u |)

Wherein N is _i Number of subjects for which the object is preferred, N _u Is the number of objects preferred by the subject;

ppk derived from subject-object scoring data _i Regarding the object with the value of k larger than or equal to the head object and the object with the value of k smaller than the head object as the long tail object, and regarding the object with the preference of each object in the object-object scoring data as the ppz _u Sorting from big to small, selecting the first k host interactive data of the long-tail object in the object-host scoring data, and selecting the front of the head object in the object-host scoring data

The individual subject interaction data is weighted and combined to update the subject-object score, and the calculation formula is as follows:

wherein,

scoring preference of subject for object, y _iu Scoring the preference of the object to the subject, wherein alpha is a weight coefficient; thus a new host-guest scoring matrix is obtained.

Further, the constructing a few-sample data set and a multiple-sample data set includes:

multi-sample object-host data set

Low sample object-host data sets including all object feedback from head and tail subjects

Full interaction recording by Long-tailed principal and Each head principalT records of volume random sampling.

Further, the training process of the neural collaborative filtering recommendation network on the subject-object data set includes:

given a target object i and a subject u, the neural collaborative filtering recommendation method may be expressed as:

wherein,

representing the predicted score between object i and subject u; θ represents a model parameter; g denotes a mapping function.

The object-host data set is used as original input and is converted into an input vector which can be directly processed by a model after being subjected to independent thermal coding, then a linear embedding layer is utilized to convert a high-dimensional and sparse input vector into a low-dimensional and dense expression vector so as to obtain embedding vectors of an object i and a host u, and an embedding integration layer is utilized to integrate the embedding vectors and the corresponding nearest neighbor vectors to form a final expression vector v of the object and the host _i And v _u 。

For the matrix decomposition MF part, v is _i And v _u Performing element product to obtain matrix decomposition output vector

For the multi-layer perceptron MLP part, v is _i And v _u Spliced to obtain an input vector v _iu Then v is further determined _iu Inputting the data into a multilayer perceptron to learn an interaction function between an object and a subject to obtain an output vector of the multilayer perceptron

By

Output vector combining two parts of matrix decomposition and multilayer perceptronSplicing, inputting the spliced objects into a full connection layer to obtain a prediction score between an object i and a subject u, wherein h is a weight vector of an output layer; b represents the bias term of the output layer; σ (-) is a Sigmoid function. By using

As a function of the loss, wherein

θ is a model parameter, and preferences for subject-object are also trained as above.

The training process of the neural collaborative filtering recommendation network on the subject-object data set comprises the following steps:

given a target subject u and an object i, the neural collaborative filtering recommendation method is expressed as:

wherein,

representing a prediction score between subject u and object i; θ represents a model parameter; g represents a mapping function;

using the subject-object data set as an original input, converting the original input into an input vector which can be directly processed by a model after unique hot coding, converting a high-dimensional and sparse input vector into a low-dimensional and dense expression vector by using a linear embedding layer so as to obtain embedding vectors of a subject u and an object i, and integrating the embedding vectors and the corresponding two types of expression vectors of nearest neighbor vectors by using an embedding integration layer to form a final expression vector v of the subject and the object _u And v _i ；

For the matrix decomposition MF part, v is _u And v _i Performing element product to obtain matrix decomposition output vector

For the multi-layer perceptron MLP part, v is _u And v _i Spliced to obtain an input vector v _ui Then v is further determined _ui Inputting the data into a multilayer perceptron to learn an interaction function between an object and a subject to obtain an output vector of the multilayer perceptron

By

Splicing output vectors of two parts of matrix decomposition and a multilayer sensor, and inputting the output vectors into a full connection layer to obtain a prediction score between a subject u and an object i, wherein h is a weight vector of an output layer; b represents the bias term of the output layer; σ (-) is a Sigmoid function, using

As a function of the loss, wherein

And theta is a model parameter.

Further, the obtaining model parameter changes by using F () includes:

using a full-connection layer as a mapping network of F () to learn the last layer parameter of the object-subject neural collaborative filtering recommendation network, and using the formula

And solving the parameters of the multi-sample model.

Then using the loss function

Learning the parameter change from the few sample model to the many sample model, wherein F (theta; omega) uses the parameter omega to learn the meta-mapping from the few sample model parameters to the many sample model parameters, the input of which is the neural collaborative filtering recommendation model

The trained parameters θ, λ are regularization parameters that balance the two terms. II F (theta; omega) -theta ^* ‖ ² The predicted multi-sample model parameters (via F (theta; omega)) and the multi-sample model parameters theta ^* The distance between them is minimized.

Based on training data sets

Training a neural collaborative filtering recommendation network to ensure

High recommendation performance for medium data, while helping to learn F (θ; ω).

Using a full-connection layer as a mapping network of F () to learn the parameters of the last layer of the subject-object neural collaborative filtering recommendation network, and using the formula

And solving the parameters of the multi-sample model.

Then using the loss function

Learning the parameter variation from a few sample model to a many sample model,

a sample data set is few for the subject-object,

a subject-object multi-sample dataset.

Further, the obtaining the bidirectional score includes:

obtaining a guest-host score: select a series of t _j Satisfy t ₁ <t ₂ <t ₃ …<t _m Constructed as described

Data set, simultaneous learning of a series of F _1,2 (·)…F _j,j+1 (·)…F _m-1,m (. wherein F) _j,j+1 Is learning from

Data set to

Model parameter mapping of data sets by minimization

Thus obtaining the product. The final scoring data is given by the following formula,

wherein

θ ^* For the original multi-sample parameter, omega _j,j+1 For full connection layer learning

To

Parameter of model variation, θ _j Is composed of

Data set model parameters.

Obtaining a subject-object score: select a series of t _j Satisfy t ₁ <t ₂ <t ₃ …<t _m Constructed as described

Data set, simultaneous learning of a series of F _1,2 (·)…F _j,j+1 (·)…F _m-1,m (. wherein F) _j,j+1 (. is learning from

Data set to

Model parameter mapping of data sets by minimization

Thus obtaining the compound. The final scoring data is given by the following formula,

wherein

To

Parameter of model variation, θ _j Is composed of

Data set model parameters.

And aggregating the two-way scores through arithmetic mean values, matching with a test set, and verifying the accuracy, recall rate and f1 value of the model.

The invention has the beneficial effects that: according to the method, classification sampling is carried out according to the bilateral interactive popularity of the object and the subject of the two-way scoring data, and after the scoring data are normalized, certain two-way preference is blended into the one-way preference data, so that the condition that the long tail phenomenon of the aggregated one-way scoring data is more serious is prevented while the long tail item is represented.

According to the invention, by constructing a bidirectional nearest neighbor feature matrix, the two-way obtained nearest neighbor features are combined with the object and subject features of the neural collaborative filtering embedding layer and input into the fully-connected neural network for training, so that the subject preference and the subject preference information from the subject-subject scoring data can be further fused on the basis of learning the subject-subject scoring data, and the local feature information between the object and the subject can be fully mined.

The method provided by the invention has the advantages that the change of the parameters of the neural collaborative filtering network is migrated and applied to the long-tailed object/subject by using the full connection layer F () network to learn the change of the parameters of the object-subject and the subject-object head object/subject from the few-sample model to the multiple-sample model, so that the long-tailed object/subject with few-sample interaction is enriched, and the recommendation of the bidirectional long-tailed project is improved.

Drawings

Fig. 1 is a schematic flow chart of a neural collaborative filtering bidirectional recommendation method based on head-to-tail knowledge migration according to an embodiment of the present invention.

Fig. 2 is a method model of a neural collaborative filtering bidirectional recommendation method based on migrating head and tail knowledge according to an embodiment of the present invention.

Detailed Description

It is easily understood that according to the technical solution of the present invention, a person skilled in the art can propose various alternative structures and implementation ways without changing the spirit of the present invention. Therefore, the following detailed description and the accompanying drawings are merely illustrative of the technical aspects of the present invention, and should not be construed as all of the present invention or as limitations or limitations on the technical aspects of the present invention.

Fig. 2 is a method model provided by an embodiment of the present invention, and the flow of the present invention is illustrated by using a shallow, linear matrix decomposition and a deep, non-linear multi-layer sensor as an implementation manner, and the specific contents are as follows: given a target object i and a subject u, the object-subject recommendation method proposed by the present invention can be expressed as:

wherein,

representing the predicted score between object i and subject u; θ represents a model parameter; g denotes a mapping function. Given a target subject u and an object i, the subject-object recommendation method proposed by the present invention can be expressed as:

wherein,

representing a prediction score between subject u and object i; θ represents a model parameter; g denotes a mapping function.

According to an embodiment of the invention, a neural collaborative filtering bidirectional recommendation method based on head and tail knowledge migration is shown in combination with fig. 1, and includes the following steps:

s101, acquiring object-subject and subject-object two-way grading data, and performing normalization pretreatment.

Specifically, the step S101 is as follows:

s1011, firstly, carrying out normalization processing on the object-subject scoring data, and mapping the object-subject scoring data to [0,1], wherein the formula is as follows:

among them, rating _max ，rating _min Maximum and minimum score values, respectively.

S1012, normalizing the subject-object score data, and mapping the subject-object score data to [0,1], wherein the formula is as follows:

wherein, rating _max ，rating _min Maximum and minimum score values, respectively.

S102, similarity matrixes of the object-subject and subject-object grading data are calculated respectively, and two object similarity matrixes and two subject similarity matrixes are obtained.

Specifically, the step S102 is as follows:

s1021, forming a scoring matrix of the objects to the objects from scoring data of the objects and the objects, wherein the row and column of the scoring matrix are object number IDs and are object number IDs, the row number and column number of the scoring matrix are the objects respectively, the total number of the objects, the expression vector of the object i is the scoring of all the objects in the ith row, and the expression vector of the object u is the scoring of all the objects in the u th column;

and S1022, calculating the similarity among all the objects by utilizing a cosine similarity formula to form an object similarity matrix, and calculating the similarity among all the subjects to form a subject similarity matrix.

S1023, forming a scoring matrix of the subject to the object from the scoring data of the subject and the object, wherein the row of the scoring matrix is a subject number ID, the column is an object number ID, the row number and the column number of the scoring matrix are respectively the subject, the total number of the object, the expression vector of the subject u is the score of all the objects in the u-th row, and the expression vector of the object i is the score of all the subjects in the i-th column for the subject;

and S1024, calculating the similarity among all objects by utilizing a cosine similarity formula to form an object similarity matrix, and calculating the similarity among all subjects to form a subject similarity matrix.

And S103, respectively carrying out matrix decomposition training on the object-subject and the subject-object scoring data to obtain a corresponding object characteristic matrix and a corresponding subject characteristic matrix. And forming an object nearest neighbor feature matrix by using vectors corresponding to each object nearest neighbor in the two object feature matrices according to a plurality of similarity matrices obtained by the bidirectional data, and forming a subject nearest neighbor feature matrix by using vectors corresponding to each subject nearest neighbor in the two subject feature matrices.

If only object-subject and subject-object scores are input into the recommendation network, local feature information between the objects and the subjects cannot be sufficiently mined, so that object preferences obtained by learning object-subject score data and subject-object preference information are further fused on the basis of the subject-object preference by combining the neighbor matrices of the objects and the subjects and learning the neighbor matrices of the subject-object score data, and the subject-object preference information are enriched.

Specifically, the step S103 is as follows:

s1031, performing matrix decomposition on the object-subject scoring matrix, initializing an object feature matrix I (m f) and a subject feature matrix U (n f), wherein m and n are the numbers of objects and subjects respectively, and f is the dimension of an implicit vector, multiplying the object feature matrix and the subject feature matrix to form a reconstruction training matrix, and utilizing a reconstruction loss formula

Calculating a reconstruction error of a reconstruction matrix, wherein Y _i,u Denotes the score of the ith object on the u th subject, I _i Indicates the characteristics of the ith object, U _u Representing features of the u-th body, T being a transpose operation, λ being a regularization parameter, | |) _F Is the F norm. And updating the object characteristic matrix and the subject characteristic matrix by gradient descent, and repeating the steps for continuous iteration until convergence to obtain the implicit vectors of each object and subject.

S1032, performing matrix decomposition on the subject-object scoring matrix, initializing a subject feature matrix U (n f) and an object feature matrix I (m f), wherein m and n are the number of objects and subjects respectively, and f is the dimension of an implicit vector, multiplying the subject feature matrix and the object feature matrix to form a reconstruction training matrix, and utilizing a reconstruction loss formula to utilize the reconstruction loss formula

Calculating a reconstruction error of a reconstruction matrix, wherein Y _u,i Represents the score of the ith subject on the ith object, U is a subject feature matrix, U _u Representing the characteristics of the u-th subject, I is a guest characteristic matrix, I _i Representing features of the ith object, T being a transpose operation, λ being a regularization parameter, | | _F Updated by gradient descent for F normAnd continuously iterating the subject characteristic matrix and the object characteristic matrix until convergence is achieved, so as to obtain the implicit characteristic vectors of each subject and each object.

And S1033, respectively selecting nearest neighbors of each object from the object similarity matrix obtained from the object-subject scoring data and the object similarity matrix obtained from the subject-object scoring data, and splicing the implicit vectors of the nearest neighbors of each object in the corresponding object feature matrix to form an object nearest neighbor feature matrix.

S1034, selecting nearest neighbors of each subject from the subject similarity matrix obtained from the object-subject scoring data and the subject similarity matrix obtained from the subject-object scoring data, and splicing the implicit vectors of the nearest neighbors of each subject in the corresponding subject characteristic matrix to form a subject nearest neighbor characteristic matrix.

S104, respectively calculating popularity of the object-subject and subject-object scoring data, sampling according to different popularity of the object-subject and a certain rule, and giving different weights to the two-way scoring data to combine new object-subject and subject-object scoring data.

When one-way preference is trained, for example, the preference of an object to a subject, in order to add some factors of two-way selection to better realize two-way recommendation matching and enrich the representation of long tail data, the subject-object scoring data is sampled according to a certain rule and then fused with the object-subject scoring, in order to prevent the phenomenon that the one-way scoring data is more serious in long tail after aggregation, different strategies are sampled on the head and long tail items, and then the two-way scoring is given different weights for weighted fusion.

Specifically, the step S104 is as follows:

s1041, calculating each subject popularity of the subject-subject scoring data and each subject popularity of the subject-subject scoring data, wherein the calculation formula is as follows:

popularity of objects: ppk _i ＝ln(1+|N _i |)

The popularity of the subject: ppz _u ＝ln(1+|N _u |)

Wherein N is _i Number of subjects for which the object is preferred, N _u The number of objects preferred for the subject.

S1042, obtaining ppz from object-host scoring data _u Regarding subjects with k values larger than or equal to k as head subjects, regarding subjects with k values smaller than k as long-tail subjects, and regarding the preferred subjects of each subject in the subject-object scoring data as ppk _i Sorting from big to small, selecting front k object interactive data of a long-tail object in the object-object scoring data, and selecting front of a head object in the object-object scoring data

Personal object interaction data

S1043, updating the object-subject score by weighted combination, wherein the calculation formula is as follows:

wherein,

S1044, calculating the popularity of each object in the object-object scoring data and the popularity of each object in the object-object scoring data, wherein the calculation formula is as follows:

popularity of the object: ppk _i ＝ln(1+|N _i |)

The popularity of the subject: ppz _u ＝ln(1+|N _u |)

s1045, obtaining ppk from subject-object scoring data _i Regarding the object with the value of k larger than or equal to the head object and the object with the value of k smaller than the head object as the long tail object, and regarding the object with the preference of each object in the object-object scoring data as the ppz _u Arranged from large to smallSelecting the first k host interactive data of the long-tail object in the object-host scoring data, and selecting the front of the head object in the object-host scoring data

Individual subjects interact with the data.

S1046, updating the subject-object score by weighted combination to obtain a new subject-object score matrix, wherein the calculation formula is as follows:

wherein,

scoring preference of the subject for the object, y _iu Scoring the preference of the object to the subject, wherein alpha is a weight coefficient; thus, a new host-guest scoring matrix is obtained.

And S105, dividing the new object-subject scoring data into a few-sample data set and a multi-sample data set for multiple times, inputting the data into a neural collaborative filtering network to obtain respective objects, embedding features into the subjects, splicing the features with nearest neighbor feature vectors, and inputting the features into a multilayer perceptron. And dividing the new subject-object scoring data into a few sample data sets and a multi-sample data set for multiple times, inputting the data into a neural collaborative filtering network to obtain respective subject and object embedding characteristics, splicing the characteristics with nearest neighbor characteristic vectors, and inputting the characteristics into a multilayer perceptron.

The lack of data points results in insufficient representation of long-tailed items, while we have quite rich information and enough items with data points at the head. It is proposed to explore the links between models learned through only a few interaction examples and model parameters learned through enough interaction examples for the same project. For example, given an item with user feedback, F () learning implicitly adds model parameter changes in similar user processes that provide feedback for the same item, allowing us to improve representation quality even when there is insufficient data. Therefore, a small sample data set and a multi-sample data set need to be constructed to learn the application of parameter changes to the long tail data.

Specifically, the step S105 is as follows:

s1051, a multi-sample object-host data set

Sample-less object-object data set including all object feedback from head and tail subjects

The method is characterized by comprising all interaction records of a long-tail main body and t records randomly sampled by each head main body.

S1052, using the object-host data set as original input, converting the data set into an input vector which can be directly processed by a model after unique hot coding, then converting a high-dimensional and sparse input vector into a low-dimensional and dense expression vector by using a linear embedding layer so as to obtain embedding vectors of an object i and a host u, and integrating the embedding vectors and corresponding nearest neighbor vectors by using an embedding integration layer to form a final expression vector v of the object and the host _i And v _u 。

S1053, decomposing v for MF part of matrix in the neural collaborative filtering network _i And v _u Performing element product to obtain matrix decomposition output vector

E.g. v _i ＝[a ₁ ，a ₂ ,…a _f ] ^T ，v _u ＝[b ₁ ，b ₂ ,…b _f ] ^T Then, then

For the multi-layer perceptron MLP part, v is _i And v _u Spliced to obtain an input vector v _iu E.g. v _i ＝[a ₁ ，a ₂ ,…a _f ] ^T ，v _u ＝[b ₁ ，b ₂ ,…b _f ] ^T Then v is _iu ＝[a ₁ ，a ₂ ,…a _f ，b ₁ ，b ₂ ,…b _f ] ^T . Then v is converted into _iu Inputting the data into a multilayer perceptron to learn an interaction function between an object and a subject to obtain an output vector of the multilayer perceptron

Using the following formula

…

And S1054, designing the multilayer sensor, and following a common tower-shaped structure. Specifically, the number of implicit elements in the next layer is half that of the previous layer. For example, if the number of layers L of the multilayer perceptron is 3 and the predictor dL is 64, the network structure is 256 → 128 → 64 and the embedding dimension is 64. In general, a multilayer sensor using three layers has been able to achieve very good results.

S1055, preparing a composition from

Splicing output vectors of two parts of matrix decomposition and a multilayer sensor, and inputting the output vectors into a full connection layer to obtain a prediction score between an object i and a subject u, wherein h is a weight vector of an output layer; b represents the bias term of the output layer; σ (-) is a Sigmoid function. By using

As a function of the loss, wherein

Theta is a model parameter, and the optimization method is an Adam algorithm. It can adaptively adjust the size of its learning rate for different parameters.

S1056, multiple sample subject-object data set

Low sample subject-object data sets including all subject feedback from head and tail objects

The method is characterized by comprising all interaction records of long-tail objects and t records of random sampling of each head object.

S1057, using the subject-object data set as original input, converting the original input into an input vector capable of being directly processed by a model after being subjected to one-hot encoding, then converting a high-dimensional and sparse input vector into a low-dimensional and dense expression vector by using a linear embedding layer so as to obtain embedding vectors of a subject u and an object i, and integrating the embedding vectors and two types of expression vectors of corresponding nearest neighbor vectors by using an embedding integration layer to form a final expression vector v of the subject and the object _u And v _i 。

S1058, decomposing v for MF part of matrix in the neural collaborative filtering network _u And v _i Performing element product to obtain matrix decomposition output vector

For the multi-layer perceptron MLP part, v is _u And v _i Spliced to obtain an input vector v _ui E.g. v _i ＝[a ₁ ，a ₂ ,…a _f ] ^T ，v _u ＝[b ₁ ，b ₂ ,…b _f ] ^T Then v is _ui ＝[b ₁ ，b ₂ ,…b _f ，a ₁ ，a ₂ ,…a _f ] ^T . Then v is measured _ui Inputting the data into a multilayer perceptron to learn an interaction function between an object and a subject to obtain an output vector of the multilayer perceptron

Using the following formula

…

S1059, designing the multilayer perceptron, and following the common tower structure. Specifically, the number of implicit elements in the next layer is half that of the previous layer. For example, if the number of layers L of the multi-layer sensor is 3 and the predictor dL is 64, the network structure is 256 → 128 → 64, and the embedding dimension is 64. In general, a multilayer sensor using three layers has been able to achieve very good results.

S10510, preparation of

Splicing output vectors of two parts of matrix decomposition and a multilayer sensor, and inputting the output vectors into a full connection layer to obtain a prediction score between a subject u and an object i, wherein h is a weight vector of an output layer; b represents the bias term of the output layer; σ (-) is a Sigmoid function. By using

As a function of the loss, wherein

S106, using a full connection layer as a mapping network of F (), and learning the training change of parameters of the neural collaborative filtering network from object-subject few samples to multi-sample data sets; using a fully connected layer as a mapping network for F (), learning the variation of the neural collaborative filtering network parameters from subject-object few-sample to multi-sample dataset training.

Specifically, the step S106 is as follows:

s1061, using a full connection layer as a mapping network of F () to learn the last layer parameter of the object-subject neural collaborative filtering recommendation network, and using the formula

And solving the parameters of the multi-sample model.

S1062, then using the loss function

The trained parameters θ, λ are regularization parameters that balance the two terms. II F (theta; omega) -theta ^* ‖ ² The predicted multi-sample model parameters (via F (theta; omega)) and the multi-sample model parameters theta ^* The distance between them is minimized. It ensures that F (theta; omega) learning prediction multi-sample model parameters can be well fitted

The data samples in (1); second item

Based onTraining data set

Training a neural collaborative filtering recommendation network to ensure

High recommendation performance for medium data, while helping to learn F (θ; ω). Therefore, training the few-sample model and the loss function of the F (theta; omega) simultaneously provides high-quality neural collaborative filtering recommendation and improves the accuracy of F (theta; omega) model mapping.

S1063, using a full connection layer as a mapping network of F () to learn the parameters of the last layer of the subject-object neural collaborative filtering recommendation network, and using the formula

And solving the parameters of the multi-sample model.

S1064, then using the loss function

a sample data set is few for the subject-object,

a subject-object multi-sample dataset.

And S107, generating an object-subject score and a subject-to-object score by using the trained recommendation network, aggregating two-way scores, and evaluating the performance of the model.

In order to make the migration learning between projects smoother, the invention further changes the size of t to construct different data sets and recursively learns the dynamic trajectory of the model.

Specifically, the step S107 is as follows:

s1071, obtaining an object-subject score: select a series of t _j Satisfy t ₁ <t ₂ <t ₃ …<t _m Constructed as described

Data set to

Model parameter mapping of data sets by minimization

wherein

To

Parameter of model variation, θ _j Is composed of

Data set model parameters.

S1072, obtaining subject-object score: select a series of t _j Satisfy t ₁ <t ₂ <t ₃ …<t _m Constructed as described

Data set to

Model parameter mapping of data sets by minimization

wherein

To

Parameter of model variation, θ _j Is composed of

Data set model parameters.

The two-way scores are aggregated by arithmetic mean, matched with a test set, and verified for model accuracy, recall and f1 values.

The above description is only an embodiment of the present invention, and is not intended to limit the present invention. Various modifications and alterations to this invention will become apparent to those skilled in the art. Any modification, equivalent replacement, improvement or the like made within the spirit and principle of the present invention should be included in the scope of the claims of the present invention.

Claims

1. A neural collaborative filtering bidirectional recommendation method based on migration head and tail knowledge is characterized by comprising the following steps: the neural collaborative filtering bidirectional recommendation method comprises the following steps:

step 1: acquiring object-subject and subject-object two-way scoring data, and performing normalization pretreatment;

step 2: respectively calculating similarity matrixes of the object-object and subject-object scoring data to obtain two object similarity matrixes and two subject similarity matrixes;

and step 3: respectively carrying out matrix decomposition training on the object-subject and subject-object scoring data in the step 1 to obtain corresponding object feature matrices and corresponding subject feature matrices, forming an object nearest neighbor feature matrix by using vectors corresponding to each object nearest neighbor in the two object feature matrices according to the two object similarity matrices and the two subject similarity matrices obtained in the step 2, and forming a subject nearest neighbor feature matrix by using vectors corresponding to each subject nearest neighbor in the two subject feature matrices;

and 4, step 4: respectively calculating popularity of the object-subject and popularity of the object-subject in the object-subject scoring data, sampling according to different popularity of the object-subject, and giving different weights to the two-way scoring data to combine new object-subject scoring data and new subject-object scoring data;

and 5: dividing the new object-subject scoring data obtained in the step 4 into a few sample data sets and a multiple sample data set for multiple times, inputting the data into a neural collaborative filtering recommendation network to obtain respective objects, embedding features into the subjects, splicing the subject embedding features with the corresponding nearest neighbor feature vectors, and continuously inputting training parameters in the neural network; dividing the new subject-object scoring data obtained in the step 4 into a few-sample data set and a multi-sample data set for multiple times, inputting a neural collaborative filtering recommendation network to obtain respective subject and object embedding characteristics, splicing with the corresponding nearest neighbor characteristic vector, and continuously inputting training parameters in the neural network;

step 6: using a full connection layer as a mapping network of F (), learning neural collaborative filtering and recommending the change of network parameters from object-subject few-sample to multi-sample data set training; using a full connection layer as a mapping network of F (), learning neural collaborative filtering and recommending the change of network parameters from subject-object few-sample to multi-sample data set training;

and 7: and generating the subject-to-subject score and the subject-to-object score by using the trained neural collaborative filtering recommendation network, aggregating the two-way scores, and evaluating the performance of the model.

2. The neural collaborative filtering bidirectional recommendation method based on the migration head-tail knowledge according to claim 1, characterized in that: the normalization preprocessing of the collected bidirectional scoring data in the step 1 specifically comprises the following steps:

wherein rating _max ，rating _min Score maximum and minimum values, respectively;

normalizing the subject-object score data and mapping the subject-object score data to [0,1]]The formula used is as follows:

3. The neural collaborative filtering bidirectional recommendation method based on the migration head-tail knowledge according to claim 1, characterized in that: in the step (2), the first step is that,

calculating a similarity matrix of the object-subject scoring data, comprising: forming a scoring matrix of the objects to the objects by the scoring data of the objects and the objects, wherein the row of the scoring matrix is an object number ID, the column of the scoring matrix is a object number ID, the row number and the column number of the scoring matrix are respectively the total number of the object objects, calculating the similarity among all the objects by utilizing a cosine similarity formula to form an object similarity matrix, and then calculating the similarity among all the objects to form a main body similarity matrix;

calculating a similarity matrix of subject-object scoring data, comprising: and forming a scoring matrix of the subject to the object by the subject-object scoring data, wherein the row of the scoring matrix is a subject number ID, the column of the scoring matrix is an object number ID, the row number and the column number of the scoring matrix are respectively the total number of the subject and the object, calculating the similarity between all subjects by using a cosine similarity formula to form a subject similarity matrix, and then calculating the similarity between all objects to form an object similarity matrix.

4. The neural collaborative filtering bidirectional recommendation method based on the migration head-tail knowledge according to claim 1, characterized in that: in the step 3, the step of the method is that,

the acquisition of the object characteristic matrix and the subject characteristic matrix in the object-subject scoring matrix specifically comprises the following steps: performing matrix decomposition on the object-subject scoring matrix, initializing an object characteristic matrix and a subject characteristic matrix, multiplying the object characteristic matrix and the subject characteristic matrix to form a reconstruction training matrix, and utilizing a reconstruction loss formula and a reconstruction loss formula by utilizing a reconstruction loss formula

Calculating a reconstruction error of a reconstruction matrix, wherein Y _i,u Represents the score of the ith object on the u-th subject, I is an object feature matrix, I _i Represents the characteristics of the ith object, U is a subject characteristic matrix, U _u Representing features of the u-th body, T being a transpose operation, λ being a regularization parameter, | |) _F Updating the object feature matrix and the subject feature matrix by gradient descent for F norm, and continuously iterating until convergence to obtain implicit feature vectors of each object and subject;

subject in the subject-object scoring matrixThe acquisition of the feature matrix and the guest feature matrix is specifically as follows: performing matrix decomposition on the subject-object scoring matrix, initializing a subject characteristic matrix and an object characteristic matrix, multiplying the subject characteristic matrix and the object characteristic matrix to form a reconstruction training matrix, and utilizing a reconstruction loss formula and a reconstruction loss formula

the acquisition of the object nearest neighbor feature matrix specifically comprises the following steps: respectively selecting nearest neighbors of each object from an object similarity matrix obtained from object-object scoring data and an object similarity matrix obtained from the object-object scoring data, and splicing implicit vectors of the nearest neighbors of each object in corresponding object feature matrices to form an object nearest neighbor feature matrix;

the main body nearest neighbor feature matrix is obtained specifically as follows: and respectively selecting nearest neighbors of each subject from a subject similarity matrix obtained from the subject-subject scoring data and a subject similarity matrix obtained from the subject-subject scoring data, and splicing implicit vectors of the nearest neighbors of each subject in corresponding subject feature matrices to form a subject nearest neighbor feature matrix.

5. The neural collaborative filtering bidirectional recommendation method based on the head-tail knowledge migration according to claim 4, characterized in that: in the step 4:

obtaining new object-subject scoring data, comprising the steps of:

popularity of the object: ppk _i ＝ln(1+|N _i |)

The popularity of the subject: ppz _u ＝ln(1+|N _u |)

subject popularity ppz from object-subject scoring data _u Regarding subjects with k values larger than or equal to k as head subjects and subjects with k values smaller than k as long-tail subjects, and regarding the subjects preferred by each subject in the subject-object scoring data as object popularity ppk _i Sorting from big to small, selecting the first k object interaction data of the long-tail subject in the subject-object scoring data, and selecting the front of the head subject in the subject-object scoring data

And (3) updating the object-subject score by weighted combination of individual object interaction data to obtain a new object-subject score matrix, wherein the calculation formula is as follows:

wherein,

scoring preference of an object for a subject, y _ui Scoring the preference of the subject to the object, wherein alpha is a weight coefficient;

obtaining new subject-object scoring data, comprising the steps of:

popularity of the object: ppk _i ＝ln(1+|N _i |)

The popularity of the subject: ppz _u ＝ln(1+|N _u |)

Wherein, N _i Number of subjects for which the object is preferred, N _u Is the number of objects preferred by the subject;

object popularity ppk obtained from subject-object scoring data _i Regarding the objects with the k value larger than or equal to the k value as head objects, regarding the objects with the k value smaller than the k value as long-tail objects, and regarding the objects with the preference of each object in the object-object scoring data as the popularity ppz of the object _u Sorting from big to small, selecting the first k host interactive data of the long-tail object in the object-host scoring data, and selecting the front of the head object in the object-host scoring data

And (3) carrying out weighted combination on the interactive data of each subject to update the subject-object score to obtain a new subject-object score matrix, wherein the calculation formula is as follows:

wherein,

scoring preference of subject for object, y _iu And alpha is a weight coefficient.

6. The neural collaborative filtering bidirectional recommendation method based on the head-tail knowledge migration according to claim 1, characterized in that: in the step 5, the step of processing the image,

the composition of the object-subject few-sample data set and the multi-sample data set comprises the following steps: multi-sample object-host data set

The system consists of all interactive records of a long tail main body and t records of random sampling of each head main body;

the composition of the subject-object few sample data set and the multi-sample data set comprises: multi-sample subject-object data set

7. The neural collaborative filtering bidirectional recommendation method based on the migration head-tail knowledge according to claim 1, characterized in that: in the step 5, the process of training the object-host data set by the neural collaborative filtering recommendation network includes:

step 5-1-1: given a target object i and a subject u, the neural collaborative filtering recommendation method is expressed as:

wherein,

representing the predicted score between object i and subject u; θ represents a model parameter; g represents a mapping function;

step 5-1-2: the object-host data set is used as original input and is converted into an input vector which can be directly processed by a model after being subjected to independent thermal coding, then a linear embedding layer is utilized to convert a high-dimensional and sparse input vector into a low-dimensional and dense expression vector so as to obtain embedding vectors of an object i and a host u, and an embedding integration layer is utilized to integrate the embedding vectors and the corresponding nearest neighbor vectors to form a final expression vector v of the object and the host _i And v _u ；

Step 5-1-3: for the matrix decomposition MF part, v is _i And v _u Performing element product to obtain matrix decomposition output vector

For the multi-layer perceptron MLP part, v is _i And v _u Are spliced together to obtain an input vector v _iu Then v is further determined _iu Inputting the data into a multilayer perceptron to learn an interaction function between an object and a subject to obtain an output vector of the multilayer perceptron

Step 5-1-4: by

Splicing output vectors of two parts of matrix decomposition and a multilayer sensor, and inputting the output vectors into a full connection layer to obtain a prediction score between an object i and a subject u, wherein h is a weight vector of an output layer; b represents the bias term of the output layer; σ (-) is a Sigmoid function, using

As a function of the loss, wherein

Theta is a model parameter.

8. The neural collaborative filtering bidirectional recommendation method based on the migration head-tail knowledge according to claim 1, characterized in that: the process of training the subject-object data set by the neural collaborative filtering recommendation network comprises the following steps:

step 5-2-1: given a target subject u and an object i, the neural collaborative filtering recommendation method is expressed as:

wherein,

step 5-2-2: using the subject-object data set as an original input, converting the original input into an input vector which can be directly processed by a model after unique hot coding, converting a high-dimensional and sparse input vector into a low-dimensional and dense expression vector by using a linear embedding layer so as to obtain embedding vectors of a subject u and an object i, and integrating the embedding vectors and the corresponding two types of expression vectors of nearest neighbor vectors by using an embedding integration layer to form a final expression vector v of the subject and the object _u And v _i ；

Step 5-2-3: for the matrix decomposition MF part, v is _u And v _i Performing element product to obtain matrix decomposition output vector

Step 5-2-4: by

As a function of the loss, wherein

And theta is a model parameter.

9. The neural collaborative filtering bidirectional recommendation method based on the migration head-tail knowledge according to claim 1, characterized in that: in the step 6, obtaining parameter changes of the neural collaborative filtering model in a training process of a data set from a few samples to a many samples by using F (), including:

step 6-1: using a full-connection layer as a mapping network of F () to learn the last layer parameter of the object-subject neural collaborative filtering recommendation network, and using the formula

For a multi-sample dataset, argmin is a minimization loss function, thus obtaining a multi-sample model parameter theta ^* ；

Step 6-2: using loss functions

Learning the variation of parameters from an object-subject few-sample model to a multiple-sample model, wherein,

a small sample data set for object-subject,

a multi-sample dataset for the object-subject; f (theta;ω) learning a meta-map from the few-sample model parameters to the many-sample model parameters using the parameter ω, whose input is a neural collaborative filtering recommendation model

The trained parameter theta, lambda is a regularization parameter for balancing the two terms, | F (theta; omega) -theta ^* ‖ ² Passing the predicted multi-sample model parameters through F (theta; omega) and the multi-sample model parameters theta ^* The distance between the two is minimized and,

based on training data sets

Training the neural collaborative filtering recommendation network to ensure

High recommendation performance of data while helping to learn F (theta; omega);

step 6-3: using a full-connection layer as a mapping network of F () to learn the parameters of the last layer of the subject-object neural collaborative filtering recommendation network, and using the formula

Obtaining a multi-sample model parameter;

step 6-4: then using the loss function

Learning the parameter change from a subject-object few-sample model to a multi-sample model,

a sample data set is few for the subject-object,

few sample data sets for subject-object.

10. The neural collaborative filtering bidirectional recommendation method based on the head-tail knowledge migration according to claim 1, characterized in that: the step 7 specifically includes:

step 7-1: obtaining a guest-host score: select a series of t _j Satisfy t ₁ <t ₂ <t ₃ …<t _m Constructed of

Data set to

Model parameter mapping of data sets by minimization

The final scoring data is obtained by the following formula,

wherein

To

Parameter of model variation, θ _j Is composed of

Data set neural collaborative filtering model parameters;

step 7-2: obtaining a subject-object score: select a series of t _j Satisfy t ₁ <t ₂ <t ₃ …<t _m Constructed as described

Data set to

Model parameter mapping of data sets by minimization

The final scoring data is obtained by the following formula,

wherein

θ ^* For the original multi-sample parameter, omega _j,j+1 For full-link layer learning

To

Parameter of model variation, θ _j Is composed of

Data set neural collaborative filtering model parameters；

And 7-3: aggregating two-way scores: the two-way scores are aggregated by arithmetic mean, matched with a test set, and verified for model accuracy, recall and f1 values.