CN115809374A

CN115809374A - Method, system, device and storage medium for correcting mainstream deviation of recommendation system

Info

Publication number: CN115809374A
Application number: CN202310104256.6A
Authority: CN
Inventors: 张海仙; 张宽易; 谢敏; 张懿; 谌祖港; 黄粱可汗; 李欣洋; 尚文一; 尹腾; 杨雨奇
Original assignee: Sichuan University
Current assignee: Sichuan University
Priority date: 2023-02-13
Filing date: 2023-02-13
Publication date: 2023-03-17
Anticipated expiration: 2043-02-13
Also published as: CN115809374B

Abstract

The invention discloses a method, a system, equipment and a storage medium for correcting mainstream deviation of a recommendation system, belongs to mainstream deviation correction of the recommendation system in the field of data recommendation, and aims to solve the technical problems of low overall recommendation accuracy and low recommendation fairness caused by the fact that the mainstream deviation is not fully considered by the recommendation system in the prior art. According to the method, the mainstream score of the user is calculated and weighted, then the interaction data of the user after weighting is used as the training data of model training, through the method, the mainstream degree of the user can be considered in the reconstruction of the model, and the phenomenon of excessively recommending popular goods can not occur, so that the influence of mainstream deviation on a recommendation system can be effectively reduced, the effect of the recommendation system on a wider user group is improved, higher fairness is realized, the overall recommendation accuracy of the recommendation system is improved, and the recommendation fairness of the recommendation system is higher.

Description

Method, system, device and storage medium for correcting mainstream deviation of recommendation system

Technical Field

The invention belongs to the technical field of data recommendation, relates to the field of mainstream deviation correction of a recommendation system, and particularly relates to a method, a system, equipment and a storage medium for correcting mainstream deviation of the recommendation system.

Background

With the expansion of the scale of the internet and information systems, the amount of information generated by the internet and information systems also shows an explosive growth situation, and the difficulty of a user in retrieving the content desired by the user from mass information is greatly improved. At present, various websites use a recommendation system in the background, the recommendation system calculates candidate products most suitable for being recommended to a user according to the access characteristics of the user, and then the candidate products are displayed to the user for selection. As a technology capable of effectively solving information overload, the recommendation system can filter out the most interesting part of the user from massive contents according to personalized requirements of different users, so that the recommendation system is widely applied to the fields of e-commerce, video and audio entertainment, accurate advertisement delivery and the like. The collaborative filtering method is a common method in a recommendation system, and can be divided into user-based collaborative filtering and article-based collaborative filtering according to different targets. The collaborative filtering based on the users takes the users as the center, and the idea is to recommend articles similar to the users to the target users; item-based collaborative filtering is item-centric, with the idea being to recommend similar items to a target item to a user who likes the current item. However, due to the influence of the user crowd effect, the platform display mechanism, the quality difference of the articles, and other factors, a phenomenon that the interaction of many users is concentrated on a small part of the articles easily occurs. The traditional collaborative filtering method is easy to learn the deviation during the training process, so that the recommendation of a small part of popular items is prone to be realized, and the recommendation of a large part of items is difficult to obtain the recommendation opportunity, so that the recommendation result cannot reflect the real preference of the user. This causes that the recommendation effect received by the mainstream users who like to pursue the hot spot is often very good, while the recommendation effect received by the rest of the broader user groups is not satisfactory, and the received recommendation effect of different users is greatly different due to different mainstream degrees, which is the mainstream deviation phenomenon in the recommendation system.

Most of the existing deviation correcting schemes start from the perspective of articles, namely, the mainstream deviation is indirectly reduced by correcting the popularity deviation of the articles in a recommendation system. The invention patent application with application number CN202110218946.5 discloses a causal reasoning method for correcting popularity deviation of a recommendation system, which comprises the following steps: acquiring a matching score of a user and an article in a current recommendation system; predicting an item score according to the popularity of the item, and predicting a user score according to the preference of the user; and aggregating the matching scores of the user and the articles, the article scores and the user scores, predicting the matching scores of the user and the articles, and removing the influence caused by the popularity deviation to obtain the final matching scores of the user and the articles. The method is a model-independent counterfactual reasoning framework, can be suitable for various recommendation systems, improves the recommendation performance of the recommendation system by eliminating the popularity deviation, and can provide high-quality and accurate personalized recommendation content for users. The method is the same as other article-based collaborative filtering methods, mainly aims at improving the phenomenon that recommendation is concentrated on a small part of popular articles, reduces the influence of the popular articles on the overall model recommendation decision in the training process by adopting modes such as inverse tendency fraction weighting, and the like, and simultaneously gives higher weight to the long-tail articles to increase the recommendation probability of the long-tail articles.

In recent years, there are also methods for removing mainstream deviations from the perspective of users, such as adjusting weights of different users in a training process, training a model separately for user groups with different preferences, and the like, so as to enhance capturing capability of the model for preferences of a specific user group. The invention patent application with the application number of CN201911056270.3 discloses a recommendation list re-ranking method for improving the diversity of a recommendation system. The method is the same as other collaborative filtering based on the user, different requirements of the user on diversity of the recommendation list can be considered, so that the recommended articles are more fit for real feeling of people, grading deviation of different users on the same article is also considered, the diversity is properly improved on the balance of accuracy and diversity, and the influence on the accuracy is small.

The method for correcting the mainstream deviation of the recommendation system can actually expand the recommendation range of the recommendation system, so that the recommendation system can not be limited to a part of popular articles, but can take care of some long-tail articles, and the fairness problem in article recommendation is solved to a certain extent. However, this does not mean that these long-tailed items can be recommended to the appropriate users, but rather reduces the accuracy of the recommendation system if recommended to mainstream users who prefer to pursue hot spots. Therefore, a method of correcting the deviation of popularity alone does not necessarily play a positive role in correcting the deviation of the mainstream. The existing method for directly correcting the mainstream deviation also has a certain problem, and the effect of the part of users is easily damaged by reducing the weight of the mainstream user in the training process, so that the overall accuracy of the recommendation system is reduced; the method for training different models separately for different user groups also has problems, and the division of the user groups, the training of a plurality of models and the consumption during integration make the method difficult to realize in the actual production environment. In addition, the existing method for correcting the deviation of the main stream does not consider the characteristic of the change of the main stream, a group of users belonging to the main stream at present are not necessarily the main stream users in the past, and a group of users not belonging to the main stream in the past can also become the main stream users in the future due to pursuit of hot spots.

Disclosure of Invention

The invention aims to: in order to solve the technical problems of low overall recommendation accuracy and low recommendation fairness caused by the fact that a recommendation system does not fully consider mainstream deviation in the prior art, the invention provides a method, a system, equipment and a storage medium for correcting the mainstream deviation of the recommendation system.

The invention specifically adopts the following technical scheme for realizing the purpose:

a method for correcting mainstream deviation of a recommendation system comprises the following steps:

step S1, data collection and processing

Obtaining user information, article information and user article interaction information in a recommendation system, and respectively constructing user co-occurrence vectors

Co-occurrence vector of article

；

Step S2, calculating the mainstream score

According to co-occurrence vector of articles

Calculating the total interaction times of the articles

(ii) a According to user co-occurrence vectors

Calculating the total number of interactions of the user

(ii) a According to the total number of interactions of the article

Total number of interactions of user

Item categories, computing users

Dynamic mainstream degree score of

(ii) a According to the user

Dynamic mainstream level score of

Calculating the average value of the dynamic mainstream degree scores of all the users to obtain the global dynamic mainstream degree score

And forming global dynamic mainstream degree scores of all article categories into a global dynamic mainstream degree vector

；

S3, constructing a dynamic mainstream degree characteristic model

Constructing a dynamic mainstream degree characteristic model based on a three-layer perceptron MLP model, wherein the first two layers of the dynamic mainstream degree characteristic model use a ReLU function as an activation function, and the last layer of the dynamic mainstream degree characteristic model uses a softmax activation function; with user information vectors

And the global dynamic mainstream degree vector output by the step S2

Spliced and used as input of dynamic mainstream degree characteristic model

Outputting a dynamic mainstream characteristic hidden vector by the dynamic mainstream degree characteristic model;

s4, constructing a collaborative filtering module

Constructing a collaborative filtering module comprising an encoder and a decoder;

the encoder is constructed by adopting a three-layer perceptron MLP model; user interaction data

Input encoder, encoder calculating user interaction data

And each dimension of (a) and generate separately

Mean value of

Variance, forming a mean vector of the user

Sum variance vector

Wherein the two vectors are both in t dimension, and constitute the mean vector of the user

Sum variance vector

Wherein the two vectors are both in t dimension, and then h dimension user interaction hidden vectors are generated by random sampling

；

The decoder is constructed by adopting a four-layer perceptron MLP model, the first three layers of activation functions of the decoder are tanh functions, and the last layer of activation functions of the decoder are softmax functions; step S3, outputting the dynamic mainstream characteristic hidden vector and the user interaction hidden vector output by the encoder

As input to a decoder, the decoder output reconstructs user interaction data

And reconstructing dynamic mainstream feature vectors

Reconstructing dynamic mainstream feature vectors

For completing the reconstruction of the decoder;

step S5, recommendation result generation

Inputting user interaction data currently observed by the user to be predicted according to the collaborative filtering module obtained by the training completion in the step S4

The encoder outputs a user interaction hidden vector

(ii) a Then the user interaction is hidden

And inputting the dynamic mainstream characteristic hidden vector output by the step S3 into a decoder, and outputting reconstructed user interaction data by the decoder.

Further, in step S1, any user is constructed according to the user item interaction information

User co-occurrence vector with all items

Constructing any article according to the user article interaction information

Co-occurrence vector with all articles

；

Wherein,

which represents the total number of users,

the total number of items is indicated and,

the items are shown as being in the form of objects,

is shown as

The number of the articles is one,

representing a user

And articles

The interaction situation of (a) is,

representing an article

And the user

The interaction scenario of (2).

Further, in step S2, according to the article

Co-occurrence vector with article

Calculating the total number of article interactions

；

According to the user

Co-occurrence vector with user

And the total number of times of interaction of the user is calculated,

；

according to the total number of interaction times of the articles

Total number of user interactions

Item categories, computing users

Dynamic mainstream level score of

(ii) a In the case of insensitive number of interactions, for the category

Article of, user

Dynamic mainstream level score of

Calculating according to the formula (1); in the case of sensitive number of interactions, for the category

Article of, user

Dynamic mainstream degree score of

Calculating according to the formula (2);

（1）

（2）

according to the user

Dynamic mainstream degree score of

The calculation formula is as follows:

then, the global dynamic mainstream degree scores of all the article categories are divided

Make up a dimension of

Global dynamic mainstream level vector of

Expressed as:

wherein,

、

all of which represent the time of day,

representing the hyper-parameter (for controlling the logarithmic curve),

representing an article

Belong to the category

，

Which represents the total number of users,

a set of all the users is represented,

representing the total number of categories of items.

Furthermore, a co-occurrence vector weighting process is also performed, and the weighting process is described as: user will be

To belong to the category

Article of (2)

Of the interaction situation

Multiplication by the user

In the category of

Dynamic mainstream degree score of

；

For user co-occurrence vectors

Each item in the list is weighted, and after the weighting is finished, the co-occurrence vector of the whole user is obtained

Normalizing by using a softmax function to obtain user interaction data for inputting into the collaborative filtering module

：

Wherein any of the user interaction data

Has a value range of [0,1 ]]。

Further, in step S3, the user information vector

Expressed as:

wherein,

representing a user

The age information of the person to be treated is quantified,

representing a user

The binary gender information of (1);

input of dynamic mainstream degree characteristic model

Expressed as:

；

wherein,

representing a vector stitching operation.

Further, in step S4, the loss function of the filter module is cooperated

Dividing into reconstructed target losses

Distribution approximation loss

And dynamic mainstream eigenvector approximation loss

Three parts, loss function

The calculation formula of (2) is as follows:

reconstructing object losses

The calculation formula of (2) is as follows:

distribution approximation loss

The calculation formula of (2) is as follows:

dynamic mainstream eigenvector approximation loss

Is calculated byThe formula is as follows:

wherein,

a hidden vector representing the user interaction is shown,

representing the data of the user interaction(s),

representing the posterior distribution of each user data sample,

representing variation distribution

And posterior distribution

In the approximation that the difference between the first and second values,

to represent

To pair

In the expectation that the position of the target is not changed,

in order to be a hyper-parameter,

is a user interaction implicit vector

KL represents the KL divergence,

a distribution is represented a priori, which is,

which represents a vector of the variance (m) of the signal,

which represents the square of the mean vector and,

in order to be a hyper-parameter,

representing the original dynamic mainstream feature vector,

representing the reconstructed dynamic mainstream feature vector.

A system for correcting deviations in a mainstream of a recommendation system, comprising:

the data collection and processing module is used for acquiring the user information, the article information and the user article interaction information in the recommendation system and respectively constructing the user co-occurrence vectors

Co-occurrence vector of article

；

A mainstream score calculation module for calculating a mainstream score based on the co-occurrence vector of the article

Calculating the total interaction times of the articles

(ii) a According to user co-occurrence vectors

Calculating the total number of interactions

(ii) a According to the total number of interaction times of the articles

Total number of user interactions

Item categories, computing users

Dynamic mainstream level score of

(ii) a According to the user

Dynamic mainstream degree score of

And forming the global dynamic mainstream degree scores of all the article categories into a global dynamic mainstream degree vector

；

The dynamic mainstream degree characteristic model building module is used for building a dynamic mainstream degree characteristic model based on a three-layer perceptron MLP model, the ReLU function is used as an activation function in the first two layers of the dynamic mainstream degree characteristic model, and the softmax activation function is used in the last layer of the dynamic mainstream degree characteristic model; with user information vectors

And the global dynamic mainstream degree vector output by the mainstream fraction calculation module

Spliced as input of dynamic mainstream degree characteristic model

the collaborative filtering module construction module is used for constructing a collaborative filtering module comprising an encoder and a decoder;

Input encoder, encoder calculating user interaction data

And each dimension of (1), and separately generate

Mean value of

Variance, forming a mean vector of users

Sum variance vector

；

The decoder is constructed by adopting a four-layer perceptron MLP model, the first three layers of activation functions of the decoder are tanh functions, and the last layer of activation function of the decoder is softmax functions; dynamic mainstream feature hidden vectors output by a dynamic mainstream degree feature model building module and user interaction hidden vectors output by an encoder

As input to the decoder, the decoder outputs reconstructed user interaction data

And reconstructing dynamic mainstream feature vectors

Reconstructing dynamic mainstream feature vectors

For completing the reconstruction of the decoder;

a recommendation result generation module for inputting the user interaction data currently observed by the user to be predicted according to the collaborative filtering module obtained by the collaborative filtering module construction module after training

The encoder outputs a user interaction hidden vector

(ii) a Then the user interaction is hidden

And inputting the dynamic mainstream characteristic hidden vector output by the dynamic mainstream degree characteristic model building module into a decoder, and outputting reconstructed user interaction data by the decoder.

A computer device comprising a memory and a processor, the memory storing a computer program which, when executed by the processor, causes the processor to carry out the steps of the above method.

A computer-readable storage medium, storing a computer program which, when executed by a processor, causes the processor to carry out the steps of the above-mentioned method.

The invention has the following beneficial effects:

1. in the invention, in the process of dynamically correcting the mainstream deviation, the mainstream score of the user is calculated and weighted, and then the weighted user interaction data is used as the training data of model training.

2. In the invention, a collaborative filtering module based on an asymmetric variational self-encoder is constructed, and the capability of capturing and utilizing dynamic mainstream characteristics by a model is enhanced through asymmetric structural design and introduction of dynamic mainstream characteristic vectors.

3. In the invention, two scenes of sensitive interaction times and insensitive interaction times are fully considered, a method for dynamically calculating the mainstream degree score of the user is provided, the mainstream degree score can be used as input data of a variational self-encoder through a weighting normalization process, the influence of mainstream deviation is fully considered, the overall recommendation accuracy of a recommendation system is improved, and the recommendation fairness of the recommendation system is higher.

Drawings

FIG. 1 is a schematic flow diagram of the present invention;

FIG. 2 is a detailed flow diagram of the present invention;

FIG. 3 is a schematic diagram of the structure of the collaborative filtering module according to the present invention.

Detailed Description

Example 1

The embodiment provides a method for correcting a mainstream deviation of a recommendation system, a flow diagram of which is shown in fig. 1, and the method comprises the steps of data collection and processing, mainstream score calculation, dynamic mainstream degree feature model construction, collaborative filtering module construction and recommendation result generation. The detailed flow diagram of the method is shown in fig. 2, and specifically includes:

step S1, data collection and processing

Co-occurrence vector of article

。

Constructing any user according to user article interaction information

User co-occurrence vectors with all items

，

Representing a user

And articles

The interaction scenario of (2). In the scene with insensitive interaction times (only paying attention to whether there is interaction, but not paying attention to the interaction times, such as movie recommendation, book recommendation and the like), if explicit interaction is generated

Otherwise

(ii) a In the case of sensitive interaction times (concerning whether there is an interaction, and also concerning the number of interactions, such as music recommendation, short video recommendation, etc.), if an explicit interaction is generated

Wherein

Representing a user

And articles

The number of interactions; otherwise

。

Constructing any article according to user article interaction information

Co-occurrence vector with all articles

，

Representing an article

And the user

The interaction scenario of (2). In the scene with sensitive interaction times (only paying attention to whether there is interaction, but not paying attention to the interaction times, such as movie recommendation, book recommendation and the like), if explicit interaction is generated

Otherwise, otherwise

. In case of sensitive interaction times (concerning whether there is any interaction, and also concerning the number of interactions, such as music recommendations, short video recommendations, etc.), if an explicit interaction is generated

In which

Representing an article

And the user

The number of interactions; otherwise

。

Wherein,

which represents the total number of users,

the total number of items is indicated and,

to represent an item of material that is,

denotes the first

The number of the articles is increased, and the articles,

representing the user.

The data cleaning is mainly to filter out part of users and articles according to a threshold (for example, to filter out articles with interaction times smaller than a certain threshold), and the purpose of the data cleaning is to remove abnormal data to ensure normal operation of a recommendation process.

Step S2, calculating the mainstream score

The step mainly quantizes the user and the global mainstream degree, so that a dynamic mainstream degree feature vector can be generated in the step S3 conveniently. Since the concept of mainstream level involves both individual users and overall users, the mainstream level scores of individual users and global can be calculated separately here.

According to the article

Co-occurrence vector with article

The total number of interactions of the article can be calculated

. Since the degree of mainstream is a dynamically changing concept, the number of item interactions can be filtered based on time, e.g. only considering the occurrence

Interaction in the time period only takes the release date into consideration

Previous item with total number of item interactions of

Number of interactions of any article therein

Only data within this time period is considered.

According to the user

Co-occurrence vector with user

The total number of interactions of the user can be calculated

. Since the degree of mainstream is a dynamically changing concept, the number of user interactions can be filtered according to time, e.g. only considering the occurrence

And (3) the total interaction times of the user in the time period are as follows:

wherein is arbitrary

Only the interactive data in the time period is considered, and the release date is

The subsequent item interaction data are all set to 0.

According to the idea of collaborative filtering, the degree of user's mainstream depends on whether their interactive items are interacted with by other users. Meanwhile, the method considers that the preference degrees of users to different categories of articles are inconsistent, so the influence of the article category factor is also considered in the calculation process of the mainstream degree score of the user. According to the total number of interaction times of the articles

Total number of interactions of user

Item category, computing user

Dynamic mainstream degree score of

. In the case of insensitive number of interactions, for the category

Article of, user

Dynamic mainstream degree score of

Article of, user

Dynamic mainstream degree score of

Calculating according to the formula (2);

（1）

（2）

wherein,

will be provided with

Within a time period, the user

All items that have interacted add up to the number of interactions of all users. The phenomenon of power law distribution (i.e. small part) due to the interactive recording of real world objectsItems occupy most of the interactions, and most items have little), so the total number of interactions with other users for each item using a logarithmic function

Inhibition, base number

Is a hyper-parameter. Dynamic mainstream score of user in case of insensitive interaction times

Each interaction record is given the same weight. Dynamic mainstream score of user in case of sensitive interaction times

Each interaction record is given a different weight,

representing a user

And articles

This means that the greater the number of user interactions, the greater the weight of the item in the user mainstream level score evaluation.

According to the user

Dynamic mainstream level score of

For categories of

Article of (1), global dynamic mainstream level score

Is the average of the dynamic mainstream level scores of all users, so the global dynamic masterFractional degree of flow

The calculation formula of (c) is:

Make up a dimension of

Global dynamic mainstream level vector of

Expressed as:

wherein,

、

both of which represent the time that it takes,

representing the hyper-parameter (for controlling the logarithmic curve),

representing an article

Belong to the category

，

Which represents the total number of users,

a set of all the users is represented,

representing the total number of categories of items.

And then weighting the co-occurrence vectors of the users according to the obtained mainstream degree scores of the users, namely introducing the mainstream information of the users into the co-occurrence vectors. The weighting process is described as: user will be connected

To belong to the category

Article of

Of the interaction situation

Multiplication by the user

In the category of

Dynamic mainstream degree score of

；

For user co-occurrence vectors

Each item in the list is weighted, and the co-occurrence vector of the whole user is obtained after the weighting is finished

Normalizing by using softmax function to obtain user interaction data for inputting into the collaborative filtering module

：

Wherein any one of the user interaction data

Has a value range of [0,1 ]]。

User interaction data derived therefrom

As well as the input of the encoder followed by the collaborative filtering module.

S3, constructing a dynamic mainstream degree characteristic model

Can be calculated according to the step (2) in

Time slot arbitrary user

With respect to any category of item collections

Main stream degree score of

. For any user

Calculating its mainstream level scores for all the categories of the item set, these mainstream level scores may constitute one

Vector of dimensions, denoted as

The vector may characterize the user

The mainstream degree of all the item categories, the larger the score of the mainstream degree of an item category is, the more the user prefers popular items in the category, and the smaller the score is, the more the user prefers popular items in the category.

In order to enable the collaborative filtering model in step S4 to make full use of the user information and the global information, a dynamic mainstream feature expression based on the user information and the global information needs to be obtained. A dynamic mainstream level feature model is thus constructed here. And constructing a dynamic mainstream degree feature model based on the three-layer perceptron MLP model, wherein the dynamic mainstream degree feature model is used for extracting key features in user information and global information and reducing the original input into a hidden vector with a lower dimensionality. The ReLU function is used as an activation function in the first two layers of the dynamic mainstream degree characteristic model, and the softmax activation function is used in the last layer of the dynamic mainstream degree characteristic model. With user information vectors

And the global dynamic mainstream degree vector output by the step S2

Spliced as input of dynamic mainstream degree characteristic model

。

The user information vector

Expressed as:

wherein,

representing a user

The age information of the person to be treated is quantified,

representing a user

The binary gender information of (1);

inputting the dynamic mainstream degree characteristic model

Expressed as:

；

wherein,

representing vector stitching operations

Input the method

And outputting the dynamic mainstream characteristic hidden vector after dimensionality reduction of the three-layer perceptron of the dynamic mainstream degree characteristic model.

S4, constructing a collaborative filtering module

An asymmetric variational self-encoder (VAE) is used as a main structure of the collaborative filtering model, and the structure is shown in fig. 3. The method adopts an asymmetric variational self-encoder to carry out collaborative filtering, and aims to add extra dynamic mainstream degree information into a hidden layer between an encoder and a decoder, so that the decoder can directly utilize personal information of a user and global mainstream degree during decoding.

The collaborative filtering module comprises an encoder and a decoder;

the encoder is constructed by adopting a three-layer perceptron MLP model. User interaction data

Input encoder, encoder calculating user interaction data

And each dimension of (1), and separately generate

Mean value of

Variance, forming a mean vector of the user

Sum variance vector

Wherein the two vectors are both in t dimension, and then h dimension user interaction hidden vector is generated by random sampling

，

. Since the network cannot perform back propagation due to the random sampling method, the re-parameterization method is adopted to complete the sampling process.

The decoder is constructed by adopting a four-layer perceptron MLP model, the first three layers of activation functions of the decoder are tanh functions, and the last layer of activation functions of the decoder are softmax functions and are used for generating probability distribution

. The dynamic mainstream characteristic hidden vector output by the step S3 and the user interaction hidden vector output by the encoder

And reconstructing dynamic mainstream feature vectors

Reconstructing dynamic mainstream feature vectors

For completing the reconstruction of the decoder.

The variational autoencoder reasoning process is as follows, assuming the user

Corresponding user interaction hidden vector

Compliance

Normal distribution of (c). Assuming user interaction data input to the encoder according to the recommendation system interaction data characteristics

Obey probability of

The likelihood function of the polynomial distribution of (1) is as follows:

wherein,

representing hidden vectors interacted by user

Is determined and is

Probability of individual item interaction

In order to enable the network to learn the parameters by back-propagation, a posteriori distributions for each user data sample must be found

Since this posterior distribution is not easy to find, the variation distribution is used here by means of variation estimation

To approximate

. Suppose that

Satisfy a Gaussian distribution

In which

Is a variance vector

Diagonal covariance matrix of. Then the optimization goal of the network at this time is to optimize the parameter generation mean vector

Sum variance vector

Make variation distribute

And posterior distribution

As similar as possible.

Wherein the reparameterization method operates as follows, assuming noise

Obey normal distribution

User interaction implicit vector

May be represented by a variance vector

Mean vector of

And noise are linearly combined, so that the network can learn. The reparameterization formula is as follows:

wherein,

is the standard deviation;

unlike the standard variational self-encoder network, an asymmetric structure is used to obtain the user interaction implicit vector

Thereafter, the input generated in step S3 is inputted

Is spliced at

And then fed into the decoder. The generating part of the decoder may be divided into reconstructing user interaction data

And reconstructing dynamic mainstream feature vectors

。

In conclusion, loss function of collaborative filtering model based on asymmetric variational self-encoder

Dividing into reconstructed target losses

Distribution approximation loss

And dynamic mainstream eigenvector approximation loss

Three parts, loss function

The calculation formula of (2) is as follows:

reconstructing object loss

The purpose of this is to make the reconstructed user interaction data output by the decoder as identical as possible to the user interaction data input to the encoder, the calculation formula being:

distribution approximation loss

Is to make the variation distributed

As close to a posterior distribution as possible

For measuring the approximation degree of two distributions, the calculation formula is:

in order to convert the original dynamic mainstream feature vector

And reconstructing dynamic mainstream feature vectors

With its negative number as a loss term, with the aim of enabling the reconstruction of dynamic mainstream feature vectors

In the direction of original dynamic main flow feature vector

Approximate losses by dynamic mainstream feature vectors as close as possible

The method can enable a decoder to complete the reconstruction process by using the dynamic mainstream characteristics as much as possible. Dynamic mainstream feature vectorApproximate loss

The calculation formula of (c) is:

wherein,

a hidden vector representing the user interaction is shown,

representing the data of the user interaction(s),

representing the posterior distribution of each user data sample,

representing variation distribution

And posterior distribution

In the approximation that the difference between the first and second values,

to represent

To pair

In the expectation of the above-mentioned method,

is a hyper-parameter, is used for controlling the punishment of the distribution similarity degree to the whole objective function,

is a user interaction implicit vector

Of (2)The degree, KL, indicates the KL divergence,

which represents a distribution a priori, and,

the variance vector is represented by a vector of variances,

which represents the square of the mean vector and,

in order to be a super-parameter,

representing the original dynamic mainstream feature vector,

representing the reconstructed dynamic mainstream feature vector.

Step S5, recommendation result generation

Inputting the user to be predicted according to the collaborative filtering module obtained by the training completion of the step S4

Currently observed user interaction data

Encoder output mean vector

Sum variance vector

Then is represented by the formula

Calculating to obtain a user interaction hidden vector

(ii) a Then the user interaction is hidden

And step S3 outputThe decoder outputs reconstructed user interaction data, the dimensionality of the decoder is n-dimensional and is the same as the quantity of all articles, and the value of each dimensionality is 0,1]In between.

Aiming at the obtained reconstructed user interaction data, firstly, eliminating articles which do not meet the time requirement, and only considering

And the previous time period, then any release time is

And setting the value of the dimension where the subsequent article serial number is positioned as 0. Secondly, removing the objects which have appeared in the historical data and the currently observed interaction situation

And recording the dimension serial numbers with the values not being 0, and setting the dimensions of the reconstructed user interaction data as 0. And finally, sequencing the reconstructed user interaction data from large to small, wherein the dimension serial number of top-N is the user to be predicted

The top N item recommendation lists.

Example 2

The embodiment provides a system for correcting mainstream deviation of a recommendation system, which comprises a data collection and processing module, a mainstream score calculation module, a dynamic mainstream degree feature model construction module, a collaborative filtering module construction module and a recommendation result generation module, wherein the specific content of each module is as follows:

Co-occurrence vector of article

。

Constructing any user according to user article interaction information

User co-occurrence vectors with all items

，

Representing a user

And articles

The interaction scenario of (2). In the scene with insensitive interaction times (only paying attention to whether there is excessive interaction, but not paying attention to the interaction times, such as movie recommendation, book recommendation and the like), if the explicit interaction is generated

Otherwise, otherwise

Wherein

Representing a user

And articles

The number of interactions; otherwise

。

Constructing any article according to user article interaction information

Co-occurrence vector with all articles

，

Representing an article

And the user

Otherwise

In which

Representing an article

And the user

The number of interactions; otherwise

。

Wherein,

which represents the total number of users,

the total number of items is indicated and,

the items are shown as being in the form of objects,

is shown as

The number of the articles is increased, and the articles,

representing the user.

Data cleaning is mainly to filter out part of users and articles according to a threshold (for example, to filter out articles with interaction times smaller than a certain threshold), and the purpose is to clear abnormal data to ensure normal operation of a recommendation process.

And the mainstream score calculating module is mainly used for quantizing the user and the global mainstream degree and facilitating the generation of the dynamic mainstream degree characteristic vector in the dynamic mainstream degree characteristic model building module. Since the concept of mainstream level involves both individual users and overall users, the mainstream level scores of individual users and global can be calculated separately here.

According to the article

Co-occurrence vector with article

The total number of interactions of the article can be calculated

. Since the mainstream degree is a dynamically changing concept, the article interaction times can be filtered according to time, such as only considering occurrence

Interaction in the time period only takes the release date into consideration

Previous article with total number of article interactions of

Number of interactions of any article therein

Only data within this time period is considered.

According to the user

Co-occurrence vector with user

The total number of interactions of the user can be calculated

. Since the degree of main flow is a dynamically changing concept, the number of user interactions can be filtered according to time, e.g. only considering the occurrence

wherein is arbitrary

Only the interactive data in the time period are considered, and the release date is

The subsequent item interaction data are all set to 0.

According to the idea of collaborative filtering, the degree of user's mainstream depends on whether their interactive items are interacted with by other users. Meanwhile, the method considers that the preference degrees of users to different categories of articles are inconsistent, so the influence of the article category factor is also considered in the calculation process of the mainstream degree score of the user. According to the total number of interactions of the article

Total number of interactions of user

Item category, computing user

Dynamic mainstream degree score of

. In the case of insensitive number of interactions, for the category

Article of, user

Dynamic mainstream degree score of

Article of, user

Dynamic mainstream level score of

Calculating according to the formula (2);

（1）

（2）

wherein,

will be provided with

During the period of time, the user can select the time period,user' s

All items that have interacted add up to the number of interactions of all users. Because the interaction records of real-world objects have the phenomenon of power law distribution (namely a small part of the objects occupy most of the interactions, and most of the objects have no interactions), the total number of interactions between each object and other users is determined by using a logarithmic function

Inhibition, base number

Each interaction record is given a different weight,

representing a user

And articles

This means that the greater the number of user interactions, the greater the weight in the evaluation of the degree of item occupancy in the score of the degree of mainstream of the user.

According to the user

Dynamic mainstream degree score of

For the category of

Global dynamic mainstream level score

Is the average of the dynamic mainstream level scores of all users, so the global dynamic mainstream level score

The calculation formula of (c) is:

Make up a dimension of

Global dynamic mainstream level vector of

Expressed as:

wherein,

、

both of which represent the time that it takes,

representing the hyper-parameter (for controlling the logarithmic curve),

representing an article

Belong to the category

，

Which represents the total number of users,

a set of all the users is represented,

representing the total number of item categories.

And then weighting the co-occurrence vectors of the users according to the obtained mainstream degree scores of the users, namely introducing the mainstream information of the users into the co-occurrence vectors. The weighting process is described as: user will be

To belong to the category

Article of

Of the interaction situation

Multiplication by the user

In the category of

Dynamic mainstream degree score of

；

For user co-occurrence vectors

Normalization using softmax function, resulting in input collaborative filteringUser interaction data for modules

：

Wherein any of the user interaction data

Has a value range of [0,1 ]]。

User interaction data obtained here

And the dynamic mainstream degree characteristic model building module is used for building the dynamic mainstream degree characteristic model.

Can be calculated according to the mainstream score calculation module when

Time slot arbitrary user

With respect to any category of item collections

Score of degree of mainstream

. For any user

Calculating its mainstream level scores for the set of all categories of items, which may constitute one mainstream level score

Vector of dimensions, denoted as

The vector may characterize the user

In order to enable the collaborative filtering model of the collaborative filtering module building module to fully utilize the user information and the global information, a dynamic mainstream feature expression based on the user information and the global information needs to be obtained. A dynamic mainstream level feature model is thus constructed here. And constructing a dynamic mainstream degree feature model based on the three-layer perceptron MLP model, wherein the dynamic mainstream degree feature model is used for extracting key features in user information and global information and reducing the original input into a hidden vector with a lower dimensionality. The first two layers of the dynamic mainstream degree characteristic model use a ReLU function as an activation function, and the last layer of the dynamic mainstream degree characteristic model uses a softmax activation function. With user information vectors

Spliced and used as input of dynamic mainstream degree characteristic model

。

The user information vector

Expressed as:

wherein,

representing a user

The age information of the patient is quantified by the age-information-measuring device,

representing a user

The binary gender information of (1);

inputting the dynamic mainstream degree characteristic model

Expressed as:

；

wherein,

representing vector stitching operations

Input device

The collaborative filtering module construction module is used for constructing a collaborative filtering module, and adopts an asymmetric variational self-encoder (VAE) as a main structure of a collaborative filtering model, and the structure of the collaborative filtering module is shown in FIG. 3. The method adopts an asymmetric variational self-encoder to carry out collaborative filtering, and aims to add extra dynamic mainstream degree information into a hidden layer between an encoder and a decoder, so that the decoder can directly utilize personal information of a user and global mainstream degree during decoding.

The collaborative filtering module comprises an encoder and a decoder;

Input encoder, encoder calculating user interaction data

And each dimension of (1), and separately generate

Mean value of

Variance, forming a mean vector of users

Sum variance vector

，

. Dynamic mainstream characteristic hidden vectors output by the dynamic mainstream degree characteristic model building module and user interaction hidden vectors output by the encoder

As input to a decoder, the decoder output reconstructs user interaction data

And reconstructing dynamic mainstream feature vectors

Reconstructing dynamic mainstream feature vectors

For completing the reconstruction of the decoder.

Variational autocoder inference procedures are as follows, assuming a user

Corresponding user interaction hidden vector

Compliance

Is normally distributed. Assuming user interaction data input to the encoder according to the recommendation system interaction data characteristics

Obey probability of

The likelihood function of the polynomial distribution of (1) is as follows:

wherein,

representing hidden vectors interacted by user

Is determined and is

Probability of individual item interaction;

To approximate

. Suppose that

Satisfy the Gaussian distribution

In which

Is a variance vector

Sum variance vector

Make variation distribution

And posterior distribution

As similar as possible.

Wherein the reparameterization method operates as follows, assuming noise

Obey normal distribution

Hidden vector of user interaction

Can be represented by a variance vector

Mean vector of

Linearly combined with noise to obtainThereby enabling the network to learn. The reparameterization formula is as follows:

wherein,

is the standard deviation;

Then, the input generated in the dynamic mainstream degree characteristic model building module is input

Is spliced at

And reconstructing dynamic mainstream feature vectors

。

Dividing into reconstructed target losses

Distribution approximation loss

And dynamic mainstream eigenvector approximation loss

Three parts, loss function

The calculation formula of (c) is:

reconstructing object loss

distribution approximation loss

Is to make the variation distributed

As close to a posterior distribution as possible

For measuring the approximation degree of the two distributions, the calculation formula is:

to convert the original dynamic mainstream feature vector

And reconstructing dynamic mainstream feature vectors

In the direction of and withOriginal dynamic mainstream feature vector

Approximate losses by dynamic mainstream feature vectors as close as possible

The method can enable a decoder to complete the reconstruction process by using the dynamic mainstream characteristics as much as possible. Dynamic mainstream eigenvector approximation loss

The calculation formula of (2) is as follows:

wherein,

a hidden vector representing the user interaction is shown,

representing the data of the user interaction(s),

representing the posterior distribution of each user data sample,

representing variation distribution

And posterior distribution

In the approximation that the difference between the first and second values,

represent

To pair

In the expectation of the above-mentioned method,

is a user interaction implicit vector

KL represents the KL divergence,

which represents a distribution a priori, and,

which represents a vector of the variance (m) of the signal,

which represents the square of the mean vector and,

in order to be a hyper-parameter,

representing the original dynamic mainstream feature vector(s),

representing the reconstructed dynamic mainstream feature vector.

A recommendation result generation module for inputting the user to be predicted according to the collaborative filtering module obtained by the collaborative filtering module construction module after training

Currently observed user interaction data

Encoder output mean vector

Sum variance vector

Then by the formula

Calculating to obtain a user interaction hidden vector

(ii) a Then the user interaction is hidden

Inputting the dynamic mainstream characteristic hidden vector output by the dynamic mainstream degree characteristic model building module into a decoder, outputting the reconstructed user interaction data by the decoder, wherein the dimensionality of the user interaction data is n-dimensional and is the same as the quantity of all articles, and the value of each dimensionality is 0,1]In between.

And the previous time period, then any release time is

And recording the dimension serial numbers with the values not being 0, and setting the dimensions of the reconstructed user interaction data as 0. And finally, sequencing the reconstructed user interaction data from large to small, wherein the dimension serial number with the large top-N is the user to be predicted

The top N item recommendation lists.

Example 3

The present embodiment provides a computer device, which includes a memory and a processor, wherein the memory stores a computer program, and the computer program, when executed by the processor, causes the processor to execute the steps of the above method for correcting a mainstream deviation of a recommendation system.

The computer device may be a desktop computer, a notebook, a palm computer, a cloud server, or other computing devices. The computer equipment can carry out man-machine interaction with a user through a keyboard, a mouse, a remote controller, a touch panel or voice control equipment and the like.

The memory includes at least one type of readable storage medium including a flash memory, a hard disk, a multimedia card, a card-type memory (e.g., SD or D interface display memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a read-only memory (ROM), an electrically erasable programmable read-only memory (EEPROM), a programmable read-only memory (PROM), a magnetic memory, a magnetic disk, an optical disk, etc. In some embodiments, the storage may be an internal storage unit of the computer device, such as a hard disk or a memory of the computer device. In other embodiments, the memory may also be an external storage device of the computer device, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), or the like provided on the computer device. Of course, the memory may also include both internal and external storage units of the computer device. In this embodiment, the memory is usually used for storing an operating system and various types of application software installed on the computer device, for example, program codes of the method for correcting the mainstream deviation of the recommendation system, and the like. In addition, the memory may also be used to temporarily store various types of data that have been output or are to be output.

The processor may be a Central Processing Unit (CPU), controller, microcontroller, microprocessor, or other data Processing chip in some embodiments. The processor is typically used to control the overall operation of the computer device. In this embodiment, the processor is configured to execute the program code stored in the memory or process data, for example, execute the program code of the method for correcting the deviation of the main stream of the recommendation system.

Example 4

The present embodiment provides a computer-readable storage medium, in which a computer program is stored, and the computer program, when executed by a processor, causes the processor to execute the steps of the above method for correcting a deviation of a mainstream of a recommendation system.

Wherein the computer readable storage medium stores an interface display program executable by at least one processor to cause the at least one processor to perform the steps of the method for correcting a recommended system mainstream deviation as described above.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present application or portions contributing to the prior art may be embodied in the form of a software product, where the computer software product is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk), and includes several instructions for enabling a terminal device (which may be a mobile phone, a computer, a server, or a network device, etc.) to execute the method for correcting mainstream deviations of the recommendation system according to the embodiments of the present application.

Claims

1. A method for correcting deviations in a recommendation system mainstream comprising the steps of:

step S1, data collection and processing

Co-occurrence vector of article

；

Step S2, calculating the mainstream score

According to co-occurrence vector of articles

Calculating the total interaction times of the articles

(ii) a According to user co-occurrence vectors

Calculating the total number of interactions

(ii) a According to the total number of interaction times of the articles

Total number of interactions of user

Item categories, computing users

Dynamic mainstream degree score of

(ii) a According to the user

Dynamic mainstream level score of

；

S3, constructing a dynamic mainstream degree characteristic model

MLP model structure based on three-layer perceptronBuilding a dynamic mainstream degree characteristic model, wherein the first two layers of the dynamic mainstream degree characteristic model use a ReLU function as an activation function, and the last layer of the dynamic mainstream degree characteristic model uses a softmax activation function; with user information vectors

And the global dynamic mainstream degree vector output by the step S2

Spliced as input of dynamic mainstream degree characteristic model

s4, constructing a collaborative filtering module

Input encoder, encoder calculating user interaction data

And each dimension of (a) and generate separately

Mean value of

Variance, forming a mean vector of users

Sum variance vector

Wherein the two vectors are both in t dimension and are randomly sampled to generate h dimensionUser interaction hidden vector of

；

The decoder is constructed by adopting a four-layer perceptron MLP model, the first three layers of activation functions of the decoder are tanh functions, and the last layer of activation functions of the decoder are softmax functions; the dynamic mainstream characteristic hidden vector output by the step S3 and the user interaction hidden vector output by the encoder

As input to a decoder, the decoder output reconstructs user interaction data

And reconstructing dynamic mainstream feature vectors

Reconstructing dynamic mainstream feature vectors

For completing the reconstruction of the decoder;

step S5, recommendation result generation

The encoder outputs a user interaction hidden vector

(ii) a Then the user interaction is hidden

And inputting the dynamic mainstream characteristic hidden vector output in the step S3 into a decoder, and outputting reconstructed user interaction data by the decoder.

2. The method of claim 1, wherein the deviation of the mainstream of the recommendation system is correctedThe method is characterized in that: in step S1, any user is constructed according to the user article interaction information

User co-occurrence vector with all items

Constructing any article according to user article interaction information

Co-occurrence vector with all articles

；

Wherein,

which represents the total number of users,

the total number of the items is represented,

the items are shown as being in the form of objects,

denotes the first

The number of the articles is one,

representing a user

And articles

The interaction situation of (a) is,

representing an article

And the user

The interaction scenario of (2).

3. A method for correcting deviations in a recommendation system mainstream according to claim 1, wherein: in step S2, according to the article

Co-occurrence vector with article

Calculating the total number of article interactions

；

According to the user

Co-occurrence vector with user

And the total number of times of interaction of the user is calculated,

；

according to the total number of interactions of the article

Total number of interactions of user

Item category, computing user

Dynamic mainstream degree score of

(ii) a In the case of insensitive number of interactions, for the category

Article of, user

Dynamic mainstream degree score of

Article of, user

Dynamic mainstream degree score of

Calculating according to the formula (2);

（1）

（2）

according to the user

Dynamic mainstream degree score of

The calculation formula is as follows:

Make up a dimension of

Global dynamic mainstream level vector of

Expressed as:

wherein,

、

all of which represent the time of day,

the representation of the hyper-parameter is,

representing an article

Belong to the category

，

Which represents the total number of users,

a set of all the users is represented,

representing the total number of item categories.

4. A method of correcting deviations in a recommendation system mainstream according to claim 3, wherein: and carrying out co-occurrence vector weighting processing, wherein the weighting processing is described as follows: user will be

To belong to the category

Article of

Of the interaction situation

Multiplication by the user

In the category of

Dynamic mainstream degree score of

；

For user co-occurrence vectors

Normalizing by using a softmax function to obtain the user interaction number for inputting the collaborative filtering moduleAccording to

：

Wherein any of the user interaction data

Has a value range of [0,1 ]]。

5. A method for correcting deviations in a recommendation system mainstream according to claim 1, wherein: in step S3, the user information vector

Expressed as:

wherein,

representing a user

The age information of the person to be treated is quantified,

representing a user

The binary gender information of (1);

input of dynamic mainstream degree characteristic model

Expressed as:

；

wherein,

a vector stitching operation is represented.

6. A method for correcting deviations in a recommendation system mainstream according to claim 1, wherein: in step S4, the loss function of the collaborative filtering module

Divided into reconstructed target losses

Distribution approximation loss

And dynamic mainstream eigenvector approximation loss

Three parts, loss function

The calculation formula of (2) is as follows:

reconstructing object losses

The calculation formula of (2) is as follows:

distribution approximation loss

The calculation formula of (2) is as follows:

dynamic mainstream eigenvector approximation loss

The calculation formula of (2) is as follows:

wherein,

a hidden vector representing the user interaction is shown,

representing the data of the user interaction(s),

representing the posterior distribution of each user data sample,

representing variation distribution

And posterior distribution

In the approximation that the difference between the first and second values,

to represent

To pair

In the expectation that the position of the target is not changed,

in order to be a hyper-parameter,

is a user interaction hidden vector

KL represents the KL divergence,

a distribution is represented a priori, which is,

the variance vector is represented by a vector of variances,

which represents the square of the mean vector and,

in order to be a hyper-parameter,

representing the original dynamic mainstream feature vector,

representing the reconstructed dynamic mainstream feature vector.

7. A system for correcting deviations in a recommendation system mainstream comprising:

a data collecting and processing module for obtaining user information, article information and user article interaction information in the recommendation system and respectively constructing user co-occurrence vectors

Co-occurrence vector of article

；

Calculating the total interaction times of the articles

(ii) a According to user co-occurrence vectors

Calculating the total number of interactions of the user

(ii) a According to the total number of interactions of the article

Total number of interactions of user

Item category, computing user

Dynamic mainstream level score of

(ii) a According to the user

Dynamic mainstream degree score of

；

Spliced as input of dynamic mainstream degree characteristic model

Input encoder, encoder calculating user interaction data

And each dimension of (1), and separately generate

Mean value of

Variance, forming a mean vector of users

Sum variance vector

；

The decoder is constructed by adopting a four-layer perceptron MLP model, the first three layers of activation functions of the decoder are tanh functions, and the last layer of activation functions of the decoder are softmax functions; dynamic mainstream characteristic hidden vectors output by the dynamic mainstream degree characteristic model building module and user interaction hidden vectors output by the encoder

And reconstructing dynamic mainstream feature vectors

Reconstructing dynamic mainstream feature vectors

For completing the reconstruction of the decoder;

The encoder outputs a user interaction hidden vector

(ii) a Then the user interaction is hidden

And dynamic mainstream level characteristicsAnd inputting the dynamic mainstream characteristic hidden vector output by the model building module into a decoder, and outputting reconstructed user interaction data by the decoder.

8. A computer device, characterized by: comprising a memory and a processor, the memory storing a computer program which, when executed by the processor, causes the processor to perform the steps of the method according to any one of claims 1 to 6.

9. A computer-readable storage medium, characterized in that: stored with a computer program which, when executed by a processor, causes the processor to carry out the steps of the method according to any one of claims 1 to 6.