CN108829763B

CN108829763B - Deep neural network-based attribute prediction method for film evaluation website users

Info

Publication number: CN108829763B
Application number: CN201810524543.1A
Authority: CN
Inventors: 屈鸿; 刘永胜; 房展; 舒扬; 杨舰; 邓悟; 季江舟; 张晓敏
Original assignee: University of Electronic Science and Technology of China
Current assignee: University of Electronic Science and Technology of China
Priority date: 2018-05-28
Filing date: 2018-05-28
Publication date: 2021-09-07
Anticipated expiration: 2038-05-28
Also published as: CN108829763A

Abstract

The invention discloses an attribute prediction method for film evaluation website users based on a deep neural network, and belongs to the fields of data mining, machine learning and the like. The method carries out matrix decomposition on sparse user scoring data in the film evaluation website, decomposes the user scoring matrix into a user-feature matrix and a film-feature matrix, partially fills the user vector by utilizing the two matrices, and takes the filled user scoring data as the feature vector of the user. Inputting the feature vector data of a user into a fully-connected deep neural network, and training a network model by combining a multi-task learning technology to obtain a label score of each task; and performing decision analysis according to the scores of the task labels to obtain a final classification result. According to the invention, through a deep neural network technology, automatic attribute prediction is carried out on film evaluation website users, the problem of data sparsity is solved, and the classification accuracy and efficiency are higher.

Description

Deep neural network-based attribute prediction method for film evaluation website users

Technical Field

An attribute prediction method of film evaluation website users based on a deep neural network is used for predicting the preference degree of each attribute-people to films according to film evaluation and belongs to the technical fields of data mining, deep learning, matrix decomposition and the like.

Background

The deep learning is originated from an artificial neural network, and an artificial neural network model can be built by means of the thought of modern neurology and simulating the structure and mechanism of biological cerebral neurons. Deep learning is to establish a plurality of hidden layers between an input layer and an output layer of a network to form a complex neural network, and to optimize and adjust parameters between networks by learning sample data without a label or with a label, so as to solve the classification problem after regression.

The essence of deep learning is that a multi-layer neural network model is built to learn useful features from massive data, and sample data is accurately classified or predicted. Different from shallow learning, deep learning has a deeper network structure, and feature representation of training data in an original space is transformed into a new space with stronger expression capacity through layer-by-layer transformation of data features, so that more accurate classification is realized. Feature learning is an important link in deep learning.

The basic idea is as follows: for a system S with an n-layer network: (S)₁,S₂,...S_n) I represents the input of the network, O represents the output of the network, and the specific network structure is:

if the final objective is to equalize the inputs and outputs, then the parameters of each pair of networks are adjusted in order to bring O as close to I as possible, so that the series-level characteristics S of I are obtained₁,S₂,...S_n. The output of each layer in the network serves as the input to the next layer to achieve a hierarchical representation of the input data.

Matrix decomposition is an important technology in the field of personalized recommendation, and the mathematical theoretical basis of matrix decomposition algorithm is the transformation of rows and columns of a matrix. The idea on which matrix decomposition is based is: each user and each article have own unique characteristics, and a user-characteristic matrix and an article-characteristic matrix can be decomposed from a scoring matrix of the user for the items by using a matrix decomposition technology, so that the preference characteristics of the user and the characteristics of each article can be directly obtained from scoring data, and the dimensionality of the characteristic matrix is also obtained.

For example, in a movie website, each viewing user has their own preferences for movies, which can be intuitively understood as the types that the user likes: inspirational, comedy, action, science fiction, etc. The user-attribute matrix represents how much the user likes these potential factors, and the more the user likes this type of movie, the larger the corresponding value in the matrix. Similarly, each movie can also be described by these characteristic factors, so the characteristic-item matrix represents the weight of these factors in each movie, i.e., the genre of the movie. If the movie is for a certain type of material, the larger the corresponding values in the matrix to which those materials are mapped. Thus, the user-property matrix and the property-item matrix are combined to obtain a user-item matrix, and each corresponding value in the matrix represents a degree of a user's liking for a movie.

However, conventional matrix decomposition techniques do not accomplish the decomposition of the matrix because each user scores only a small fraction of the movies, which results in an extremely sparse scoring matrix. The commonly used method is to calculate the prediction error by using the existing score, then adjust the parameters by using a random gradient descent method or a least square method to minimize the error, and finally obtain the prediction score of the user on the movie.

In the traditional algorithms such as classification regression in machine learning, the class or the value of a single attribute is predicted through the characteristics of input samples. Namely, a single-task learning method for obtaining a learning function under a unified model, in which case the samples in the training set obtain only information of a single task. In the practical research and application process of the user attributes, more than one attribute is needed to be predicted, and a certain correlation exists between the attributes. Many researches have proved that some attribute prediction tasks with relatively large relevance are learned at the same time, certain related information is shared among all tasks in the learning process, and the learned model can achieve a better learning effect. Most of the existing multi-task learning methods are realized by using the joint probability distribution of a plurality of categories, and the prediction of samples with a plurality of label quantities and category quantities has certain limitations.

Disclosure of Invention

The invention aims to provide an attribute prediction method of film evaluation website users based on a deep neural network; the problem of the prior art that the prediction effect is poor due to the limitation on the number of the labels or the number of the categories of the tasks and the data sparsity is solved.

The technical scheme adopted by the invention is as follows:

a method for predicting attributes of film evaluation website users based on a deep neural network is characterized by comprising the following steps:

(1) decomposing the historical scoring data of the user into a user-feature matrix and a movie-feature matrix based on a matrix decomposition technology, and densely filling a user vector based on the two matrixes, wherein the user vector scores each movie for the user;

(2) carrying out fully-connected deep neural network model training based on the filled user vector, a plurality of attribute structural characteristics of the sample user and a multi-task learning technology to obtain a trained deep neural network model;

(3) and inputting the user vector of the sample user to be predicted into the trained deep neural network model for prediction to obtain the score of each task attribute of the predicted user, thereby obtaining the final user attribute category.

Further, the specific steps of the step (1) are as follows:

(11) based on a matrix decomposition technology, a matrix R of the historical scoring data of the user is decomposed into a user implicit preference matrix P and an article implicit feedback matrix Q, and the formula is as follows:

R＝P^TQ

wherein P ∈ R^f×mAnd Q ∈ R^f×nRepresenting the mapping of users and articles in a potential factor space, wherein f is the dimensionality of a potential factor, T represents the transposition of a matrix, m is the number of the users, and n is the number of the movies;

(12) filling each user vector in a fixed quantity according to the user implicit preference matrix P and the article implicit feedback matrix Q, wherein the filling position is the position corresponding to the data with the highest value in the score data predicted by each user vector, and the specific formula of the filling position is as follows:

s.t.||D||₂＝Z

where Z is the number of elements that need to be converted to nonzero values, index_jIndicating the position of the jth user that needs to be filled with non-zero value elements, D being the set of data that needs to be filled in each user vector, p_j,iRepresenting the predicted scores of the j users for the i movies.

Further, the step (2) comprises the following specific steps:

(21) the label of the sample user is represented as a multi-classification label of a plurality of tasks which are combined into a structured vector form, and the label of each task can be represented as the following form:

wherein, N (t) is the category number of the t-th task, wherein, the task is the attribute of the user, if the task t belongs to the category corresponding to the l-th label, the corresponding task t belongs to the category corresponding to the l-th label

Is 1, otherwise is 0;

the final structured label of the sample is:

Y＝[Y₁,Y₂,...Y_T]

in the formula, T represents that T task exists, and Y represents a label of the T task;

(22) dividing a user set consisting of the filled user vectors into a training set and a testing set, and initializing a deep neural network model parameter w;

(23) randomly selecting samples from a training set, and calculating values of each unit of a node output layer and an output layer of a deep neural network model intermediate layer after initialization parameters are calculated, wherein the activation function of the intermediate layer neuron adopts ReLU (ReLU), namely a node output function, and the formula is as follows:

f(x)＝max(0,x)

wherein x is the input to the neuron;

the activation function of the output layer is a softmax function, namely a function of the output layer; for each task t, the formula for the activation value of the kth output neuron is:

in the formula, net_k、net_sIs the net output value of the neuron, and N (t) is the class number of the t-th task. (24) Performing cross entropy on the numerical values of the node output and each unit of the output layer of the middle layer obtained in the step (23) and the final structured label of the sample obtained in the step (21), calculating the total loss by using a cross entropy loss function, stopping training if the total loss value is smaller than a set threshold value or the iteration times are larger than a set maximum iteration time, otherwise, turning to the step (25);

(25) and (4) updating the deep neural network model parameters w in the step (22) by using the SGD, and then turning to the step (23).

Further, the specific steps of the step (3) are as follows:

(31) inputting the user vector of the sample user to be predicted into the trained deep neural network model, and calculating the output of the deep neural network model to obtain the output form [ Y [)₁,Y₂,...Y_T]Wherein the classification form of the task t is

Each element y_i ^(t)'A probability score representing the sample user belonging to the category label;

(32) for each task t obtained in step (31), the classification Y of the task_tThe method is a multi-label classification, and the label category corresponding to the element with the highest score is selected as the classification result of the sample user under the task

The method comprises the following specific steps:

combining a plurality of tasks, wherein the final classification result of the sample is as follows:

in summary, due to the adoption of the technical scheme, the invention has the beneficial effects that:

1. in the invention, because the user only interacts with part of the movies, the scoring data has high sparsity, and the scoring data of the user is partially filled by adopting a matrix decomposition technology, so that the feature vector of the user is more complete, and the risk of overfitting is avoided to a certain extent;

2. according to the invention, the deep neural network learning theory is utilized to learn data from a deeper level, the characteristic data of the user is expressed more essentially, and finally the decision analysis is carried out on the attribute classification of the user, so that the classification effect is improved.

3. In the invention, the multi-task learning is carried out on a plurality of attributes of the user at the same time, the correlation among different attributes is fully utilized, and the classification accuracy is improved.

Drawings

FIG. 1 is a general flow diagram of user attribute prediction in accordance with the present invention;

FIG. 2 is a flow chart of user vector filling in the present invention;

FIG. 3 is a diagram illustrating a parameter sharing mechanism in multi-task learning according to the present invention;

FIG. 4 is a schematic diagram of a structured representation of user attributes in multi-task learning according to the present invention;

FIG. 5 is a diagram of the overall network structure of the user attribute prediction based on deep neural network learning in the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

Referring to fig. 1, a method for predicting attributes of users of a film evaluation website based on a deep neural network first extracts historical rating data of the users from the film evaluation website, wherein the historical rating data can be expressed as the likes or interests of the users in various movies. The method comprises the steps of preprocessing historical scoring data of a user by means of technologies such as matrix decomposition and the like to obtain a high-dimensional user characteristic vector, taking the characteristic vector of the user (the user vector, namely the score of the user on all movies) as the input of a deep neural network, automatically learning the characteristics of the user by combining a plurality of attribute structured representations of the user to obtain the label score of each task of the user, and finally performing classification decision on the attributes of the user. The method comprises the following specific steps:

a method for predicting attributes of film evaluation website users based on a deep neural network comprises the following steps:

(1) decomposing the historical scoring data of the user into a user-feature matrix and a movie-feature matrix based on a matrix decomposition technology, densely filling user vectors based on the two matrices, and then standardizing the matrices, wherein the user vectors are the scores of the user on all movies; referring to fig. 2, in the preprocessing process of the user feature data, the historical scoring data of the user on each movie can reflect the interest degree of the user on the movie, and can be used as the feature vector of the user, so as to predict the attribute information of the user. Because of the limited effort of the user, it is not possible to score all movies in the web site, and a movie that is not scored does not necessarily represent a user's disinterest in the movie. Therefore, it is necessary to predict the possible scoring situation of each user for all movies based on the historical scoring data of the user and the scores of other users.

The method comprises the following specific steps:

R＝P^TQ

s.t.||D||₂＝Z

where Z is the number of elements that need to be converted to nonzero values, index_jIndicating the position of the jth user that needs to be filled with non-zero value elements, D being the set of data that needs to be filled in each user vector, p_j,_iRepresenting the predicted scores of the j users for the i movies.

fig. 3 is a parameter sharing mechanism in multi-task learning, in which all tasks share parameters of hidden layers in a network and output layers related to each task are reserved, as shown in fig. 4-5. The hard sharing mechanism reduces the risk of overfitting to some extent. Therefore, the more tasks are learned at the same time, the more tasks the same representation can be captured by the model, so that the overfitting risk on the original task is smaller. The network can adapt to different attributes of users through the training of a many-to-many task model, meanwhile, the generalization capability of extracting features is improved, and the advantages of the network are greatly exerted;

fig. 4 is a structural representation of attributes of a user in multitasking learning, and shows the results of structural representation of three attributes of the user, namely gender, age and occupation.

Fig. 5 is an overall network structure of user attribute prediction based on deep learning, which includes an input layer, two hidden layers and an output layer. The neurons between every two adjacent layers in the network are all connected, and the neurons of all the layers are not connected. The data after standardization processing is directly used as the characteristic vector of a user, high-dimensional user characteristics are input into a hidden layer in a full connection mode, the input characteristic vector of the user is converted into higher-level abstract characteristics through a multi-layer neural network structure, and finally the higher-level abstract characteristics are input into an output layer to serve as a classification result. The input layer neuron number, namely the feature vector dimension of the user, is the total number of the movies in the website, the two hidden layer neuron numbers are 400 and 40 respectively, and the output layer neuron number is related to the total category number of each attribute of the user.

The method comprises the following specific steps:

Is 1, otherwise is 0;

the final structured label of the sample is:

Y＝[Y₁,Y₂,...Y_T]

f(x)＝max(0,x)

wherein x is the input to the neuron;

(3) Inputting the user vector of the sample user to be predicted into the trained deep neural network model for prediction to obtain the scores of each task attribute of the predicted user, so as to obtain the final user attribute category, and performing decision analysis according to the task scores (namely label scores) to obtain the final user attribute category. The method comprises the following specific steps:

(32) for each task t obtained in step (31), the taskClass Y of affairs_tThe method is a multi-label classification, and the label category corresponding to the element with the highest score is selected as the classification result of the sample user under the task

The method comprises the following specific steps:

the grading data of the user to the movie (namely the feature vector of the user) can be abstracted into the interest and hobby vector of the user, and the implicit hobby degrees of the user to the like of praise and collection of the movie can be converted into numerical values to express the features of the user, so that the method can adapt to different operation objects and various data types. Moreover, a plurality of attributes of the user may have some relationship, for example, the gender of the user is related to the occupation of the user, and the age of the user is also related to the occupation of the user, so that in order to better utilize the correlation between the attributes of the user, the invention introduces a multi-task learning method into the user attribute prediction process, and provides a multi-task learning model for learning a plurality of attributes together. In summary, aiming at the problem of sparsity of user data of the film evaluation website, the matrix decomposition technology is used for filling data to obtain relatively dense user feature vectors, the deep neural network model is used for learning and classifying the features of the users, the basic attributes of the users are predicted, and the accuracy of prediction and classification is improved.

The specific implementation is as follows:

(1) the invention is applied to a Movielens1M data set, each attribute and corresponding value of the Movielens data set are shown in Table 1, the number of users of the data set is 6040, and the number of movies is 3952. Firstly, expressing scores of all movies by users in a data set into a user vector form;

table 1 user attributes and corresponding values in Movielens dataset

(2) Based on a matrix decomposition technology, a scoring matrix of a user is decomposed into a user-characteristic matrix P and a movie-characteristic matrix Q with a hiding factor dimension of 35, and the formula is as follows:

R＝P^TQ

wherein P ∈ R^35×6040，Q∈R^35×3952；

(3) Predicting a scoring value of each user vector missing part according to the user implicit preference matrix P and the article implicit feedback matrix Q, and selecting the first 15 data with higher prediction scores to fill corresponding positions;

(4) the attribute tags of the sample user are represented in a structured form of a combination of a plurality of attribute tags, and the specific form is shown in fig. 4.

(5) Randomly selecting 90% of sample users as a training set and 10% of sample users as a testing set;

(6) inputting training samples into the fully-connected deep neural network models of the two hidden layers to carry out weight training adjustment to obtain a trained deep neural network model;

(7) and (4) testing the samples in the test set by using the network weight values obtained by training in the step (6), wherein the prediction accuracy rates of the three attributes of the gender, the age and the occupation of the user are respectively 80.28%, 50.98% and 19.6%.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims

1. A method for predicting attributes of film evaluation website users based on a deep neural network is characterized by comprising the following steps:

(3) inputting the user vector of a sample user to be predicted into the trained deep neural network model for prediction to obtain the score of each task attribute of the predicted user, so as to obtain the final user attribute category;

the specific steps of the step (1) are as follows:

R＝P^TQ

s.t.||D||₂＝Z

where Z is the number of elements that need to be converted to nonzero values, index_jIndicating the position of the jth user that needs to be filled with non-zero value elements, D being the set of data that needs to be filled in each user vector, p_j,iRepresents the predictedScoring of i movies by j users

The step (2) comprises the following specific steps:

Is 1, otherwise is 0;

the final structured label of the sample is:

Y＝[Y₁,Y₂,...Y_T]

f(x)＝max(0,x)

wherein x is the input to the neuron;

in the formula, net_k、net_sIs the net output value of the neuron, and N (t) is the category number of the t task; (24) performing cross entropy on the numerical values of the node output and each unit of the output layer of the middle layer obtained in the step (23) and the final structured label of the sample obtained in the step (21), calculating the total loss by using a cross entropy loss function, stopping training if the total loss value is smaller than a set threshold value or the iteration times are larger than a set maximum iteration time, otherwise, turning to the step (25);

2. The method for predicting the attributes of the users of the film review website based on the deep neural network as claimed in claim 1, wherein the specific steps of the step (3) are as follows:

Each element

A probability score representing the sample user belonging to the category label;

The method comprises the following specific steps: