CN110555132A - Noise reduction self-encoder recommendation method based on attention model - Google Patents
Noise reduction self-encoder recommendation method based on attention model Download PDFInfo
- Publication number
- CN110555132A CN110555132A CN201910742757.0A CN201910742757A CN110555132A CN 110555132 A CN110555132 A CN 110555132A CN 201910742757 A CN201910742757 A CN 201910742757A CN 110555132 A CN110555132 A CN 110555132A
- Authority
- CN
- China
- Prior art keywords
- user
- vector
- movie
- encoder
- matrix
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 18
- 239000011159 matrix material Substances 0.000 claims abstract description 32
- 239000013598 vector Substances 0.000 claims description 60
- 238000012545 processing Methods 0.000 claims description 6
- 238000013527 convolutional neural network Methods 0.000 claims description 4
- 239000013589 supplement Substances 0.000 claims description 2
- 238000013135 deep learning Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 238000012549 training Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 230000007812 deficiency Effects 0.000 description 1
- 201000009240 nasopharyngitis Diseases 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/735—Filtering based on additional data, e.g. user or group profiles
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Biophysics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
a film recommendation method of a noise reduction self-encoder based on an attention model belongs to the technical field of film recommendation. In the existing recommendation algorithm, the self-encoder recommendation model is easy to realize and widely applied due to high operation speed, but when a scoring matrix is sparse, the recommendation accuracy is greatly reduced, and the attention degree of auxiliary information and the attention degree of a user to watching records are not considered. In order to solve the problems, the method combines an attention model and a noise reduction self-encoder, learns the preference of a user by using the attention model, and integrates the preference of the user into the noise reduction self-encoder to jointly iterate and update parameters, so that the complete score of the user is predicted. The method has obvious improvement on the aspect of the accuracy of the prediction scoring.
Description
Technical Field
The method belongs to the technical field of film recommendation, and particularly designs a film recommendation method combining an attention model and a noise reduction self-encoder.
Background
Movies gradually become an indispensable entertainment mode in our daily life, but in the face of a huge movie resource library, how to help users to quickly find favorite movies becomes a main problem faced by a recommendation system.
The traditional movie recommendation method is usually to recommend movies according to the search popularity of the movies, but the types of the movies favored by users are different, so that the satisfaction degree of a recommendation system is low. In recent years, deep learning has enjoyed great success in various fields, such as computer vision, translation, and the like. Deep learning is also widely applied to a recommendation system at present, and because the deep learning can capture complex relation among data, a recommendation algorithm based on the deep learning greatly exceeds a common algorithm in terms of performance.
The existing deep learning recommendation system is widely applied to a recommendation algorithm based on an auto-encoder, but most of algorithms based on the auto-encoder do not consider auxiliary information and different attention degrees of users to watching records, so that the recommendation accuracy is still not ideal. In response to these technical deficiencies, the present invention addresses these problems with an attention model and self-encoder.
disclosure of Invention
The technical problem solved by the invention is as follows: the conventional noise reduction self-encoder has low accuracy, and ignores the influence of the watching records and the auxiliary information of the film of the user, namely the occupation, the work, the gender and the category of the film on the recommendation system.
Aiming at the problems, the invention provides a noise reduction self-encoder recommendation method based on an attention model, which comprises the following steps:
Step 1, obtaining the user, the movie information and the score of the user on the movie from the online public data set.
and 2, processing the movie title information by using a convolutional neural network.
And 3, processing the residual information of the user and the movie and respectively converting the residual information into a user vector and a movie vector.
And 4, selecting the movies with the scores of the first ten users as a user preference matrix according to the scores of the users.
and 5, inputting the user preference matrix into the attention model, and finally calculating to obtain the preference characteristic vector of the user.
And 6, inputting the user score into a noise reduction self-encoder, adding a user auxiliary information vector and a user preference characteristic vector in a hidden layer of the noise reduction self-encoder, and obtaining a complete user score after 100 iterations.
and 7, recommending the movies according to the prediction scores.
According to the invention, the preference characteristics of the user are extracted by using the attention model and are fused into the noise reduction self-encoder, so that the user characteristics are more accurate, and the performance of the recommendation algorithm is greatly improved.
Drawings
FIG. 1 is a diagram of a user-assisted information extraction model
FIG. 2 is a diagram of a movie auxiliary information extraction model
FIG. 3 is a diagram illustrating an overall model of a noise reduction auto-encoder recommendation method based on an attention model according to the present invention
Detailed Description
The invention is described in further detail below with reference to the attached drawing figures:
The invention provides a noise reduction self-encoder recommendation system method based on an attention model, which comprises the following steps:
step 1, obtaining the user, the movie information and the score of the user on the movie from the online public data set. There were 6040 users, 3952 movies, and 1000209 scores, all of which were integers, with scores ranging from 1 to 5. The user, movie information and user rating form are as follows.
User ID | gender | age | occupation | Zip-code |
1 | F | 1 | 10 | 48067 |
2 | M | 56 | 16 | 70072 |
3 | M | 25 | 15 | 55117 |
… | … | … | … | … |
TABLE 1. user information
TABLE 2 movie information
User ID | Movie ID | Rating | Timestamp |
1 | 1193 | 5 | 978300760 |
1 | 661 | 3 | 978302109 |
1 | 914 | 3 | 978301968 |
… | … | … | … |
TABLE 3 Scoring data
and 2, processing the movie title by using the convolutional neural network. Removing irrelevant information such as year in the title, converting the words into 32-dimensional vectors, setting the length of each movie title to be 15 words, and filling up the words by using empty vectors if the length of each movie title is insufficient. And inputting the title matrix into a convolutional neural network, and extracting the title feature vector.
And 3, processing other auxiliary information of the user and the movie, such as the gender, age, occupation and movie category of the user, as shown in the figures 1 and 2. Respectively converting F and M in the gender of the user into two 16-dimensional vectors; since there are seven age categories, the age is shifted to 7 16-dimensional vectors; the number of the working categories is 21, and the working categories are converted into 21 16-dimensional vectors; the categories of the movies are 18 in total, the movies are converted into 18 32-dimensional vectors, in order to ensure that the matrix size of each movie category is the same, the movie category length is set to be 18, and the empty vectors are used for supplement.
Inputting user id, work and age information into a full-connection layer to obtain a 200-dimensional user information vector; and inputting the movie id, the category and the title into the full-connection layer to obtain a 200-dimensional movie information vector.
and 4, selecting ten movies with the highest scores from the movies with the minimum number of 20 movies scored by the user to obtain corresponding movie information vectors to form a 10 x 200 movie matrix preferred by the user.
Step 5, as shown in fig. 3, inputting the movie matrix preferred by the user into the attention network, wherein the formula is as follows:
I′i=ftanh(W'Ii+b') (1)
Ai=fsoftmax(I′iWA) (2)
Iρ=∑AiIi (3)
Wherein I i is the movie vector of the first ten movie vectors scored by the ith user, W ' is the movie weight matrix and is the normal distribution random matrix with the mean value of 0 and the standard deviation of 0.01, b ' is the movie bias matrix, f tanh is the tanh function, I i ' is the movie feature vector of the user, W A is the movie feature weight matrix and is the normal distribution random matrix with the mean value of 0 and the standard deviation of 0.01, f softmax is the softmax function, A i is the attention weight coefficient of the ith movie vector, and I ρ is the final user preference vector
Step 6, inputting the user score into a noise reduction self-encoder, adding a user auxiliary information vector and a user preference characteristic vector in a hidden layer of the noise reduction self-encoder for joint iteration, wherein the formula is as follows:
X′=X+N (4)
h=ftanh(WX'+b) (5)
h'=h+WsS+Iρ (6)
Y=ftanh(Wzh'+bz) (7)
wherein X is a user scoring vector, N is a noise vector, X 'is a noise scoring vector, W is a weighting matrix for scoring by adding noise, the weighting matrix is a normal distribution random matrix with a mean value of 0 and a standard deviation of 0.01, b is a bias matrix for the scoring vector by adding noise, h is a hidden layer vector, S is a user auxiliary information vector, W s is an auxiliary information vector weighting matrix, the weighting matrix is a normal distribution random matrix with a mean value of 0 and a standard deviation of 0.01, h' is a processed hidden layer vector, W z, b z are a processed weighting matrix and a bias matrix respectively, and Y is a final complete scoring vector.
The cost function is:
L=||X-Y||2 (8)
L is a cost function, i.e., the squared difference of the input score and the output score.
And obtaining the complete score of the user through at least 100 iterations. The parameter optimization method comprises Adam and RMSProp, and the parameter selection range is as follows: the learning rate is 0.00001-0.01, the training step length is 100-500, and the hidden layer node is 100-500. And continuously changing parameter setting, observing the square difference between the input score and the output score, and when the square difference is the minimum value, adopting the parameter optimization method of Adam, wherein the learning rate is 0.00001, the training compensation is 300, and the number of hidden nodes is 200.
And 7, recommending the movies according to the prediction scores.
Experimental simulation
The invention uses a data set of MovieLens 1M, which comprises 6040 users, 3952 movies, and nearly 10 ten thousand scores, wherein each person has at least 20 scores. Part of score data is randomly extracted to be used as a training set for training, and finally the accuracy of the method is higher than that of a common recommendation algorithm. The invention combines the attention model and the noise reduction self-encoder, solves the common cold start problem in the recommendation system, and improves the recommendation capability to a certain extent.
Claims (4)
1. The attention model-based noise reduction self-encoder recommendation method is characterized by comprising the following steps of:
step 1, obtaining a user, movie information and a score of the user for a movie from an online public data set;
Step 2, processing the movie title information by using a convolutional neural network;
Step 3, processing the user and movie information and respectively converting the user and movie information into a user vector and a movie vector;
Step 4, selecting the movies with the top ten scores of each user according to the scores of the users as a user preference matrix;
step 5, inputting the user preference matrix into an attention model, and finally calculating to obtain a preference characteristic vector of the user;
step 6, inputting the user score into a noise reduction self-encoder, adding a user auxiliary information vector and a user preference characteristic vector in a hidden layer of the noise reduction self-encoder, and obtaining a complete user score after at least 100 iterations;
And 7, recommending the movies according to the prediction scores.
2. The attention model noise reduction self-encoder recommendation method according to claim 1, wherein step 3 specifically converts F, M in user gender into two 16-dimensional vectors respectively; since there are seven age categories, the age is shifted to 7 16-dimensional vectors; the number of the working categories is 21, and the working categories are converted into 21 16-dimensional vectors; the categories of the movies are 18 in total, the movies are converted into 18 32-dimensional vectors, in order to ensure that the sizes of the movie category matrixes are the same, the movie category length is set to be 18, and insufficient empty vectors are used for supplement;
inputting user id, work and age information into a full-connection layer to obtain a 200-dimensional user information vector; and inputting the movie id, the category and the title into the full-connection layer to obtain a 200-dimensional movie information vector.
3. the attention model denoising auto-encoder recommendation method according to claim 1, wherein step 5 is to input a user preference movie matrix into the attention network so as to obtain a user preference vector, and the formula is as follows:
I′i=ftanh(W'Ii+b') (1)
Ai=fsoftmax(I′iWA) (2)
Iρ=∑AiIi (3)
Wherein I i is the movie vector of the first ten of the ith user score, W ' is the movie weight matrix, b ' is the movie bias matrix, f tanh is the tanh function, I ' i is the user movie feature vector, W A is the movie feature weight matrix, f softmax is the softmax function, A i is the attention weight coefficient of the ith movie vector, and I ρ is the final user preference vector.
4. The attention model denoising autoencoder recommendation method according to claim 1, wherein step 6 is specifically to input the user score into the denoising autoencoder, and add the user auxiliary information vector and the user preference feature vector in the hidden layer of the denoising autoencoder for joint iteration, and the formula is as follows:
X′=X+N (4)
h=ftanh(WX'+b) (5)
h'=h+WsS+Iρ (6)
Y=ftanh(Wzh'+bz) (7)
Wherein X is a user scoring vector, N is a noise vector, X 'is a noise scoring vector, W is a noise scoring weight matrix, b is a noise scoring vector bias matrix, h is a hidden layer vector, S is a user auxiliary information vector, W s is an auxiliary information vector weight matrix, h' is a processed hidden layer vector, W z and b z are respectively a processed weight matrix and a bias matrix, and Y is a final complete scoring vector;
The cost function is:
L=||X-Y||2 (8)
L is a cost function, namely the square difference of the input score and the output score;
And obtaining the complete score of the user through at least 100 iterations.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910742757.0A CN110555132A (en) | 2019-08-13 | 2019-08-13 | Noise reduction self-encoder recommendation method based on attention model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910742757.0A CN110555132A (en) | 2019-08-13 | 2019-08-13 | Noise reduction self-encoder recommendation method based on attention model |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110555132A true CN110555132A (en) | 2019-12-10 |
Family
ID=68737268
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910742757.0A Pending CN110555132A (en) | 2019-08-13 | 2019-08-13 | Noise reduction self-encoder recommendation method based on attention model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110555132A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111127146A (en) * | 2019-12-19 | 2020-05-08 | 江西财经大学 | Information recommendation method and system based on convolutional neural network and noise reduction self-encoder |
CN112784153A (en) * | 2020-12-31 | 2021-05-11 | 山西大学 | Tourist attraction recommendation method integrating attribute feature attention and heterogeneous type information |
CN113190702A (en) * | 2021-05-08 | 2021-07-30 | 北京百度网讯科技有限公司 | Method and apparatus for generating information |
CN113220936A (en) * | 2021-06-04 | 2021-08-06 | 黑龙江广播电视台 | Intelligent video recommendation method and device based on random matrix coding and simplified convolutional network and storage medium |
CN114065039A (en) * | 2021-11-17 | 2022-02-18 | 重庆邮电大学 | Mean value pooling operation-based self-encoder recommendation method and system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013012205A2 (en) * | 2011-07-15 | 2013-01-24 | Park Yong Goo | Method and system for providing a movie recommendation service |
CN109410080A (en) * | 2018-10-16 | 2019-03-01 | 合肥工业大学 | A kind of social image recommended method based on level attention mechanism |
CN110008377A (en) * | 2019-03-27 | 2019-07-12 | 华南理工大学 | A method of film recommendation is carried out using user property |
CN110008409A (en) * | 2019-04-12 | 2019-07-12 | 苏州市职业大学 | Based on the sequence of recommendation method, device and equipment from attention mechanism |
-
2019
- 2019-08-13 CN CN201910742757.0A patent/CN110555132A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013012205A2 (en) * | 2011-07-15 | 2013-01-24 | Park Yong Goo | Method and system for providing a movie recommendation service |
CN109410080A (en) * | 2018-10-16 | 2019-03-01 | 合肥工业大学 | A kind of social image recommended method based on level attention mechanism |
CN110008377A (en) * | 2019-03-27 | 2019-07-12 | 华南理工大学 | A method of film recommendation is carried out using user property |
CN110008409A (en) * | 2019-04-12 | 2019-07-12 | 苏州市职业大学 | Based on the sequence of recommendation method, device and equipment from attention mechanism |
Non-Patent Citations (1)
Title |
---|
XIN DONG等: "A Hybrid Collaborative Filtering Model with Deep Structure for Recommender Systems", 《PROCEEDINGS OF THE THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE》 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111127146A (en) * | 2019-12-19 | 2020-05-08 | 江西财经大学 | Information recommendation method and system based on convolutional neural network and noise reduction self-encoder |
CN111127146B (en) * | 2019-12-19 | 2023-05-26 | 江西财经大学 | Information recommendation method and system based on convolutional neural network and noise reduction self-encoder |
CN112784153A (en) * | 2020-12-31 | 2021-05-11 | 山西大学 | Tourist attraction recommendation method integrating attribute feature attention and heterogeneous type information |
CN112784153B (en) * | 2020-12-31 | 2022-09-20 | 山西大学 | Tourist attraction recommendation method integrating attribute feature attention and heterogeneous type information |
CN113190702A (en) * | 2021-05-08 | 2021-07-30 | 北京百度网讯科技有限公司 | Method and apparatus for generating information |
CN113190702B (en) * | 2021-05-08 | 2023-07-25 | 北京百度网讯科技有限公司 | Method and device for generating information |
CN113220936A (en) * | 2021-06-04 | 2021-08-06 | 黑龙江广播电视台 | Intelligent video recommendation method and device based on random matrix coding and simplified convolutional network and storage medium |
CN113220936B (en) * | 2021-06-04 | 2023-08-15 | 黑龙江广播电视台 | Video intelligent recommendation method, device and storage medium based on random matrix coding and simplified convolutional network |
CN114065039A (en) * | 2021-11-17 | 2022-02-18 | 重庆邮电大学 | Mean value pooling operation-based self-encoder recommendation method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110555132A (en) | Noise reduction self-encoder recommendation method based on attention model | |
CN110598130B (en) | Movie recommendation method integrating heterogeneous information network and deep learning | |
CN104462383B (en) | A kind of film based on a variety of behavior feedbacks of user recommends method | |
CN107862561A (en) | A kind of method and apparatus that user-interest library is established based on picture attribute extraction | |
CN107944035B (en) | Image recommendation method integrating visual features and user scores | |
CN109947987B (en) | Cross collaborative filtering recommendation method | |
CN109902235B (en) | User preference clustering collaborative filtering recommendation algorithm based on bat optimization | |
CN113807422B (en) | Weighted graph convolutional neural network scoring prediction model integrating multi-feature information | |
CN112612948B (en) | Deep reinforcement learning-based recommendation system construction method | |
CN111753044A (en) | Regularization-based language model for removing social bias and application | |
CN109472282B (en) | Depth image hashing method based on few training samples | |
CN114329222B (en) | Film recommendation method integrating attention mechanism and double-end knowledge graph | |
CN112396492A (en) | Conversation recommendation method based on graph attention network and bidirectional long-short term memory network | |
CN111324807A (en) | Collaborative filtering recommendation method based on trust degree | |
CN112380451A (en) | Favorite content recommendation method based on big data | |
CN113742556A (en) | Multi-mode feature alignment method based on global and local alignment | |
CN110674313B (en) | Method for dynamically updating knowledge graph based on user log | |
CN107026755B (en) | A kind of service recommendation method based on sequence study | |
CN115204967A (en) | Recommendation method integrating implicit feedback of long-term and short-term interest representation of user | |
CN113836393B (en) | Cold start recommendation method based on preference self-adaptive meta-learning | |
CN112883268B (en) | Session recommendation method considering user multiple interests and social influence | |
CN111291193A (en) | Application method of knowledge graph in zero-time learning | |
CN108536666A (en) | A kind of short text information extracting method and device | |
CN113449200B (en) | Article recommendation method and device and computer storage medium | |
CN113342994A (en) | Recommendation system based on non-sampling cooperative knowledge graph network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20191210 |