CN114025205A

CN114025205A - Intelligent recommendation method for home TV video

Info

Publication number: CN114025205A
Application number: CN202111287347.5A
Authority: CN
Inventors: 赵心炜; 侯永宏; 刘传玉; 王辉
Original assignee: Tianjin University
Current assignee: Tianjin University
Priority date: 2021-11-02
Filing date: 2021-11-02
Publication date: 2022-02-08

Abstract

The invention discloses an intelligent recommendation method for family TV videos, which comprises a long-term interest video recommendation list generation step for all family members, wherein the long-term interest video recommendation list generation step is used for generating a recommendation list of all family user interest contents according to long-term film watching records and through a collaborative filtering algorithm based on user clustering; the method comprises the steps of generating a short-term interest video recommendation list of a current TV user, wherein the short-term interest video recommendation list is used for acquiring interest preference of the current user through a recurrent neural network combined with an Attention mechanism according to a short-term viewing record and generating an interest video recommendation list of the current user; the method comprises an optimal user recommendation list generation step, wherein the content recommendation lists in the steps (1) and (2) are optimized and combined to generate a final user recommendation list. The method and the device solve the problem that the recommendation accuracy is poor when the current recommendation technology is oriented to a plurality of users with large interest difference, better solve the problem of TV end information overload in the Internet television era, and finally improve the use experience of TV end users.

Description

Intelligent recommendation method for home TV video

Technical Field

The invention belongs to the field of automatic recommendation of internet video contents, and particularly relates to video content recommendation facing a family TV user group by combining a clustering-based collaborative filtering algorithm and a recurrent neural network algorithm of a kneading Attention mechanism.

Background

The phenomenon of information overload of internet television content is becoming obvious day by day, and the use of an automatic recommendation algorithm is an effective means for solving the problem of information overload; the video content recommendation scenes aimed at by common recommendation algorithms in the market at present are all use scenes with strong privacy, for example, video content recommendation of a user by a mobile app is performed, and the algorithms only need to grasp the watching interest of a single user through data; compared with the above scenario, the scenario of the user at the television end is more complex, and mainly appears as follows:

(1) one television terminal is usually oriented to a plurality of members in one family, and different members in the same family have the same film watching hobbies and different film watching requirements;

(2) a television terminal user can switch the television normally, the watching content and the watching interest recorded by the terminal change greatly in a short time, and the watching data shows the characteristic of rapid and dynamic change along with the time.

The recommendation algorithm with strong privacy is directly applied to the television end, so that the recommendation can be completed, but the recommendation effect is poor; therefore, the invention aims to better solve the problem of information overload of Internet television content and improve the use experience of family users on the television.

Disclosure of Invention

The invention aims to provide an automatic recommendation algorithm more suitable for a home television end to solve the problem of overload of on-demand content information on a home Internet television, so that the television use experience of home users is improved.

The technical scheme of the invention is as follows:

an intelligent recommendation method for home TV videos comprises the following steps:

(1) long-term interest video recommendation list generation for all members of TV-end family

And (4) a recommendation algorithm combining clustering and collaborative filtering according to the historical playing preference content of the user. In the algorithm, on one hand, a clustering algorithm is utilized to mine the long-term preference rule of the family user to the content; on the other hand, a long-term interest content list of all the family members is generated by utilizing a user-based collaborative filtering algorithm. The specific process is as follows:

the first step is as follows: screening and preprocessing behavior data of home users

The main purpose of the process is to remove invalid user behavior data and to retain valid user behavior data, for example, to remove data with too short a viewing time.

The second step is that: building implicit scoring interest model

The main logic of the implicit rating model of the user is to calculate the rating of the user to different contents by the ratio of the viewing time length of the user to the total viewing time length. The main process comprises three steps:

A. extracting main fields in the user viewing content behavior data: the "content viewing time length" and the "content viewing time length ratio".

B. And obtaining the implicit rating of the user to the content according to the characteristic field of the user behavior.

C. And sorting the final calculation result to provide data for a subsequent algorithm.

The third step: classifying different families by using a clustering algorithm;

clustering users by using a K-Means + + algorithm, wherein the process mainly comprises the following three steps:

A. of all data points, the k most distant points are selected as the points for which the cluster center is initialized.

B. And classifying all the points in the data set, and respectively classifying the points to the cluster with the shortest distance from the clustering center.

C. And (4) calculating the centers of all the clusters again, finishing clustering if the positions of the center points are not deviated, and otherwise, repeating the steps until the positions of the center points are not deviated.

The fourth step: generating recommendation lists using collaborative filtering algorithms

The content recommendation based on the collaborative filtering algorithm of the user is completed by utilizing a Pearson similarity calculation method, and the specific steps are as follows:

A. dividing implicit grading data of users according to each user, and calculating intermediate variables;

B. integrating intermediate calculation results with the same content, calculating intermediate variable products among users if the users belong to the same category, and not calculating if the users do not belong to the same category;

C. product summation is carried out on the intermediate variables with the same user type, and the similarity between the users is calculated;

D. according to the similarity between the users, obtaining other users similar to the users;

E. and generating a content recommendation list according to the implicit grading content of the similar users.

(2) Current TV user interest video recommendation list generation

The method mainly utilizes the advantages of a Recurrent Neural Network (RNN) and a variant LSTM thereof on processing time sequences to build a generation model of an interest video recommendation list of a current TV user; meanwhile, considering that the interests of different members of the family may be greatly different and interfere with the result, the part proposes to add an attention (attention) mechanism on the basis of a neural network structure to weaken interference items generated by the viewing of other members and improve the recommendation accuracy. The specific process is as follows:

The second step is that: converting behavior data into data that can be processed by model

The main purpose of the process is to convert the viewing data into data which can be processed by a model, in the process, one-hot coding is firstly carried out on the content, and as the coded data is a high-dimensional sparse vector, the result is processed by an embedding layer, the high-dimensional sparse vector is converted into a low-dimensional dense vector which is beneficial to deep neural network processing, all the viewing behaviors of a user are mapped into vectors with fixed length, and the subsequent network computing processing is facilitated.

The third step: inputting data to LSTM layer for processing

The main purpose of this process is to obtain the relationship that exists between the user's state at time t and the user's viewing behavior.

The fourth step: inputting data into the Attention layer for processing

Since the watching behaviors of all the family are recorded in the playing history behaviors of the family user, namely, the recorded history behaviors can be regarded as a sequence mixed by a plurality of interest topic sampling points. Therefore, the main purpose of introducing the Attention mechanism in the process is to enhance the content with strong correlation with the planning content in the historical viewing record and weaken the content with weak correlation, thereby further predicting the interest degree of the user in the target content.

The fifth step: inputting data into output layer, generating user short-term interest list

The process mainly comprises the steps of outputting a result feature vector of the attribute to a full link layer, calculating result probability through a sigmoid function, and judging whether a current user is interested in target content through short-term behavior data. And finally, generating a current interest recommendation list of the user.

(3) Optimal user recommendation list generation

In the process, the user interest lists generated in the steps (1) and (2) are combined, and a final user recommendation list with priority is generated according to indexes such as watching frequency, popularity and content scores;

the invention has the beneficial effects that:

the method solves the problem of information overload of the Internet television end, builds the on-demand content recommendation algorithm more suitable for the TV end, and can improve the recommendation accuracy by 10% -15% compared with the traditional recommendation algorithm.

The specific implementation mode is as follows:

the present invention is further illustrated by the following specific examples, which are intended to be illustrative, not limiting and are not intended to limit the scope of the invention.

generating long-term interest video recommendation list of all members of TV-end family

The first step is as follows: the method comprises the following steps of screening and preprocessing behavior data of a family user, wherein the specific processing process in the process comprises the following three steps:

(1) the process faces the case that the data has missing values: the repairable data are supplemented manually; directly deleting unrepairable data; and directly carrying out deduplication processing on the repeated data.

(2) The process is faced with significantly erroneous data: storing the corrected data after manual correction; and directly deleting the data which cannot be corrected.

(3) Abnormal value discrimination and processing: in the first two steps, only obvious dirty data are processed, and some data which cannot be removed and corrected by a simple method may exist in real data; and in the face of the data, removing unreasonable data by adopting a statistical analysis method: if the data meet normal distribution, a confidence probability and a confidence interval can be set to select the data, the data are not in the corresponding interval to indicate that the related data exceed the minimum range allowed by the error, and the data are removed; if the data does not satisfy the normal distribution, the data can be determined by using a box plot, wherein the box plot includes 6 data points, the upper quartile is recorded as Q1, and the lower quartile is recorded as Q2, the quartile distance IQR is Q1-Q2, and if the data is smaller than the values of Q1-1.5IQR or Q2+1.5IQR, the data is considered abnormal, and the data can be rejected.

The second step is that: building an implicit scoring interest model, wherein the main logic of the model is to calculate scores of different contents of a user according to the ratio of the watching time length of the user to the total watching time length, and the main process comprises three steps:

extracting main fields in the content viewing behavior data of the user: the "content viewing time length" and the "content viewing time length ratio".

And obtaining the implicit rating of the user to the content according to the characteristic field of the user behavior.

And thirdly, sorting the final calculation result to provide data for a subsequent algorithm.

The third step: classifying different families by using a K-Means + + clustering algorithm, wherein the specific calculation method is as follows:

(1) firstly, randomly selecting a point in a set as the center of a cluster;

(2) calculating the distance between each point in the set and the center of the cluster, wherein a commonly used method is an Euclidean distance method, and assuming that two points x and y exist in the space, the Euclidean distance formula between the two points is as follows:

(3) selecting a new data point as a new cluster center, wherein the selection idea is as follows: (2) the larger the calculated distance is, the higher the probability value of the point as a new cluster center is;

(4) initializing the center points of all clusters by continuously executing the second step and the third step;

(5) and after the central point of each cluster is obtained, calculating by using a K-Means algorithm respectively. The detailed procedure for K-Means is: randomly selecting k values from a data set N as centers of initialization clustering; calculating the distance from all points in the data set N to the central value of each cluster by using the Euclidean distance method in the step (2); dividing the values in the data set N into the cluster with the central point nearest to the values; after all the data are divided, the central position of the cluster is recalculated according to the data condition of the cluster, and the calculation formula is as follows:

in the above formula, x represents the number of clusters in each class of the cluster center kCoordinates of points, Center_kRepresenting the coordinates of the newly calculated cluster center. If the center position of the new cluster is changed, the distances from all the points to the central values of all the clusters are recalculated, and if the center of the new cluster is kept unchanged, the related clustering result is output.

The fourth step: the content recommendation based on the collaborative filtering algorithm of the user is completed by utilizing a Pearson similarity calculation method, and the specific steps are as follows:

(1) dividing implicit grading data of users according to each user, and calculating an intermediate variable Q_uiThe specific calculation formula is as follows:

wherein u and i represent the user and the content of the user with implicit rating, P_uiRefers to the user u's score for content i,

representing the mean of all content implicit scores by user u.

(2) Integrating intermediate calculation results with the same content, and calculating intermediate variable product Q between users if the users belong to the same category_ui×Q_viIf not, not calculating;

(3) and (3) performing product summation on intermediate variables with the same user type to calculate the similarity between users, wherein a specific calculation formula is as follows:

(4) according to the similarity between the users, obtaining other users similar to the users;

(5) and generating a content recommendation list according to the implicit grading content of the similar users.

Introducing the clustered effect:

TABLE 1 comparison of accuracy rates of two recommendation algorithms

TABLE 2 two recommendation algorithm recall comparisons

As shown in the above table, 300 to 1500 user data are respectively input in the experiment, and the two tables respectively represent the accuracy and the recall rate, and the higher the value is, the better the effect is. The experimental result shows that compared with the traditional collaborative filtering algorithm based on the user, the collaborative filtering algorithm based on the user clustering has more excellent recommendation result.

2. Current TV user interest video recommendation list generation

The same as 1, 2, in the first step, the behavior data of the family user is screened and preprocessed, and the specific processing process in the process is divided into the following three steps:

In the second step, the behavior data is converted into data which can be processed by the model

In the process, one-hot coding is firstly carried out on the content, and as the coded data is a high-dimensional sparse vector, the result is processed by an embedding layer, the high-dimensional sparse vector is converted into a low-dimensional dense vector beneficial to deep neural network processing, all the watching behaviors of a user are mapped into a vector with a fixed length, and the subsequent network computing processing is facilitated.

Inputting data into an LSTM layer for processing, wherein the process mainly aims to obtain the relationship between the state of a user at the time t and the watching behavior of the user, and the specific calculation process and formula are as follows:

in the whole calculation process, after input reaches an LSTM structure, information needing to be abandoned is judged through a sigmoid function, and the calculation formula is as follows:

f_t＝σ(W_f·[h_t-1，x_t]+b_f)

in the above formula, f_tRepresenting forgetting gate, W_fIs a weight value h_t-1Representing the state of the hidden layer at the previous moment, x_tRepresenting the input at the current time, b_fRepresents a bias;

then, the storage of the neuron information at the current moment is determined through two parts of operation, the first part determines the updating content through a sigmoid function, and the calculation formula is as follows:

i_t＝σ(W_i·[h_t-1，x_t]+b_i)

in the above formula, i_tRepresents an input gate, W_iIs a weight value h_t-1Representing the state of the hidden layer at the previous moment, x_tRepresenting the input at the current time, b_iRepresents a bias;

the second part creates candidate vectors through tanh function

The calculation formula is as follows:

in the above formula, the first and second carbon atoms are,

is the current candidate vector value, W_CIs a weight value h_t-1Representing the state of the hidden layer at the previous moment, x_tRepresenting the input at the current time, b_CRepresents a bias;

combining the previous calculation to obtain the neuron state C of the present stage_tThe calculation formula is as follows:

in the above formula, C_tIs the present stage of neuronal state, f_tRepresenting forgetting gate, C_t-1Representing the state of the neuron at the previous moment, i_tWhich represents the input gate or gates, respectively,

the current candidate vector value is obtained;

finally, the current stage neuron state C is determined_tBy the tanh function and o_tMultiply to calculate h_tThe calculation formula is as follows:

o_t＝σ(W_o·[h_-1，x_t]+b_o)

h_t＝o_t·tanh(C_t)

in the above formula, o_tRepresents an output gate, W_oRepresents the weight value, h_t-1Representing the state of the hidden layer at the previous moment, x_tRepresenting the input at the current time, b_oRepresents a bias; h is_tBeing the state of the current hidden layer, C_tThe present stage of the neuronal state.

And fourthly, inputting data into an Attention layer for processing, introducing an Attention mechanism to enhance the content with stronger correlation with the planned content in the historical viewing record and weaken the content with weak correlation, thereby further predicting the interest degree of the user in the target content, wherein the calculation formula is as follows:

in the above formula, α_tRepresenting an Attention score reflecting the correlation between the video content of the target and the user's historical interest status at each moment, the greater the correlation, alpha_tThe higher the value of (A); e.g. of the type_aA vector representing that the name of the content watched by the current user passes through the embedding layer, W is a model parameter,

wherein n is_hRepresents a hidden state h_tDimension of, n_aRepresenting the vector dimension after the currently viewed program embedding. Finally, the vector formula output after the Attention mechanism can be expressed as follows:

h′_t＝h_t×α_t

fifthly, inputting data into an output layer to generate a user short-term interest list

The relevant experimental effects are as follows:

TABLE 3 LSTM model accuracy data

TABLE 4 RNN model accuracy data

TABLE 5 LSTM model recall data

TABLE 6 RNN model recall data

TABLE 7 LSTM model f1 data

TABLE 8 RNN model f1 data

As can be seen from the data in tables 3-8, the LSTM model has better effect than the RNN model, and the accuracy of the LSTM model is 5-10% higher than that of the RNN model in terms of accuracy. At the same time, it is clear that: after the LSTM model and the RNN model are added with the Attention mechanism, the accuracy, the recall rate and the f1 value of the LSTM model and the RNN model are all higher than the model corresponding to the model without the Attention mechanism, wherein the quasi-removal rate is about 5% higher, the recall rate is about 4% higher, and the f1 value is about 5% higher.

The invention is not the best known technology.

The above embodiments are merely illustrative of the technical ideas and features of the present invention, and the purpose thereof is to enable those skilled in the art to understand the contents of the present invention and implement the present invention, and not to limit the protection scope of the present invention. All equivalent changes and modifications made according to the spirit of the present invention should be covered within the protection scope of the present invention.

Claims

1. An intelligent recommendation method for home TV videos is characterized by comprising the following steps:

the method comprises the steps of generating a long-term interest video recommendation list of all family members: generating a recommendation list of all user interest contents of a family through a collaborative filtering algorithm based on user clustering;

the method comprises the steps of generating a video recommendation list of the current TV user interest: acquiring interest preference of a current user through a recurrent neural network combined with an Attention mechanism, and generating an interest video recommendation list of the current user;

the method comprises the following steps of generating an optimal user recommendation list: and (3) optimally combining the content recommendation lists in the steps (1) and (2) to generate a final user recommendation list.

2. The intelligent home TV video recommendation method according to claim 1, wherein: the collaborative filtering algorithm based on user clustering comprises the following steps:

(1) screening and preprocessing the behavior data of the family user;

(2) building an implicit grading interest model;

(3) classifying different families by using a clustering algorithm;

(4) and generating a long-term interest video recommendation list of all the family members by using a collaborative filtering algorithm.

3. The intelligent home TV video recommendation method according to claim 2, wherein: and (3) the implicit rating model in the step (2) is used for calculating the rating of the user to different contents according to the ratio of the watching time length of the user to the total watching time length.

4. The intelligent home TV video recommendation method according to claim 2, wherein: the clustering algorithm in the step (3) is a K-Means + + algorithm, and comprises the following steps:

step one, selecting k points with the farthest distance from all the data points as initialization points of the cluster centers.

Step (ii) ofIIAnd classifying all the points in the data set, and respectively classifying the points to the cluster with the shortest distance from the clustering center.

And step three, calculating the centers of all the clusters again, if the positions of the center points are not deviated, finishing clustering, and otherwise, repeating the steps until the positions of the center points are not deviated.

5. The intelligent home TV video recommendation method according to claim 2, wherein: the collaborative filtering algorithm in the step (4) is a Pearson similarity calculation method, and specifically comprises the following steps:

dividing implicit rating data of users according to each user, and calculating intermediate variables;

step (ii) ofIIIntegrating intermediate calculation results with the same content, calculating the intermediate variable product between users if the users belong to the same category, and not calculating if the users do not belong to the same category;

step (ii) ofIIIProduct summation is carried out on the intermediate variables with the same user type, and the similarity between the users is calculated;

step (ii) ofFourthlyAccording to the similarity between the users, obtaining other users similar to the users;

and step five, generating a content recommendation list according to the implicit grading content of the similar users.

6. The intelligent home TV video recommendation method according to claim 5, wherein: the formula for calculating the intermediate variable in the first step is as follows:

wherein u and i represent the user and the content of the user with implicit rating, Q_uiIs an intermediate variable, P_uiRefers to the user u's score for content i,

representing the mean of all content implicit scores by user u.

7. The intelligent home TV video recommendation method according to claim 5, wherein: the third step is as follows:

wherein u and v represent two users, i represents the user with implicit rating, and Q_uiFor user u intermediate variables, Q_viA user v intermediate variable; p_uiRefers to the user u's score, P, for content i_viScoring content i for user v;

represents the mean of all content implicit scores by user u,

representing the mean of all content implicit scores by user v.

8. The intelligent recommendation method for TV video according to claim 1, wherein: the recurrent neural network combined with the Attention mechanism comprises the following steps:

(1) screening and preprocessing the behavior data of the family user;

(2) converting the behavior data into data which can be processed by a model;

(3) obtaining the relation between the state of the user at the time t and the watching behavior of the user by using an LSTM network;

(4) the Attention network is utilized to weaken the interference items of the viewing records of other members, and the interest degree of the user on the target content is predicted more accurately;

(5) and calculating the result probability by using a sigmoid function, and generating a current TV user interest video recommendation list.

9. The intelligent recommendation method for TV video according to claim 8, wherein: the specific calculation flow and formula of the step (3) are as follows:

f_t＝σ(W_f·[h_t-1，x_t]+b_f)