CN112784171B

CN112784171B - Movie recommendation method based on context typicality

Info

Publication number: CN112784171B
Application number: CN202110083203.1A
Authority: CN
Inventors: 张清华; 张金镇; 艾志华; 李新太
Original assignee: Chongqing University of Post and Telecommunications
Current assignee: Chongqing University of Post and Telecommunications
Priority date: 2021-01-21
Filing date: 2021-01-21
Publication date: 2022-11-04
Anticipated expiration: 2041-01-21
Also published as: CN112784171A

Abstract

The invention belongs to the field of computer science and technology, and particularly relates to a movie recommendation method based on context typicality, which comprises the following steps: acquiring user data, and constructing a context typicality matrix according to the user data and the context typicality of each user to different movie types; searching neighbors in the context typicality matrix, and calculating scores of neighbor users according to context attribute values of the neighbors; calculating the importance of the context attribute to each movie type by adopting knowledge granularity according to the grading information of the user data; calculating a multi-context prediction score according to the importance and the single context; recommending the film according to the multi-context prediction score; compared with the traditional collaborative filtering recommendation system, the method has the advantages that the traditional collaborative filtering is replaced by the typical collaborative filtering, and the sparsity of data is effectively relieved.

Description

Movie recommendation method based on context typicality

Technical Field

The invention belongs to the field of computer science and technology, and particularly relates to a movie recommendation method based on context typicality.

Background

With the increasing data of the internet, the conventional Information Retrieval (IR) has been unable to meet the user's demand. In order to be able to determine the interests of a user and to present information of interest to the user, recommendation Systems (recommendation Systems) have become the most promising way for the new era of the internet.

In the field of recommendation systems, only depending on a user-item binary relationship, the recorded data has a sparsity problem in a scoring process of the system, and the depicted preference relationship is not reasonable. Traditional collaborative filtering algorithms measure a user using item scores, capturing user preferences in the low dimension of the item scores. When data is sparse, there are not enough common scoring items among users, which may cause inaccuracy in measuring similarity of users, so that the information recommended to the users by the system is inaccurate.

Disclosure of Invention

In order to solve the above problems in the prior art, the present invention provides a movie recommendation method based on contextual representativeness, the method comprising:

s1: acquiring user data, and collecting the user data to obtain a data set; calculating contextual typicality of the data according to information in the data; the user data comprises movie information, rating information and context attribute values;

s2: constructing a context typicality matrix according to the user data and the context typicality of each user to different movie types;

s3: searching neighbors in the context typicality matrix, and calculating scores of neighbor users according to context attribute values of the neighbors;

s4: predicting the score of the target user on the movie under the context attribute value by adopting the scores of the neighbor users;

s5: calculating the importance of the context attribute to each movie type by adopting knowledge granularity according to the grading information of the user data;

s6: calculating a multi-context prediction score according to the importance and the single context;

s7: the movie is recommended based on the multi-context prediction scores.

Preferably, the specific process of constructing the context user-typicality vector comprises:

step 1: calculating the user typicality of the user to the movie type under the context attribute value;

and 2, step: converting the contextual user-typicality of the user for each movie type into a contextual user-typicality vector; the expression of the context user-typicality vector is:

and 3, step 3: all context user-typicality vectors are constructed as a context-typicality matrix.

Further, the formula for user-typicality of a user to movie types under the context attribute values is:

further, the context typicality matrix is:

preferably, the searching of the context typical matrix comprises calculating cosine similarity between the user data in the context typical matrix and the target user data, and judging the magnitude of all cosine similarity, when the user data with the largest cosine similarity is the neighbor of the context typical matrix; the cosine similarity formula is:

preferably, the formula for predicting scores according to the single context of the neighbor users of the target user for different movies is as follows:

preferably, the formula for calculating the importance of the context attribute to each movie type by using the knowledge granularity is as follows:

preferably, the formula of the multi-context prediction score is:

the invention has the advantages that:

1) Compared with the traditional manual recommendation, the movie recommendation system provided by the invention has higher accuracy and efficiency;

2) Compared with the traditional collaborative filtering recommendation system, the method has the advantages that the typical collaborative filtering is used for replacing the traditional collaborative filtering, so that the data sparsity is effectively relieved;

3) The preference degree of the user to the items under the context is measured by the knowledge granularity, so that the preference degree can be described more accurately;

4) And the context is introduced, the user preference is measured in the context, the actual scene is more suitable, and the recommendation accuracy is improved.

Drawings

FIG. 1 is an overall flow diagram of the present invention;

FIG. 2 is a context-based exemplary matrix according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

A movie recommendation method based on context typicality, as shown in fig. 1, the method comprising:

s1: acquiring user data, and collecting the user data to obtain a data set; calculating contextual typicality of the data from information in the data; the user data comprises movie information, rating information and context attribute values;

s2: constructing a context typicality matrix according to the user data and the context typicality of each user for different movie types;

s7: the movie is recommended according to the multi-context predictive score.

After the user data are obtained, the user data are collected to obtain a user data set; the user data includes movie information, rating information, and context attribute values. The data set of the user is U, U _x Representing the xth user. In a data setComprises an item set O, a context attribute set C and attribute values in the context attribute set, wherein C _h,k A kth attribute value representing an h context attribute. In the raw data set, historical scores for individual movies by individual users are also included.

After the user data is acquired, the types of the movies are classified into different types of groups T, T according to the types of the movies _l Is the ith project group.

And generating a decision information table according to the classification result of the final movie, wherein the content in the decision information table is shown in table 1.

TABLE 1 decision information Table in context recommendation System

There is a lot of context information in the recommendation system, but some of them have a large influence on the decision of the user, and some have little influence on the decision. To determine which contexts are important for different types, the present invention bases its knowledge granularity G (d) _y,t |C _h,k ) To measure the importance of the context to different item types:

wherein d is _y,t Indicating a certain film type t, C to which a film y belongs _h,k A k-th attribute value representing an h-th context attribute,

set of scores, R, representing types of movies in the same single context attribute _i Represents one score and n represents the nth.

Context typicals are built to measure user preferences.

The specific process of constructing the context typicality matrix comprises the following steps:

(1) Calculating the user typicality of the user to the movie type under the context attribute value, wherein the formula is as follows:

wherein

Indicating how well user x scores movie type t,

representing the frequency of scoring of movie type t by user x.

Wherein R is ^Max Is the maximum rating of the score.

Wherein the content of the first and second substances,

indicates the number of times user x scored the movie type under the context attribute value, indicates

The number of times user x scores all movie types under the context attribute value.

(2) Converting the contextual user-typicality of the user for each movie type into a contextual user-typicality vector; the expression of the context user-typicality vector is:

a context-typicality matrix is then constructed from the context-user-typicality vectors.

As shown in fig. 2, the expression of the context typicality matrix is:

wherein m represents the number of users, n represents the number of user groups,

representing user u _x Context attribute value C _h,k The user representative vector of (a).

Searching adjacent cosine similarity between user data in the context typical matrix and target user data, and judging the cosine similarity, wherein the adjacent cosine similarity comprises the cosine similarity between the user data in the context typical matrix and the target user data, and when the user data with the largest cosine similarity is adjacent to the context typical matrix; the cosine similarity formula is:

wherein | · | | represents the modulus of the vector,

a context representative vector of the representation.

After the similarity between the target user and other users is calculated, a prediction score is obtained according to the context typicality of the other users to the movie and the score of the project in the current context. The formula is as follows:

wherein d is _y,t Represents the movie type t, C to which movie y belongs _h,k A k-th attribute value representing an h-th context attribute,

set of scores, R, representing types of movies in the same single context attribute _x,y Represents the rating of movie y by user x, n represents the nth rating,

value C representing the context attribute of a neighbor user i _h,k The history of movie y is scored.

For target user u in multi-context c _x The score prediction for the ith item is calculated as:

wherein x represents the x-th user, C represents the context attribute set where the user is, y represents the y-th movie, C _h,k K-th attribute value, d, representing h-th context attribute _y,t Indicates the movie type, G (d), to which movie y belongs _y,t |C _h,k ) Represents a context attribute value C _h,k For type d of film _y,t The degree of importance of (a) is,

representing the single context prediction score of user x for movie y.

The formula for calculating the importance of the context attribute to each movie type by using the knowledge granularity formula is as follows:

wherein, d _y,t Represents the movie type t, C to which movie y belongs _h,k A k-th attribute value representing an h-th context attribute,

set of scores, R, representing types of movies in the same single context attribute _x,y Representing the rating of movie y by user x and n representing the nth rating.

The formula for the multi-context prediction score is:

where x represents the x-th user, C represents the set of context attributes where the user is located, y represents the y-th movie, C _h,k K-th attribute value, d, representing h-th context attribute _y,t Represents the movie type t, G (d) to which movie y belongs _y,t |C _h,k ) Represents a context attribute value C _h,k For type d of film _y,t The importance of (a) to (b),

representing the single context prediction score for movie y for user x.

The above-mentioned embodiments, which further illustrate the objects, technical solutions and advantages of the present invention, should be understood that the above-mentioned embodiments are only preferred embodiments of the present invention, and should not be construed as limiting the present invention, and any modifications, equivalents, improvements, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A method for recommending movies based on contextual representativeness, the method comprising:

s1: acquiring user data, and collecting the user data to obtain a data set; calculating contextual typicality of the data from information in the data; the user data comprises movie information, rating information and context attribute values; the formula for calculating the context typicality of the data is:

wherein the content of the first and second substances,

indicates the degree of rating, C, of user x for movie type t _h,k A kth attribute value representing an h context attribute, r represents a degree of scoring,

representing the scoring frequency of the user x on the movie type t, and f represents the scoring frequency;

s6: calculating a multi-context prediction score according to the importance and the score of the target user on the movie under the context attribute value;

s7: the movie is recommended according to the multi-context predictive score.

2. The method of claim 1, wherein the step of constructing the context-user-typicality vector comprises:

step 2: converting the contextual user-typicality of the user for each movie type into a contextual user-typicality vector; the expression of the contextual user-typicality vector is:

and step 3: constructing all context user representativeness vectors into a context representativeness matrix;

wherein the content of the first and second substances,

representing a contextual user-typicality vector, C _h,k A kth attribute value representing an h context attribute,

representing user x in context attribute value C _h,k User typicality for the nth movie type.

3. The method of claim 2, wherein the context representativeness matrix is:

value C of a contextual attribute representing user x _h,k The user representative vector of (a).

4. The method of claim 1, wherein the step of searching for neighbors in the context canonical matrix comprises calculating cosine similarity between the user data in the context canonical matrix and the target user data, determining the magnitude of all cosine similarities, and when the user data with the largest cosine similarity is a neighbor of the context canonical matrix; the cosine similarity formula is:

where | l | · | | represents the modulus of the vector,

a context-representative vector of representations.

5. The method of claim 1, wherein the formula for the predictive scoring of a single context of different types of movies by neighboring users is as follows:

wherein

Represents a context attribute value C _h,k User-typicality of next user i to type t to which movie y belongs,

indicating a neighbor user i context attribute value C _h,k The history of movie y is scored.

6. A method as claimed in claim 1, wherein the formula for calculating the importance of the context attribute to each movie type by using the knowledge granularity is as follows:

wherein d is _y,t Represents the movie type t, C to which movie y belongs _h,k K-th attribute representing h-th context attributeProperty value, [ r ]] _dy,t Set of scores, R, representing types of movies in the same single context attribute _x,y Representing the rating of movie y by user x and n representing the nth rating.

7. The method of claim 1, wherein the multi-context prediction score is formulated as:

where x represents the x-th user, C represents the set of context attributes where the user is located, y represents the y-th movie, C _h,k K-th attribute value, d, representing h-th context attribute _y,t Represents the movie type t, G (d) to which movie y belongs _y,t |C _h,k ) Represents a context attribute value C _h,k For type d of film _y,t The degree of importance of (a) is,

representing the single context prediction score of user x for movie y.