CN105426550B

CN105426550B - Collaborative filtering label recommendation method and system based on user quality model

Info

Publication number: CN105426550B
Application number: CN201511018787.5A
Authority: CN
Inventors: 冯研
Original assignee: TCL Corp
Current assignee: TCL Corp
Priority date: 2015-12-28
Filing date: 2015-12-28
Publication date: 2020-02-07
Anticipated expiration: 2035-12-28
Also published as: CN105426550A

Abstract

The invention discloses a collaborative filtering label recommendation method and a collaborative filtering label recommendation system based on a user quality model, wherein the method comprises the following steps: perfecting a label system for the conditions occurring in the existing system; mapping information of users in the system to a two-dimensional matrix to construct a user model, and storing the user model in a user-label two-dimensional matrix form; obtaining a model vector of a current user, and calculating the similarity between the current user and a neighbor user in a system; calculating the model quality of neighbor users in the system; generating optimal recommendation according to the model quality of neighbor users in the system and an improved collaborative filtering recommendation algorithm; and returning the optimal recommendation result to the user interface through the WEB server. The method optimizes the selection process of the traditional optimal recommendation user, improves the accuracy and recall rate of recommendation, and evolves and updates the current label system of the system; and according to the appearance of the user and the resource in the system, a proper label source is selected, so that the problems of cold start and single label source are solved.

Description

Collaborative filtering label recommendation method and system based on user quality model

Technical Field

The invention relates to the technical field of WEB application, in particular to a collaborative filtering tag recommendation method and system based on a user quality model.

Background

With the deep development of network technology, labels have become a standard information organization mode on the internet, and are widely applied to a free classification method, which is a method for users to freely access information, and enables users to label information characteristics in a form of labels by using own voice. The label is used for classifying, organizing and retrieving information of texts, pictures, videos and audio resources, the information is searched and shared, and the method is a unique information organization tool in the internet information environment. In the past few years, tagging systems that users create and share metadata have been explored and applied on the internet, and websites such as Flickrtll, del.

The classified words in the traditional classification system are often lack of popularity and relevance, the words are relatively outdated, relevant information and expected results are difficult for professionals to obtain through traditional classified word search, metadata used in the traditional classification structure is relatively high in cost, a large amount of time and energy of the professionals are consumed for defining and classifying the metadata, and in the label system, a complex metadata definition task is given to users to complete by the system, the label definition is group behaviors of the users to resources, so that the label system is higher in compactness and better in adaptability to the users compared with the traditional fixed hierarchical structure classification system, and is more in line with the current popular trend. The label classification enables the key points of searching to be better displayed and highlighted through labels, and is different from general keywords in that when the keywords are used for searching, only articles containing the keywords in the content can be searched, but tags contain the keywords which are not in the text, and the tags are used for searching, so that articles containing words except the keywords can be searched, and the width and the breadth of searching are enlarged.

Although the tags have excellent advantages in the implementation of information resource retrieval and web page navigation, the use of the tags requires that people have to define the tags in advance, however, the manual tag definition process is time-consuming and tedious, in order to liberate people from the time-consuming and tedious tag definition work and enable free classification to be more widely applied, the introduction of a tag recommendation service is urgent, and the service is implemented by recommending some potential tags which may be interested by users for the users to select from, so that the tag definition is more convenient and faster.

The label recommendation is an emerging field accompanying the popularization and application of network technology, but the following problems exist in the overall view:

1. the label is an old problem. The recommended labels are derived from a fixed label system, and as time goes on, the data volume is continuously increased, labels which are lacked in the original label system and are suitable for new resources need to be added, but the fixed label system cannot evolve as time goes on, and the recommendation quality is inevitably reduced.

2. Cold start problems. The user, the label and the resource are three major elements of a label recommendation system, the occurrence conditions of the three major elements in the system are fully considered during recommendation, but most of the existing label recommendation systems only extract information from the existing user model and the existing resource model, and ignore the data mining problem which needs to be solved when the system faces a new user and a new resource.

3. Uniqueness of the source of the tag. Resource content, user history labels (also called user interest labels) and resource history labels are three main label sources for label recommendation, each label source has own advantages and disadvantages, most of the existing label recommendation systems only focus on one of the label sources, and the multiple label sources are not combined.

Accordingly, the prior art is yet to be improved and developed.

Disclosure of Invention

In view of the defects of the prior art, the invention aims to provide a collaborative filtering tag recommendation method and system based on a user quality model, and aims to solve the problems that a collaborative filtering recommendation algorithm and most of existing tag recommendation algorithms in the prior art have old tag space, cold start, single tag source and the like.

The technical scheme of the invention is as follows:

a collaborative filtering label recommendation method based on a user quality model comprises the following steps:

A. detecting user input information, acquiring a training set in a label classification information database, extracting all labels in the training set to form a label system of the existing system, and perfecting the label system according to resources and the condition of a user in the existing system;

B. mapping information of users in the system to a two-dimensional matrix to construct a user model, and storing the user model in a user-label two-dimensional matrix form;

C. obtaining a model vector of a current user, and calculating the similarity between the current user and a neighbor user in a system;

D. calculating the model quality of neighbor users in the system;

E. generating optimal recommendation according to the model quality of neighbor users in the system and an improved collaborative filtering recommendation algorithm;

F. and returning the optimal recommendation result to the user interface through the WEB server.

The collaborative filtering label recommendation method based on the user quality model includes the following steps:

a1, detecting user input information, acquiring a training set in a tag classification information database, and extracting all tags in the training set to form a tag system C { t1, t2, …, tn } of the existing system S;

a2, judging resource R_iAnd user U_iThe situation that occurs in the existing system S;

a3, if

I.e. the resource has not appeared in the existing system, the resource R is extracted_iAdding the first X resource title keywords with the highest weight into a system label system C;

a4, if

Namely, the resource appears in the system and the user does not appear, the resource R is extracted_iY labels with the highest use frequency and X resource title keywords with the highest weight are added into a system label system C;

a5, if U_i∈S and R_iAnd e, S, namely the user and the resource appear in the system, and history label information is adopted.

The collaborative filtering label recommendation method based on the user quality model, wherein the step B specifically includes:

b1, mapping the information of K users in the system to a two-dimensional matrix to construct a user model, and storing the mapping result in a user-label characteristic matrix;

b2, momentEach row vector VU in the array_k＝(w(T₁)；w(T₂)；…；w(T_i)；w(T_n) User model in which T represents a user_iRepresents the ith and the user U_kRelated tag, w (T)_i) Presentation label T_iIn the vector VU_kThe weight in (1) is (are),

wherein tf (T)_i,U_k) Represents T_iTag by user U_kThe number of uses, N represents the total number of system tags,

indicating at least one use of T_iThe number of users of the tag.

The collaborative filtering label recommendation method based on the user quality model is characterized in that the step C specifically comprises the following steps: obtaining a model vector of a current user, and calculating the similarity sim (prof) between the current user and a neighbor user in a system_u,prof_v)

Wherein prof_uAnd prof_vUser model vectors of a current user u and a neighbor user v are respectively.

The collaborative filtering label recommendation method based on the user quality model, wherein the step D specifically includes: model qualities Q u (v) of neighbor users in the computing system,

wherein:

in the above formula, k_iFor the i-th tag of the user v,

is k_iThe normalized value of the number of users of (c),is k_iThe average degree of similarity of the users of (1),

is k_iWord frequency of, w (l, k)_i) Is k_iThe model quality of the neighbor user is the average label quality of the neighbor user.

The collaborative filtering label recommendation method based on the user quality model is characterized in that the optimal recommendation result in the improved collaborative filtering recommendation algorithm in the step E is denoted as T (u, r), and the calculation formula is as follows:

δ(v,r,t)∶＝1ifδ(v,r,t)∈U×R×T else 0，

in the above formula, N_uFor k nearest neighbor users of the current user u, T (u, r) is the best recommendation result of the algorithm, sim (prof)_u,prof_v) And delta (v, R, T) is equal to U multiplied by R multiplied by T and represents that the user v has a label definition relation to the resource R for the similarity between the current user U and the neighbor user v.

A collaborative filtering tag recommendation system based on a user quality model, wherein the system comprises:

the system comprises a label system perfecting module, a label classification information database and a label classification information database, wherein the label system perfecting module is used for detecting information input by a user, acquiring a training set in the label classification information database, extracting all labels in the training set to form a label system of the existing system, and perfecting the label system according to resources and the condition of the user in the existing system;

the user model building module is used for mapping the information of the users in the system to a two-dimensional matrix to build a user model and storing the user model in a user-label two-dimensional matrix form;

the similarity calculation module is used for acquiring a model vector of the current user and calculating the similarity between the current user and a neighbor user in the system;

the model quality calculation module is used for calculating the model quality of the neighbor users in the system;

the optimal recommendation generation module is used for generating optimal recommendations according to the model quality of neighbor users in the system and an improved collaborative filtering recommendation algorithm;

and the result feedback module is used for returning the optimal recommendation result to the user interface through the WEB server.

The collaborative filtering label recommendation system based on the user quality model comprises a label system perfecting module, a label system perfecting module and a label model updating module, wherein the label system perfecting module specifically comprises:

the system comprises a label system forming unit, a label classification information database and a label system management unit, wherein the label system forming unit is used for detecting user input information, acquiring a training set in the label classification information database, and extracting all labels in the training set to form a label system C { t1, t2, …, tn } of the existing system S;

a judging unit for judging the resource R_iAnd user U_iThe situation that occurs in the existing system S;

a first processing unit for processing ifI.e. the resource has not appeared in the existing system, the resource R is extracted_iAdding the first X resource title keywords with the highest weight into a system label system C;

a second processing unit for processing if

I.e., resources are present in the system, users are not,then resource R is extracted_iY labels with the highest use frequency and X resource title keywords with the highest weight are added into a system label system C;

a third processing unit for processing if U_i∈S and R_iAnd e, S, namely the user and the resource appear in the system, and history label information is adopted.

The collaborative filtering label recommendation system based on the user quality model is characterized in that the user model construction module specifically comprises:

the storage unit is used for mapping the information of K users in the system to a two-dimensional matrix to construct a user model, and storing the mapping result in a user-label characteristic matrix;

a user model building unit for each row vector VU in the matrix_k＝(w(T₁)；w(T₂)；…；w(T_i)；w(T_n) User model in which T represents a user_iRepresents the ith and the user U_kRelated tag, w (T)_i) Presentation label T_iIn the vector VU_kThe weight in (1) is (are),

indicating at least one use of T_iThe number of users of the tag.

The collaborative filtering label recommendation system based on the user quality model is characterized in that the similarity calculation module specifically comprises: obtaining a model vector of a current user, and calculating the similarity sim (prof) between the current user and a neighbor user in a system_u,prof_v)，

The invention provides a collaborative filtering label recommendation method and system based on a user quality model, wherein a user model quality judgment theory is applied to the traditional collaborative filtering label recommendation, the optimal recommended user selection process in the traditional algorithm is optimized, the recommendation accuracy and recall rate are further improved, the system can realize the evolution and the update of a label system, and the problem of label space obsolescence is solved; meanwhile, the advantages of various label sources are analyzed, and a proper label source is selected according to the appearance conditions of users and resources in the system, so that the problem of cold start and the problem of single label source are solved.

Drawings

Fig. 1 is a flowchart of a collaborative filtering label recommendation method based on a user quality model according to a preferred embodiment of the present invention.

Fig. 2 is a schematic diagram of a specific application embodiment of the collaborative filtering label recommendation method based on the user quality model according to the present invention.

FIG. 3 is a functional block diagram of a preferred embodiment of the collaborative filtering tag recommendation system based on a user quality model according to the present invention.

Detailed Description

In order to make the objects, technical solutions and effects of the present invention clearer and clearer, the present invention is described in further detail below. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.

The traditional collaborative filtering recommendation system based on the nearest neighbor set is widely and successfully applied, the problem of solving the tag recommendation problem in the same way is a natural choice, but the tag recommendation system has self-specificity because no score is formed in the tag recommendation system, and a tag is replaced by the tag.

The label system is generally composed of three elements of a user, a resource and a label, the user can define the label for the resource in the system, the type of the resource is determined by the type of the system, and a label recommendation system can be composed of the following 4 parts:

1. set U formed by all users in system

2. Set R of all resources in the system

3. Set T composed of all tags in system

4. Function of relationship

The relation function expresses that the user U defines a label set for the resourceWherein

A user U belongs to U and a resource R belongs to R, so that a label set T (U, R) with scores is generated, and the top n labels with the highest scores in the recommendation set are input into the system.

Similar to the collaborative filtering algorithm, the tag recommendation system also maps the user information to a two-dimensional matrix for storage, and the mapping result obtains two user model matrices: one user-resource matrix with the size of K multiplied by M is recorded as a matrix X; another user-label matrix with size K × L, denoted as Y, where K ═ U |, M: ═ R |, L ═ T |, no scoring information is recorded in the collaborative filtering label system, only the user and resource association information and the user and label association information are recorded in the form of codes in binary matrices X and Y, where X ∈ {0,1}^k×m,Y∈{0,1}^k×lFor example, if element X in the X matrix_k,m1 means that the kth user is associated with the mth resource, and if equal to 0, it means that there is no association. Similarly, element Y in matrix Y_k,lWhen the number is 1, it indicates that the kth user and the lth tag are related, and if the number is 0, it indicates no relation.

For a given user u and a resource r, an algorithm firstly finds a user which defines a label for the resource r, then similarity calculation formulas based on a collaborative filtering algorithm of the user are adopted to respectively calculate the similarity between the current user and the users, a neighbor user set of the current user is obtained (the neighbor users are different according to different models adopted by the similarity calculation, because the similarity calculation may be based on two matrix models, namely a user-resource matrix model and a user-label matrix model), then the label of the neighbor user is recommended and scored according to the similarity between the neighbor user and the current user, and the label shared by a plurality of neighbor users has higher recommendation score.

In a label recommendation system with a user set of U, a label set of T and a resource set of R, the collaborative filtering label recommendation algorithm is as follows:

δ(v,r,t)∶＝1ifδ(v,r,t)∈U×R×T else 0

in the above formula, N_uFor k nearest neighbor users of the current user u, T (u, r) is the recommendation result of the algorithm, sim (prof)_u,prof_v) And delta (v, R, T) is equal to U multiplied by R multiplied by T and represents that the user v has a label definition relation to the resource R for the similarity between the current user U and the neighbor user v. Wherein, the operator represents dynamic assignment, and each time when the parameter value on the right side in the formula changes, the value on the left side automatically covers the previous value.

In a label recommendation system, a user U belongs to a U model and is usually represented by P_l(u)＝∪_r∈RD (u, l), where D (u, l) represents the label set defined by user u for resource l, and the user model describes the label set defined by user in the system, so the quality of the label directly determines the quality of the user model. The label is a keyword defined by the user according to personal interests and resource content, so that a good label has individuation and specificity, not only accords with the vocabulary using habits of the user, but also can highly describe the resources and reflect the interest tendency of the user.

User u pair of capitalSource l defines a tag k_iThen label k_iThe quality of the label can be measured by parameters such as the number of users, the similarity of the users, the word frequency, the specificity of the label and the like.

Label k_iThe number of users of (1) is use k_iTo define the number of users of resource l. k is a radical of_iThe larger the number of users, the higher the quality. k is a radical of_iThe number of users can be expressed as | u_l,k_i|，u_l,k_iTo use the label k_iTo define all user sets of resource l, using the total number of system users N_allNormalizing the same to obtain:

label k_iIs using k_iTo define the average similarity of those users of resource i. The greater the average user similarity, label k_iIs of high quality. The average similarity calculation formula for users is as follows:

wherein

Indicating usage label k_iTo define the set of all users of resource l, | u_l,k_iI represents the number of users in a user group, u_simxAnd u_simyShowing any two different users in the user group, sim (u)_simx,u_simy) The similarity of the user models of the two users is represented and can be obtained by calculating the cosine included angle of the feature vector of the user model.

Label k_iThe word frequency of is defined as k_iThe number of times that resource/is defined is a percentage of the number of times resource/is defined by all tags. k is a radical of_iThe higher the word frequency, the higher its quality. k is a radical of_iWord frequency availability of tags

It is shown that,represents k_iThe number of times a tag is used to define resource/N_lRepresenting the total number of times resource/is defined by all tags.

Label k_iIs used to measure k_iAn important index for the degree of characterization of resource l, which shows k_iIs used to define the breadth of the different resources. The higher the specificity, the better the label quality. The tag specificity can be calculated by the TF-IDF algorithm:

in the above formula

Presentation tag k_iIs used to define the frequency of resource/, N is the total number of all resources,

is at least k_iThe number of primary resources defined by the tag.

The higher the overall quality of the user label is, the higher the quality of the user model is, and the user model quality reflects the accuracy and the advisability of the user label definition behavior, that is, the higher the quality of the user model of a user is, the more suitable the label defined by the user is for recommendation.

The traditional collaborative filtering label recommendation algorithm seeks label recommendation from a neighbor user, only considers the similarity of user models of the neighbor user and the current user, but ignores the quality of the user models of the neighbor user, and has low recommendation quality, so that the label recommendation algorithm based on the user model quality can be adopted.

The invention provides a flow chart of a preferred embodiment of a collaborative filtering label recommendation method based on a user quality model, as shown in fig. 1, the method comprises the following steps:

step S100, detecting user input information, acquiring a training set in a label classification information database, extracting all labels in the training set to form a label system of the existing system, and perfecting the label system according to resources and the condition of the user in the existing system.

In specific implementation, a training set is selected from the tag classification information database, all tags in the training set are extracted to form a tag system C { t1, t2, …, tn } of the ready-made system S, and the system is respectively perfected according to the resource and the situation of the user in the existing system. Further, user input information is detected, and a test set is obtained from the label classification information database, wherein the test set is a sampling set of labels of the label classification information database. When all the labels in the training set are extracted to form a label system C { t1, t2, …, tn } of the existing system S, a test set is adopted to detect the label system, whether the current label system C { t1, t2, …, tn } is complete is judged, specifically, when all the labels in the test set are in the current label system, the current label system C { t1, t2, …, tn } is complete is judged, if some labels in all the labels in the test set are not in the current label system, the current label system C { t1, t2, …, tn } is determined to be incomplete, and the existing label system is further improved. Specifically, the training set may be reselected to improve the label system, or the labels that do not appear in the test set may be added to the label system.

In specific implementation, the step S100 specifically includes:

s101, detecting user input information, acquiring a training set in a label classification information database, and extracting all labels in the training set to form a label system C { t1, t2, …, tn } of the existing system S;

step S102, judging resource R_iAnd user U_iThe situation that occurs in the existing system S;

step S103, if

step S104, if

step S105, if U_i∈S and R_iAnd e, S, namely the user and the resource appear in the system, and history label information is adopted.

In specific implementation, resource R is analyzed_iAnd user U_iSituation occurring in the existing system S, user U_iAnd resource R_iThe following 4 cases can occur:

(1)

completely in a cold start situation, new user, new resource;

(2)

the user appears in the system, and the resource does not appear;

(3)

resources appear in the system, and users do not appear;

(4)U_i∈S and R_iboth the S-user and the resource are present in the system.

The label perfection measures for different situations are as follows:

when the situations (1) and (2) occur, extracting the resource R_iAdding the top X resource title keywords { key1, key2, key3} with the highest weight into a system label system C, namely C ← { key1, key2, key3 };

when the situation (3) occurs, the resource R is extracted_iAdding Y most popular labels and X resource title keywords with the highest weight into a system label system C;

when the situation (4) occurs, the history tag information is employed.

In specific implementation, X may be preset, preferably 3, and Y may also be preset, preferably 2.

And S200, mapping the information of the users in the system to a two-dimensional matrix to construct a user model, and storing the user model in a user-label two-dimensional matrix form.

In specific implementation, a user model is constructed by mapping information of k users in a system to a two-dimensional matrix, a mapping result is shown in a user-tag characteristic matrix QT, and each row vector VU in the matrix_k＝(w(T₁)；w(T₂)；…；w(T_i)；w(T_n) User model in which T represents a user_iRepresents the ith and the user U_kRelated tag, w (T)_i) Presentation label T_iIn the vector VU_kThe weight in (1).

The step S200 specifically includes:

step S201, mapping information of K users in the system to a two-dimensional matrix to construct a user model, and storing mapping results in a user-label characteristic matrix;

step S202, each row vector VU in the matrix_k＝(w(T₁)；w(T₂)；…；w(T_i)；w(T_n) User model in which T represents a user_iRepresents the ith and the user U_kRelated tag, w (T)_i) Presentation label T_iIn the vector VU_kThe weight in (1) is (are),

indicating at least one use of T_iThe number of users of the tag.

And S300, obtaining a model vector of the current user, and calculating the similarity between the current user and a neighbor user in the system.

In practical implementation, the neighbor user refers to a user with a higher correlation with the current user, such as a user in the same area. The user model in the label recommendation system is stored in a user-label two-dimensional matrix form, and the similarity between the current user and other users in the system can be obtained by calculating the cosine similarity value of the user model vector corresponding to the current user and other users in the matrix. Specifically, a model vector of a current user is obtained, and the similarity sim (prof) between the current user and a neighbor user in a system is calculated_u,prof_v)，

And S400, calculating the model quality of the neighbor users in the system.

In specific implementation, the user model quality theory shows that the user model quality is influenced by the use frequency of users, the similarity of user groups, the representation frequency of tags and the specificity of tags. Model qualities Q u (v) of neighbor users in the computing system,

wherein:

in the above formula, k_iFor the i-th tag of the user v,is k_iTo a userThe value of the number of normalization is,is k_iThe average degree of similarity of the users of (1),

And S500, generating the optimal recommendation according to the improved collaborative filtering recommendation algorithm according to the model quality of the neighbor users in the system.

In the specific implementation, in the tag recommendation system, for a current user, as a neighbor user of a recommender, the quality of a user model of the current user has an important influence on a recommendation effect, so that a collaborative filtering tag recommendation algorithm is improved, an optimal recommendation result in the improved collaborative filtering recommendation algorithm is denoted as T (u, r), and a calculation formula is as follows:

δ(v,r,t)∶＝1 ifδ(v,r,t)∈U×R×T else 0，

And step S600, returning the optimal recommendation result to the user interface through the WEB server.

And returning the optimal recommendation result to the user interface through the WEB server during specific implementation. The user may use a different interface and return to the user's television interface if the user is using a television interface.

The invention also provides a flow chart of a specific application embodiment of the collaborative filtering tag recommendation method based on the user quality model, which is introduced by taking a user television interface as an example, and as shown in fig. 2, the method comprises the following steps:

specifically, the television is connected with a WEB server, and the WEB server is further connected with the database. The database comprises a user information base in which user history information is stored, a resource information base in which resource information is stored, and a tag information base in which tag information is stored.

When a user watches a television through a user television interface of the television, the user watching information is sent to a WEB server, the WEB server carries out data preprocessing on the watching information, user history information is obtained from a user information base, a current user quality model is generated according to the user history information, a core recommendation model is generated according to the current user quality model, resource information of the resource information base and label information of the label information base, a recommendation result is generated according to the core recommendation model and sent to the WEB server, and the WEB server returns the recommendation result to the user television interface through a recommendation page for the user to check.

The invention also provides a functional schematic block diagram of a collaborative filtering label recommendation system based on a user quality model, as shown in fig. 3, wherein the method comprises the following steps:

the label system improvement module 100 is used for detecting information input by a user, acquiring a training set in a label classification information database, extracting all labels in the training set to form a label system of the existing system, and improving the label system according to resources and the condition of the user in the existing system; as described above.

The user model building module 200 is used for mapping the information of the users in the system to a two-dimensional matrix to build a user model and storing the user model in a user-label two-dimensional matrix form; as described above.

The similarity calculation module 300 is configured to obtain a model vector of a current user, and calculate a similarity between the current user and a neighbor user in the system; as described above.

A model quality calculation module 400 for calculating the model quality of the neighbor users in the system; as described above.

The optimal recommendation generation module 500 is used for generating optimal recommendations according to the model quality of neighbor users in the system and an improved collaborative filtering recommendation algorithm; as described above.

A result feedback module 600, configured to return the optimal recommendation result to the user interface through the WEB server; as described above.

a judging unit for judging the resource R_iAnd user U_iThe situation that occurs in the existing system S; as described above.

A first processing unit for processing if

I.e. the resource has not appeared in the existing system, the resource R is extracted_iAdding the first X resource title keywords with the highest weight into a system label system C; as described above.

A second processing unit for processing if

Namely, the resource appears in the system and the user does not appear, the resource R is extracted_iY labels with the highest use frequency and X resource title keywords with the highest weight are added into a system label system C; as described above.

A third processing unit for processing if U_i∈S and R_iThe method comprises the following steps that (1) the E belongs to S, namely, a user and a resource are present in a system, and historical label information is adopted; as described above.

the storage unit is used for mapping the information of K users in the system to a two-dimensional matrix to construct a user model, and storing the mapping result in a user-label characteristic matrix; as described above.

indicating at least one use of T_iThe number of users of the tag; as described above.

Wherein prof_uAnd prof_vRespectively are user model vectors of a current user u and a neighbor user v; as described above.

In summary, the present invention provides a collaborative filtering label recommendation method and system based on a user quality model, the method includes: perfecting a label system for the conditions occurring in the existing system; mapping information of users in the system to a two-dimensional matrix to construct a user model, and storing the user model in a user-label two-dimensional matrix form; obtaining a model vector of a current user, and calculating the similarity between the current user and a neighbor user in a system; calculating the model quality of neighbor users in the system; generating optimal recommendation according to the model quality of neighbor users in the system and an improved collaborative filtering recommendation algorithm; and returning the optimal recommendation result to the user interface through the WEB server. The method optimizes the optimal recommended user selection process in the traditional algorithm, improves the accuracy and recall rate of recommendation, and evolves and updates the current label system of the system; and according to the appearance of the user and the resource in the system, a proper label source is selected, so that the problems of cold start and single label source are solved.

It is to be understood that the invention is not limited to the examples described above, but that modifications and variations may be effected thereto by those of ordinary skill in the art in light of the foregoing description, and that all such modifications and variations are intended to be within the scope of the invention as defined by the appended claims.

Claims

1. A collaborative filtering label recommendation method based on a user quality model is characterized by comprising the following steps:

D. calculating the model quality of neighbor users in the system;

F. returning the optimal recommendation result to the user interface through the WEB server;

the step D specifically comprises the following steps: the model quality of the neighbor users in the computing system qu (v),

wherein:

in the above formula, k_iFor the i-th tag of the user v,

is k_iThe normalized value of the number of users of (c),

is k_iThe average degree of similarity of the users of (1),

is k_iWord frequency of, w (l, k)_i) Is k_iThe model quality of the neighbor user is the average label quality of the neighbor user;

indicating usage label k_iTo define the set of all users of resource l, | u_l,k_iI represents the number of users in a user group, u_simxAnd u_simyRepresents any two different users in the user group, sim (u)_simx,u_simy) Representing user model similarity of two users; n is the total number of all resources,

is at least k_iThe number of primary resources defined by the label;

the optimal recommendation result in the improved collaborative filtering recommendation algorithm in the step E is denoted as T (u, r), and the calculation formula is as follows:

δ(v,r,t)∶＝1ifδ(v,r,t)∈U×R×T else 0，

in the above formula, U is a user set, T is a tag set, R is a resource set, and N_uFor k nearest neighbor users of the current user u, T (u, r) is the best recommendation result of the algorithm, sim (prof)_u,prof_v) For the similarity between the current user U and the neighbor user v, delta (v, R, T) belongs to U multiplied by R multiplied by T and represents that the user v has a label definition relation to the resource R; where an operator represents a dynamic assignment.

2. The collaborative filtering label recommendation method based on the user quality model according to claim 1, wherein the step a specifically includes:

a3, if

a4, if

3. The collaborative filtering label recommendation method based on the user quality model according to claim 2, wherein the step B specifically includes:

b2, each row vector VU in matrix_k＝(w(T₁)；w(T₂)；…；w(T_i)；w(T_n) User model in which T represents a user_iRepresents the ith and the user U_kRelated tag, w (T)_i) Presentation label T_iIn the vector VU_kThe weight in (1) is (are),

indicating at least one use of T_iThe number of users of the tag.

4. The collaborative filtering label recommendation method based on the user quality model according to claim 3, wherein the step C specifically comprises: obtaining model vector of current user, calculating current user andsimilarity sim (prof) of neighbor users in system_u,prof_v)

5. A collaborative filtering tag recommendation system based on a user quality model, the system comprising:

the model quality calculation module is used for calculating the model quality of the neighbor users in the system; but also for calculating the model quality qu (v) of the neighbor users in the system,

wherein:

in the above formula, k_iFor user vThe number (i) of the tags is,

is k_iThe normalized value of the number of users of (c),

is k_iThe average degree of similarity of the users of (1),is k_iWord frequency of, w (l, k)_i) Is k_iThe model quality of the neighbor user is the average label quality of the neighbor user;

is at least k_iThe number of primary resources defined by the label;

the optimal recommendation generation module is used for generating optimal recommendations according to the model quality of neighbor users in the system and an improved collaborative filtering recommendation algorithm; the method is also used for recording the best recommendation result in the improved collaborative filtering recommendation algorithm as T (u, r), and the calculation formula is as follows:

δ(v,r,t)∶＝1ifδ(v,r,t)∈U×R×T else 0，

in the above formula, U is a user set, T is a tag set, R is a resource set, and N_uFor k nearest neighbor users of the current user u, T (u, r) is the best recommendation result of the algorithm, sim (prof)_u,prof_v) For the similarity between the current user U and the neighbor user v, delta (v, R, T) belongs to U multiplied by R multiplied by T and represents that the user v has a label definition relation to the resource R; wherein the operator represents a dynamic assignment;

6. The collaborative filtering label recommendation system based on the user quality model according to claim 5, wherein the label system improvement module specifically comprises:

a first processing unit for processing if

a second processing unit for processing if

a third processing unit for processing if U_i∈S and R_iIs e.g. S, i.e. both user and resource are in the systemHistorical tag information was used when the system appeared.

7. The collaborative filtering label recommendation system based on user quality model according to claim 6, wherein the user model building module specifically comprises:

indicating at least one use of T_iThe number of users of the tag.

8. The collaborative filtering label recommendation system based on the user quality model according to claim 7, wherein the similarity calculation module specifically is: obtaining a model vector of a current user, and calculating the similarity sim (prof) between the current user and a neighbor user in a system_u,prof_v)，

Wherein prof_uAnd prof_vRespectively a current user u and a neighbor userv user model vector.