CN114564639A

CN114564639A - Course recommendation method based on deep session interest interaction model

Info

Publication number: CN114564639A
Application number: CN202210114867.4A
Authority: CN
Inventors: 刘铁园; 吴琼; 古天龙; 常亮
Original assignee: Guilin University of Electronic Technology
Current assignee: Guilin University of Electronic Technology
Priority date: 2022-01-31
Filing date: 2022-01-31
Publication date: 2022-05-31

Abstract

The invention discloses a course recommendation method based on a deep session interest interaction model, which is characterized by mainly comprising the following steps of: screening and preprocessing user data, sequencing behavior information of users and projects according to time, and dividing sessions by taking one day as a time interval; in order to depict the interest of the dynamic change of the user, enrich the interest representation of the user, and apply GRU to capture the dynamic preference of the user; next, inputting the recent behavior data and the dynamic interest representation of the user into a second-layer Attention network to obtain a multi-angle interest representation of the user; and finally, performing inner product on multi-angle interest representation and course vector representation of the user, and selecting each candidate item with high score to recommend the student, thereby solving the problems that the influence of noise items existing in the interaction process of the user and the items is not considered and static and low-rank vectors cannot sufficiently express the interest of the user in the current course-based recommendation method.

Description

Course recommendation method based on deep session interest interaction model

(I) the technical field

The invention relates to the technical fields of machine learning, deep learning, data mining and the like, in particular to a course recommendation method for capturing multi-angle interest preferences of a user.

(II) background of the invention

In recent years, the traditional offline education mode is stressed by the rising labor cost, meanwhile, the personalized requirements of consumers cannot be met, and the challenges of high cost, low profit and the like faced by offline education are increasingly highlighted under the background. And the rapid development of the internet and artificial intelligence is accompanied, so that the online education network environment is greatly improved and promoted. Online education refers to a method for rapid learning and content dissemination through internet technology. The existing online education platform MOOC is popular in online cloud classrooms and the like. Compared with the traditional education mode, the online education has the characteristics of high efficiency, convenience, low threshold and sufficient education resources. However, a large number of courses exist in the online learning platform, so that users are often overwhelmed, and the difficulty of decision making is increased. In order to solve the problem of information overload, course recommendation systems have come to work, aiming to provide personalized services for appropriate users at appropriate times.

The invention discloses a course recommending method and system based on dynamic weight of a graph-convolution neural network, which is published at present, and is disclosed as CN110580314A, wherein a user-course matrix is obtained by acquiring the score value of each course of a user and preprocessing the score value, the graph-convolution neural network is constructed on the basis of the user-course matrix, the user-course score matrix is predicted, and the user-course score matrix is subjected to sequence mode mining to obtain the recommended course sequence of each user. The method does not consider the noise course in the user interactive course, which can cause inaccurate recommending result and dynamic change of the user interest, but not invariable. The invention describes a course recommending method based on a deep conversation interest interaction model, which utilizes time sequence information of user and project interaction, removes noise courses of a user in course sessions by using an attention network, extracts the preference of each session, then simulates the interest of the user in dynamic change by using GRU, finally obtains the interest representation of the user, and obtains the score of each course by combining the course representation so as to recommend the course to students.

Disclosure of the invention

The invention aims to solve the problem that the influence of noise items existing in the interaction process of a user and the items is not considered in the interest extraction process of user conversation in the current course recommendation method; meanwhile, a static and low-rank vector cannot fully express the interest of the user, and the interest of the user is not constant and changes along with time, so that the generated recommendation model cannot recommend personalized courses for the user, and the like.

In order to solve the problems, the invention is realized by the following technical scheme:

and screening and preprocessing the original online learning related behavior data of the downloaded good MOOCCube data set.

And sequencing the behavior information of the user and the project according to time, and dividing the conversation by taking one day as a time interval.

Since all items in a conversation may not be the items that the user really wants to interact with, the attention network is applied to cull noise courses for each conversation and extract the interest of each conversation.

In order to characterize the dynamically changing interest of the user, the user interest expression is enriched, and the GRU is good at capturing the sequence relation, so the GRU is applied to capturing the dynamic preference of the user.

Inputting the latest behavior data of the user and the dynamic interest representation obtained in the fourth step into the second layer Attention network to obtain the multi-angle interest representation of the user.

And finally, performing inner product on the multi-angle interest representation and the course vector representation of the user, obtaining the score of each candidate item, and selecting the score with high score to recommend to the student.

Compared with the prior art, the invention has the following advantages:

in the aspect of data modeling, the time sequence behaviors of the user are divided, courses browsed by the user in one day are divided into one session, and the fact that the characteristics of the courses in the same session are highly homogeneous, and the characteristics of the courses in different sessions are greatly different.

Based on the conversation divided by sequential time modeling, the model can distribute different weights to different items by using an attention mechanism, can effectively eliminate noise courses, extracts interest point information contained in the conversation, and is beneficial to improving the accuracy of recommendation.

In the aspect of user interest modeling, interest may change due to the influence of various factors on the time interval of the conversation of a user, and a learning behavior is a continuous behavior.

After considering the dynamically changing user interest, the model also notes that items recently viewed by the user are also an important contributor to course recommendations. The invention introduces a second layer of Attention, distributes different weights for the dynamic interest representation of the user and the recently interacted items, and finally obtains the multi-angle interest representation of the user, thereby improving the performance.

Description of the drawings

FIG. 1 is a block diagram of a model of the present invention.

FIG. 2 is a flowchart of an overview of course recommendation based on a deep session interest interaction model.

FIG. 3 is a diagram illustrating extraction of session interests.

Fig. 4 is a flow chart of input data generation.

FIG. 5 is a flow chart of extracting session interests.

FIG. 6 is a flow chart of user dynamic interest generation.

FIG. 7 is a flow chart of multi-angle preference generation for a user.

(V) detailed description of the preferred embodiments

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings in combination with specific examples.

The invention describes the specific implementation process of the method by taking course recommendation based on the deep session interest interaction model as an example. The model framework of the invention is shown in FIG. 1, and the overall process of course recommendation based on the deep session interest interaction model is shown in FIG. 2. The specific steps are explained by combining a schematic diagram:

Step 1, downloading a MOOCCube data set in an MOOCData official network, and preprocessing the data after screening.

And 2, after the screened data are obtained in the step 1, arranging the data of the students according to a time sequence, and finally dividing the sequence behavior of each user into sessions.

And 3, embedding the student course records in each session of the user into an embedding layer to obtain a user session vector matrix, and obtaining a session interest vector through an Attention layer, wherein the session interest vector is shown in fig. 3.

And 4, inputting the interest of each session into a GRU module, and simulating the interest of the dynamic change of the user so as to obtain a dynamic interest vector of the user.

And step 5, considering that the influence of each session interest item on the user is possibly different, and the items recently browsed by the user are important factors influencing the preference of the user, taking the recently browsed item set as short-term interest preference, and simultaneously inputting the short-term interest preference and the user dynamic interest vector preference into the Attention layer to obtain multi-angle user interest vector representation.

And 6, performing inner product on the multi-angle preference representation obtained in the step 5 and the candidate course vectors to obtain candidate course scores, sequencing the candidate courses, and selecting the top 10 courses to realize course recommendation.

Fig. 4 shows a flowchart of the input data generation of the present example, which includes the following specific steps:

step 1, the paper published in ACL2020 discloses an open data warehouse for large-scale online education, and this data set includes 706 real online courses, 38181 teaching videos, 114563 concepts, hundreds of thousands of courses for 199999 MOOC users, and resource libraries related to course concepts such as video viewing records. Course data and student behavior data come from the real use environment of the classroom online. The data set is subjected to primary processing, useless data are cleaned, and user.csv files in the selected data set contain the records of the learning behaviors of students. csv contains course information including the course name, the course type, the course profile and other relevant information. The student's learning behavior record is composed of different attributes, which have different meanings. For example, in the user.csv dataset, id represents the student number, name represents the student name, court _ order represents the lesson that the student watched, and enroll _ time represents the time that the student watching the corresponding lesson occurred. Csv, id denotes the course's course _ id, name denotes the course's information, course _ type denotes the type to which the corresponding course belongs, course _ info denotes the course profile, and video _ order denotes the order of the videos contained in this course.

And 2, further processing the two data sets, firstly arranging student behavior records in the user csv data set according to a time sequence, screening users with the number of watching courses larger than 10, then dividing sessions for the student behaviors by taking one day as a unit, and screening users with the number of sessions larger than 4 again. For the court.csv data set, only three types of data, namely user _ id, court _ id and roll _ time, are reserved, and other types of data are discarded. Finally, after data preprocessing, 14580 pieces of student behavior data meeting requirements, including 994 students and 632 courses, are obtained. Resulting in the final desired data set.

Through the processing, the conversation of the user at the corresponding time stamp is achieved, and due to the fact that the behavior data of the user has the following characteristics that firstly, a large number of noise items exist in each conversation of the user, secondly, the items are highly homogeneous in the same conversation, therefore, the noise course is removed for each conversation by applying the attention network, and the interest of each conversation is extracted.

FIG. 5 is a flowchart showing the steps of extracting session interests, including:

step 1, for each user U e U, dividing the session from the sequential data by day unit,

Representing the user's sequential sessions, where N represents the total number of sessions made by user u,

representing the nth session (set of items) of user u. Using the last session in the model

To extract the user's short-term preferences, which is also an important factor in recommending courses for the user. On the other hand, for the dynamic preference of the user, the user can select the dynamic preference

And (4) extracting. The invention inputs the course vector in the conversation into the Attention layer, screens out the course noise through the Attention, in form, pay Attention to the definition of the network:

wherein the content of the first and second substances,

are parameters of the model. We assume that the set of items in the conversation are successively marked from 1 to W, and c_jA dense embedded vector representing item j. Will course vector c_jObtaining a hidden representation h as input to a multilayer perceptron (MLP)_1j. The function Φ (·) is the RELU nonlinear activation function.

And 2, taking the embedded u of the user as a context vector, and measuring the attention score of the user by using a softmax function. Finally, the session features represent

Our invention applies GRUs to model the dynamically changing interests of users, considering that the interests of users are not constant and dynamically change over time.

FIG. 6 is a flowchart showing the dynamic interest generation of a user, and the specific steps include:

Step 1, obtaining conversation characteristics I_kInput into a gated neural unit (GRU) and modeling variable length sequence data can handle temporal dynamics of user item interactions and sequential patterns of user behavior, and can solve the problem of "long term dependence".

And 2, modeling the change of the session interest by applying GRU. The formula for GRU is as follows:

z_t＝σ(W_z·[h_t-1，I_t])

r_t＝σ(W_r·[h_t-1，I_i])

where σ (·) is the sigmoid function, z and r are the update gate and reset gate, respectively, and their vector magnitude and input I_tAnd the same, wherein (·) represents dot product and (·) represents cross product.

Step 3, h obtained in step 2_tIs a hidden state of the GRU, representing the user's dynamic preferences.

In addition to the dynamic interests of the user, the short-term interests of the user should be considered, firstly, ignoring the short-term interests may lead to inaccurate recommendation results, secondly, when in sparse data sets, since the data set demand of the short-term preferences is small, the short-term benefits may be a main determinant for predicting the next item and relieving the sparseness, and finally, the short-term preferences may effectively capture the interests of the user in each dimension in terms of fine granularity. The dynamic interests of the user need to be inferred in their long-term behavior, while the short-term interests of the user are reflected in their short-term behavior and captured by the embedding of the user.

FIG. 7 is a flowchart showing the multi-angle preference generation of a user, comprising the following steps:

step 1, inputting the dynamic preference ht of the user into a second layer of the Attention layer, and assigning the weight to the Attention layer.

Step 2, similar to modeling user dynamic preferences, we also turn to attention network, assigning different weights for long-term representation and item embedding in short-term item sets to capture high-level representations of users.

Wherein

Is a parameter of the model, and j > 0, c_jIs a course of short-term preference sets

When j is 0, c_j＝h_tSimilar to the user interest extraction layer, user embedding is used as a context vector to obtain a mixed representation of user interest.

The computational hybrid user representation is as follows:

wherein, beta₀Is a weight of long-term user interest preferences. u. of_tIs a final user interest representation that considers both long-term dynamic preferences and short-term preferences and assigns them different weights using an attention mechanism, which can capture non-linear interactions between the user and the item at the same time.

Step 3, in order to recommend courses to the user, after obtaining the user interest representation, calculating a preference score of the candidate item j as follows:

R_ujt＝u_tc_j

we then rank the candidate courses according to the user's preference score for the candidate purpose, the purpose of the model being to provide the user with a ranked list of courses at time t. Thus, the model utilizes a pairwise ordered objective function for our model, according to the optimization criteria of BPR. We assume that the user compares to item c, which is not observed _kPreference model recommended candidate item c_jTherefore, the following inequality is present:

R_ujt＞R_ukt

wherein R is_uktIs the user not observing the item c at time t_kThe preference score of (1). Thus we have a paired set of items

We then trained the model by maximizing the a posteriori (MAP), as follows:

wherein Θ is { U, C, W ═ W₁，W₂，b₁，b₂Is the parameter set of the model, σ is the logistic regression function, Θ_ucIs a set of embedded matrices for users and courses, Θ_a＝{W₁，W₂，W_z，W_rW is a set of weight matrices for attention layer and GRU, λ ═ λ_uc，λ_aIs the regularization parameter.

It should be noted that, although the above-mentioned embodiments illustrate the present invention, the present invention is not limited thereto, and thus the present invention is not limited to the above-mentioned embodiments. Other embodiments, which can be made by those skilled in the art in light of the teachings of the present invention, are considered to be within the scope of the present invention without departing from its principles.

Claims

1. A course recommendation method based on a deep session interest interaction model is characterized in that: screening and preprocessing user data, sequencing behavior information of users and projects according to time, and dividing sessions by taking one day as a time interval; in order to depict the interest of the dynamic change of the user, enrich the interest representation of the user, and apply GRU to capture the dynamic preference of the user; next, inputting the recent behavior data and the dynamic interest representation of the user into a second-layer Attention network to obtain a multi-angle interest representation of the user; and finally, performing inner product on the multi-angle interest representation and the course vector representation of the user, and selecting each candidate item with high score to recommend the student. Therefore, the problems that the influence of noise items existing in the interaction process of the user and the items is not considered and static and low-rank vectors cannot sufficiently express the interest of the user in the current course-based recommendation method are solved.

2. The course recommendation method based on the deep session interest interaction model as claimed in claim 1, wherein: in the aspect of data modeling, the invention divides the time sequence behaviors of the user, divides the courses browsed by the user in one day into one session, and divides the course characteristics in the same session into a plurality of sessions, wherein the course characteristics are highly homogeneous, and the course characteristics are greatly different among different sessions.

3. The course recommendation method based on the deep session interest interaction model as claimed in claim 1, wherein: based on the conversation divided by sequential time modeling, the model can distribute different weights to different items by using an attention mechanism, can effectively eliminate noise courses, extracts interest point information contained in the conversation, and is beneficial to improving the accuracy of recommendation.

4. The course recommendation method based on the deep session interest interaction model as claimed in claim 1, wherein: in the aspect of user interest modeling, interest may change due to the influence of various factors on the time interval of the conversation of the user, and the learning behavior is a continuous behavior, so that the experience of online learning of students is improved by simulating the change of interest between each conversation of the user by using the GRU.

5. After considering the dynamically changing user interest perspective, the model also notes that items recently viewed by the user are also an important contributor to the course recommendations. The invention introduces a second layer of Attention, distributes different weights for the dynamic interest representation of the user and the recently interacted items, and finally obtains the multi-angle interest representation of the user, thereby improving the performance.