CN115510286A

CN115510286A - Multi-relation cognitive diagnosis method based on graph convolution network

Info

Publication number: CN115510286A
Application number: CN202211389531.5A
Authority: CN
Inventors: 张琨; 吴开放; 杨永辉; 吴乐
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2022-11-08
Filing date: 2022-11-08
Publication date: 2022-12-23

Abstract

The invention discloses a multi-relation cognitive diagnosis method based on a graph convolution network, which comprises the following steps: 1. constructing heterogeneous data: historical answer recording matrixes of students, incidence matrixes of exercises and knowledge points and interaction matrixes of students and knowledge points; 2. carrying out characteristic propagation on different answer results of the students by utilizing graph convolution respectively; the problem of data sparsity caused by wrong division of student answer records is relieved through graph comparison learning; 3. modeling the interaction relation between the student and the knowledge points and the inclusion relation between the exercises and the knowledge points by using an attention mechanism; 4. and fusing different characteristics obtained by the two modules, and predicting the performance of the student through a neurocognitive diagnosis model. The invention fully excavates the influence of different answer results of students on the abilities of the students and models the mastering conditions of the students on different knowledge points from two angles, thereby realizing more accurate student performance prediction and the mastering degree of the students on specific knowledge points.

Description

Multi-relation cognitive diagnosis method based on graph convolution network

Technical Field

The invention relates to cognitive diagnosis in the field of intelligent education, in particular to a multi-relation cognitive diagnosis method based on a graph convolution network.

Background

In the field of education, with the wide application of intelligent education systems, how to model knowledge mastery conditions of students and to recommend personalized exercises become a focus of attention of people.

The cognitive diagnosis model is used for modeling the mastering conditions of students at different knowledge points according to the historical answer conditions of the students, and then predicting the accuracy of the students on the problem which is not done by the students by using the commonly used diagnosis model. Early work utilized manually designed functions to model the linear relationship of students to problems, as traditional project response theory models single student abilities. The complex relation of interaction between students and exercises cannot be fully mined by the representation of the students and the exercises obtained by self-training of the neural network in the existing work, the hierarchical relation graph between the students, the exercises and the concepts is modeled, the influence of different answer results of the students on the mastering of knowledge points is ignored, in addition, the number of the exercises is far more than that of the knowledge points, one exercise usually comprises a plurality of knowledge points, the representation of the students and the exercises is relatively rough only through the interactive modeling of the students and the exercises, and the process of direct interaction between the students and the knowledge points is omitted in the prior cognitive diagnosis work. How to use the historical answer records of students and the association relationship between exercises and knowledge points to obtain the mastery degree of the students on different knowledge points and the student answer prediction becomes a problem worthy of research.

Disclosure of Invention

The invention aims to overcome the defects of the prior art and provides a multi-relation cognitive diagnosis method based on a graph-volume network, so that the influence of different answer results of students on the abilities of the students can be fully exploited, and the mastering conditions of the students on different knowledge points can be modeled from two angles, thereby realizing more accurate student performance prediction and the mastering degree of the students on specific knowledge points.

The invention adopts the following technical scheme for solving the technical problems:

the invention relates to a multi-relation cognitive diagnosis method based on a graph convolution network, which is characterized by comprising the following steps of:

step 1, obtaining historical answer records of students and constructing heterogeneous data:

denote student set by S, and S = { S = ₁ ,...,s _a ,...,s _b ,...,s _M }，s _a Denotes the a student, s _b Represents the b-th student, M represents the total number of students, a is more than or equal to 1, and b is more than or equal to M; denote the problem set by E, and E = { E = { ₁ ,...,e _i ,...,e _j ,...,e _N }，e _i Represents the ith exercise, e _j Representing the jth problem, N representing the total number of the problems, i is more than or equal to 1, and j is more than or equal to N; the set of knowledge points is represented by C, and C = { C ₁ ,...,c _f ,...,c _g ,...,c _K }，c _f Represents the f-th knowledge point, c _g Representing the g-th knowledge point, K represents the total number of the knowledge points, f is more than or equal to 1, and g is more than or equal to K;

let r be _ai Represents the a-th student s _a Answer ith exercise e _i As a result, the historical answer matrix of the student is R = { R = { R } _ai } _M×N (ii) a And dividing the historical answer matrix R of the student into an answer pair exercise matrix R ⁺ Sum-and-error problem matrix R ^- (ii) a Let p be _af Represents the a-th student s _a The exercise contains the f-th knowledge point c _f Then the interaction matrix of the student and the knowledge point is P = { P _af } _M×K (ii) a Let q be _ig Show the ith track problem e _i Containing the g-th knowledge point c _g If the correlation matrix of the problem and knowledge point is Q = { Q = _ig } _N×K ；

Step 2, initializing student characterization vectors, exercise characterization vectors and knowledge point characterization vectors;

step 2.1, initializing student set S by using an Xavier method to obtain student representation T = { T = (zero-to-zero) after obtaining student representation S ₁ ,...t _a ,...t _M Where t is _a Represents the a-th student s _a D-dimensional vector of (1);

step 2.2, initializing the problem set E by using an Xavier method to obtain a problem representation X = { X = (X) } ₁ ,...x _i ,...x _N In which x _i Represents the ith exercise e _i D-dimensional vector of (1);

step 2.3, initializing the knowledge point set C by using an Xavier method to obtain a knowledge point representation O = { O = (O) } ₁ ,...o _f ,...o _K In which o _f Represents the f-th knowledge point c _f D-dimensional vector of (1);

step 3, constructing a multi-relationship cognitive diagnosis model based on the graph convolution network, comprising the following steps: the system comprises a relational graph-based exercise level learning module, an attention mechanism-based knowledge point level learning module and a neuro-cognitive diagnosis module;

step 3.1, the processing of the problem level learning module based on the relational graph is as follows:

step 3.1.1, combining the right and wrong two relations in the student historical answer records to construct two student-exercise bipartite graphs to obtain a graph G ⁺ And graph G ^- Wherein, diagram G ⁺ Graph G representing the composition of the student's joint-border with the question for which the answer is correct ^- A graph formed by connecting students and exercises with wrong answers;

step 3.1.2, for FIG. G ⁺ Calculating the feature vector of the ith student node after the ith update by using the formula (1)

And feature vectors of the first problem node

In the formula (1), the acid-base catalyst,

represents the a-th student s _a A set of questions that are correctly answered,

indicating the correct ith answer question e _i A set of students;

and

respectively representing the feature vector of the ith student node and the feature vector of the ith exercise node after the ith-1 time of updating; when l =1, let

Step 3.1.3, for FIG. G ⁺ Obtaining the a-th student node characterization of the problem level using equation (2)

And ith problem node representation of problem level

Step 3.1.4, for FIG. G ^- Obtaining the characterization of the student node at the process of step 3.1.2 and step 3.1.3

And characterization of problem nodes

Step 3.1.4, the a-th student node representation of the exercise layer is obtained by using the formula (3)

And ith problem node characterization

In formula (3), R' represents a relation set containing right and wrong answers;

step 3.1.5, compare FIG. G ⁺ Student nodes or problem nodes in (1) as anchor point samples, in graph G ^- Taking the student node or the exercise node corresponding to the anchor point sample as a positive sample, and taking the graph G as a positive sample ⁺ Taking other student nodes or exercise nodes except the anchor point sample as negative samples;

step 3.1.6, constructing a contrast loss function L of the student node by using the formula (4) _s ：

In the formula (4), T ⁺ Shows diagram G ⁺ Student node characterization of (1), T ^- Shows diagram G ^- The student node representation in (1); s' represents other student node sets except the a-th student node; tau is a temperature parameter, sim represents cosine similarity;

step 3.1.7, constructing contrast loss function L of problem node by using formula (5) _e ：

In the formula (5), X ⁺ Shows diagram G ⁺ Middle problem node characterization vector, X ^- Shows diagram G ^- The problem node in (1) represents a vector; e' represents other exercise node sets except the ith exercise node;

step 3.2, the attention mechanism-based knowledge point level learning module performs processing:

step 3.2.1, obtaining an interaction matrix P of the students and the knowledge points from the historical answer records R and the association matrix Q of the exercises and the knowledge points of the students, and obtaining the weight alpha of the mastery degree of the a-th student node to the f-th knowledge point by using the formula (6) _af So as to obtain the a-th student node characterization vector of the knowledge level by using the formula (7)

In the formula (6), the reaction mixture is,

representing a set of knowledge points having interaction with an a-th student node;

step 3.2.2 obtaining the associated weight beta of the ith problem node and the g knowledge point by using the formula (8) _ig So as to obtain the ith exercise node characterization vector of the knowledge level by using the formula (9)

In the formula (8), the reaction mixture is,

representing a knowledge point set contained in the ith problem node;

step 3.3, processing of the neurocognitive diagnosis module:

step 3.3.1, respectively obtaining the final characterization vectors of the a-th student node by using the formula (10)

And the final characterization vector of the ith problem node

In the formula (10), [;]represents a tandem operation, σ (-) represents a sigmoid activation function; FC _s A fully connected layer representing a student representation fusion; FC _e Representing the problem to characterize the fused fully connected layer;

step 3.3.2, predicting the score of the ith student node for correctly answering the ith exercise node by using the formula (11)

In the formula (11), F (-) represents a multilayer perceptron of two fully-connected layers, and the weight value is a non-negative value; k represents a one-hot coded vector of the ith problem and knowledge point mapping;

is the ith problem node characterization vector

The scalar value after dimensionality reduction is obtained by the formula (12):

in formula (12), FC ₁ Represents a fully connected layer;

step 4, constructing a cross entropy loss function L by using the formula (13) _se ：

In the formula (13), (a, i) represents a sample pair of students and exercises collected in the training process, wherein a belongs to S, and i belongs to E; y is _ai Representing the real score of the ith student on the ith problem;

and 5, constructing a total Loss function Loss by using the formula (14):

Loss＝L _se +λ(L _s +L _e ) (14)

in the formula (14), λ is a weight for the super parameter to balance different loss values;

and 6, training the cognitive diagnosis model by using an Adam optimizer, minimizing a total Loss function Loss to update model parameters until convergence, so that the trained cognitive diagnosis model is used for predicting answers of students, and the mastering degree of the corresponding students on different knowledge points is obtained according to the obtained final characterization vectors of the student nodes.

The electronic device comprises a memory and a processor, and is characterized in that the memory is used for storing programs for supporting the processor to execute the multi-relation cognitive diagnosis method, and the processor is configured to execute the programs stored in the memory.

The invention relates to a computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, performs the steps of the multi-relationship cognitive diagnostic method.

Compared with the prior art, the invention has the beneficial effects that:

1. the invention divides the relation between the student behavior and the exercise into the correct relation and the wrong relation aiming at different answering behaviors of the student, adopts the mode of the graph to carry out representation learning on the interactive relation between the student and the exercise, and carries out propagation on the fused features through the graph volume layer to learn more accurate representation of the student, the exercise and the knowledge points, thereby better diagnosing the mastering condition of the student on the knowledge points.

2. In order to better perform cognitive diagnosis, the invention develops a multi-relationship cognitive diagnosis method based on a graph convolution network, can learn student characteristics and problem characteristics from two aspects of problems and knowledge points, fully considers the interactive relationship among multi-source data, and is more beneficial to measuring the abilities of students at different knowledge points. Meanwhile, the accuracy of student score prediction is effectively improved.

3. The invention designs a graph comparison learning method based on relational data, positive and negative sample pairs are constructed from two angles of right answer and wrong answer exercises, the problem of data sparsity is effectively relieved, and therefore the performance of a model is effectively improved.

Drawings

FIG. 1 is a flow chart of a multi-relationship cognitive diagnosis model based on a graph convolution network.

Detailed Description

In the embodiment, the influence difference of the correctness of the answer of the student on the mastery degree of the knowledge point is considered, and the feature fusion is performed on the learning of the exercise level based on the self-adaptive graph and volume network and the learning of the knowledge point level based on the attention mechanism to obtain more accurate student capability representation and exercise characteristic representation, so that the mastery condition of the student on the knowledge point is better diagnosed. As shown in fig. 1, the method comprises the following steps:

denote student set by S, and S = { S = ₁ ,...,s _a ,...,s _b ,...,s _M }，s _a Denotes the a student a, s _b Representing the b-th student, M represents the total number of students, a is more than or equal to 1, and b is more than or equal to M; denote the problem set by E, and E = { E = { ₁ ,...,e _i ,...,e _j ,...,e _N }，e _i Represents the ith exercise, e _j Representing the jth problem, N representing the total number of the problems, i is more than or equal to 1, and j is more than or equal to N; the set of knowledge points is denoted by C, and C = { C ₁ ,...,c _f ,...,c _g ,...,c _K }，c _f Represents the f-th knowledge point, c _g Representing the g-th knowledge point, K represents the total number of the knowledge points, f is more than or equal to 1, and g is more than or equal to K;

step 2.3, initializing the knowledge point set C by using an Xavier method to obtain a knowledge point representation O = { O = (O) } ₁ ,...o _f ,...o _K In which o is _f Represents the f-th knowledge point c _f D-dimensional vector of (a);

step 3, constructing a multi-relation cognitive diagnosis model based on a graph volume network, comprising the following steps: the system comprises a relational graph-based exercise level learning module, an attention mechanism-based knowledge point level learning module and a neuro-cognitive diagnosis module;

And feature vectors of the first problem node

In the formula (1), the reaction mixture is,

represents the a-th student s _a A set of questions to be answered correctly,

indicating the correct ith answer question e _i A set of students;

and

are respectively provided withRepresenting the feature vector of the ith student node and the feature vector of the ith exercise node after the (l-1) th update; when l =1, let

And ith problem node representation of problem level

And characterization of problem nodes

And ith problem node characterization

In the formula (3), R' represents a relationship set, and comprises two relationships of the answer pair and the answer mistake;

step 3.1.5, compare FIG. G ⁺ The student node or exercise node in (1) is used as an anchor point sample in (G) ^- Taking the student node or the exercise node corresponding to the anchor point sample as a positive sample, and taking the graph G as a positive sample ⁺ Taking other student nodes or exercise nodes except the anchor point sample as negative samples;

In the formula (6), the reaction mixture is,

step 3.2.2 obtaining the associated weight beta of the ith problem node and the g knowledge point by using the formula (8) _ig So as to obtain the ith problem node characterization vector of the knowledge level by using the formula (9)

In the formula (8), the reaction mixture is,

representing a knowledge point set contained in the ith problem node;

and 3.3, processing by a neurocognitive diagnosis module:

And the final characterization vector of the ith problem node

is the ith problem node characterization vector

in formula (12), FC ₁ Represents a fully connected layer;

In the formula (13), (a, i) represents that the training has been performedThe method comprises the steps of collecting sample pairs of students and exercises, wherein a belongs to S, and i belongs to E; y is _ai Representing the real score of the ith student on the ith problem;

and 5, constructing a total Loss function Loss by using the formula (14):

Loss＝L _se +λ(L _s +L _e ) (14)

In this embodiment, an electronic device includes a memory for storing a program that supports a processor to execute the above-described multi-relationship cognitive diagnostic method, and a processor configured to execute the program stored in the memory.

In this embodiment, a computer-readable storage medium stores a computer program, and the computer program is executed by a processor to execute the steps of the multi-relationship cognitive diagnosis method.

Example (b):

to verify the effectiveness of the present method, the present invention employs two public datasets commonly used in intelligent educational research: ASSISTMENTs, EDNet. Screening not less than 15 students in answer records of each data set, removing exercises which do not contain knowledge point information, and randomly selecting 1000 students for research on the large-scale data set EDNet.

For the student achievement prediction task, the ACC, the AUC and the RMSE are adopted as evaluation standards. The invention selects 5 methods for effect comparison, namely IRT, MIRT, DINA, neurolCD and RCD. In particular, table 1 shows the experimental results in the above data set, and it can be observed that the method proposed by the present invention is superior to the comparative method in all three indexes of ACC, AUC and RMSE.

TABLE 1 student Performance prediction results on datasets for the methods of the invention and comparative methods

The invention carries out ablation experiments to explore the influence of each component, wherein MRCD-GRAPH represents that only the characterization learning of the problem level is carried out; MRCD-KC represents that only the representation learning of the knowledge point level is carried out, MRCD-GRAPH-KC (T) represents that the wrong division of answer records is not carried out; MRCD-GRAPH-KC (RGCN) represents the use of R-GCN to model learning student-problem interaction diagrams.

TABLE 2 student performance prediction results of ablation experiments according to the method of the invention

From table 2, it can be observed that the learning module at the problem level plays the most important role. This module is very important in the MRCD model because it focuses on the high-order interactions between students and problems and uses a graph-contrast learning framework to alleviate sparsity problems. Furthermore, when the correct answer and the wrong answer behavior are considered to be the same, the model performance is also degraded, which indicates that it is necessary to deal with the wrong relation of students to problems.

The number of layers of the hyper-reference convolutional network in the module is verified through experiments, the experimental results are shown in the following table 3, the model performance becomes better along with the increase of GCN layers, and information in learning high-order interaction is helpful for student achievement prediction.

TABLE 3 comparison of the Performance of the method of the invention for different propagation depths D

In conclusion, the method provided by the invention is remarkably superior to a plurality of comparative methods, thereby proving the feasibility of the method provided by the invention.

Claims

1. A multi-relation cognitive diagnosis method based on a graph convolution network is characterized by comprising the following steps:

denote student set by S, and S = { S = ₁ ,...,s _a ,...,s _b ,...,s _M }，s _a Denotes the a student a, s _b Representing the b-th student, M represents the total number of students, a is more than or equal to 1, and b is more than or equal to M; denote the problem set by E, and E = { E = { ₁ ,...,e _i ,...,e _j ,...,e _N }，e _i Represents the ith exercise, e _j Representing the jth problem, N representing the total number of the problems, i is more than or equal to 1, and j is more than or equal to N; denote the knowledge point set by C, and C = { C ₁ ,...,c _f ,...,c _g ,...,c _K }，c _f Represents the f-th knowledge point, c _g Representing the g-th knowledge point, K represents the total number of the knowledge points, f is more than or equal to 1, and g is more than or equal to K;

let r be _ai Represents the a-th student s _a Answer ith exercise e _i As a result, the historical answer matrix of the student is R = { R = { R } _ai } _M×N (ii) a And dividing the historical answer matrix R of the student into an answer pair exercise matrix R ⁺ Sum-and-error problem matrix R ^- (ii) a Let p be _af Represents the a-th student s _a The exercise contains the f-th knowledge point c _f Then the interaction matrix of the student and the knowledge point is P = { P _af } _M×K (ii) a Let q be _ig Show the ith track problem e _i Containing the g-th knowledge point c _g If the correlation matrix of exercise and knowledge points is Q = { Q = _ig } _N×K ；

And feature vectors of the first problem node

In the formula (1), the reaction mixture is,

denotes the firsta students s _a A set of questions that are correctly answered,

indicating the correct ith answer question e _i A set of students;

and

And ith problem node representation of problem level

And characterization of problem nodes

Step 3.1.4, obtaining the a-th student section of the exercise layer by using the formula (3)Point characterization

And ith problem node characterization

step 3.1.5, compare FIG. G ⁺ Student nodes or problem nodes in (1) as anchor point samples, in graph G ^- Taking the student node or the exercise node corresponding to the anchor point sample as a positive sample, and drawing G ⁺ Taking other student nodes or exercise nodes except the anchor point sample as negative samples;

In the formula (5), X ⁺ Shows diagram G ⁺ Middle problem node characterization vector, X ^- Shows diagram G ^- Exercise section of (1)Point characterization vectors; e' represents other exercise node sets except the ith exercise node;

In the formula (6), the reaction mixture is,

In the formula (8), the reaction mixture is,

representing a knowledge point set contained in the ith exercise node;

and 3.3, processing by a neurocognitive diagnosis module:

And the final characterization vector of the ith problem node

is the ith problem node characterization vector

in formula (12), FC ₁ Represents a fully connected layer;

and 5, constructing a total Loss function Loss by using the formula (14):

Loss＝L _se +λ(L _s +L _e ) (14)

2. An electronic device comprising a memory and a processor, wherein the memory is configured to store a program that enables the processor to perform the multi-relationship cognitive diagnostic method of claim 1, and the processor is configured to execute the program stored in the memory.

3. A computer-readable storage medium, having a computer program stored thereon, where the computer program is executed by a processor to perform the steps of the multi-relationship cognitive diagnostic method of claim 1.