CN113283488A - Learning behavior-based cognitive diagnosis method and system - Google Patents

Learning behavior-based cognitive diagnosis method and system Download PDF

Info

Publication number
CN113283488A
CN113283488A CN202110542027.3A CN202110542027A CN113283488A CN 113283488 A CN113283488 A CN 113283488A CN 202110542027 A CN202110542027 A CN 202110542027A CN 113283488 A CN113283488 A CN 113283488A
Authority
CN
China
Prior art keywords
student
test question
course
diagnosis
matrix
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110542027.3A
Other languages
Chinese (zh)
Other versions
CN113283488B (en
Inventor
许斌
毛亦铭
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tsinghua University
Original Assignee
Tsinghua University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tsinghua University filed Critical Tsinghua University
Priority to CN202110542027.3A priority Critical patent/CN113283488B/en
Publication of CN113283488A publication Critical patent/CN113283488A/en
Application granted granted Critical
Publication of CN113283488B publication Critical patent/CN113283488B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance

Abstract

The embodiment of the invention provides a learning behavior-based cognitive diagnosis method and a system, wherein the method comprises the following steps: determining student numbers and answer numbers to be cognized and diagnosed, wherein the student numbers and the answer numbers correspond to student answers and corresponding video records contained in a learning course one by one; inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model; the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training; the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network. The embodiment of the invention realizes the effective prediction of the knowledge level of the student.

Description

Learning behavior-based cognitive diagnosis method and system
Technical Field
The invention relates to the technical field of intelligent education, in particular to a learning behavior-based cognitive diagnosis method and system.
Background
Cognitive diagnosis is an important task proposed for online education scenes, aims to diagnose the knowledge level of students by utilizing the behaviors of the students on online education websites, and is one of key technologies for solving numerous intelligent applications (personalized test question recommendation and learning path planning) of online education.
The general cognitive diagnosis method relies on the question making records of students on an online learning platform, modeling is carried out on both the students and the test questions after the question making sequence and the score of each student are obtained, the students and the test questions are modeled in the modeling process, the answer results of the students are predicted, the student answer prediction is a two-classification problem, the model gradually optimizes parameters in the classification process, and the final student knowledge level mastering degree is obtained by outputting intermediate characteristic variables. Traditional cognitive diagnosis is divided into one-dimensional continuous models and high-dimensional discrete models. However, in the online course learning system, students not only answer questions but also learn knowledge in courses through videos on the system, and the existing method ignores the behavior of learning through videos. In addition, the existing method treats the test questions as independent individuals, and does not dig deep connection among the test questions, for example, the test questions with the same knowledge point are considered to have stronger correlation. Meanwhile, rich information contained in the online course structure is not well utilized.
Disclosure of Invention
The embodiment of the invention provides a learning behavior-based cognitive diagnosis method and system, which are used for solving the problems of part or all of the problems in the cognitive diagnosis method of the existing online education system.
In a first aspect, an embodiment of the present invention provides a cognitive diagnosis method based on learning behaviors, including:
determining student numbers and answer numbers to be cognized and diagnosed, wherein the student numbers and the answer numbers correspond to student answers and corresponding video records contained in a learning course one by one;
inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model;
the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training;
the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
Preferably, the diagnostic model comprises a multi-vector model, a pre-processing model, a predictive model and a parameter update model;
inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model, wherein the method comprises the following steps:
inputting the student number and the answer number to be cognitively diagnosed into the multi-vector model, and outputting a plurality of vectors including a student knowledge level vector, a test question investigation knowledge point vector, a test question discrimination vector, a test question difficulty vector and a video difficulty vector;
inputting the vectors into the preprocessing model, and obtaining a preprocessing result according to the following formula:
Figure BDA0003072176120000021
wherein, FknInvestigating knowledge point vectors for test questions, FsAs student knowledge level vector, FeDivide the vector for the test question FdAs test question difficulty vector, FvIs a video difficulty vector;
inputting the preprocessing result into the prediction model, and outputting a student answer score prediction value based on an interaction function of a fully-connected neural network;
and inputting the predicted values of the student answer scores into the parameter updating model, updating parameters by taking a cross entropy function constructed based on the predicted values of the student answer scores and the real values as a loss function through back propagation, and outputting the cognitive diagnosis results of the students after the loss function is converged.
Preferably, the multi-vector model comprises a trainable matrix of parameters of a student's knowledge level;
after the node information of the course graph is updated through the graph neural network, the node updating result H is updated according to the video sample and the test question samplekRepartitioning of the video node representation matrix VkAnd the test question node representation matrix Ek
The student knowledge level vector is obtained based on the student number and a trainable parameter matrix of the student knowledge level, and the formula is as follows:
Fs=sigmoid(xs×B);
the test question investigation knowledge point vector is obtained based on the answer number and the incidence matrix of the test questions and the knowledge points, and the formula is as follows:
Fkn=xe×Q;
the test question discrimination vector is obtained based on the answer number and the trainable parameter matrix of the test question discrimination capability, and the formula is as follows:
Fe=sigmoid(xe×D);
the test question difficulty vector is obtained based on the answer number and the test question node expression matrix, and the formula is as follows:
Fd=sigmoid(xe×Ek);
the video difficulty vector is obtained based on the answer number and the video node expression matrix, and the formula is as follows:
Fv=sigmoid(xe×Vk);
wherein x isSDenotes student number, xeThe method comprises the steps of representing test question numbers, B representing a trainable parameter matrix of student knowledge levels, Q representing an incidence matrix of test questions and knowledge points, D representing a trainable parameter matrix of test question distinguishing capacity, sigmoid being an activation function and used for mapping student knowledge level vectors, test question distinguishing vectors, test question difficulty vectors and video difficulty vectors to be between 0 and 1, and EkAnd VkAnd respectively representing a matrix for the test question nodes and a matrix for the video nodes.
Preferably, the interaction function of the fully-connected neural network is as follows:
f1=ReLU(W1×xT+b1),
f2=ReLU(W2×f1+b2),
y=sigmoid(W3×f2+b3);
wherein x is the result of the pretreatment, W1、W2And W3Respectively, input product parameters of each layer of the fully-connected neural network, b1、b2And b3The bias parameters are respectively the bias parameters of each layer of the fully-connected neural network, ReLU is an activation function and is used for increasing the nonlinear mapping of the fully-connected neural network, and sigmoid is an activation function and is used for mapping the student answer score predicted value between 0 and 1.
Preferably, the cross entropy function constructed by the predicted student answer score value and the real student answer score value is as follows:
Figure BDA0003072176120000041
wherein r isiFor the real value of student's answer score of the ith test question, yiAnd (4) predicting the answer score of the student of the corresponding ith test question.
Preferably, the step of constructing the curriculum schedule based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label comprises the following steps:
taking the test question sample and the corresponding video sample as course elements wiConstruct course element set W ═ { W ═ Wi∣wi∈Cj,CjE.g., M }, and all course elements w in the course element setiNodes as a course graph; wherein M represents the curriculum corpus, CiRepresenting a course;
according to the question sampleAnd marking the knowledge points to obtain marked knowledge points, performing character string matching on the marked knowledge points and the same knowledge points in the subtitles of the corresponding video samples to generate video marks, and constructing an incidence matrix Q of the test questions and the knowledge points based on the course element set, wherein the incidence matrix Q is { Q ═ Q { (Q) } of the test questions and the knowledge pointsij} | W | × | Kn |; wherein the content of the first and second substances,
Figure BDA0003072176120000042
kn is the set of knowledge points, kiIs a marked knowledge point, Qij1 denotes course element wiContaining knowledge points kjElse, course element wiDoes not contain knowledge point kj,Kr=[(ki,kj)]Kr is a set of knowledge point relationships, kiIs kjFirstly, correcting knowledge points;
inputting a word vector model into the text of the test question sample and the subtitle of the video sample, and obtaining vector representation of the text of the test question sample and the subtitle of the video sample as node characteristics F based on the course element set;
and respectively obtaining an adjacent matrix based on course structure information and an adjacent matrix based on knowledge point association information by taking the distance and the strength of the connection as weights according to the distance between the course elements and the connection between the knowledge points contained in the course elements as edges between the nodes, and merging and normalizing the adjacent matrix based on the course structure information and the adjacent matrix based on the knowledge point association information to obtain an adjacent matrix A of the nodes.
Preferably, the updating node information of the course graph through the graph neural network includes: and (3) performing iterative update on the nodes by adopting the following node update function:
Figure BDA0003072176120000051
where, σ is the activation function,
Figure BDA0003072176120000052
being self-connecting adjacency matricesA is an adjacency matrix of nodes, I is an identity matrix,
Figure BDA0003072176120000053
is that
Figure BDA0003072176120000054
The diagonal matrix of (a).
In a second aspect, an embodiment of the present invention provides a cognitive diagnosis system based on learning behaviors, including:
the system comprises a number determining unit, a learning course determining unit and a learning course judging unit, wherein the number determining unit is used for determining the number of students to be cognitively diagnosed and the number of answers, and the numbers of the students and the answers correspond to the answers of the students and corresponding video records contained in the learning course one by one;
the cognitive diagnosis unit is used for inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model;
the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training;
the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
In a third aspect, an embodiment of the present invention provides an electronic device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor implements the steps of the learning behavior-based cognitive diagnosis method according to any one of the above-mentioned first aspects when executing the program.
In a fourth aspect, an embodiment of the present invention provides a non-transitory computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of the learning behavior-based cognitive diagnosis method according to any one of the above-mentioned first aspect.
According to the learning behavior-based cognitive diagnosis method and system provided by the embodiment of the invention, the student cognitive diagnosis result output by the diagnosis model is obtained by inputting the student number and the answer number into the diagnosis model, the diagnosis model is obtained by modeling videos and test questions in the learning course corresponding to the student number and the answer number and training by adopting a graph neural network, and the knowledge level of students can be effectively predicted.
Drawings
In order to more clearly illustrate the technical solutions of the present invention or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.
FIG. 1 is a flow chart of a learning behavior-based cognitive diagnosis method provided by the invention;
FIG. 2 is a block diagram of a diagnostic model provided by the present invention;
FIG. 3 is a schematic structural diagram of a learning behavior-based cognitive diagnosis system provided by the present invention;
FIG. 4 is a schematic structural diagram of a cognitive diagnostic unit provided by the present invention;
fig. 5 is a schematic structural diagram of an electronic device provided in the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The technical basis of the invention is as follows: traditional cognitive diagnosis is divided into one-dimensional continuous models and high-dimensional discrete models. The one-dimensional continuity model is represented by an Item Response Theory (IRT) model, the characteristics of each student And each test question are represented by one-dimensional variables, a logistic regression function is used for calculating the probability of the student answering the test question, the high-dimensional discrete model is represented by a Determini Inputs (Noisy-And gate (DINA) model, the high-dimensional vectors are used for representing the students, the student vectors correspond to the knowledge points, And the mastery degree of a specific knowledge point is represented by the value of each dimension. Meanwhile, a Q matrix is constructed to represent the correlation between the test questions and the knowledge points, and error parameters and guess parameters are introduced into the DINA model, so that the student portrait is better modeled. The method of the invention adopts a trainable matrix vector to model students and test questions, and uses an artificial neural network as an interaction function to predict the answer results of the students.
Knowledge point (Knowledge focus) is the content that is mainly taught in the course. For example, "red and black trees," "B-trees," "vectors" are knowledge points for the "data structure" class, and "memory management," "threads," "disk management" are knowledge points for the "operating system" class. It is formally defined as
Figure BDA0003072176120000071
Where Kn is the set of knowledge points, kiIs a specific knowledge point.
The Knowledge Component (Knowledge Component) is defined as a graph composed of Knowledge points, and includes the Knowledge points and the relationships between the Knowledge points. Its formal definition may be expressed as K ═ Kn, Kr, where
Figure BDA0003072176120000072
Indicates a knowledge point, Kr ═ k [ ("k")i,kj)]Representing the relation of knowledge points, the method only considers one relation of sequential revision relations, i.e. if kiIs kjFirst repair knowledge point of (k)i,kj) In Kr.
Course (Course), defined as the Course of on-line education, consisting of video and test questions, formally defined as
Figure BDA0003072176120000081
Wherein C isiThe course is represented by a presentation of the lesson,
Figure BDA0003072176120000082
representing the jth node in the course. t is tijRepresenting the node type, including video and test question, kcijRepresenting the set of knowledge points that the node contains, cijRepresenting the textual content represented by the node.
Curriculum corps, defined as a collection of multiple curriculums. Its formal definition can be expressed as
Figure BDA0003072176120000083
Wherein M represents the curriculum corpus, CiRepresenting a course. For convenience of representation, all Elements in M are reintegrated into a Course element set (Course Elements), whose formal definition may be represented as W ═ { W ═ Wi∣wi∈Cj,CjE.m, where W represents the set.
And the Q matrix (Q-matrix) is defined as an association matrix of the test questions and the knowledge points. Its formal definition is Q ═ Qij} | W | × | Kn |, where Qij1 denotes course element WiContaining knowledge points kj
The Course Graph (Course Graph) is defined as a Graph formed by the questions and videos in the Course corpus, and is formally defined as G ═ a, F, where G denotes the Graph, F denotes the node characteristics, and a denotes the adjacency matrix of the nodes.
The learning behavior-based cognitive diagnosis method and system provided by the invention are described below with reference to fig. 1 to 5.
The embodiment of the invention provides a learning behavior-based cognitive diagnosis method. Fig. 1 is a schematic flowchart of a learning behavior-based cognitive diagnosis method according to an embodiment of the present invention, as shown in fig. 1, the method includes:
step 110, determining the serial numbers of students to be cognized and diagnosed and the serial numbers of answers, wherein the serial numbers of the students and the serial numbers of the answers correspond to the answers of the students and corresponding video records contained in the learning course one by one;
step 120, inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model;
the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training;
the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
Compared with the prior art, the method provided by the embodiment of the invention models the videos and the test questions in the learning course, establishes deep connection between the test questions by using the course structure information and the knowledge point correlation information, adopts the graph neural network for training, provides a new data set aiming at the scene, and obtains the best result on the student answer prediction task by using the data set. When the embodiment of the invention is used for carrying out the graph neural network training, the output vector length is aligned with the vector length representing the knowledge level of the student, so that the subsequent calculation is convenient, and simultaneously, each dimension of the vector can be aligned with a knowledge point in the practical sense to represent the difficulty of investigating the knowledge point, so that the model has better interpretability.
Based on any of the above embodiments, as shown in fig. 2, the diagnosis model includes a multi-vector model, a preprocessing model, a prediction model and a parameter updating model;
inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model, wherein the method comprises the following steps:
inputting the student number and the answer number to be cognitively diagnosed into the multi-vector model, and outputting a plurality of vectors including a student knowledge level vector, a test question investigation knowledge point vector, a test question discrimination vector, a test question difficulty vector and a video difficulty vector;
inputting the vectors into the preprocessing model, and obtaining a preprocessing result according to the following formula (1):
Figure BDA0003072176120000091
wherein, FknInvestigating knowledge point vectors for test questions, FsAs student knowledge level vector, FeDivide the vector for the test question FdAs test question difficulty vector, FvIs a video difficulty vector;
inputting the preprocessing result into the prediction model, and outputting a student answer score prediction value based on an interaction function of a fully-connected neural network;
and inputting the predicted values of the student answer scores into the parameter updating model, updating parameters by taking a cross entropy function constructed based on the predicted values of the student answer scores and the real values as a loss function through back propagation, and outputting the cognitive diagnosis results of the students after the loss function is converged.
Specifically, the three-layer fully-connected neural network is used as an interaction function, the student numbers and the answer test question numbers are input, and a plurality of constructed vectors are obtained. Firstly, preprocessing a plurality of vectors, taking a preprocessing result as the input of a three-layer fully-connected neural network, outputting the prediction probability of a diagnosis model to a test question and answer pair through an interactive function of the fully-connected neural network, then using a cross entropy function as a loss function, updating model parameters through back propagation, and taking out a student knowledge level matrix B after a loss value is converged, namely, a cognitive diagnosis result.
In any of the above embodiments, the multi-vector model comprises a trainable parameter matrix of student knowledge levels;
after the node information of the course graph is updated through the graph neural network, the node updating result H is updated according to the video sample and the test question samplekRepartitioning of the video node representation matrix VkAnd the test question node representation matrix Ek
The student knowledge level vector is obtained based on the student number and a trainable parameter matrix of the student knowledge level, and formula (2) is as follows:
Fs=sigmoid(xs×B); (2)
the test question investigation knowledge point vector is obtained based on the answer number and the incidence matrix of the test question and the knowledge point, and the formula (3) is as follows:
Fkn=xe×Q; (3)
the test question discrimination vector is obtained based on the answer number and the trainable parameter matrix of the test question discrimination capability, and the formula (4) is as follows:
Fe=sigmoid(xe×D); (4)
the test question difficulty vector is obtained based on the answer number and the test question node expression matrix, and the formula (5) is as follows:
Fd=sigmoid(xe×Ek); (5)
the video difficulty vector is obtained based on the answer number and the video node expression matrix, and a formula (6) is as follows:
Fv=sigmoid(xe×Vk); (6)
wherein x isSDenotes student number, xeThe method comprises the steps of representing test question numbers, B representing a trainable parameter matrix of student knowledge levels, Q representing an incidence matrix of test questions and knowledge points, D representing a trainable parameter matrix of test question distinguishing capacity, sigmoid being an activation function and used for mapping student knowledge level vectors, test question distinguishing vectors, test question difficulty vectors and video difficulty vectors to be between 0 and 1, and EkAnd VkAnd respectively representing a matrix for the test question nodes and a matrix for the video nodes.
Specifically, a trainable parameter matrix B representing the knowledge level of students and a trainable parameter matrix D representing the discrimination ability of test questions are firstly constructed, and specifically, each knowledge level vector of students can be represented as Fs=sigmoid(xs×B),xSNumber for indicating studentSigmoid is an activation function, and the student knowledge level vector is mapped between 0 and 1. Simultaneously, the obtained test question nodes are used for representing a matrix EkObtaining a test question difficulty vector: fd=sigmoid(xe×Ek),xeAnd expressing test question numbers, using sigmoid as an activation function, and mapping the test question difficulty vectors to 0-1. The matrix V is expressed by using video nodes in the same waykObtaining a video difficulty vector:
Fv=sigmoid(xe×Vk) And sigmoid is an activation function, and the video difficulty vector is mapped between 0 and 1. And (3) expressing the vector of the examination question investigation knowledge points by using a Q matrix: fkn=xeXQ. Meanwhile, a test question discrimination vector is also defined: fe=sigmoid(xeAnd multiplied by D), sigmoid is an activation function, and the test question discrimination vector is mapped between 0 and 1.
Based on any one of the above embodiments, the interaction functions (7), (8) and (9) of the fully-connected neural network are as follows:
f1=ReLU(W1×xT+b1), (7)
f2=ReLU(W2×f1+b2), (8)
y=sigmoid(W3×f2+b3); (9)
wherein x is the result of the pretreatment, W1、W2And W3Respectively, input product parameters of each layer of the fully-connected neural network, b1、b2And b3And respectively the bias parameters of each layer of the fully-connected neural network, wherein ReLU is an activation function and is used for increasing the nonlinear mapping of the fully-connected neural network, and sigmoid is an activation function and is used for mapping the student answer score predicted value to be between 0 and 1.
Based on any one of the above embodiments, the cross entropy function (10) constructed by the predicted value and the true value of the student answer score is as follows:
Figure BDA0003072176120000121
wherein r isiFor the real value of student's answer score of the ith test question, yiAnd (4) predicting the answer score of the student of the corresponding ith test question.
Based on any of the above embodiments, the constructing of the course graph based on the test question sample and the corresponding knowledge point label and the corresponding video sample and the corresponding video label includes the following steps:
taking the test question sample and the corresponding video sample as course elements wiConstruct course element set W ═ { W ═ Wi∣wi∈Cj,CjE.g., M }, and all course elements w in the course element setiNodes as a course graph; wherein M represents the curriculum corpus, CiRepresenting a course;
specifically, a student answer record and a watching video record are input, and all test questions and videos therein are extracted as elements of W.
Obtaining marked knowledge points according to the knowledge point marks corresponding to the test question samples, carrying out character string matching on the marked knowledge points and the same knowledge points in the subtitles of the corresponding video samples to generate video marks, and constructing an association matrix Q (Q) of the test question and the knowledge points based on the course element setij} | W | × | Kn |; wherein the content of the first and second substances,
Figure BDA0003072176120000122
kn is the set of knowledge points, kiIs a marked knowledge point, Qij1 denotes course element wiContaining knowledge points kjElse, course element wiDoes not contain knowledge point kj,kiIs kjFirstly, correcting knowledge points;
specifically, knowledge point labeling is carried out on all test questions, and then character string matching is carried out on the labeled knowledge points and the same knowledge points in the video captions automatically to generate video labels so as to construct a Q matrix with a label structure. The curriculum graph is constructed using the W and Q matrices described above, using all elements in W as nodes in G. When the student question making result is predicted, only the videos related to the question containing the knowledge points are selected for interaction, and interference of a large number of other videos on the question making result is avoided.
Inputting a word vector model into the text of the test question sample and the subtitle of the video sample, and obtaining vector representation of the text of the test question sample and the subtitle of the video sample as node characteristics F based on the course element set;
specifically, a pre-training model BERT-Chinese is used as a word vector model, video subtitles and test question texts are input, and output vector representation is used as a node feature F: BERT (text (w)).
And respectively obtaining an adjacent matrix based on course structure information and an adjacent matrix based on knowledge point association information by taking the distance and the strength of the connection as weights according to the distance between the video samples and the connection between the knowledge points contained in the video samples, and merging and normalizing the adjacent matrix based on the course structure information and the adjacent matrix based on the knowledge point association information to obtain an adjacent matrix A of the nodes.
Specifically, the method of the embodiment of the invention constructs the adjacency matrix A based on the course structure information and the knowledge point association information. The following two auxiliary functions (11) and (12) are first defined:
Figure BDA0003072176120000131
Figure BDA0003072176120000132
wherein, MS (w)i,wj) 1 represents wiAnd wjBelonging to the same course, MC (w)i) N represents wiIs element number n in the course.
Adjacency matrix weight A based on course structure informationSThe calculation formula (13) is as follows:
Figure BDA0003072176120000133
Figure BDA0003072176120000134
denotes wiAnd wjBased on the weight of the course structure, the calculation method comprises the following steps:
if w isiAnd wjAre elements in the same lesson and their distance in the lesson is less than lambda, then the weight is the reciprocal of their distance, otherwise 0.λ is an artificially defined hyper-parameter.
Adjacency matrix weight A based on knowledge point association informationKThe calculation formula (14) is as follows:
Figure BDA0003072176120000141
wherein the content of the first and second substances,
Figure BDA0003072176120000142
denotes wiAnd wjBased on the weight of the associated information of the knowledge points, if wiAnd wjHaving the same knowledge point, then
Figure BDA0003072176120000143
Is 1 if wiAnd wjIf the owned knowledge points have sequential revision relationship, then
Figure BDA0003072176120000144
Is alpha, otherwise 0. Alpha is an artificially defined hyper-parameter.
Finally merge ASAnd AKObtaining A: a ═ Normalization (A)S+AK). Wherein Normalization is a Normalization function.
Based on any embodiment of the foregoing, the performing node information update on the course graph through the graph neural network includes: the nodes are iteratively updated using a node update function (15) as follows:
Figure BDA0003072176120000145
where, σ is the activation function,
Figure BDA0003072176120000146
is a self-connected adjacency matrix, A is an adjacency matrix of nodes, I is an identity matrix,
Figure BDA0003072176120000147
is that
Figure BDA0003072176120000148
The diagonal matrix of (a).
Specifically, the nodes are updated by using the graph convolution neural network on the constructed curriculum graph G. The node update function of the conventional graph neural network is: hk=M(A,Hk-1;θk) Where M is the information transfer function, θkIs a trainable parameter matrix HkIs a vector representation of the nodes over k iterations. For the graph convolution neural network, the specific node update function is:
Figure BDA0003072176120000149
where σ is the activation function, here ReLU is chosen,
Figure BDA00030721761200001410
is a contiguous matrix of self-connecting,
Figure BDA00030721761200001411
is that
Figure BDA00030721761200001412
The diagonal matrix of (a). After the updating is finished, the H is re-divided according to the video and the test questionskRepresenting a matrix V for video nodeskAnd the test question node representation matrix Ek
The learning behavior-based cognitive diagnosis system provided by the invention is described below, and the learning behavior-based cognitive diagnosis method described below and the learning behavior-based cognitive diagnosis method described above can be referred to correspondingly.
Fig. 3 is a schematic structural diagram of a learning behavior-based cognitive diagnosis system according to an embodiment of the present invention, as shown in fig. 3, the system includes a number determination unit 310 and a cognitive diagnosis unit 320;
a number determining unit 310, configured to determine a student number and an answer number to be subjected to cognitive diagnosis, where the student number and the answer number correspond to student answers and corresponding video records included in a learning course one to one;
the cognitive diagnosis unit 320 is used for inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model;
the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training;
the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
Compared with the prior art, the system provided by the embodiment of the invention models videos and test questions in a learning course, establishes deep connection between the test questions by using course structure information and knowledge point correlation information, adopts a graph neural network for training, provides a new data set aiming at the scene, and obtains the best result on a student answer prediction task by using the data set. When the embodiment of the invention is used for carrying out the graph neural network training, the output vector length is aligned with the vector length representing the knowledge level of the student, so that the subsequent calculation is convenient, and simultaneously, each dimension of the vector can be aligned with a knowledge point in the practical sense to represent the difficulty of investigating the knowledge point, so that the model has better interpretability.
Based on any of the above embodiments, as shown in fig. 4, the cognitive diagnosis unit includes a multi-vector module 410, a preprocessing module 420, a prediction module 430, and a parameter update module 440;
the multi-vector module 410 is configured to input the student number and the answer number to be subjected to cognitive diagnosis, and output a plurality of vectors including a student knowledge level vector, a test question investigation knowledge point vector, a test question discrimination vector, a test question difficulty vector, and a video difficulty vector;
the preprocessing module 420 is configured to input the vectors into the preprocessing model, and obtain a preprocessing result according to the following formula (16):
Figure BDA0003072176120000161
wherein, FknInvestigating knowledge point vectors for test questions, FsAs student knowledge level vector, FeDivide the vector for the test question FdAs test question difficulty vector, FvIs a video difficulty vector;
the prediction module 430 is configured to input the preprocessing result and output a predicted student answer score value based on an interaction function of a fully-connected neural network;
the parameter updating module 440 is configured to input the predicted student answer score values, update parameters by back propagation by using a cross entropy function constructed based on the predicted student answer score values and the actual values as a loss function, and output a student cognitive diagnosis result after the loss function converges.
In any of the above embodiments, the multi-vector module comprises a trainable parameter matrix of student knowledge levels;
after the node information of the course graph is updated through the graph neural network, the node updating result H is updated according to the video sample and the test question samplekRepartitioning of the video node representation matrix VkAnd the test question node representation matrix Ek
The student knowledge level vector is derived based on the student number and a trainable parameter matrix of the student knowledge level, and formula (17) is as follows:
Fs=sigmoid(xs×B); (17)
the test question investigation knowledge point vector is obtained based on the answer number and the incidence matrix of the test question and the knowledge point, and the formula (18) is as follows:
Fkn=xe×Q; (18)
the test question discrimination vector is obtained based on the answer number and the trainable parameter matrix of the test question discrimination capability, and a formula (19) is as follows:
Fe=sigmoid(xe×D); (19)
the test question difficulty vector is obtained based on the answer number and the test question node expression matrix, and a formula (20) is as follows:
Fd=sigmoid(xe×Ek); (20)
the video difficulty vector is obtained based on the answer number and the video node expression matrix, and the formula (21) is as follows:
Fv=sigmoid(xe×Vk); (21)
wherein x isSDenotes student number, xeThe method comprises the steps of representing test question numbers, B representing a trainable parameter matrix of student knowledge levels, Q representing an incidence matrix of test questions and knowledge points, D representing a trainable parameter matrix of test question distinguishing capacity, sigmoid being an activation function and used for mapping student knowledge level vectors, test question distinguishing vectors, test question difficulty vectors and video difficulty vectors to be between 0 and 1, and EkAnd VkAnd respectively representing a matrix for the test question nodes and a matrix for the video nodes.
Based on any one of the above embodiments, the interaction functions (22), (23) and (24) of the fully-connected neural network are as follows:
f1=ReLU(W1×xT+b1), (22)
f2=ReLU(W2×f1+b2), (23)
y=sigmoid(W3×f2+b3); (24)
wherein x is the result of the pretreatment, W1、W2And W3Respectively, input product parameters of each layer of the fully-connected neural network, b1、b2And b3And respectively the bias parameters of each layer of the fully-connected neural network, wherein ReLU is an activation function and is used for increasing the nonlinear mapping of the fully-connected neural network, and sigmoid is an activation function and is used for mapping the student answer score predicted value to be between 0 and 1.
Based on any one of the above embodiments, the cross entropy function (25) constructed by the predicted value and the true value of the student answer score is as follows:
Figure BDA0003072176120000171
wherein r isiFor the real value of student's answer score of the ith test question, yiAnd (4) predicting the answer score of the student of the corresponding ith test question.
Based on any of the above embodiments, the constructing of the course graph based on the test question sample and the corresponding knowledge point label and the corresponding video sample and the corresponding video label includes the following steps:
taking the test question sample and the corresponding video sample as course elements wiConstruct course element set W ═ { W ═ Wi∣wi∈Cj,CjE.g., M }, and all course elements w in the course element setiNodes as a course graph; wherein M represents the curriculum corpus, CiRepresenting a course;
obtaining marked knowledge points according to the knowledge point labels corresponding to the test question samples, performing character string matching on the marked knowledge points and the same knowledge points in the video captions to generate video labels, and constructing an incidence matrix Q (Q) of the test questions and the knowledge points based on the course element setij} | W | × | Kn |; wherein the content of the first and second substances,
Figure BDA0003072176120000181
kn is the set of knowledge points, kiIs a marked knowledge point, Qij1 denotes course element wiContaining knowledge points kjElse, course element wiDoes not contain knowledge point kj,Kr=[(ki,kj)]Kr is a set of knowledge point relationships, kiIs kjFirstly, correcting knowledge points;
inputting a word vector model into the text of the test question sample and the subtitle of the video sample, and obtaining vector representation of the text of the test question sample and the subtitle of the video sample as node characteristics F based on the course element set;
and respectively obtaining an adjacent matrix based on course structure information and an adjacent matrix based on knowledge point association information by taking the distance and the strength of the connection as weights according to the distance between the course elements and the connection between the knowledge points contained in the course elements as edges between the nodes, and merging and normalizing the adjacent matrix based on the course structure information and the adjacent matrix based on the knowledge point association information to obtain an adjacent matrix A of the nodes.
Based on any embodiment of the foregoing, the performing node information update on the course graph through the graph neural network includes: the nodes are iteratively updated using a node update function (26) as follows:
Figure BDA0003072176120000182
where, σ is the activation function,
Figure BDA0003072176120000183
is a self-connected adjacency matrix, A is an adjacency matrix of nodes, I is an identity matrix,
Figure BDA0003072176120000184
is that
Figure BDA0003072176120000185
The diagonal matrix of (a).
For a specific example of practical application, for the learning behavior-based cognitive diagnosis method and system provided by the present invention, since no public data set can provide records for students to watch online videos, data of 12 computer courses are extracted from the large-scale mule course database moococube, in order to ensure the quality of data, records of students with the number of answers less than 8 or with the number of wrong answers less than 2 are filtered, and records of answers and questions are sampled to balance positive and negative examples, and finally 271960 learning records are obtained, including 2093 students, 519 trials, 857 videos and 101 marked knowledge points. The results of the experiment are shown in table 1 below:
TABLE 1
Figure BDA0003072176120000191
From the above table 1, compared with other methods, the method and the system provided by the invention have the best results in the three indexes of accuracy, root mean square error and AUC, and the effectiveness of the method is fully proved.
Aiming at the learning behaviors of students on a admiration course website, the invention firstly proposes that the knowledge state of the students is individually modeled by combining the video learning behavior records of the students with the answer records. The learning behavior of the student is characterized to more perfectly represent the knowledge state of the student. And automatically labeling the related knowledge points of the videos watched by the students, modeling according to the time sequence, and combining the current question making records to give more accurate horizontal diagnosis results of the students.
Aiming at the problem that test questions are independent, the invention designs a course graph to mine rich information in test question texts and course structures. All videos and test questions are taken as nodes on the graph, video subtitles and test question texts are taken as node characteristics through a word vector model, course structures and knowledge point correlation are taken as edge weight basis, and better characteristic representation is obtained through a graph neural network.
The invention provides a cognitive diagnosis data set based on a real admiration platform, which removes records with the number of answers less than eight for students when constructing the data set, avoids the records of students with high lesson leaving rate, and simultaneously ensures that at least two wrong answers are provided in the answers to balance positive and negative examples of samples. The data set comprises complete student watching videos and answer records, and a large number of experiments prove that the method provided by the invention can more effectively predict the knowledge level of students and the output has interpretability.
Fig. 5 is a schematic structural diagram of an electronic device according to an embodiment of the present invention, and as shown in fig. 5, the electronic device may include: a processor (processor)510, a communication Interface (Communications Interface)520, a memory (memory)530 and a communication bus 540, wherein the processor 510, the communication Interface 520 and the memory 530 communicate with each other via the communication bus 540. Processor 510 may invoke logic instructions in memory 530 to perform a learning behavior-based cognitive diagnostic method comprising: determining student numbers and answer numbers to be cognized and diagnosed, wherein the student numbers and the answer numbers correspond to student answers and corresponding video records contained in a learning course one by one; inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model; the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training; the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
Furthermore, the logic instructions in the memory 530 may be implemented in the form of software functional units and stored in a computer readable storage medium when the software functional units are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In another aspect, an embodiment of the present invention further provides a computer program product, where the computer program product includes a computer program stored on a non-transitory computer-readable storage medium, the computer program includes program instructions, and when the program instructions are executed by a computer, the computer can execute the learning behavior-based cognitive diagnosis method provided by the above methods, where the method includes: determining student numbers and answer numbers to be cognized and diagnosed, wherein the student numbers and the answer numbers correspond to student answers and corresponding video records contained in a learning course one by one; inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model; the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training; the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
In yet another aspect, an embodiment of the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, the computer program being implemented by a processor to perform the learning behavior-based cognitive diagnosis methods provided above, the method including: determining student numbers and answer numbers to be cognized and diagnosed, wherein the student numbers and the answer numbers correspond to student answers and corresponding video records contained in a learning course one by one; inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model; the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training; the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (10)

1. A learning behavior-based cognitive diagnostic method, comprising:
determining student numbers and answer numbers to be cognized and diagnosed, wherein the student numbers and the answer numbers correspond to student answers and corresponding video records contained in a learning course one by one;
inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model;
the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training;
the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
2. The learning behavior-based cognitive diagnosis method according to claim 1, wherein the diagnosis model comprises a multi-vector model, a preprocessing model, a prediction model and a parameter update model;
inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model, wherein the method comprises the following steps:
inputting the student number and the answer number to be cognitively diagnosed into the multi-vector model, and outputting a plurality of vectors including a student knowledge level vector, a test question investigation knowledge point vector, a test question discrimination vector, a test question difficulty vector and a video difficulty vector;
inputting the vectors into the preprocessing model, and obtaining a preprocessing result according to the following formula:
Figure FDA0003072176110000011
wherein, FknInvestigating knowledge point vectors for test questions, FsAs student knowledge level vector, FeDivide the vector for the test question FdAs test question difficulty vector, FvIs a video difficulty vector;
inputting the preprocessing result into the prediction model, and outputting a student answer score prediction value based on an interaction function of a fully-connected neural network;
and inputting the predicted values of the student answer scores into the parameter updating model, updating parameters by taking a cross entropy function constructed based on the predicted values of the student answer scores and the real values as a loss function through back propagation, and outputting the cognitive diagnosis results of the students after the loss function is converged.
3. The learning behavior-based cognitive diagnostic method of claim 2, wherein the multi-vector model comprises a trainable parameter matrix of student knowledge levels;
after the node information of the course graph is updated through the graph neural network, the node updating result H is updated according to the video sample and the test question samplekRepartitioning of the video node representation matrix VkAnd the test question node representation matrix Ek
The student knowledge level vector is obtained based on the student number and a trainable parameter matrix of the student knowledge level, and the formula is as follows:
Fs=sigmoid(xs×B);
the test question investigation knowledge point vector is obtained based on the answer number and the incidence matrix of the test questions and the knowledge points, and the formula is as follows:
Fkn=xe×Q;
the test question discrimination vector is obtained based on the answer number and the trainable parameter matrix of the test question discrimination capability, and the formula is as follows:
Fe=sigmoid(xe×D);
the test question difficulty vector is obtained based on the answer number and the test question node expression matrix, and the formula is as follows:
Fd=sigmoid(xe×Ek);
the video difficulty vector is obtained based on the answer number and the video node expression matrix, and the formula is as follows:
Fv=sigmoid(xe×Vk);
wherein x isSDenotes student number, xeThe method comprises the steps of representing test question numbers, B representing a trainable parameter matrix of student knowledge levels, Q representing an incidence matrix of test questions and knowledge points, D representing a trainable parameter matrix of test question distinguishing capacity, sigmoid being an activation function and used for mapping student knowledge level vectors, test question distinguishing vectors, test question difficulty vectors and video difficulty vectors to be between 0 and 1, and EkAnd VkAnd respectively representing a matrix for the test question nodes and a matrix for the video nodes.
4. The learning behavior-based cognitive diagnostic method of claim 2, wherein the interaction function of the fully-connected neural network is as follows:
f1=ReLU(W1×xT+b1),
f2=ReLU(W2×f1+b2),
y=sigmoid(W3×f2+b3);
wherein x is the result of the pretreatment, W1、W2And W3Respectively, input product parameters of each layer of the fully-connected neural network, b1、b2And b3And respectively the bias parameters of each layer of the fully-connected neural network, wherein ReLU is an activation function and is used for increasing the nonlinear mapping of the fully-connected neural network, and sigmoid is an activation function and is used for mapping the student answer score predicted value to be between 0 and 1.
5. The learning behavior-based cognitive diagnosis method according to claim 2, wherein the cross entropy function constructed by the predicted student answer score and the actual student answer score is as follows:
Figure FDA0003072176110000031
wherein r isiFor the real value of student's answer score of the ith test question, yiAnd (4) predicting the answer score of the student of the corresponding ith test question.
6. The learning behavior-based cognitive diagnosis method according to claim 1, wherein the step of constructing a curriculum schedule based on the test question samples and corresponding knowledge point labels and corresponding video samples and corresponding video labels comprises the following steps:
taking the test question sample and the corresponding video sample as course elements wiConstruct course element set W ═ { W ═ Wi∣wi∈Cj,CjE.g., M }, and all course elements w in the course element setiNodes as a course graph; wherein M represents the curriculum corpus, CiRepresenting a course;
obtaining marked knowledge points according to the knowledge point marks corresponding to the test question samples, carrying out character string matching on the marked knowledge points and the same knowledge points in the subtitles of the corresponding video samples to generate video marks, and constructing an incidence matrix Q (Q) of the test question, the video and the knowledge points based on the course element setij} | W | × | Kn |; wherein the content of the first and second substances,
Figure FDA0003072176110000032
kn is the set of knowledge points, kiIs a marked knowledge point, Qij1 denotes course element wiContaining knowledge points kjElse, course element wiDoes not contain knowledge point kj,Kr=[(ki,kj)]Kr is a set of knowledge point relationships, kiIs kjFirstly, correcting knowledge points;
inputting a word vector model into the text of the test question sample and the subtitle of the video sample, and obtaining vector representation of the text of the test question sample and the subtitle of the video sample as node characteristics F based on the course element set;
and respectively obtaining an adjacent matrix based on course structure information and an adjacent matrix based on knowledge point association information by taking the distance and the strength of the connection as weights according to the distance between the course elements and the connection between the knowledge points contained in the course elements as edges between the nodes, and merging and normalizing the adjacent matrix based on the course structure information and the adjacent matrix based on the knowledge point association information to obtain an adjacent matrix A of the nodes.
7. The learning behavior-based cognitive diagnosis method according to claim 1 or 3, wherein the node information updating of the curriculum schedule through the graph neural network comprises: and (3) performing iterative update on the nodes by adopting the following node update function:
Figure FDA0003072176110000041
where, σ is the activation function,
Figure FDA0003072176110000042
is a self-connected adjacency matrix, A is an adjacency matrix of nodes, I is an identity matrix,
Figure FDA0003072176110000043
is that
Figure FDA0003072176110000044
The diagonal matrix of (a).
8. A learning behavior-based cognitive diagnostic system, comprising:
the system comprises a number determining unit, a learning course determining unit and a learning course judging unit, wherein the number determining unit is used for determining the number of students to be cognitively diagnosed and the number of answers, and the numbers of the students and the answers correspond to the answers of the students and corresponding video records contained in the learning course one by one;
the cognitive diagnosis unit is used for inputting the student number and the answer number to be subjected to cognitive diagnosis into a diagnosis model to obtain a student cognitive diagnosis result output by the diagnosis model;
the diagnosis model is obtained based on test question samples, corresponding knowledge point labels and corresponding video samples and corresponding video label training;
the diagnosis model is used for constructing a course graph based on the test question sample, the corresponding knowledge point label, the corresponding video sample and the corresponding video label, and conducting corresponding student cognitive diagnosis on the learning course to be subjected to cognitive diagnosis after node information of the course graph is updated through a graph neural network.
9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the steps of the learning behavior based cognitive diagnostic method according to any one of claims 1 to 7 are implemented when the program is executed by the processor.
10. A non-transitory computer readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the learning behavior based cognitive diagnostic method according to any one of claims 1 to 7.
CN202110542027.3A 2021-05-18 2021-05-18 Learning behavior-based cognitive diagnosis method and system Active CN113283488B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110542027.3A CN113283488B (en) 2021-05-18 2021-05-18 Learning behavior-based cognitive diagnosis method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110542027.3A CN113283488B (en) 2021-05-18 2021-05-18 Learning behavior-based cognitive diagnosis method and system

Publications (2)

Publication Number Publication Date
CN113283488A true CN113283488A (en) 2021-08-20
CN113283488B CN113283488B (en) 2021-11-23

Family

ID=77279727

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110542027.3A Active CN113283488B (en) 2021-05-18 2021-05-18 Learning behavior-based cognitive diagnosis method and system

Country Status (1)

Country Link
CN (1) CN113283488B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114090839A (en) * 2022-01-20 2022-02-25 华南师范大学 Method, system, device and storage medium for learner cognitive structure processing

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190114549A1 (en) * 2015-05-11 2019-04-18 Daniel Joseph Olsher Systems and Methods for a Universal Task Independent Simulation and Control Platform for Generating Controlled Actions Using Nuanced Artificial Intelligence
CN110264091A (en) * 2019-06-24 2019-09-20 中国科学技术大学 Student's cognitive diagnosis method
CN111553166A (en) * 2020-04-05 2020-08-18 温州大学 Scene cognition calculation-based online learner dynamic model prediction method
WO2020228141A1 (en) * 2019-05-13 2020-11-19 清华大学 Electromagnetic signal identification method and device for constructing graph convolutional network on basis of implicit knowledge
US10943072B1 (en) * 2019-11-27 2021-03-09 ConverSight.ai, Inc. Contextual and intent based natural language processing system and method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190114549A1 (en) * 2015-05-11 2019-04-18 Daniel Joseph Olsher Systems and Methods for a Universal Task Independent Simulation and Control Platform for Generating Controlled Actions Using Nuanced Artificial Intelligence
WO2020228141A1 (en) * 2019-05-13 2020-11-19 清华大学 Electromagnetic signal identification method and device for constructing graph convolutional network on basis of implicit knowledge
CN110264091A (en) * 2019-06-24 2019-09-20 中国科学技术大学 Student's cognitive diagnosis method
US10943072B1 (en) * 2019-11-27 2021-03-09 ConverSight.ai, Inc. Contextual and intent based natural language processing system and method
CN111553166A (en) * 2020-04-05 2020-08-18 温州大学 Scene cognition calculation-based online learner dynamic model prediction method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
XU BIN (许 斌) 等: "Efficient Composition of Semantic Web Services with End-to-End QoS Optimization", 《TSINGHUA SCIENCE AND TECHNOLOGY》 *
李振 等: "教育知识图谱的概念模型与构建方法研究", 《电化教育研究》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114090839A (en) * 2022-01-20 2022-02-25 华南师范大学 Method, system, device and storage medium for learner cognitive structure processing

Also Published As

Publication number Publication date
CN113283488B (en) 2021-11-23

Similar Documents

Publication Publication Date Title
CN110264091B (en) Student Cognitive Diagnosis Method
Chen et al. Prerequisite-driven deep knowledge tracing
CN111753098A (en) Teaching method and system based on cross-media dynamic knowledge graph
US11631338B2 (en) Deep knowledge tracing with transformers
CN110941723A (en) Method, system and storage medium for constructing knowledge graph
CN111241243A (en) Knowledge measurement-oriented test question, knowledge and capability tensor construction and labeling method
CN112508334B (en) Personalized paper grouping method and system integrating cognition characteristics and test question text information
Rad et al. AI thinking for cloud education platform with personalized learning
CN112257966B (en) Model processing method and device, electronic equipment and storage medium
CN114254127A (en) Student ability portrayal method and learning resource recommendation method and device
CN115510814B (en) Chapter-level complex problem generation method based on dual planning
CN113851020A (en) Self-adaptive learning platform based on knowledge graph
Giabbanelli et al. Overcoming the PBL assessment challenge: Design and development of the incremental thesaurus for assessing causal maps (ITACM)
CN111079018A (en) Exercise personalized recommendation method, exercise personalized recommendation device, exercise personalized recommendation equipment and computer readable storage medium
Lu et al. CMKT: Concept map driven knowledge tracing
CN115455186A (en) Learning situation analysis method based on multiple models
Yang et al. A novel quantitative relationship neural network for explainable cognitive diagnosis model
CN116385937A (en) Method and system for solving video question and answer based on multi-granularity cross-mode interaction framework
CN113283488B (en) Learning behavior-based cognitive diagnosis method and system
Huizhong et al. Research on the automation integration terminal of the education management platform based on big data analysis
Liang et al. HELP-DKT: an interpretable cognitive model of how students learn programming based on deep knowledge tracing
KR102567043B1 (en) Method, device and system for providing book curation service based on reading ability diagnosis using artificial intelligence model
Bhusal Predicting Student's Performance Through Data Mining
Gómez‐Rey et al. Ordinal regression by a gravitational model in the field of educational data mining
CN112785039B (en) Prediction method and related device for answer score rate of test questions

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Xu Bin

Inventor after: Mao Yiming

Inventor after: Yu Jifan

Inventor after: Li Juanzi

Inventor after: Hou Lei

Inventor before: Xu Bin

Inventor before: Mao Yiming