CN114969301A - Self-adaptive recommendation method, device, equipment and storage medium for online programming teaching - Google Patents

Self-adaptive recommendation method, device, equipment and storage medium for online programming teaching Download PDF

Info

Publication number
CN114969301A
CN114969301A CN202210902123.9A CN202210902123A CN114969301A CN 114969301 A CN114969301 A CN 114969301A CN 202210902123 A CN202210902123 A CN 202210902123A CN 114969301 A CN114969301 A CN 114969301A
Authority
CN
China
Prior art keywords
recommendation
learner
answer information
adaptive
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210902123.9A
Other languages
Chinese (zh)
Inventor
刘淇
庄严
黄振亚
陈恩红
苏喻
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Artificial Intelligence of Hefei Comprehensive National Science Center
Original Assignee
Institute of Artificial Intelligence of Hefei Comprehensive National Science Center
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Artificial Intelligence of Hefei Comprehensive National Science Center filed Critical Institute of Artificial Intelligence of Hefei Comprehensive National Science Center
Priority to CN202210902123.9A priority Critical patent/CN114969301A/en
Publication of CN114969301A publication Critical patent/CN114969301A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance
    • G06Q50/2057Career enhancement or continuing education service
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B19/00Teaching not covered by other main groups of this subclass
    • G09B19/0053Computers, e.g. programming
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B7/00Electrically-operated teaching apparatus or devices working with questions and answers
    • G09B7/02Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B7/00Electrically-operated teaching apparatus or devices working with questions and answers
    • G09B7/02Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student
    • G09B7/04Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student characterised by modifying the teaching programme in response to a wrong answer, e.g. repeating the question, supplying a further explanation

Abstract

The application discloses a self-adaptive recommendation method, a device, equipment and a storage medium for online programming teaching, wherein the self-adaptive recommendation method for online programming teaching comprises the following steps: acquiring an answer information sample, and dividing the answer information sample into a support set and a query set; based on the support set, minimizing a cross entropy loss function on the support set, and determining a capability estimation algorithm function based on the cross entropy loss function on the support set; performing outer layer optimization on a model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function; acquiring the answer information of the learner, inputting the answer information into a recommendation model, and carrying out test question recommendation processing on the answer information based on the recommendation model to obtain a recommendation strategy of the learner. And (3) analyzing the answer information of the learner and recommending test questions based on a recommendation model obtained by a double-layer optimization method, so as to reduce the deviation of test question recommendation.

Description

Self-adaptive recommendation method, device, equipment and storage medium for online programming teaching
Technical Field
The application relates to the field of artificial intelligence application, in particular to a self-adaptive recommendation method, device, equipment and storage medium for online programming teaching.
Background
At present, a generation method of a recommendation strategy for online programming teaching mainly depends on expert experience related to learning education, and the recommendation strategy is often bound with a diagnosis model, namely, corresponding recommendation strategies are designed according to the characteristics of different diagnosis models. However, the adaptivity of such recommendation strategies is limited, and in the design of the recommendation strategies, expert experience has to be relied on to understand the details and principles of the diagnostic model in order to design the corresponding recommendation strategies, and in the ability assessment of learners, the conventional expert design methods cannot identify the complex behaviors of learners in the process, such as guessing, errors and other factors, and the recommendation deviation is large.
Disclosure of Invention
The application mainly aims to provide a self-adaptive recommendation method, device, equipment and storage medium for online programming teaching, and aims to solve the technical problem that in the prior art, a recommendation strategy depends on manual experience recommendation, and the recommendation deviation is large.
In order to achieve the above object, the present application provides an adaptive recommendation method for online programming teaching, including:
acquiring an answer information sample, and dividing the answer information sample into a support set and a query set;
minimizing a cross entropy loss function on the support set based on the support set, and determining a capability estimation algorithm function based on the cross entropy loss function on the support set;
performing outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function;
and acquiring the answer information of the learner, inputting the answer information into the recommendation model, and recommending the test questions of the answer information based on the recommendation model to obtain the recommendation strategy of the learner.
Optionally, the step of determining the capability estimation algorithm function based on the cross entropy loss function on the support set includes:
and fusing the cross entropy loss function on the support set with a preset potential capacity estimation to obtain the capacity estimation algorithm function.
Optionally, the step of performing outer layer optimization on the model to be optimized based on the query set and the capability estimation algorithm function to determine a recommended policy algorithm function includes:
minimizing a cross entropy loss function on the set of queries based on the capability estimation algorithm function;
and performing outer layer optimization on the model to be optimized based on the cross entropy loss function on the query set, and determining a recommended strategy algorithm function.
Optionally, the step of performing test question recommendation processing on the answer information based on the recommendation model to obtain the recommendation strategy of the learner includes:
based on the recommendation model, evaluating the answer information and determining the ability information of the learner;
determining a recommended strategy for the learner based on the competency information.
Optionally, the step of evaluating the answer information based on the recommendation model to determine the learner's competence information includes:
determining questions answered in pairs and questions answered in wrong in the answer information;
respectively capturing performance information of the question of the answer pair and the question of the wrong answer through self-attention operation based on the recommendation model, and determining first ability information of the learner based on the performance information;
capturing a contradiction between the question of the answer pair and the question of the wrong answer through double attention operation based on the first ability information, and determining second ability information of the learner based on the contradiction;
and obtaining the learner's ability information based on the first ability information and the second ability information.
The application also provides a self-adaptation recommendation device of online programming teaching, the self-adaptation recommendation device of online programming teaching includes:
the system comprises a dividing module, a query module and a processing module, wherein the dividing module is used for obtaining an answer information sample and dividing the answer information sample into a support set and a query set;
the inner-layer optimization module is used for minimizing a cross entropy loss function on the support set based on the support set and determining a capability estimation algorithm function based on the cross entropy loss function on the support set;
the outer layer optimization module is used for carrying out outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function;
and the recommendation module is used for acquiring the answer information of the learner, inputting the answer information into the recommendation model, and recommending the test questions to the answer information based on the recommendation model to obtain the recommendation strategy of the learner.
The present application further provides an adaptive recommendation device for online programming teaching, the adaptive recommendation device for online programming teaching includes: a memory, a processor, and a program stored on the memory for implementing the adaptive recommendation method for online programming teaching,
the memory is used for storing a program for realizing the self-adaptive recommendation method of the online programming teaching;
the processor is used for executing the program for realizing the self-adaptive recommendation method of the online programming teaching, so as to realize the steps of the self-adaptive recommendation method of the online programming teaching.
The present application also provides a storage medium having stored thereon a program for implementing an adaptive recommendation method for online programming teaching, the program for implementing an adaptive recommendation method for online programming teaching being executed by a processor to implement the steps of the adaptive recommendation method for online programming teaching.
Compared with the strategy recommended by depending on human experience in the prior art and the recommended deviation is large, the self-adaptive recommendation method, the self-adaptive recommendation device, the self-adaptive recommendation equipment and the storage medium for online programming teaching, provided by the application, are used for acquiring answer information samples and dividing the answer information samples into a support set and a query set; minimizing a cross entropy loss function on the support set based on the support set, and determining a capability estimation algorithm function based on the cross entropy loss function on the support set; performing outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function; and acquiring the answer information of the learner, inputting the answer information into the recommendation model, and recommending the test questions of the answer information based on the recommendation model to obtain the recommendation strategy of the learner. The method is based on a recommendation model obtained by a double-layer optimization method, the answer information of the learner is analyzed, the test questions with the optimal learning value of the learner are recommended, expert experience is not needed, the complex behaviors of the learner are considered in the identification process, and the test question recommendation deviation is reduced.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present application and together with the description, serve to explain the principles of the application. In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious for those skilled in the art to obtain other drawings without inventive labor.
FIG. 1 is a schematic diagram of an apparatus architecture of a hardware operating environment according to an embodiment of the present application;
FIG. 2 is a schematic flowchart illustrating a first embodiment of an adaptive recommendation method for online programming teaching according to the present application;
fig. 3 is a schematic flow chart of information processing according to a first embodiment of the adaptive recommendation method for online programming teaching according to the present application.
The implementation, functional features and advantages of the objectives of the present application will be further explained with reference to the accompanying drawings.
Detailed Description
It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
As shown in fig. 1, fig. 1 is a schematic terminal structure diagram of a hardware operating environment according to an embodiment of the present application.
The terminal in the embodiment of the application may be a PC, or may be a mobile terminal device having a display function, such as a smart phone, a tablet computer, an e-book reader, an MP3 (Moving Picture Experts Group Audio Layer III, motion Picture Experts compression standard Audio Layer 3) player, an MP4 (Moving Picture Experts Group Audio Layer IV, motion Picture Experts compression standard Audio Layer 4) player, a portable computer, or the like.
As shown in fig. 1, the terminal may include: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, a communication bus 1002. Wherein a communication bus 1002 is used to enable connective communication between these components. The user interface 1003 may include a Display screen (Display), an input unit such as a Keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface, a wireless interface. The network interface 1004 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface). The memory 1005 may be a high-speed RAM memory or a non-volatile memory (e.g., a magnetic disk memory). The memory 1005 may alternatively be a storage device separate from the processor 1001.
Optionally, the terminal may further include a camera, a Radio Frequency (RF) circuit, a sensor, an audio circuit, a WiFi module, and the like. Such as light sensors, motion sensors, and other sensors. Specifically, the light sensor may include an ambient light sensor that may adjust the brightness of the display screen according to the brightness of ambient light, and a proximity sensor that may turn off the display screen and/or the backlight when the mobile terminal is moved to the ear. As one of the motion sensors, the gravity acceleration sensor can detect the magnitude of acceleration in each direction (generally, three axes), detect the magnitude and direction of gravity when the mobile terminal is stationary, and can be used for applications (such as horizontal and vertical screen switching, related games, magnetometer attitude calibration), vibration recognition related functions (such as pedometer and tapping) and the like for recognizing the attitude of the mobile terminal; of course, the mobile terminal may also be configured with other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which are not described herein again.
Those skilled in the art will appreciate that the terminal structure shown in fig. 1 is not intended to be limiting and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.
As shown in fig. 1, a memory 1005, which is a kind of computer storage medium, may include therein an operating device, a network communication module, a user interface module, and an adaptive recommendation program for online programming teaching.
In the terminal shown in fig. 1, the network interface 1004 is mainly used for connecting to a backend server and performing data communication with the backend server; the user interface 1003 is mainly used for connecting a client (user side) and performing data communication with the client; and processor 1001 may be used to invoke an adaptive recommendation program for online programming teaching stored in memory 1005.
Referring to fig. 2, an embodiment of the present application provides an adaptive recommendation method for online programming teaching, where the adaptive recommendation method for online programming teaching includes:
step S100, obtaining an answer information sample, and dividing the answer information sample into a support set and a query set;
step S200, based on the support set, minimizing a cross entropy loss function on the support set, and based on the cross entropy loss function on the support set, determining a capability estimation algorithm function;
step S300, performing outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function;
and step S400, acquiring the answer information of the learner, inputting the answer information into the recommendation model, and carrying out test question recommendation processing on the answer information based on the recommendation model to obtain the recommendation strategy of the learner.
In this embodiment, the specific application scenarios may be:
at present, a method for generating a recommendation strategy for learning and teaching mainly depends on expert experience related to learning and teaching, and corresponding recommendation strategies are designed according to the characteristics of different diagnosis models. However, the adaptivity of such recommendation strategies is limited, and in the design of the recommendation strategies, expert experience has to be relied on to understand the details and principles of the diagnosis model in order to design the corresponding recommendation strategies, and in the ability assessment of learners, the traditional expert design method cannot identify the complex behaviors of learners in the process, and the recommendation deviation is large.
The method comprises the following specific steps:
step S100, obtaining an answer information sample, and dividing the answer information sample into a support set and a query set;
in this embodiment, the adaptive recommendation method for online programming teaching is applied to an adaptive recommendation device for online programming teaching.
In this embodiment, the answer information samples are answer information of multiple learners and used for training a recommendation model, wherein the content of the answer information samples is the same as the answer information of the learners, and is not described herein again.
In this embodiment, the device divides the answer information sample into support set and query set, i.e. randomly divides the learner i answer record into query set
Figure 991650DEST_PATH_IMAGE001
And supporting set
Figure 424293DEST_PATH_IMAGE002
Two parts.
Step S200, based on the support set, minimizing a cross entropy loss function on the support set, and based on the cross entropy loss function on the support set, determining a capability estimation algorithm function;
in the present embodiment, in the inner-layer optimization, the recommendation strategy is sequentially answered from the support set according to learner i's previous answer
Figure 492743DEST_PATH_IMAGE003
Selecting a question; next, minimize the support set
Figure 490655DEST_PATH_IMAGE004
Cross entropy loss on to estimate the capability in the outer layer
Figure 526613DEST_PATH_IMAGE005
Wherein the formula is as follows:
Figure 104225DEST_PATH_IMAGE006
wherein q represents the question ID, a represents whether the answer of the test question is correct or not, if a is 1, the answer of the test question is correct, and if a is 0, the answer is wrong,
Figure 27181DEST_PATH_IMAGE007
for cross-entropy loss of the capability diagnosis,
Figure 464503DEST_PATH_IMAGE008
is the cross entropy loss to measure the accuracy of the capability estimate.
Specifically, the step S200 includes the following step S210:
and step S210, fusing the cross entropy loss function on the support set with a preset potential capacity estimation to obtain the capacity estimation algorithm function.
In this embodiment, since the inferred capacities from the previous response records are not unique, the apparatus improves the estimation accuracy and robustness by fusing one or more sets of preset potential capacity estimates on the cross-entropy loss function on the support set, wherein the preset potential capacity estimates are:
Figure 253337DEST_PATH_IMAGE009
for example, a programming problem can be solved by a plurality of algorithm methods such as dynamic programming, breadth-first search, brute force search and the like at the same time, and the different using methods determine that the learner can have a plurality of different potential abilities.
Figure 900219DEST_PATH_IMAGE010
Meaning that, due to uncertainty, the learner's previous behavior corresponds to a set of estimates rather than a single value. This approach has two advantages: 1) a set of estimates may be used to describe the learner's competence from different perspectives and to resist perturbations. Further fusing them can improve the accuracy of each step of estimation; 2) the estimation method is general and is based on the objective patterns that exist in the learner's interaction with the test question, without additional restrictions and assumptions being made on the learning process itself.
Adjusted at each turn
Figure 943261DEST_PATH_IMAGE011
In the inner layer optimization (MLE), a diversity regularization term is added
Figure 142030DEST_PATH_IMAGE012
. The specific calculation formula is as follows:
Figure 765297DEST_PATH_IMAGE013
wherein the content of the first and second substances,
Figure 560078DEST_PATH_IMAGE014
Figure 378998DEST_PATH_IMAGE015
wherein the content of the first and second substances,
Figure 748668DEST_PATH_IMAGE016
is the MLE loss, ensuring the accuracy of the estimation.
Figure 653039DEST_PATH_IMAGE017
Is the coefficient weight that controls the diversity regularization term. Due to optimization
Figure 517090DEST_PATH_IMAGE018
It is not possible to get the average value before all m estimates are generated, so a provisional average of the previous i-1 estimates is used
Figure 408428DEST_PATH_IMAGE019
Substitution
Figure 683420DEST_PATH_IMAGE020
. All estimates
Figure 888136DEST_PATH_IMAGE021
Are generated sequentially and balance accuracy and diversity between them.
Step S300, performing outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function;
specifically, the step S300 includes the following steps S310 to S320:
step S310, based on the capability estimation algorithm function, minimizing a cross entropy loss function on the query set;
and S320, performing outer layer optimization on the model to be optimized based on the cross entropy loss function on the query set, and determining a recommended strategy algorithm function.
In the embodiment, in the outer-layer optimization, the cross entropy loss on the learner's query set is minimized to optimize according to the capability estimation algorithm function estimated by the inner layer, so as to generate the final recommendation strategy, and the formula is as follows:
Figure 477250DEST_PATH_IMAGE022
wherein n is the total number of learners in the data set,
Figure 598658DEST_PATH_IMAGE007
for the cross-entropy loss of the capability diagnosis,
Figure 454006DEST_PATH_IMAGE008
to measure the cross-entropy loss of accuracy of the capability estimate,
Figure 677177DEST_PATH_IMAGE005
the learner's competence estimated for the current moment.
In this embodiment, this two-layer optimization presents the following three advantages: 1) the error in the capability estimate is mainly due to the difference in the selected problem, and this further guides
Figure 273244DEST_PATH_IMAGE023
And (4) optimizing. Due to real ability
Figure 452421DEST_PATH_IMAGE024
Is unknown, the present invention utilizes on a query set
Figure 272478DEST_PATH_IMAGE025
The degree of fit of (c) to measure the estimation error in the outer layer. 2) Since the programmed learning may stop at any round/time according to different stopping rules, the present invention simplifies the goal and sums all steps to minimize losses. 3) Policy
Figure 717366DEST_PATH_IMAGE026
Is independent of the type of diagnostic model and, more importantly, it allows for efficient topic selection adaptively based on the characteristics of a given diagnostic model. Once the recommendation strategy is generated, its parameters do not need to be updated in the learning process, and the next test question can be selected adaptively according to the previous answer records.
In another embodiment, a policy will be recommended
Figure 588895DEST_PATH_IMAGE026
Is defined as the objective of an optimization problem. Based on this, an equivalent transformation can be performed on it:
Figure 950475DEST_PATH_IMAGE027
wherein the content of the first and second substances,
Figure 879117DEST_PATH_IMAGE028
is the average cross entropy of the diagnostic model over the query set. Thus, the two-tier optimization is translated into a maximized cumulative expected reward in reinforcement learning, i.e.
Figure 545721DEST_PATH_IMAGE029
. In particular, in the reinforcement learning MDP
Figure 811486DEST_PATH_IMAGE030
Is defined as: 1) the set of states S refers to all the response records obtained in the t-th round, i.e.
Figure 437027DEST_PATH_IMAGE031
(ii) a 2) The action set A is usedIn the question bank, each question/action is selected at most once in the learning process of a learner; 3) the state transition function P refers to the selection of topics
Figure 411937DEST_PATH_IMAGE032
Then observe the learner state as
Figure 80684DEST_PATH_IMAGE033
The probability of (c). Because whether the learner answers the question is unknown in advance, the question is a source of uncertainty in the learning process; 4) the reward function R is the negative loss of the ability estimation on the query set in the tth round, i.e. the
Figure 556665DEST_PATH_IMAGE034
Based on the above definition, the adaptive recommendation of online programming teaching in programming learning is redefined as a decision process: given the learner's previous answers and some diagnostic model, which test question best suits him and most accurately measures his ability. In fact, the decision processes of learner responses, ability estimation and recommendation strategies are mutually influenced and depended, and further become a complex device. The reinforcement learning framework can explore more most suitable test questions for different learners from a long-term perspective. For example, multiple test question combinations may generally provide more comprehensive and accurate learner competency assessment and promotion.
In the embodiment, the self-adaptive recommendation method for the online programming teaching based on reinforcement learning is applied to the self-adaptive recommendation of the online programming teaching on the online programming education platform and used for automatically selecting test questions suitable for learners in the learning process, so that the evaluation accuracy and the programming learning efficiency are improved. The learner data is answer data from the online education platform, and the data can be divided into data describing attributes of the learner and the test question and data describing answer behaviors of the learner. The performance data may then be screened based on the learning context and the target learner group. The filtered learner data is further partitioned into a query set and a support set to generate a recommendation strategy based on the optimization problem objective. The learner's answering behavior is input to the embedding layer and the behavior learning module from two channels according to the correctness, and the test questions are converted into low-dimensional dense feature vectors. Then, the feature vectors with low dimension and density are input into a contradiction capturing module, and the contradiction between the test question pairs is extracted by using a double attention mechanism, so that the guess and the error behavior of the learner are identified. Finally, in order to improve the accuracy of the adaptive recommendation of the online programming teaching, a new ability estimation method is designed based on the multifaceted characteristics of the learner in the learning process, and a group of estimation values are fused to enhance the robustness of the ability estimation in the learning.
And step S400, acquiring the answer information of the learner, inputting the answer information into the recommendation model, and carrying out test question recommendation processing on the answer information based on the recommendation model to obtain the recommendation strategy of the learner.
In this embodiment, a learner is usually a learner on an online learning platform, the answer information of the learner is related to the question that the learner has made a historical answer on the online learning platform, and the answer information of the learner can be divided into the following two categories: one type is data describing various attributes of the learner and the test question, including but not limited to basic information of the learner, a question ID, a knowledge point to which the question relates, and the like. Another type is data describing the learner's response activity including, but not limited to, time of response, topic ID of response, result of response (correct or incorrect), etc.
In this embodiment, the method for acquiring the learner's answer information by the device may be extracted from a database of the online learning platform, or may be acquired by inputting the learner's answer information into the device, which is not limited in this embodiment.
In another embodiment, after the step of obtaining the learner's answer information, the device filters the learner's answer information based on the preset filtering criteria. In the embodiment of the invention, the composition data is screened according to the learning range and the target learner group, for example, if an online programming learning course is organized for primary and middle school students, only the knowledge points are reserved according to the related knowledge pointsProgramming problems that those first-level algorithms can solve. In the process, all answered records of learner i may be represented as tuple set D i = ({ (q, a) }, q denotes the topic ID, and if a is 1, it means that it correctly answers the test question q, and a is 0, it means that the answer is wrong.
In this embodiment, the device inputs the answer information into a preset recommendation model, based on the recommendation model obtained by a double-layer optimization method training, analyzes the answer information of the learner and recommends a test question with the optimal learning value for the learner, without depending on expert experience, and considers the complex behavior of the learner in the recognition process to reduce the deviation of test question recommendation.
Specifically, the step S400 includes the following steps S410 to S420:
step S410, based on the recommendation model, evaluating the answer information and determining the ability information of the learner;
in this embodiment, the device evaluates the answer information based on the recommendation model, and determines the learner's competence information.
Specifically, for each learner, its competency may be expressed as
Figure 377990DEST_PATH_IMAGE035
And is
Figure 648435DEST_PATH_IMAGE036
Representing the true value of his ability to be evaluated, where d refers to the dimension of the ability (e.g., the number of knowledge points to be evaluated). An adaptive recommender for online programming teaching generally comprises two parts: 1) capability diagnostic model
Figure 264531DEST_PATH_IMAGE037
The system is used for predicting the probability of the learner's answer to each test question to model the learner's ability; 2) recommendation strategy
Figure 622831DEST_PATH_IMAGE038
The most suitable topic is selected from the topic library based on the past behavior of the learner. Specifically, in the first step of learningt wheel, which selects the next question
Figure 751193DEST_PATH_IMAGE039
To give the learner a response
Figure 192539DEST_PATH_IMAGE040
Specifically, the step S410 includes the following steps S411 to S414:
step S411, determining questions answered correctly and questions answered incorrectly in the answer information;
in this embodiment, the device determines the right answer and wrong answer in the answer information according to the data of the learner's answer behavior in the answer information, wherein the data of the learner's answer behavior includes the time of answering, the question ID of answering, the answer result (correct or wrong), and the like.
In this embodiment, referring to fig. 3, the network mainly comprises a dual-channel behavior learning module (denoted as PL), a contradiction capturing module (denoted as CL) and a decision module. First, the PL captures two different pieces of performance information, right and wrong, respectively, for learners (false answers are usually much less than correct answers in the real world). Second, the CL identifies and extracts inconsistencies in learner performance in an attempt to mitigate the effects of perturbations (i.e., guesses and miss factors). And finally, the decision layer selects the next question.
Specifically, firstly, embedding and characterizing the test questions in the answer records: given the current state of learner i
Figure 835878DEST_PATH_IMAGE041
Whole question bank
Figure 263449DEST_PATH_IMAGE042
Each test question is embedded into a d-dimensional continuous space to construct an embedded matrix
Figure 921351DEST_PATH_IMAGE043
. Because the learner provides different information for different answer results (correct and wrong) of the same test questionThere are two representations for each question:
Figure 189390DEST_PATH_IMAGE044
and
Figure 195392DEST_PATH_IMAGE045
the embedded matrix is the answer pair and the wrong answer respectively.
Step S412, based on the recommendation model, respectively capturing the performance information of the right-answered question and the wrong-answered question through self-attention operation, and determining first competence information of the learner based on the performance information;
in this embodiment, a two-pass self-care operation is used to handle correct and incorrect answers, respectively. Setting k 1 And k 2 Representing the number of correct and incorrect answers at round t, respectively. The questions to be answered are represented as two embedded matrices,
Figure 426653DEST_PATH_IMAGE046
and is
Figure 201711DEST_PATH_IMAGE047
(where 1 is correct and 0 is wrong) and then entered into the self-attention layer. It is usually composed of two sublayers, namely a self-attention layer and a feed-forward network. Specifically, the self-attention layer is defined as:
Figure 378002DEST_PATH_IMAGE048
wherein the projection matrix
Figure 605721DEST_PATH_IMAGE049
Are the corresponding learnable parameters. The Attention function is implemented by a scaled dot product operation:
Figure 109515DEST_PATH_IMAGE050
wherein C, K and V respectively represent Query (Query) and key(Key) and Value (Value).
Figure 129292DEST_PATH_IMAGE051
Is a scaling factor that prevents the use of large inner product values. Meanwhile, in order to endow the layer with nonlinear characteristics and consider interaction in different potential dimensions, a point-by-point feed-forward network is applied
Figure 348921DEST_PATH_IMAGE052
In (1). The specific calculation method comprises the following steps:
Figure 204881DEST_PATH_IMAGE053
wherein
Figure 639930DEST_PATH_IMAGE054
Is the ReLU activation function; weight matrix
Figure 248634DEST_PATH_IMAGE055
And
Figure 514531DEST_PATH_IMAGE056
has the dimension of
Figure 451263DEST_PATH_IMAGE057
;b (1) And b (2) Are two bias terms. Therefore, the temperature of the molten metal is controlled,
Figure 608443DEST_PATH_IMAGE058
is the output of this two-channel behavior learning module, i.e., the learner's first competency information.
Step S413, capturing a contradiction between the question of the answer pair and the question of the wrong answer through a double attention operation based on the first competence information, and determining second competence information of the learner based on the contradiction;
in this embodiment, the contradiction between the question of the answer pair and the question of the wrong answer is finally defined as the complexity of the behavior of the learner in learning mainly represented by guessing and wrong factors, for example, when the learner encounters a wrong question or algorithm, the learner can test the sample by guessing to obtain a higher score (i.e. guessing factor); when faced with a simple test question, the answer may be wrong (i.e., a failure factor) due to carelessness or insufficient comprehensiveness. To achieve an increase in the efficiency of programming learning, the recommendation strategy should identify and eliminate these perturbations in the learner's performance in order to better select the next test question. For example, if a learner correctly answers a difficult test question, but incorrectly answers a simpler test question, the possible contradictions are: the problem may be guessed or missed in simple questions, or both.
In this embodiment, the device captures the contradiction between the correct and incorrect responses through a double attention operation, and the specific formula is as follows:
Figure 961669DEST_PATH_IMAGE059
wherein
Figure 398467DEST_PATH_IMAGE060
Is a question of examination
Figure 88074DEST_PATH_IMAGE061
And
Figure 721050DEST_PATH_IMAGE062
of the contradiction score, weight matrix
Figure 789500DEST_PATH_IMAGE063
And
Figure 52991DEST_PATH_IMAGE064
has the dimension of
Figure 623037DEST_PATH_IMAGE065
. All of
Figure 200649DEST_PATH_IMAGE066
A fractional matrix is formed
Figure 920343DEST_PATH_IMAGE067
. Then, the contradiction scores
Figure 823577DEST_PATH_IMAGE068
Normalization was performed from the row and column dimensions respectively by the Softmax function:
Figure 815673DEST_PATH_IMAGE069
thereby, respectively constructing a matrix
Figure 996643DEST_PATH_IMAGE070
. To further extract the problem pairs where there are contradictions, the module uses the generated matrices in performance learning
Figure 774106DEST_PATH_IMAGE071
And using fractional matrices
Figure 644979DEST_PATH_IMAGE072
Double attention and feed forward operations are performed:
Figure 655529DEST_PATH_IMAGE073
wherein the content of the first and second substances,
Figure 106102DEST_PATH_IMAGE074
and
Figure 3651DEST_PATH_IMAGE075
and contradictory feature matrixes of the test questions in the two channels, namely second capability information, are respectively obtained.
Step S414, obtaining the learner' S competence information based on the first competence information and the second competence information.
In the present embodiment, it is preferred that,
Figure 313934DEST_PATH_IMAGE076
is the output of this two-channel behavior learning module, i.e. learningThe first capability information of the person who has the first capability,
Figure 749463DEST_PATH_IMAGE077
contradictory feature matrices, i.e., second ability information, of the test questions in the two channels, respectively, the device being based on the first ability information and the second ability information, the learner's ability information being
Figure 613514DEST_PATH_IMAGE078
Step S420, based on the ability information, determining the recommendation strategy of the learner.
In this embodiment, the decision layer in the network is based on
Figure 755783DEST_PATH_IMAGE079
The four matrixes determine the selection of the next question, namely the recommendation strategy of the learner. Here, an average pooling operation is applied to each matrix and they are concatenated into a vector
Figure 561933DEST_PATH_IMAGE080
. Here, the Q-learning algorithm is used, which corresponds to the approximate action value function, noted
Figure 148074DEST_PATH_IMAGE081
The Q-learning algorithm is a reinforcement learning algorithm, and other reinforcement learning algorithms such as a time difference method (TD method), Policy Gradient, Actor-critical, etc. may also be used in this embodiment, which is not limited herein. Using a feed-forward neural network to represent:
Figure 550236DEST_PATH_IMAGE082
wherein
Figure 874907DEST_PATH_IMAGE083
And
Figure 524063DEST_PATH_IMAGE084
is a matrix of the weights that is,
Figure 481655DEST_PATH_IMAGE085
and
Figure 812142DEST_PATH_IMAGE086
is a bias term. To obtain
Figure 56566DEST_PATH_IMAGE087
Then, the optimal strategy is the recommended strategy of the learner
Figure 814307DEST_PATH_IMAGE088
Compared with the strategy recommended by depending on human experience in the prior art, the self-adaptive recommendation method for on-line programming teaching has the advantages that the recommended deviation is large, in the method, answer information samples are obtained and are divided into a support set and a query set; minimizing a cross entropy loss function on the support set based on the support set, and determining a capability estimation algorithm function based on the cross entropy loss function on the support set; performing outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function; and acquiring the answer information of the learner, inputting the answer information into the recommendation model, and recommending the test questions of the answer information based on the recommendation model to obtain the recommendation strategy of the learner. The method is based on a recommendation model obtained by a double-layer optimization method, the answer information of the learner is analyzed, the test questions with the optimal learning value of the learner are recommended, expert experience is not needed, the complex behaviors of the learner are considered in the identification process, and the test question recommendation deviation is reduced.
The application also provides an adaptive recommendation device of online programming teaching, the adaptive recommendation device of online programming teaching includes:
the system comprises a dividing module, a query module and a processing module, wherein the dividing module is used for obtaining an answer information sample and dividing the answer information sample into a support set and a query set;
the inner-layer optimization module is used for minimizing a cross entropy loss function on the support set based on the support set and determining a capability estimation algorithm function based on the cross entropy loss function on the support set;
the outer layer optimization module is used for carrying out outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function;
and the recommendation module is used for acquiring the answer information of the learner, inputting the answer information into the recommendation model, and recommending the test questions of the answer information based on the recommendation model to obtain the recommendation strategy of the learner.
Optionally, the inner layer optimization module includes:
and the fusion module is used for fusing the cross entropy loss function on the support set with a preset potential capability estimation to obtain the capability estimation algorithm function.
Optionally, the outer layer optimization module includes:
a query set minimization loss function module for minimizing a cross entropy loss function on the query set based on the capability estimation algorithm function;
and the recommendation strategy algorithm function determining module is used for performing outer layer optimization on the model to be optimized based on the cross entropy loss function on the query set to determine a recommendation strategy algorithm function.
Optionally, the recommendation module includes:
the evaluation module is used for evaluating the answer information based on the recommendation model and determining the ability information of the learner;
and the recommendation strategy determination module is used for determining the recommendation strategy of the learner based on the capability information.
Optionally, the evaluation module comprises:
the classification module is used for determining questions answered correctly and questions answered incorrectly in the answer information;
the performance learning module is used for respectively capturing performance information of the right-answered question and the wrong-answered question through self-attention operation based on the recommendation model and determining first ability information of the learner based on the performance information;
a contradiction capturing module, configured to capture a contradiction between the question of the answer pair and the question of the wrong answer through a double attention operation based on the first competence information, and determine second competence information of the learner based on the contradiction;
and the competence information determining module is used for obtaining the competence information of the learner based on the first competence information and the second competence information.
The specific implementation manner of the adaptive recommendation device for online programming teaching is basically the same as that of each embodiment of the adaptive recommendation method for online programming teaching, and is not described herein again.
Referring to fig. 1, fig. 1 is a schematic terminal structure diagram of a hardware operating environment according to an embodiment of the present application.
As shown in fig. 1, the terminal may include: a processor 1001, such as a CPU, a network interface 1004, a learner interface 1003, a memory 1005, a communication bus 1002. Wherein a communication bus 1002 is used to enable connective communication between these components. The learner interface 1003 may include a Display screen (Display), an input unit such as a Keyboard (Keyboard), and the optional learner interface 1003 may further include a standard wired interface, a wireless interface. The network interface 1004 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface). The memory 1005 may be a high-speed RAM memory or a non-volatile memory such as a disk memory. The memory 1005 may alternatively be a storage device separate from the processor 1001.
Optionally, the adaptive recommendation device for online programming teaching may further include a rectangular learner interface, a network interface, a camera, a Radio Frequency (RF) circuit, a sensor, an audio circuit, a WiFi module, and the like. The rectangular learner interface may include a Display screen (Display), an input sub-module such as a Keyboard (Keyboard), and the optional rectangular learner interface may further include a standard wired interface, a wireless interface. The network interface may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface).
Those skilled in the art will appreciate that the adaptive recommendation device architecture of the online programming teaching shown in FIG. 1 does not constitute a limitation of the adaptive recommendation device of the online programming teaching, and may include more or fewer components than shown, or some components in combination, or a different arrangement of components.
As shown in fig. 1, a memory 1005, which is a storage medium, may include an operating system, a network communication module, and an adaptive recommendation program for online programming teaching. The operating system is a program that manages and controls the adaptive recommendation device hardware and software resources for online programming teaching, supports the operation of the adaptive recommendation program for online programming teaching, and other software and/or programs. The network communication module is used for communication among the components in the memory 1005 and with other hardware and software in the adaptive recommendation system for online programming teaching.
In the adaptive recommendation device for online programming teaching shown in fig. 1, the processor 1001 is configured to execute an adaptive recommendation program for online programming teaching stored in the memory 1005, and implement any of the steps of the adaptive recommendation method for online programming teaching described above.
The specific implementation manner of the adaptive recommendation device for online programming teaching of the present application is substantially the same as that of each embodiment of the adaptive recommendation method for online programming teaching, and is not described herein again.
The present application also provides a storage medium having stored thereon a program for implementing an adaptive recommendation method for online programming teaching, the program being executed by a processor to implement the adaptive recommendation method for online programming teaching as follows:
acquiring the answering information of the learner;
and inputting the answer information into a preset recommendation model, and carrying out test question recommendation processing on the answer information based on the recommendation model to obtain the recommendation strategy of the learner, wherein the recommendation model is obtained by carrying out double-layer optimization method training on a model to be optimized based on an answer information sample.
Optionally, before the step of obtaining the learner's answer information, the method includes:
acquiring an answer information sample;
dividing the answer information sample into a support set and a query set;
based on the support set, carrying out inner layer optimization on a preset model to be optimized, and determining a capability estimation algorithm function;
and performing outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain the recommendation model based on the recommendation strategy algorithm function.
Optionally, the step of performing inner-layer optimization on a preset model to be optimized based on the support set to determine a capability estimation algorithm function includes:
minimizing a cross entropy loss function on the support set;
determining the capability estimation algorithm function based on a cross entropy loss function on the support set.
Optionally, the step of determining the capability estimation algorithm function based on the cross entropy loss function on the support set includes:
and fusing the cross entropy loss function on the support set with a preset potential capacity estimation to obtain the capacity estimation algorithm function.
Optionally, the step of performing outer layer optimization on the model to be optimized based on the query set and the capability estimation algorithm function to determine a recommended policy algorithm function includes:
minimizing a cross entropy loss function on the set of queries based on the capability estimation algorithm function;
and performing outer layer optimization on the model to be optimized based on the cross entropy loss function on the query set, and determining a recommended strategy algorithm function.
Optionally, the step of performing test question recommendation processing on the answer information based on the recommendation model to obtain the recommendation strategy of the learner includes:
based on the recommendation model, evaluating the answer information and determining the ability information of the learner;
determining a recommendation strategy for the learner based on the competency information.
Optionally, the step of evaluating the answer information based on the recommendation model to determine the learner's competence information includes:
determining questions answered in pairs and questions answered in wrong in the answer information;
respectively capturing performance information of the question of the answer pair and the question of the wrong answer through self-attention operation based on the recommendation model, and determining first ability information of the learner based on the performance information;
capturing contradictions between the questions of the answer pair and the wrong questions through double attention operation based on the first competence information, and determining second competence information of the learner based on the contradictions;
and obtaining the learner's competence information based on the first competence information and the second competence information.
The specific implementation of the storage medium of the present application is substantially the same as the embodiments of the adaptive recommendation method for online programming teaching, and is not described herein again.
The present application also provides a computer program product, comprising a computer program which, when executed by a processor, performs the steps of the above-described adaptive recommendation method for online programming teaching.
The specific implementation of the computer program product of the present application is substantially the same as the embodiments of the adaptive recommendation method for online programming teaching, and is not described herein again.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
The above-mentioned serial numbers of the embodiments of the present application are merely for description and do not represent the merits of the embodiments.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present application may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) as described above and includes instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present application.
The above description is only a preferred embodiment of the present application, and not intended to limit the scope of the present application, and all modifications of equivalent structures and equivalent processes, which are made by the contents of the specification and the drawings of the present application, or which are directly or indirectly applied to other related technical fields, are included in the scope of the present application.

Claims (8)

1. An adaptive recommendation method for online programming teaching is characterized in that the adaptive recommendation method for online programming teaching comprises the following steps:
acquiring an answer information sample, and dividing the answer information sample into a support set and a query set;
minimizing a cross entropy loss function on the support set based on the support set, and determining a capability estimation algorithm function based on the cross entropy loss function on the support set;
performing outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function;
and acquiring the answer information of the learner, inputting the answer information into the recommendation model, and recommending the test questions of the answer information based on the recommendation model to obtain the recommendation strategy of the learner.
2. The adaptive recommendation method for on-line programming teaching of claim 1 wherein said step of determining said capability estimation algorithm function based on a cross entropy loss function on said support set comprises:
and fusing the cross entropy loss function on the support set with a preset potential capacity estimation to obtain the capacity estimation algorithm function.
3. The adaptive recommendation method for on-line programming teaching of claim 1, wherein said step of performing outer layer optimization on said model to be optimized based on said query set and said capability estimation algorithm function, and determining a recommendation policy algorithm function comprises:
minimizing a cross entropy loss function on the query set based on the capability estimation algorithm function;
and performing outer layer optimization on the model to be optimized based on the cross entropy loss function on the query set, and determining a recommended strategy algorithm function.
4. The adaptive recommendation method for on-line programming teaching of claim 1, wherein the step of performing test question recommendation processing on the answer information based on the recommendation model to obtain the recommendation strategy of the learner comprises:
based on the recommendation model, evaluating the answer information and determining the ability information of the learner;
determining a recommendation strategy for the learner based on the competency information.
5. The adaptive recommendation method for on-line programming teaching of claim 4, wherein said step of evaluating said answer information based on said recommendation model to determine said learner's competency information comprises:
determining questions answered in pairs and questions answered in wrong in the answer information;
respectively capturing performance information of the question of the answer pair and the question of the wrong answer through self-attention operation based on the recommendation model, and determining first ability information of the learner based on the performance information;
capturing a contradiction between the question of the answer pair and the question of the wrong answer through double attention operation based on the first ability information, and determining second ability information of the learner based on the contradiction;
and obtaining the learner's ability information based on the first ability information and the second ability information.
6. An adaptive recommendation device for online programming teaching, the adaptive recommendation device comprising:
the system comprises a dividing module, a query module and a processing module, wherein the dividing module is used for obtaining an answer information sample and dividing the answer information sample into a support set and a query set;
the inner-layer optimization module is used for minimizing a cross entropy loss function on the support set based on the support set and determining a capability estimation algorithm function based on the cross entropy loss function on the support set;
the outer layer optimization module is used for carrying out outer layer optimization on a preset model to be optimized based on the query set and the capability estimation algorithm function, determining a recommendation strategy algorithm function, and training to obtain a recommendation model based on the recommendation strategy algorithm function;
and the recommendation module is used for acquiring the answer information of the learner, inputting the answer information into the recommendation model, and recommending the test questions of the answer information based on the recommendation model to obtain the recommendation strategy of the learner.
7. An adaptive recommendation device for online programming teaching, the adaptive recommendation device comprising: a memory, a processor, and a program stored on the memory for implementing the adaptive recommendation method for online programming teaching,
the memory is used for storing a program for realizing the self-adaptive recommendation method of the online programming teaching;
the processor is used for executing a program for implementing the adaptive recommendation method of the online programming teaching to implement the steps of the adaptive recommendation method of the online programming teaching according to any one of claims 1 to 5.
8. A storage medium having stored thereon a program for implementing an adaptive recommendation method for online programming teaching, the program being executed by a processor to implement the steps of the adaptive recommendation method for online programming teaching according to any one of claims 1 to 5.
CN202210902123.9A 2022-07-29 2022-07-29 Self-adaptive recommendation method, device, equipment and storage medium for online programming teaching Pending CN114969301A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210902123.9A CN114969301A (en) 2022-07-29 2022-07-29 Self-adaptive recommendation method, device, equipment and storage medium for online programming teaching

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210902123.9A CN114969301A (en) 2022-07-29 2022-07-29 Self-adaptive recommendation method, device, equipment and storage medium for online programming teaching

Publications (1)

Publication Number Publication Date
CN114969301A true CN114969301A (en) 2022-08-30

Family

ID=82969473

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210902123.9A Pending CN114969301A (en) 2022-07-29 2022-07-29 Self-adaptive recommendation method, device, equipment and storage medium for online programming teaching

Country Status (1)

Country Link
CN (1) CN114969301A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110930274A (en) * 2019-12-02 2020-03-27 中山大学 Practice effect evaluation and learning path recommendation system and method based on cognitive diagnosis
US20210397941A1 (en) * 2020-06-12 2021-12-23 International Business Machines Corporation Task-oriented machine learning and a configurable tool thereof on a computing environment
US20220215032A1 (en) * 2020-02-13 2022-07-07 Tencent Technology (Shenzhen) Company Limited Ai-based recommendation method and apparatus, electronic device, and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110930274A (en) * 2019-12-02 2020-03-27 中山大学 Practice effect evaluation and learning path recommendation system and method based on cognitive diagnosis
US20220215032A1 (en) * 2020-02-13 2022-07-07 Tencent Technology (Shenzhen) Company Limited Ai-based recommendation method and apparatus, electronic device, and storage medium
US20210397941A1 (en) * 2020-06-12 2021-12-23 International Business Machines Corporation Task-oriented machine learning and a configurable tool thereof on a computing environment

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
AAAI2022: "Fully Adaptive Framework: Neural Computerized Adaptive Testing for Online Education", 《AAAI2022》 *
HYUNBIN LOH.等: "Recommendation for Effective Standardized Exam Preparation", 《11TH INTERNATIONAL LEARNING ANALYTICS AND KNOWLEDGE CONFERECENCE》 *
SHUANGHONG SHEN等: "Assessing student"s Dynamic Knowledge State by Exploring the Question Difficulty Effect", 《PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR COFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION》 *
YAN ZHUANG等: "A Robust Computerized Adaptive Testing Approach in Educational Question Retrieval", 《SIGIR22》 *
研行笔录: "NCAT:用于在线教育的神经计算机自适应测试(AAAI2022论文解析)", 《CSDN》 *
魏思等: "融合通用题目表征学习的神经知识追踪方法研究", 《中文信息学报》 *

Similar Documents

Publication Publication Date Title
CN110399541B (en) Topic recommendation method and device based on deep learning and storage medium
KR101834003B1 (en) Cognitive learning and estimation system
KR102571069B1 (en) User knowledge tracking device, system and operation method thereof based on artificial intelligence learning
CN109934744B (en) Method, system and computer readable recording medium for providing educational service based on knowledge unit
CN112184500A (en) Extraclass learning tutoring system based on deep learning and knowledge graph and implementation method
Geden et al. Predictive student modeling in educational games with multi-task learning
CN111581529B (en) Course recommendation method and device combining student fitness and course collocation degree
CN109559576B (en) Child accompanying learning robot and early education system self-learning method thereof
CN111797854A (en) Scene model establishing method and device, storage medium and electronic equipment
Zhang et al. Personalized task difficulty adaptation based on reinforcement learning
CN112819024A (en) Model processing method, user data processing method and device and computer equipment
Chen et al. LogCF: Deep Collaborative Filtering with Process Data for Enhanced Learning Outcome Modeling.
US20070094172A1 (en) Neural network based rating system
JP7293658B2 (en) Information processing device, information processing method and program
CN114969301A (en) Self-adaptive recommendation method, device, equipment and storage medium for online programming teaching
US20220122481A1 (en) User Knowledge Tracking Device, System, and Operation Method thereof Based on Artificial Intelligence Learning
CN114358988B (en) Teaching mode pushing method and device based on AI technology
US11922827B2 (en) Learning management systems and methods therefor
CN114973815A (en) System and method for testing students in online teaching
Thabet et al. Towards intelligent serious games: deep knowledge tracing with hybrid prediction models
Pagliuca et al. Integrating learning by experience and demonstration in autonomous robots
Smith et al. Integrating heuristic and simulation-based reasoning in intuitive physics
KR102639379B1 (en) Method, apparatus and system for providing of user-customized golf professionals matching platform service based on artificial intelligence model
KR102603914B1 (en) Method, apparatus and system for providing of yoga exercise guide and curriculum using user-costomized anatomy based on image analysis
KR102411190B1 (en) Holistic student assessment framework based on multi-task learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20220830