CN112035654A

CN112035654A - Automatic topic derivation and generation method

Info

Publication number: CN112035654A
Application number: CN202010905871.3A
Authority: CN
Inventors: 王鑫
Original assignee: Shanghai Squirrel Classroom Artificial Intelligence Technology Co Ltd
Current assignee: Shanghai Squirrel Classroom Artificial Intelligence Technology Co Ltd
Priority date: 2020-09-01
Filing date: 2020-09-01
Publication date: 2020-12-04

Abstract

The invention discloses a topic automatic derivation generation method, which comprises the following steps: step S10, obtaining student information and obtaining an identification point to be observed corresponding to the student information; step S20, determining question patterns matched with the student information and the knowledge points to be investigated according to the student information and the knowledge points to be investigated; and step S30, generating a question corresponding to the knowledge point to be examined and consistent with the question type according to the question type and the knowledge point to be examined. The technical scheme can automatically generate questions according to the recognition points to be observed of the students, the generated questions and the existing questions are low in repetition probability, the question generation efficiency is improved, and the examination effect of the questions on the students is improved.

Description

Automatic topic derivation and generation method

Technical Field

The invention relates to the technical field of data processing, in particular to an automatic topic derivation generation method.

Background

With the development and popularization of computer technology and the internet, the work, study and life style of people are greatly changed, for example, people increasingly utilize computers to obtain knowledge, and the computers also provide more and more convenient services for people; in the field of education, computer technology also provides convenience to teachers and students.

At present, aiming at practice problems or test problems which are usually contacted by teachers or students, the teachers basically search questions from a question bank according to the teaching requirements of the subject, the questions are easy to repeat with the existing questions, the question repetition rate is high, and the examination of the students is not facilitated.

Disclosure of Invention

The invention provides an automatic topic derivation and generation method.

The invention provides an automatic topic derivation generation method, which comprises the following steps of S10-S30:

step S10, obtaining student information and obtaining an identification point to be observed corresponding to the student information;

step S20, determining question patterns matched with the student information and the knowledge points to be investigated according to the student information and the knowledge points to be investigated;

and step S30, generating a question corresponding to the knowledge point to be examined and consistent with the question type according to the question type and the knowledge point to be examined.

Preferably, in step S10, the acquiring student information and acquiring the to-be-inspected identification point corresponding to the student information includes:

acquiring student information, wherein the student information comprises student characteristic information, student grade information, student capability information and historical test information; the student characteristic information comprises any one or more of student number, student name, student class and student age;

acquiring subject information matched with the student grade information according to the student grade information;

extracting all subject knowledge points matched with the student information in the subject information according to the subject information;

acquiring mastered knowledge points corresponding to the student information according to the student capability information and the historical test information;

and comparing the mastered knowledge points with all corresponding subject knowledge points to obtain the mastered knowledge points corresponding to the student information as the to-be-observed knowledge points corresponding to the student information.

Preferably, the determining the question type matched with the student information and the knowledge point to be investigated according to the student information and the knowledge point to be investigated includes:

acquiring a difficulty coefficient corresponding to the identification point to be observed;

according to the student capability information, acquiring a difficulty coefficient range of a subject which is acceptable by a student and corresponds to the student information;

judging whether the difficulty coefficient corresponding to the knowledge point to be inspected exceeds the difficulty coefficient range or not;

if the difficulty coefficient range is not exceeded, determining a first question type as the question type matched with the student information and the knowledge point to be investigated; the first question type comprises any one of question types of question answering questions and blank filling questions;

and if the difficulty coefficient range is exceeded, determining a second question type as the question type matched with the student information and the knowledge point to be investigated, wherein the second question type comprises any one of a judgment question and a selection question.

Preferably, the method further comprises:

pushing the generated questions to student terminals corresponding to the student information, so that students can perform corresponding tests based on the generated questions;

and obtaining a test result obtained by testing the student corresponding to the student information based on the generated question and feedback information of the student.

Preferably, the method further comprises:

searching a system question bank according to the generated question, and identifying whether the existing question which is the same as the question exists in the system question bank;

if the existing questions same as the questions do not exist, storing the questions to the system questions;

if there is an existing topic that is the same as the topic, step S30 is re-executed to generate a new topic.

Preferably, when there are a plurality of knowledge points to be investigated, the generating a topic corresponding to the knowledge point to be investigated and consistent with the topic type according to the topic type and the knowledge point to be investigated includes steps a1-a 6:

a1, acquiring a text from a preset text library, and splitting the text to obtain a text abstract and a text main body content corresponding to the text;

step A2, judging whether the text abstract comprises at least M1 recognition points to be observed, if so, continuing the step A3; if not, return to step A1; m1 is a positive integer equal to or greater than 1;

a3, segmenting the text main body content to obtain a plurality of sub-contents, removing the sub-contents which do not comprise any knowledge point to be inspected in the plurality of sub-contents, and leaving a plurality of first target sub-contents, wherein each first target sub-content comprises at least one knowledge point to be inspected;

step A4, calculating the weight ratio of the knowledge points to be examined in the text main body content, wherein the knowledge points are included in each first target sub-content;

a5, screening the plurality of first target sub-contents according to the weight proportion to obtain a second remaining target sub-content;

and A6, generating a question which contains the knowledge point to be examined and is consistent with the question type according to the second target sub-content.

Preferably, the step a4, calculating the weight proportion of the knowledge point to be investigated included in each first target sub-content in the main text content, includes:

calculating a degree of association between the plurality of first target sub-contents according to the following formula (1):

in the formula (1), g_iRepresenting the ratio of the knowledge points to be mastered included in the ith first target sub-content to all the knowledge points to be mastered; g_i+1Representing the ratio of the knowledge points to be mastered included in the (i + 1) th first target sub-content to all the knowledge points to be mastered; g_i-1Representing the ratio of the knowledge points to be mastered included in the first target sub-content to all the knowledge points to be mastered; d_iRepresenting the ratio of the storage space occupied by the ith first target sub-content to the storage space occupied by the text main body content; d_i+1Indicating that the (i + 1) th first target sub-content occupiesA ratio of storage space to storage space occupied by the text body content; d_i-1Representing the ratio of the storage space occupied by the i-1 st first target sub-content to the storage space occupied by the text main body content; n is the total number of first target sub-content;

calculating the space proportion s of all the first target sub-contents including the kth recognition point to be observed in the text main body content by using the formula (2)_1kThen, there are:

wherein m represents a total number of paragraphs of the text body content; h_kjRepresenting the number of lines occupied by a first target sub-content including a k-th recognition point to be observed in the j section; h is_jRepresents the total number of lines included in the j section; delta_1kDenotes s_1kThe corresponding space-to-space ratio correction value is a preset value with the value range of [0.01, 0.05 ]]；

Calculating the occurrence frequency ratio s of all first target sub-contents including the k-th recognition point to be observed in the text main body content by using the formula (3)_2kThen, there are:

wherein, P_kjRepresenting the occurrence times of the kth to-be-observed recognition point in the jth segment; g_kjRepresenting the total occurrence number of each knowledge point to be examined appearing in the j section; delta_2kDenotes s_2kThe corresponding appearance frequency ratio correction value is a preset value with the value range of [0.01, 0.02 ]]；

According to said F, said s_1kS said_2kAnd the following formula (4) is used for calculating the weight ratio w of the kth recognition point to be observed in the main body content of the text_kThen, there are:

in the formula (4), a represents and s_1kThe correlation constant is 0.7, and b represents s_2kThe correlation constant takes a value of 0.3.

Preferably, the step a5, screening the plurality of first target sub-contents according to the weight percentage, to obtain a remaining second target sub-content, includes:

determining a target knowledge point to be inspected, wherein the corresponding weight ratio is equal to or greater than a preset weight ratio threshold;

acquiring each first target sub-content comprising the target knowledge point to be inspected, and taking each first target sub-content comprising the target knowledge point to be inspected as the second target sub-content;

step A6, generating a question containing the knowledge point to be examined and consistent with the question type according to the second target sub-content, comprising:

performing the following operations for each second target sub-content: extracting target knowledge points to be examined from the current second target sub-content; and generating a question corresponding to the extracted target knowledge point to be examined and consistent with the question type according to the current second target sub-content and the extracted target knowledge point to be examined.

The question automatic derivation generation method can intelligently and automatically generate questions according to the recognition points to be examined of the students, avoids repetition with the existing test questions, has low question repetition rate, can improve the examination and examination effects of the students, has high question generation speed, improves the question generation efficiency and accuracy, and improves the intelligence and objectivity of the question generation.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

The technical solution of the present invention is further described below by means of the accompanying drawings and examples.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:

FIG. 1 is a schematic workflow diagram of an embodiment of a topic automatic derivation generation method of the present invention.

Detailed Description

The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.

The invention provides an automatic topic derivation generation method, which aims to automatically derive and generate a corresponding topic according to related knowledge points and replace a traditional topic generation mode of manual topic generation.

FIG. 1 is a schematic diagram of a workflow of an embodiment of a topic automatic derivation generation method according to the present invention; the title automatic derivation generation method of the present invention can be implemented as steps S10-S30 as follows:

and step S10, obtaining student information and obtaining an identification point to be observed corresponding to the student information.

And step S20, determining question patterns matched with the student information and the knowledge points to be investigated according to the student information and the knowledge points to be investigated.

In an embodiment, the step S10 of obtaining the student information and obtaining the to-be-observed recognition point corresponding to the student information may be implemented as:

In one embodiment, the determining the topic type matching the student information and the knowledge point to be reviewed according to the student information and the knowledge point to be reviewed may be implemented as:

In one embodiment, the method may further comprise the steps of:

pushing the questions generated in the step S30 to student terminals corresponding to the student information, so that students can perform corresponding tests based on the generated questions;

In one embodiment, the method may further comprise the steps of:

The technical scheme can conveniently and quickly expand the questions in the system questions and improve the question expansion speed of the system question bank.

In one embodiment, when there are a plurality of knowledge points to be investigated (for example, when the recognition points to be investigated are a plurality of knowledge points in english, the knowledge points in english may be words, idioms, grammars, fixed sentences or structures, etc.), in this case, generating topics corresponding to the knowledge points to be investigated and corresponding to the topic models according to the topic models and the knowledge points to be investigated may include steps a1-a 6:

step A1, acquiring a text (such as an English novel) from a preset text library, and splitting the text to obtain a text abstract and a text main body content corresponding to the text;

In one embodiment, the step a4 of calculating the weight proportion of the knowledge point to be investigated included in each first target sub-content in the main text content includes:

in the formula (1), g_iRepresenting the ratio of the knowledge points to be mastered included in the ith first target sub-content to all the knowledge points to be mastered; g_i+1Representing the ratio of the knowledge points to be mastered included in the (i + 1) th first target sub-content to all the knowledge points to be mastered; g_i-1Representing the ratio of the knowledge points to be mastered included in the first target sub-content to all the knowledge points to be mastered; d_iRepresenting the ratio of the storage space occupied by the ith first target sub-content to the storage space occupied by the text main body content; d_i+1Representing the ratio of the storage space occupied by the (i + 1) th first target sub-content to the storage space occupied by the text main body content; d_i-1Representing the ratio of the storage space occupied by the i-1 st first target sub-content to the storage space occupied by the text main body content; n is the total number of first target sub-content;

In an embodiment, the step a5 of screening the plurality of first target sub-contents according to the weight percentage to obtain a remaining second target sub-content includes:

For example, when the question type is a choice question, the current second target sub-content is a section of English which comprises three sentences, the section of English comprises two knowledge points to be investigated, one of the knowledge points is an English vocabulary, and the other is a fixed sentence pattern structure; then, "generating a topic corresponding to the extracted target knowledge point to be examined and consistent with the topic type according to the current second target sub-content and the extracted target knowledge point to be examined" may be implemented as follows: and eliminating the target knowledge points to be examined in the current second target sub-content, replacing the target knowledge points with brackets, and giving a plurality of corresponding selectable filling items for each bracket so as to generate the title.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims

1. A method for generating automatic derivation of topics, the method comprising steps S10-S30:

2. The topic automatic derivation generation method of claim 1, wherein the step S10 of obtaining student information and obtaining an identification point to be observed corresponding to the student information comprises:

3. The topic automatic derivation generation method according to claim 2, wherein the determining the topic type matching the student information and the knowledge point to be reviewed according to the student information and the knowledge point to be reviewed comprises:

4. The method for automatic topic derivation generation according to claim 1, wherein the method further comprises:

5. The method for automatic topic derivation generation according to claim 1, wherein the method further comprises:

6. The method for automatically deriving and generating topics according to any one of claims 1 to 5, wherein when there are a plurality of knowledge points to be investigated, the generating topics corresponding to the knowledge points to be investigated and corresponding to the topic type according to the topic type and the knowledge points to be investigated comprises steps A1-A6:

7. The topic automatic derivation generation method according to claim 6, wherein the step a4 of calculating the weight ratio of the knowledge points to be investigated included in each first target sub-content in the main text content comprises:

8. The topic automatic derivation generation method of claim 6, wherein the step a5 of filtering the plurality of first target sub-contents according to the weight ratio to obtain a remaining second target sub-content comprises: