WO2023108991A1

WO2023108991A1 - Model training method and apparatus, knowledge classification method and apparatus, and device and medium

Info

Publication number: WO2023108991A1
Application number: PCT/CN2022/090718
Authority: WO
Inventors: 舒畅; 陈又新
Original assignee: 平安科技（深圳）有限公司
Priority date: 2021-12-15
Filing date: 2022-04-29
Publication date: 2023-06-22
Also published as: CN114238571A

Abstract

The present application belongs to the technical field of machine learning. Provided are a model training method and apparatus, a knowledge classification method and apparatus, and a device and a medium. The model training method comprises: acquiring original annotation data, wherein the original annotation data comprises question stem data, choice data and answer data; encoding the question stem data to obtain a question stem representation vector; encoding the choice data and the answer data according to a preset knowledge graph, so as to obtain a choice attribute value and an answer attribute value; performing word segmentation and splicing processing on the choice attribute value and the answer attribute value, so as to obtain a choice answer representation vector; performing vector splicing on the question stem representation vector and the choice answer representation vector, so as to obtain question data; and training a preset pre-training model according to the question data, so as to obtain a knowledge classification model, wherein the knowledge classification model is used for performing knowledge classification on a target question. The knowledge classification model obtained in the embodiments of the present disclosure can improve the accuracy and efficiency of knowledge classification.

Description

Model training method, knowledge classification method, device, equipment, medium

This application claims the priority of the Chinese patent application with the application number 202111536048.0 submitted to the China Patent Office on December 15, 2021, and the invention title is "model training method, knowledge classification method, device, equipment, medium", the entire content of which Incorporated in this application by reference.

technical field

The present application relates to the technical field of machine learning, and in particular to a model training method, a knowledge classification method, a device, a device, and a medium.

Background technique

With the development of artificial intelligence technology, there are currently technical solutions that can process various data based on artificial intelligence technology. For example, machine reading comprehension technology can be used to give answers to questions. Machine reading comprehension is a technology that enables machines to understand natural language texts and answer corresponding answers given questions and documents. This technology can be applied in many fields such as text question answering, information extraction in knowledge graph and event graph, and dialogue system.

technical problem

The following are the technical problems of the prior art that the inventors are aware of:

In some application scenarios, there is a lack of technical solutions for classifying knowledge. For example, in the English online education scenario, it is necessary to classify the topics for examining relevant English knowledge points, so as to divide the topics of the same knowledge points and provide special training for users. Since the number of English questions is too large, and some new questions are developed every year; if each question is divided manually, the workload is heavy, the efficiency is low, and it is easy to make mistakes.

technical solution

In the first aspect, the embodiment of the present application proposes a training method for a knowledge classification model, and the training method for the knowledge classification model includes:

Obtaining original annotation data; the original annotation data includes question stem data, option data and answer data;

Encoding the question stem data to obtain a question stem representation vector;

Encode the option data and answer data according to the preset knowledge graph to obtain option attribute values and answer attribute values;

Segmenting and concatenating the option attribute value and the answer attribute value to obtain an option answer representation vector;

performing vector splicing of the question stem characterization vector and the option answer characterization vector to obtain question data;

The preset pre-training model is trained according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on the target topic to obtain the type of knowledge points.

In the second aspect, the embodiment of the present application proposes a knowledge classification method for multiple-choice questions, and the knowledge classification method for multiple-choice questions includes:

Obtain multiple-choice data to be classified; wherein, the multiple-choice data includes question stem data;

Inputting the stem characterization vector into a knowledge classification model; wherein, the knowledge classification model is obtained by training according to the method described in the first aspect above;

performing feature extraction on the stem data through the knowledge classification model to obtain feature vector information;

Knowledge classification processing is performed according to the feature vector information to obtain knowledge point types.

In the third aspect, the embodiment of the present application proposes a training device for a knowledge classification model, and the training device for the knowledge classification model includes:

The original data acquisition module is used to obtain the original annotation data; the original annotation data includes question stem data, option data and answer data;

A question stem coding module, configured to encode the question stem data to obtain a question stem representation vector;

The option answer encoding module is used to encode the option data and answer data according to the preset knowledge graph to obtain option attribute values and answer attribute values;

A word segmentation and splicing module, used to perform word segmentation and splicing processing on the option attribute value and the answer attribute value to obtain an option answer representation vector;

A vector splicing module, configured to splice the question stem representation vector and the option answer representation vector to obtain topic data;

The classification model training module is used to train the preset pre-training model according to the topic data to obtain the knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on the target topic to obtain the type of knowledge points.

In the fourth aspect, the embodiment of the present application proposes a knowledge classification device for multiple-choice questions, and the knowledge classification device for multiple-choice questions includes:

The multiple-choice data acquisition module is used to obtain multiple-choice data to be classified; wherein, the multiple-choice data includes question stem data, option data and answer data;

A data input module, configured to input the data of the multiple-choice questions into the knowledge classification model; wherein, the knowledge classification model is trained according to the method described in the first aspect above;

A feature extraction module, configured to perform feature extraction on the multiple-choice question data through the knowledge classification model to obtain feature vector information;

The knowledge classification module is configured to perform knowledge classification processing according to the feature vector information to obtain knowledge point types.

In the fifth aspect, the embodiment of the present application proposes a computer device, including:

at least one memory;

at least one processor;

at least one program;

The program is stored in the memory, and the processor executes the at least one program to implement a knowledge classification model training method or a multiple-choice knowledge classification method, wherein the knowledge classification model training method includes: obtaining Original labeling data; wherein, the original labeling data includes question stem data, option data and answer data; encoding the question stem data to obtain question stem representation vectors; Carry out encoding processing to obtain the option attribute value and the answer attribute value; perform word segmentation and splicing processing on the option attribute value and the answer attribute value to obtain an option answer representation vector; represent the question stem representation vector and the option answer The vectors are spliced to obtain topic data; the preset pre-training model is trained according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on the target topic to obtain knowledge points type. The knowledge classification method for multiple-choice questions includes: obtaining multiple-choice question data to be classified; wherein, the multiple-choice question data includes question stem data; encoding the question stem data to obtain question stem representation vectors; The stem representation vector is input to the knowledge classification model; wherein, the knowledge classification model is obtained by training according to the above-mentioned knowledge classification model training method; the feature extraction is performed on the question stem data through the knowledge classification model to obtain feature vector information; Knowledge classification processing is performed according to the feature vector information to obtain knowledge point types.

In the sixth aspect, the embodiment of the present application provides a storage medium, the storage medium is a computer-readable storage medium, and the computer-readable storage medium stores computer-executable instructions, and the computer-executable instructions are used to make a computer execute A method for training a knowledge classification model or a method for classifying knowledge for multiple-choice questions, wherein the method for training the knowledge classification model includes: obtaining original label data; wherein the original label data includes question stem data, option data and answer data; encoding the question stem data to obtain a question stem representation vector; encoding the option data and answer data according to a preset knowledge graph to obtain an option attribute value and an answer attribute value; encoding the option attribute value Perform word segmentation and splicing processing with the answer attribute value to obtain the option answer characterization vector; carry out vector splicing of the question stem characterization vector and the option answer characterization vector to obtain topic data; The training model is trained to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on target topics to obtain knowledge point types. The knowledge classification method for multiple-choice questions includes: obtaining multiple-choice question data to be classified; wherein, the multiple-choice question data includes question stem data; encoding the question stem data to obtain question stem representation vectors; The stem representation vector is input to the knowledge classification model; wherein, the knowledge classification model is obtained by training according to the above-mentioned knowledge classification model training method; the feature extraction is performed on the question stem data through the knowledge classification model to obtain feature vector information; Knowledge classification processing is performed according to the feature vector information to obtain knowledge point types.

Beneficial effect

The training method of the knowledge classification model proposed in the embodiment of the present application, the knowledge classification method of multiple choice questions, the training device of knowledge classification model, the knowledge classification device of multiple choice questions, the computer equipment, the storage medium, the knowledge classification model that the training obtains can be used for the target topic Carrying out knowledge classification processing to obtain knowledge point types that meet requirements can improve the accuracy and efficiency of knowledge classification.

Description of drawings

The accompanying drawings are used to provide a further understanding of the technical solution of the present application, and constitute a part of the specification, and are used together with the embodiments of the present application to explain the technical solution of the present application, and do not constitute a limitation to the technical solution of the present application.

FIG. 1 is a flowchart of a training method for a knowledge classification model provided by an embodiment of the present disclosure;

Fig. 2 is the flowchart of step 102 in Fig. 1;

Fig. 3 is a partial flowchart of the training method of the knowledge classification model provided by another embodiment;

Fig. 4 is the flowchart of step 103 in Fig. 1;

Fig. 5 is a flowchart of step 104 in Fig. 1;

FIG. 6 is a flow chart of a knowledge classification method for multiple-choice questions provided by an embodiment of the present disclosure;

7 is a functional block diagram of a training device for a knowledge classification model provided by an embodiment of the present disclosure;

Fig. 8 is a functional block diagram of a multiple-choice knowledge classification method device provided by an embodiment of the present disclosure;

FIG. 9 is a schematic diagram of a hardware structure of a computer device provided by an embodiment of the present disclosure.

Embodiments of the present invention

In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application.

It should be noted that although the functional modules are divided in the schematic diagram of the device, and the logical sequence is shown in the flowchart, in some cases, it can be executed in a different order than the module division in the device or the flowchart in the flowchart. steps shown or described. The terms "first", "second" and the like in the specification and claims and the above drawings are used to distinguish similar objects, and not necessarily used to describe a specific sequence or sequence.

Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the technical field to which this application belongs. The terms used herein are only for the purpose of describing the embodiments of the present application, and are not intended to limit the present application.

First, analyze some nouns involved in this application:

Artificial Intelligence (AI): It is a new technical science that studies and develops theories, methods, technologies and application systems for simulating, extending and expanding human intelligence; artificial intelligence is a branch of computer science. Intelligence attempts to understand the essence of intelligence and produce a new intelligent machine that can respond in a manner similar to human intelligence. Research in this field includes robotics, language recognition, image recognition, natural language processing, and expert systems. Artificial intelligence can simulate the information process of human consciousness and thinking. Artificial intelligence is also a theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results.

Natural language processing (NLP): NLP uses computers to process, understand and use human languages (such as Chinese, English, etc.). NLP belongs to a branch of artificial intelligence and is an interdisciplinary subject between computer science and linguistics. Known as computational linguistics. Natural language processing includes syntax analysis, semantic analysis, text understanding, etc. Natural language processing is often used in technical fields such as machine translation, handwritten and printed character recognition, speech recognition and text-to-speech conversion, information retrieval, information extraction and filtering, text classification and clustering, public opinion analysis and opinion mining. It involves language processing Related data mining, machine learning, knowledge acquisition, knowledge engineering, artificial intelligence research and linguistics research related to language computing, etc.

Knowledge Graph: It combines the theories and methods of applied mathematics, graphics, information visualization technology, information science and other disciplines with metrology citation analysis, co-occurrence analysis and other methods, and uses the visual graph to display the subject visually. The core structure, development history, frontier fields and overall knowledge structure of modern theory to achieve the purpose of multidisciplinary integration. The main goal of the knowledge map is to describe various entities and concepts that exist in the real world, as well as the strong relationship between them. We use relationships to describe the association between two entities.

Entity: Refers to something that is distinguishable and exists independently. Such as a certain person, a certain city, a certain plant, a certain commodity, etc. Everything in the world is made up of concrete things, which refer to entities. Entities are the most basic elements in knowledge graphs, and different entities have different relationships.

Concept: A collection of entities of a certain type.

Semantic class (concept): A collection of entities with the same characteristics, such as countries, nations, books, computers, etc. Concepts mainly refer to collections, categories, object types, and types of things, such as people, geography, etc.

Relationship (Relationship): There is a certain relationship between entities and entities, between different concepts and concepts, and between concepts and entities. The relation is formalized as a function that maps k points to a Boolean value. On a knowledge graph, a relation is a function that maps kk graph nodes (entities, semantic classes, attribute values) to Boolean values.

Attribute (value): The value of an entity-specific attribute, which is the attribute value pointed from an entity to it. Different attribute types correspond to edges with different types of attributes. The attribute value mainly refers to the value of the specified attribute of the object. For example: "area", "population", "capital" are several different attributes. The attribute value mainly refers to the value of the specified attribute of the object, such as 9.6 million square kilometers, etc.

Triple: triple ({E,R}) is a general representation of knowledge graph; the basic form of triple mainly includes (entity 1-relationship-entity 2) and (entity-attribute-attribute value) wait. Each entity (the extension of the concept) can be identified by a globally unique ID, each attribute-value pair (AVP) can be used to describe the intrinsic characteristics of the entity, and the relationship can be used to connect two entities. the connection between them. For example, in a knowledge graph example, China is an entity, Beijing is an entity, China-capital-Beijing is a triplet example of (entity-relationship-entity), Beijing is an entity, and population is a Attributes, 20.693 million are attribute values. Beijing-population-20.693 million constitutes an example triplet of (entity-attribute-attribute value).

token: token is the basic unit of indexing, representing each indexed character; if a field is tokenized, it means that the field has passed an analysis program that can convert the content into a token string; in the process of tokenization , the parser applies any transformation logic (such as removing stop words such as "a" or "the", performing a stemming search, converting all text without case sensitivity to lowercase, etc.), the extraction should be compiled Text content to be indexed.

BERT (Bidirectional Encoder Representation from Transformers) model: The BERT model further increases the generalization ability of the word vector model, fully describes the character-level, word-level, sentence-level and even inter-sentence relationship features, and is built based on Transformer.

Large-scale pre-training models such as the BERT model have achieved good results in natural language processing tasks and have been recognized by the industry. However, these large-scale pre-training models usually have huge parameters (for example, the BERT-base model has 110 million parameters, and the BERT-large model has 340 million parameters), which brings great challenges to fine-tuning and online deployment. The massive parameters make these The speed of model fine-tuning and deployment is slow, and the calculation cost is high, which causes great delay and capacity limitation for real-time applications, so model compression is of great significance.

The embodiments of the present application may acquire and process relevant data based on artificial intelligence technology. Among them, artificial intelligence (AI) is the theory, method, technology and application system that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence, perceive the environment, acquire knowledge and use knowledge to obtain the best results. .

Artificial intelligence basic technologies generally include technologies such as sensors, dedicated artificial intelligence chips, cloud computing, distributed storage, big data processing technology, operation/interaction systems, and mechatronics. Artificial intelligence software technology mainly includes computer vision technology, robotics technology, biometrics technology, speech processing technology, natural language processing technology, and machine learning/deep learning.

In some application scenarios, such as English online education scenarios, it is necessary to classify the topics for examining relevant English knowledge points, so as to divide the topics of the same knowledge points and conduct special training for users. Since the number of English questions is too large, and some new questions are developed every year; if each question is divided manually, the workload is heavy, the efficiency is low, and it is easy to make mistakes.

Based on this, an embodiment of the present disclosure provides a training method for a knowledge classification model, a knowledge classification method for multiple-choice questions, a training device for a knowledge classification model, a knowledge classification device for multiple-choice questions, a computer device, and a storage medium, which can improve the ability of the model to classify knowledge. accuracy and efficiency.

The knowledge classification model training method, knowledge classification method for multiple-choice questions, training device for knowledge classification model, knowledge classification device for multiple-choice questions, computer equipment, and storage media provided by the embodiments of the present disclosure will be specifically described through the following embodiments. The training method of the knowledge classification model in the embodiment of the present disclosure.

The training method of the knowledge classification model provided by the embodiment of the present disclosure relates to the technical field of machine learning. The training method of the knowledge classification model provided by the embodiments of the present disclosure may be applied to a terminal, may also be applied to a server, and may also be software running on the terminal or the server. In some embodiments, the terminal can be a smart phone, a tablet computer, a notebook computer, a desktop computer, or a smart watch; the server end can be configured as an independent physical server, or as a server cluster composed of multiple physical servers or as a distributed The system can also be configured to provide basic cloud computing such as cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN, and big data and artificial intelligence platforms. The cloud server of the service; the software can be the application of the training method for realizing the knowledge classification model, etc., but it is not limited to the above forms.

FIG. 1 is an optional flow chart of a method for training a knowledge classification model provided by an embodiment of the present disclosure. The method in FIG. 1 may include but not limited to steps 101 to 106 .

Step 101, obtaining original annotation data; the original annotation data includes question stem data, option data and answer data;

Step 102, encoding the question stem data to obtain a question stem representation vector;

Step 103: Encoding the option data and answer data according to the preset knowledge map to obtain option attribute values and answer attribute values;

Step 104, performing word segmentation and splicing processing on the option attribute value and the answer attribute value to obtain the option answer representation vector;

Step 105, performing vector concatenation of the question stem representation vector and the option answer representation vector to obtain question data;

Step 106: Train the preset pre-training model according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on the target topic to obtain the type of knowledge points.

Specifically, in step 101 of an application scenario, it is necessary to obtain a certain amount of original labeling data, for example, 1 million pieces of original labeling data. The original labeling data may be manually labeled topic data. The type of knowledge points investigated by the topic, that is, the label of the original labeled data is the type of knowledge point. For example, the type of knowledge point investigated in [attributive clause] is an attributive clause, and the type of knowledge point investigated in [adverbial clause] is an adverbial clause. In this embodiment, 1 million labeled data are used to train the model, so tens of millions or even more English questions can be automatically classified at the cost of only 1 million data.

Furthermore, in some application scenarios, such as the application scenario of English online education, the original annotation data is the question stem data, option data and answer data of English multiple-choice questions. In the application scenario of English online education, it is necessary to classify the topic data for examining relevant English knowledge points, so as to divide the topics of the same knowledge point and conduct special training for users. Because the amount of question data is too large, and some new questions are developed every year; if you rely on manual division of each question, the workload is heavy, the efficiency is low, and it is easy to make mistakes. Therefore, in the embodiment of the present disclosure, by obtaining the original annotation data, and encoding the question stem data in the original annotation data, the question stem representation vector is obtained, and the option data and answer in the original annotation data are analyzed according to the preset knowledge graph. The data is encoded, so that the option attribute value and the answer attribute value can be obtained, and then the option attribute value and the answer attribute value are subjected to word segmentation and splicing processing to obtain the option answer representation vector, and then the question stem representation vector and the option answer The characterization vectors are spliced to obtain the topic data, and finally the preset pre-training model is trained according to the topic data to obtain a knowledge classification model, which can be used to perform knowledge classification processing on the target topic to obtain knowledge points type, the knowledge classification model obtained in the embodiments of the present disclosure can improve the accuracy and efficiency of knowledge classification.

Take an English multiple-choice question as an example. In a multiple-choice question that investigates attributive clauses, a sentence containing clauses is given in the question stem data: My house, which I bought last year, has got a lovely garden. The question stem requires judging the clause type of the clause "which I bought last year". The option data is: A, B, C, D four options, where option A is an adverbial clause, option B is a main clause, option C is an attributive clause, and option D is an predicative clause. There is only one answer, and the answer data corresponds to: attributive clause. That is, the answer to this multiple choice question is option C.

Please refer to Fig. 2, in step 102 of some embodiments, the question stem data is encoded to obtain the question stem representation vector, specifically including:

Step 201, preprocessing the question stem data to obtain a preliminary question stem sequence;

Step 202, perform word segmentation processing on the preliminary question stem sequence to obtain a question stem representation vector.

In a specific application scenario, step 201 includes:

Convert the English content of the question stem data to lowercase to obtain the preliminary question stem sequence.

For example, if the English content of the question stem data includes: I lOVE YOU, all I lOVE YOU are converted into lowercase, and the obtained preliminary question stem sequence is: i love you.

Further, step 201 also includes:

The English abbreviated content of the question stem data is restored to the English full name, and the preliminary question stem sequence is obtained.

For example, if the English abbreviation content of the question stem data is: i'm, then the preliminary question stem sequence obtained after restoring the I'm containing the English abbreviation to the English full name is: i am.

In a specific application scenario, in step 202, word segmentation processing is performed on the preliminary question stem sequence to obtain a question stem representation vector, specifically including:

Tokenize the preliminary question stem sequence to obtain the question stem representation vector. In some embodiments, the preliminary stem sequence is:

i am playing

The stem representation vector obtained after tokenizing i am playing is:

[i,am,play,ing]

Please refer to FIG. 3 , in some embodiments, before step 103, the training method of the knowledge classification model also includes: building a knowledge map, which may specifically include but not limited to steps 301 to 303:

Step 301, acquiring preset knowledge points;

Step 302, constructing a first triplet and a second triplet according to preset knowledge points;

Step 303, constructing a knowledge graph based on the first triple and the second triple; wherein, the first triple includes the first knowledge entity, relationship, and second knowledge entity, and the second triple includes the second knowledge entity , attribute, attribute value.

In step 301 of some embodiments, technical means such as a web crawler may be used to crawl relevant data such as preset knowledge points; relevant data may also be obtained from a preset database. In some application scenarios, the preset knowledge points are preset English knowledge points, such as English test points in the English online education scenario.

In step 302 of some embodiments, the principle of constructing the English knowledge map is: constructing the first triplet and the second triplet according to each knowledge point of the preset knowledge points, wherein the first triplet includes the first knowledge Entity, relation, second knowledge entity, the second triple group includes second knowledge entity, attribute, attribute value. Through the first triple group, the association relationship between the first knowledge entity and the second knowledge entity is established, specifically, the connection of the association relationship between the first knowledge entity and the second knowledge entity is established through an undirected edge. Explanation on the first triple: if there is a relationship between two knowledge nodes, then the two knowledge nodes with the relationship are connected together by an undirected edge. The knowledge node is called an entity, and the undirected edge represents The relationship between the two knowledge nodes, in the embodiment of the present disclosure, the two knowledge nodes correspond to the first knowledge entity and the second knowledge entity. In the second triplet, the second knowledge entity represents the name of the corresponding English knowledge point, and the second triplet represents: the name of the corresponding English knowledge point, the attribute of the English knowledge point, and the attribute value corresponding to the attribute .

In a specific application scenario, the first triple can be expressed as: clause-include-attributive clause; or the first triple can be expressed as: clause-include-adverbial clause; where [clause] is the corresponding English knowledge point , this English knowledge point includes [attributive clause] and [adverbial clause] two knowledge points, and the internal relationship is containment.

In a specific application scenario, the second triple can be expressed as: attributive clause-grade-grade 8, attributive clause-relative word-which; among them, the [attributive clause] has an attribute of [grade], and the [grade] The attribute value of is [Grade 8], which means that the [attributive clause] is a knowledge point of [Grade 8]. At the same time, the [attributive clause] also has an attribute value of [relative word], and the attribute value of this [relative word] is which.

In the embodiment of the present disclosure, by constructing a knowledge map of English knowledge points, the composition structure of English knowledge points and the inspection points of English knowledge points can be clearly known; in addition, the sum of edges between two knowledge points can be calculated to Whether two knowledge points are similar knowledge points can be judged with reference to related technologies, which is not limited in the embodiments of the present disclosure.

Please refer to FIG. 4, in step 103 of some embodiments, the preset knowledge graph includes the first triplet and the second triplet, and the option data and answer data are encoded according to the preset knowledge graph to obtain the option Attribute values and answer attribute values, which may specifically include but are not limited to include:

In some embodiments, the knowledge graph includes a first triplet and multiple second triplets, and the option data and answer data are encoded according to a preset knowledge graph to obtain option attribute values and answer attribute values, including:

Step 401: Encoding option data according to the first triplet and multiple second triplets to obtain option attribute values; wherein, the option attribute value includes attribute values of multiple second triplets;

Step 402: Encode the answer data according to the first triplet and one of the second triplets to obtain the answer attribute value; wherein, the answer attribute value is one of the multiple attribute values in the option attribute value .

Specifically, in order to improve the accuracy of the model, the embodiment of the present disclosure introduces the knowledge information of the knowledge map to the encoding stage of the options and answers. The options and answers of the questions are used to obtain knowledge entities through the relevant information of the first triplet and the second triplet of the knowledge graph. Specifically, in a specific application scenario, take an English multiple-choice question as an example. In a multiple-choice question that investigates attributive clauses, a sentence containing clause content is given in the question stem data: My house, which I bought last year, has got a lovely garden. In the question stem data, it is required to judge the clause type of the clause "which I bought last year". The option data is: A, B, C, D four options, where option A is an adverbial clause, option B is a main clause, option C is an attributive clause, and option D is an predicative clause. There is only one answer, and the answer data corresponds to: attributive clause. That is, the answer to this multiple choice question is option C. The first triple of the knowledge map is expressed as: clause-contains-attributive clause, and the second triple is: attributive clause-relative word-which. The "which" in the clause "which I bought last year" is a relative word, and the type of the corresponding clause is "attributive clause", which is the expression of the second triple: attributive clause-relative word-which. The answer corresponding to the type of the clause "which I bought last year" is: the clause is a defining clause, and the answer corresponds to the expression of the first triple: clause-contains-attributive clause. The option data is encoded according to the first triplet and multiple second triplets, and the obtained option attribute values are: adverbial clause, subject clause, attributive clause, and predicative clause. The answer data is encoded according to the first triplet and one of the second triplets, and the obtained answer attribute value is: attributive clause (that is, the attributive clause in the option attribute value); in this application scenario, the English knowledge investigated The point is the judgment of the attributive clause in the clause.

Referring to Fig. 5, in step 104 of some embodiments, word segmentation and splicing are performed on the option attribute value and the answer attribute value to obtain the option answer representation vector, which may specifically include but not limited to include:

Step 501, perform word vectorization on the option attribute value and answer attribute value, and obtain the option attribute value and answer attribute value of word vectorization;

Step 502 , concatenate the option attribute values and answer attribute values quantized to obtain option answer representation vectors.

Specifically, in some embodiments, the knowledge words corresponding to the option attribute value and the answer attribute value are vectorized into a vector token corresponding to the option attribute value and a vector token corresponding to the answer attribute value, and then the two vectors The tokens are spliced to obtain the option answer representation vector.

It should be understood that in other embodiments, the attribute value of the option and the attribute value of the answer can be spliced first to obtain the attribute value of the option answer, and then the attribute value of the option answer is vectorized into a vector token corresponding to the option answer, that is, the option answer representation vector.

In a specific application scenario, the option attribute value is a sequence of sentences A, the answer attribute value is a sentence B, and the two sentences A and B are concatenated into an option answer representation vector. Specifically, the option answer characterization vector can be a sequence with a length of 320; if the length of the option answer characterization vector is not 320, the option answer characterization vector needs to be zero-filled; and because the option attribute value may be very long, Therefore, it is necessary to truncate the attribute value of the option, and cut off the tail of a longer sentence each time until the length of the entire option answer representation vector is 320.

In an application scenario of a multiple-choice question examining attributive clauses, the question stem data is given a clause content, and it is required to judge the clause type of the clause content. The options are A, B, C, and D. Option A is an adverbial Dependent clauses, option B is the main clause, option C is the attributive clause, and option D is the predicative clause; the answer data corresponds to: attributive clause; that is, the option attribute value includes the adverbial clause, the subject clause, the attributive clause, and the predicative clause; the answer attribute value as an attributive clause. Therefore, the option answer representation vector obtained after word segmentation and concatenation of the option attribute value and the answer attribute value is expressed as [adverbial clause, subject clause, attributive clause, predicative clause, attributive clause].

In step 105 of some embodiments, the question stem characterization vector and the option answer characterization vector are vector concatenated to obtain question data, which may specifically include but not limited to include:

The question stem representation vector and the option answer representation vector are vector-spliced through separators to obtain the question data.

In some embodiments, the delimiter can be a pair of placeholders: a first placeholder [CLS] and a second placeholder [SEP], wherein the first placeholder [CLS] represents the beginning of the sequence, and the second The placeholder [SEP] indicates the end of the sequence. Among them, CLS (classifer token), also called classifier identifier or identifier, is a special token whose word embedding is usually used for classification tasks; SEP (sentence separator) is also called sentence separation identifier or separation A character is also a special token that can be used to separate two sentences.

The question stem characterization vector and the option answer characterization vector are vector-spliced through the separator to obtain the question data, including:

The question stem characterization vector is set between the first placeholder and the second placeholder, the second placeholder is set between the question stem characterization vector and the option answer characterization vector, and the question stem characterization vector and the option answer characterization vector Perform vector splicing to obtain the topic data. Specifically, the representation form of the question data is: [<CLS>, question stem representation vector, <SEP>, option answer representation vector]

The following describes specific application scenarios:

For example, the stem representation vector is: i, am, play, ing

The option answer representation vector is: [adverbial clause, subject clause, attributive clause, predicative clause, attributive clause]

Then the question data obtained by vector splicing the question stem representation vector and the option answer representation vector through the separator is:

[<CLS>,i,am,play,ing,<SEP>, adverbial clause, attributive clause, attributive clause, predicative clause, attributive clause]

In step 106 of some embodiments, the preset pre-training model can be a BERT model; specifically, according to the topic data obtained in step 105 as the input of the BERT model, the BERT model is trained to obtain a knowledge classification model, the knowledge classification model The basic framework of BERT is the BERT model; the knowledge classification model is used to predict the knowledge type of the target topic; specifically, the knowledge classification model includes a softmax classifier; the knowledge classification model obtains the feature vector information corresponding to <CLS> according to the input topic data , <CLS> can predict the knowledge type of the target topic after passing through a softmax classifier. Wherein, the target topic is a topic input into the knowledge classification model, for example, it may be a multiple-choice topic, and more specifically, in the case of an English multiple-choice question, the target topic may be a multiple-choice question examining attributive clauses.

It should be understood that for each token-level word, it includes: token embedding, position embedding, and segment embedding; wherein the token embedding is a vector representation of the word on the entire corpus obtained by the token after the model is pre-trained on the corpus; The positional embedding is the position index of the current token in the sequence; the segmental embedding is to mark whether it is sentence A or sentence B in this sequence, where the segmental embedding of the token belonging to sentence A is 0, and the segmental embedding of the token belonging to sentence B is 1. The three embeddings of token embedding, position embedding, and segment embedding are spliced together to form the word embedding of each token, and the embedding of the entire sequence is input into the multi-layer bidirectional Transformer encoder, and the first one of the last hidden layer is taken The vector corresponding to the token (namely [CLS]) is used as the aggregate representation of the entire sentence, that is, the vector represents the vector representation of the entire option sequence. In this embodiment, the knowledge type of the topic can be predicted by passing the sequence represented by the topic data through the softmax classifier.

In the embodiment of the present disclosure, by obtaining the original annotation data and encoding the question stem data in the original annotation data, the question stem representation vector is obtained, and the option data and answer data in the original annotation data are processed according to the preset knowledge graph. Encoding processing, so that the option attribute value and the answer attribute value can be obtained, and then the option attribute value and the answer attribute value are subjected to word segmentation and splicing processing to obtain the option answer representation vector, and then the question stem representation vector and the option answer representation vector Perform vector splicing to obtain topic data, and finally train the preset pre-training model according to the topic data to obtain a knowledge classification model. This knowledge classification model can be used to perform knowledge classification processing on the target topic to obtain the type of knowledge points. The knowledge classification model obtained in the embodiments of the present disclosure can improve the accuracy and efficiency of knowledge classification.

In the embodiment of the present disclosure, based on the knowledge map and deep learning, the topics of English multiple-choice questions are classified, and the model can be used to automatically distinguish the knowledge points investigated by the questions. Compared with conventional classification methods, the technical solutions of the embodiments of the present disclosure can improve the accuracy and efficiency of knowledge classification, and by introducing the knowledge map coding information (triple information) of options and answers, it is possible to more accurately predict the content of the topic. knowledge type. With a fixed cost of labeled samples, new topics can be classified more efficiently.

Please refer to FIG. 6 , the embodiment of the present disclosure also provides a knowledge classification method for multiple-choice questions. The knowledge classification method for multiple-choice questions provided by the embodiment of the present disclosure relates to the technical field of machine learning. The multiple-choice knowledge classification method provided by the embodiments of the present disclosure can be applied to a terminal, can also be applied to a server, and can also be software running on the terminal or the server. In some embodiments, the terminal can be a smart phone, a tablet computer, a notebook computer, a desktop computer, or a smart watch; the server end can be configured as an independent physical server, or as a server cluster composed of multiple physical servers or as a distributed The system can also be configured to provide basic cloud computing such as cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN, and big data and artificial intelligence platforms. The cloud server of the service; the software can be the application of knowledge classification methods to realize multiple-choice questions, but it is not limited to the above forms.

Fig. 6 is an optional flow chart of the multiple-choice knowledge classification method provided by the embodiment of the present disclosure. The method in Fig. 6 may include but not limited to steps 601 to 604:

Step 601. Obtain multiple-choice question data to be classified; wherein, the multiple-choice question data includes question stem data, option data and answer data;

Step 602, input multiple-choice question data into the knowledge classification model; wherein, the knowledge classification model is obtained through training according to the method of the first aspect above;

Step 603, perform feature extraction on the multiple-choice question data through the knowledge classification model, and obtain feature vector information;

Step 604, performing knowledge classification processing according to the feature vector information to obtain knowledge point types.

Specifically, in step 601, multiple-choice question data to be classified includes question stem data, option data and answer data. The multiple choice data is different from the original label data: the original label data includes knowledge point types, and the multiple choice data does not include knowledge point types.

It should be understood that the aforementioned target questions include multiple-choice question data to be classified.

In some embodiments, the knowledge classification model includes a softmax classifier.

In the knowledge classification method for multiple-choice questions, feature extraction is performed on the data of multiple-choice questions through the knowledge classification model, and the feature vector information corresponding to <CLS> is obtained, and the obtained feature vector information includes question stem representation vectors and option answer representation vectors; where, the The question stem characterization vector is the same as the question stem characterization vector in the above-mentioned knowledge classification model training method, that is, the question stem characterization vector in this embodiment is set between the first placeholder <CLS> and the second placeholder <SEP> , it can also be said that the question stem representation vector includes the first placeholder <CLS>; the knowledge classification method of the multiple-choice question in this embodiment is the same as the training method of the above-mentioned knowledge classification model and also includes: the second placeholder <SEP> set Between the question stem representation vector and the option answer representation vector, it can also be said that the option answer representation vector includes a second placeholder <SEP>.

In step 604 of some embodiments, according to the feature vector information corresponding to <CLS> obtained in step 603, through a softmax classifier, the softmax classifier can perform word count classification processing according to the feature vector information corresponding to <CLS>, thereby predicting The knowledge type of the topic.

In some application scenarios, such as English online education scenarios, it is necessary to classify the topics for examining relevant English knowledge points, so as to divide the topics of the same knowledge points and conduct special training for users. Since the number of questions is too large, and some new questions are developed every year; if each question is divided manually, the workload is heavy, the efficiency is low, and it is easy to make mistakes. In the embodiment of the present disclosure, by constructing a relevant English knowledge map and applying a deep learning method to classify the topics of English multiple-choice questions, the model can be used to automatically distinguish the knowledge points investigated by the questions.

Please refer to FIG. 7, an embodiment of the present disclosure also provides a training device for a knowledge classification model, which can implement the above-mentioned training method for the knowledge classification model. The training device for the knowledge classification model includes: an original data acquisition module for obtaining original label data ; The original labeling data includes question stem data, option data and answer data; the question stem encoding module is used to encode the question stem data to obtain the question stem representation vector; the option answer encoding module is used to obtain the question stem representation vector according to the preset knowledge The map encodes the option data and the answer data to obtain the option attribute value and the answer attribute value; the word segmentation and splicing module is used to perform word segmentation and splicing processing on the option attribute value and the answer attribute value to obtain the option answer representation vector ; A vector splicing module, used for vector splicing the question stem representation vector and the option answer representation vector to obtain topic data; a classification model training module, used for training a preset pre-training model according to the topic data , to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on the target topic to obtain the type of knowledge points.

The knowledge classification model training device in the embodiment of the present disclosure is used to execute the knowledge classification model training method in the above embodiment, and its specific processing process is the same as the knowledge classification model training method in the above embodiment, which will not be repeated here. A repeat.

Please refer to FIG. 8 , an embodiment of the present disclosure also provides a knowledge classification device for multiple-choice questions, which can realize the knowledge classification method for the above-mentioned multiple-choice questions. The knowledge classification device for multiple-choice questions includes: a data acquisition module for multiple-choice questions, used to obtain Multiple-choice question data; wherein, the multiple-choice question data includes question stem data, option data and answer data; the data input module is used to input the multiple-choice question data into the knowledge classification model; wherein, the knowledge classification model is the knowledge according to the above-mentioned first aspect The training method of the classification model is trained; the feature extraction module is used to extract the features of the multiple choice data through the knowledge classification model to obtain the feature vector information; the knowledge classification module is used to perform knowledge classification processing according to the feature vector information to obtain the knowledge point type .

The knowledge classification device for multiple-choice questions in the embodiment of the present disclosure is used to implement the knowledge classification method for multiple-choice questions in the above-mentioned embodiments, and its specific processing process is the same as the knowledge classification method for multiple-choice questions in the above-mentioned embodiments, and will not be repeated here. repeat.

An embodiment of the present disclosure also provides a computer device, including:

at least one memory;

at least one processor;

at least one program;

The program is stored in the memory, and the processor executes the at least one program to implement the above-mentioned knowledge classification model training method or multiple choice question knowledge classification method in the present disclosure. The computer device may be any intelligent terminal including a mobile phone, a tablet computer, a personal digital assistant (PDA for short), a vehicle-mounted computer, and the like.

Referring to FIG. 9, FIG. 9 illustrates a hardware structure of a computer device in another embodiment, and the computer device includes:

The processor 701 can be implemented by a general-purpose CPU (Central Processing Unit, central processing unit), a microprocessor, an application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits, and is used to execute related programs to Realize the technical solutions provided by the embodiments of the present disclosure;

The memory 702 may be implemented in the form of a ROM (ReadOnly Memory, read only memory), a static storage device, a dynamic storage device, or a RAM (Random Access Memory, random access memory). The memory 702 can store operating systems and other application programs. When implementing the technical solutions provided by the embodiments of this specification through software or firmware, the relevant program codes are stored in the memory 702 and called by the processor 701 to execute the implementation of the present disclosure. The training method of the knowledge classification model of the example or the knowledge classification method of the multiple choice questions; wherein, the training method of the knowledge classification model includes: obtaining the original label data; wherein, the original label data includes question stem data, option data and answer data; The data is encoded to obtain the question stem representation vector; the option data and answer data are encoded according to the preset knowledge map to obtain the option attribute value and answer attribute value; the option attribute value and answer attribute value are word-segmented and spliced, Obtain the option answer representation vector; perform vector splicing of the question stem representation vector and the option answer representation vector to obtain the topic data; train the preset pre-training model according to the topic data to obtain the knowledge classification model; where the knowledge classification model is used for The target topic is processed by knowledge classification to obtain the type of knowledge points. The knowledge classification method for multiple-choice questions includes: obtaining multiple-choice question data to be classified; wherein, the multiple-choice question data includes question stem data; encoding the question stem data to obtain question stem characterization vectors; The vector is input to the knowledge classification model; wherein, the knowledge classification model is obtained by training according to the above-mentioned knowledge classification model training method; the feature extraction is performed on the stem data through the knowledge classification model to obtain feature vector information; according to the The feature vector information is processed by knowledge classification to obtain the type of knowledge points.

The input/output interface 703 is used to realize information input and output;

The communication interface 704 is used to realize the communication interaction between the device and other devices, and the communication can be realized through a wired method (such as USB, network cable, etc.), or can be realized through a wireless method (such as a mobile network, WIFI, Bluetooth, etc.); and

A bus 705, which transmits information between various components of the device (such as a processor 701, a memory 702, an input/output interface 703, and a communication interface 704);

The processor 701 , the memory 702 , the input/output interface 703 and the communication interface 704 are connected to each other within the device through the bus 705 .

An embodiment of the present disclosure also provides a storage medium, which is a computer-readable storage medium, and the computer-readable storage medium may be non-volatile or volatile. The computer-readable storage medium stores computer-executable instructions, and the computer-executable instructions are used to make the computer execute the above-mentioned knowledge classification model training method or multiple-choice knowledge classification method; wherein, the knowledge classification model training method includes: obtaining the original Labeling data; wherein, the original labeling data includes question stem data, option data, and answer data; the question stem data is encoded to obtain the question stem representation vector; the option data and answer data are encoded according to the preset knowledge map to obtain option attribute value and answer attribute value; the option attribute value and answer attribute value are segmented and spliced to obtain the option answer representation vector; the question stem representation vector and the option answer representation vector are vector spliced to obtain the topic data; according to the topic data The preset pre-training model is trained to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on the target topic to obtain the type of knowledge points. The knowledge classification method for multiple-choice questions includes: obtaining multiple-choice question data to be classified; wherein, the multiple-choice question data includes question stem data; encoding the question stem data to obtain question stem characterization vectors; The vector is input to the knowledge classification model; wherein, the knowledge classification model is obtained by training according to the above-mentioned knowledge classification model training method; the feature extraction is performed on the stem data through the knowledge classification model to obtain feature vector information; according to the The feature vector information is processed by knowledge classification to obtain the type of knowledge points.

The training method of the knowledge classification model, the knowledge classification method of the multiple choice questions, the training device of the knowledge classification model, the knowledge classification device of the multiple choice questions, the computer equipment, and the storage medium proposed by the embodiments of the present disclosure obtain the original labeling data, and the original labeling The question stem data in the data is encoded to obtain the question stem representation vector, and the option data and answer data in the original annotation data are encoded according to the preset knowledge map, so that the option attribute value and answer attribute value can be obtained, and then Segment and concatenate the option attribute value and the answer attribute value to obtain the option answer characterization vector, and then perform vector splicing on the question stem characterization vector and the option answer characterization vector, so that the topic data can be obtained, and finally according to the topic data. The preset pre-training model is trained to obtain a knowledge classification model. The knowledge classification model can be used to perform knowledge classification processing on the target topic to obtain the type of knowledge points. The knowledge classification model obtained in the embodiment of the present disclosure can improve the accuracy of knowledge classification. accuracy and efficiency.

As a non-transitory computer-readable storage medium, memory can be used to store non-transitory software programs and non-transitory computer-executable programs. In addition, the memory may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid-state storage devices. In some embodiments, the memory optionally includes memory located remotely from the processor, and these remote memories may be connected to the processor via a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.

The embodiments described in the embodiments of the present disclosure are to illustrate the technical solutions of the embodiments of the present disclosure more clearly, and do not constitute limitations on the technical solutions provided by the embodiments of the present disclosure. Those skilled in the art know that with the evolution of technology and new For the emergence of application scenarios, the technical solutions provided by the embodiments of the present disclosure are also applicable to similar technical problems.

Those skilled in the art can understand that the technical solutions shown in FIGS. 1-6 do not constitute a limitation to the embodiments of the present disclosure, and may include more or fewer steps than those shown in the illustrations, or combine certain steps, or be different. A step of.

The device embodiments described above are only illustrative, and the units described as separate components may or may not be physically separated, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment.

Those of ordinary skill in the art can understand that all or some of the steps in the methods disclosed above, the functional modules/units in the system, and the device can be implemented as software, firmware, hardware, and an appropriate combination thereof.

The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.

In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.

The preferred embodiments of the embodiments of the present disclosure have been described above with reference to the accompanying drawings, which do not limit the scope of rights of the embodiments of the present disclosure. Any modifications, equivalent replacements and improvements made by those skilled in the art without departing from the scope and essence of the embodiments of the present disclosure shall fall within the scope of rights of the embodiments of the present disclosure.

Claims

A training method for a knowledge classification model, comprising:

Obtaining original annotation data; wherein, the original annotation data includes question stem data, option data and answer data;

Encoding the question stem data to obtain a question stem representation vector;

Encode the option data and answer data according to the preset knowledge graph to obtain option attribute values and answer attribute values;

Segmenting and concatenating the option attribute value and the answer attribute value to obtain an option answer representation vector;

performing vector splicing of the question stem characterization vector and the option answer characterization vector to obtain question data;

A preset pre-training model is trained according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on a target topic to obtain knowledge point types.
The method according to claim 1, wherein said subject data is encoded to obtain a subject characterization vector, comprising:

Preprocessing the stem data, converting the English content of the stem data to lowercase to obtain a preliminary stem sequence;

Word segmentation is performed on the preliminary question stem sequence to obtain a question stem representation vector.
The method according to claim 1, wherein, before the option data and answer data are encoded according to the preset knowledge map to obtain option attribute values and answer attribute values, the method further includes: constructing the knowledge Spectrum, including:

Obtain preset knowledge points;

Constructing a first triplet and a second triplet according to the preset knowledge points;

Construct the knowledge map according to the first triple and the second triple; wherein, the first triple includes a first knowledge entity, a relationship, a second knowledge entity, and the second triple A group includes a second knowledge entity, an attribute, and an attribute value.
The method according to claim 3, wherein the knowledge graph includes a first triplet and a plurality of second triplets, and the option data and answer data are encoded according to the preset knowledge graph to obtain option attribute values and answer attribute values, including:

The option data is encoded according to the first triplet and the plurality of second triplets to obtain the option attribute value; wherein the option attribute value includes the values of the plurality of second triplets attribute value;

The answer data is encoded according to the first triplet and one of the second triplets to obtain the answer attribute value; wherein the answer attribute value is a plurality of attributes in the option attribute value One of the attribute values in the value.
The method according to any one of claims 1 to 4, wherein said performing word segmentation and splicing processing on said option attribute value and said answer attribute value to obtain an option answer representation vector, comprising:

Carry out word vectorization with described option attribute value and described answer attribute value, obtain the item attribute value and answer attribute value of word vectorization;

Concatenate the item attribute value and answer attribute value of word vectorization to obtain the option answer representation vector.
The method according to any one of claims 1 to 4, wherein said vector splicing of said question stem representation vector and said option answer representation vector to obtain topic data includes:

The question stem characterization vector and the option answer characterization vector are vector-spliced through a separator to obtain topic data; wherein, the separator includes a first placeholder and a second placeholder, and the question stem is separated by a separator The characterization vector and the option answer characterization vector are vector spliced to obtain the topic data, which specifically includes:

The question stem characterization vector is set between the first placeholder and the second placeholder, the second placeholder is set between the question stem characterization vector and the option answer characterization vector, and the question stem characterization vector and the option answer characterization vector Perform vector splicing to obtain the subject data.
A knowledge classification method for multiple-choice questions, including:

Obtain multiple-choice question data to be classified; wherein, the multiple-choice question data includes question stem data, option data and answer data;

The multiple-choice data is input into the knowledge classification model; wherein, the knowledge classification model is obtained by training according to the method described in any one of claims 1 to 6;

Using the knowledge classification model to extract the features of the multiple-choice data row to obtain feature vector information;

Knowledge classification processing is performed according to the feature vector information to obtain knowledge point types.
A training device for a knowledge classification model, comprising:

The original data acquisition module is used to obtain the original annotation data; the original annotation data includes question stem data, option data and answer data;

A question stem coding module, configured to encode the question stem data to obtain a question stem representation vector;

The option answer encoding module is used to encode the option data and answer data according to the preset knowledge graph to obtain option attribute values and answer attribute values;

A word segmentation and splicing module, used to perform word segmentation and splicing processing on the option attribute value and the answer attribute value to obtain an option answer representation vector;

A vector splicing module, configured to splice the question stem representation vector and the option answer representation vector to obtain topic data;

The classification model training module is used to train the preset pre-training model according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on the target topic to obtain the type of knowledge points.
A computer device, comprising:

at least one memory;

at least one processor;

at least one program;

The program is stored in the memory, and the processor executes the at least one program to implement a knowledge classification model training method, wherein the knowledge classification model training method includes:

Obtaining original annotation data; wherein, the original annotation data includes question stem data, option data and answer data;

Encoding the question stem data to obtain a question stem representation vector;

Encode the option data and answer data according to the preset knowledge graph to obtain option attribute values and answer attribute values;

Segmenting and concatenating the option attribute value and the answer attribute value to obtain an option answer representation vector;

performing vector splicing of the question stem characterization vector and the option answer characterization vector to obtain question data;

A preset pre-training model is trained according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on a target topic to obtain knowledge point types.
The computer device according to claim 9, wherein said performing encoding processing on said question stem data to obtain a question stem characterization vector, comprising:

Preprocessing the stem data, converting the English content of the stem data to lowercase to obtain a preliminary stem sequence;

Word segmentation is performed on the preliminary question stem sequence to obtain a question stem representation vector.
The computer device according to claim 9, wherein, before said option data and answer data are encoded according to the preset knowledge map to obtain option attribute values and answer attribute values, the training method of the knowledge classification model further Including: constructing the knowledge map, specifically including:

Obtain preset knowledge points;

Constructing a first triplet and a second triplet according to the preset knowledge points;

Construct the knowledge map according to the first triple and the second triple; wherein, the first triple includes a first knowledge entity, a relationship, a second knowledge entity, and the second triple A group includes a second knowledge entity, an attribute, and an attribute value.
The computer device according to claim 11, wherein the knowledge graph includes a first triplet and a plurality of second triplets, and the option data and answer data are encoded according to the preset knowledge graph to obtain option attributes Value and answer attribute values, including:

The option data is encoded according to the first triplet and the plurality of second triplets to obtain the option attribute value; wherein the option attribute value includes the values of the plurality of second triplets attribute value;

The answer data is encoded according to the first triplet and one of the second triplets to obtain the answer attribute value; wherein the answer attribute value is a plurality of attributes in the option attribute value One of the attribute values in the value.
The computer device according to any one of claims 10 to 12, wherein said performing word segmentation and splicing processing on said option attribute value and said answer attribute value to obtain an option answer characterization vector, comprising:

Carry out word vectorization with described option attribute value and described answer attribute value, obtain the item attribute value and answer attribute value of word vectorization;

Concatenate the item attribute value and answer attribute value of word vectorization to obtain the option answer representation vector.
The computer device according to any one of claims 10 to 12, wherein the vector splicing of the question stem characterization vector and the option answer characterization vector to obtain topic data includes:

The question stem characterization vector and the option answer characterization vector are vector-spliced through a separator to obtain topic data; wherein, the separator includes a first placeholder and a second placeholder, and the question stem is separated by a separator The characterization vector and the option answer characterization vector are vector spliced to obtain the topic data, which specifically includes:

The question stem characterization vector is set between the first placeholder and the second placeholder, the second placeholder is set between the question stem characterization vector and the option answer characterization vector, and the question stem characterization vector and the option answer characterization vector Perform vector splicing to obtain the subject data.
A computer device, comprising:

at least one memory;

at least one processor;

at least one program;

The program is stored in the memory, and the processor executes the at least one program to implement a knowledge classification method for multiple-choice questions, wherein the knowledge classification method for multiple-choice questions includes:

Obtain multiple-choice question data to be classified; wherein, the multiple-choice question data includes question stem data, option data and answer data;

The multiple-choice question data is input into the knowledge classification model; wherein, the knowledge classification model is obtained by training according to a training method of the knowledge classification model;

Using the knowledge classification model to extract the features of the multiple-choice data row to obtain feature vector information;

performing knowledge classification processing according to the feature vector information to obtain knowledge point types;

Wherein, the training method of the knowledge classification model includes:

Obtaining original annotation data; wherein, the original annotation data includes question stem data, option data and answer data;

Encoding the question stem data to obtain a question stem representation vector;

Encode the option data and answer data according to the preset knowledge graph to obtain option attribute values and answer attribute values;

Segmenting and concatenating the option attribute value and the answer attribute value to obtain an option answer representation vector;

performing vector splicing of the question stem characterization vector and the option answer characterization vector to obtain question data;

A preset pre-training model is trained according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on a target topic to obtain knowledge point types.
A storage medium, the storage medium is a computer-readable storage medium, wherein the computer-readable storage medium stores computer-executable instructions, and the computer-executable instructions are used to enable a computer to perform training of a knowledge classification model method, wherein the training method of the knowledge classification model includes:

Obtaining original annotation data; wherein, the original annotation data includes question stem data, option data and answer data;

Encoding the question stem data to obtain a question stem representation vector;

Encode the option data and answer data according to the preset knowledge graph to obtain option attribute values and answer attribute values;

Segmenting and concatenating the option attribute value and the answer attribute value to obtain an option answer representation vector;

performing vector splicing of the question stem characterization vector and the option answer characterization vector to obtain question data;

A preset pre-training model is trained according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on a target topic to obtain knowledge point types.
The storage medium according to claim 16, wherein said performing encoding processing on said question stem data to obtain a question stem characterization vector comprises:

Preprocessing the stem data, converting the English content of the stem data to lowercase to obtain a preliminary stem sequence;

Word segmentation is performed on the preliminary question stem sequence to obtain a question stem representation vector.
The storage medium according to claim 16, wherein, before the option data and answer data are encoded according to the preset knowledge map to obtain option attribute values and answer attribute values, the training method of the knowledge classification model further Including: constructing the knowledge map, specifically including:

Obtain preset knowledge points;

Constructing a first triplet and a second triplet according to the preset knowledge points;

Construct the knowledge map according to the first triple and the second triple; wherein, the first triple includes a first knowledge entity, a relationship, a second knowledge entity, and the second triple A group includes a second knowledge entity, an attribute, and an attribute value.
The computer device according to any one of claims 16 to 18, wherein the vector splicing of the question stem characterization vector and the option answer characterization vector to obtain topic data includes:

The question stem characterization vector and the option answer characterization vector are vector-spliced through a separator to obtain topic data; wherein, the separator includes a first placeholder and a second placeholder, and the question stem is separated by a separator The characterization vector and the option answer characterization vector are vector spliced to obtain the topic data, which specifically includes:

The question stem characterization vector is set between the first placeholder and the second placeholder, the second placeholder is set between the question stem characterization vector and the option answer characterization vector, and the question stem characterization vector and the option answer characterization vector Perform vector splicing to obtain the subject data.
A storage medium, the storage medium is a computer-readable storage medium, wherein the computer-readable storage medium stores computer-executable instructions, and the computer-executable instructions are used to enable a computer to perform a knowledge classification of multiple-choice questions The method, wherein, the knowledge classification method of the multiple-choice questions includes:

Obtain multiple-choice question data to be classified; wherein, the multiple-choice question data includes question stem data, option data and answer data;

The multiple-choice question data is input into the knowledge classification model; wherein, the knowledge classification model is obtained by training according to a training method of the knowledge classification model;

Using the knowledge classification model to extract the features of the multiple-choice data row to obtain feature vector information;

performing knowledge classification processing according to the feature vector information to obtain knowledge point types;

Wherein, the training method of the knowledge classification model includes:

Obtaining original annotation data; wherein, the original annotation data includes question stem data, option data and answer data;

Encoding the question stem data to obtain a question stem representation vector;

Encode the option data and answer data according to the preset knowledge graph to obtain option attribute values and answer attribute values;

Segmenting and concatenating the option attribute value and the answer attribute value to obtain an option answer representation vector;

performing vector splicing of the question stem characterization vector and the option answer characterization vector to obtain question data;

A preset pre-training model is trained according to the topic data to obtain a knowledge classification model; wherein, the knowledge classification model is used to perform knowledge classification processing on a target topic to obtain knowledge point types.