CN117112760A - Intelligent education big model based on knowledge base - Google Patents
Intelligent education big model based on knowledge base Download PDFInfo
- Publication number
- CN117112760A CN117112760A CN202311082062.7A CN202311082062A CN117112760A CN 117112760 A CN117112760 A CN 117112760A CN 202311082062 A CN202311082062 A CN 202311082062A CN 117112760 A CN117112760 A CN 117112760A
- Authority
- CN
- China
- Prior art keywords
- module
- text
- model
- voice
- intelligent
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000007781 pre-processing Methods 0.000 claims abstract description 31
- 238000000034 method Methods 0.000 claims abstract description 15
- 238000011160 research Methods 0.000 claims abstract description 14
- 238000012549 training Methods 0.000 claims abstract description 14
- 230000008569 process Effects 0.000 claims abstract description 13
- 238000013461 design Methods 0.000 claims abstract description 10
- 238000012545 processing Methods 0.000 claims abstract description 7
- 230000004044 response Effects 0.000 claims description 12
- 230000002708 enhancing effect Effects 0.000 claims description 7
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 238000012937 correction Methods 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 claims description 6
- 230000011218 segmentation Effects 0.000 claims description 6
- 238000000638 solvent extraction Methods 0.000 claims description 6
- 238000013518 transcription Methods 0.000 claims description 6
- 230000035897 transcription Effects 0.000 claims description 6
- 238000013135 deep learning Methods 0.000 claims description 5
- 238000010801 machine learning Methods 0.000 claims description 5
- 238000013480 data collection Methods 0.000 claims description 3
- 238000002372 labelling Methods 0.000 claims description 3
- 239000000463 material Substances 0.000 claims description 3
- 230000009467 reduction Effects 0.000 claims description 3
- 230000009471 action Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000013508 migration Methods 0.000 description 2
- 230000005012 migration Effects 0.000 description 2
- 238000012805 post-processing Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013075 data extraction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B25/00—Models for purposes not provided for in G09B23/00, e.g. full-sized devices for demonstration purposes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3347—Query execution using vector based model
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/268—Morphological analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
- G06F40/295—Named entity recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
- G06F40/35—Discourse or dialogue representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/04—Inference or reasoning models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Human Computer Interaction (AREA)
- General Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Databases & Information Systems (AREA)
- Acoustics & Sound (AREA)
- Evolutionary Computation (AREA)
- Computing Systems (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Business, Economics & Management (AREA)
- Educational Technology (AREA)
- Educational Administration (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention belongs to the technical field of intelligent education, in particular to an intelligent education big model based on a knowledge base, which comprises a big model training and fine tuning module, a voice recognition module, a text preprocessing module and an intelligent dialogue module, wherein the voice recognition module and the text preprocessing module cooperatively process user input, and the intelligent dialogue module takes the big model as a base, cooperates with the voice recognition and text preprocessing module, responds to a user request to conduct intelligent reasoning and replies based on prior information provided by a self-research knowledge base. Specifically, the voice recognition module is used for processing voice input of a user in a complex noise environment, and then the voice input is converted into text information by the text preprocessing module and is input into the subsequent intelligent dialogue module. According to the invention, the safety and controllability of the model are enhanced, the safety shielding design is carried out in the process of training corpus and outputting the model, the safety and privacy of educational users can be better protected, and the model is prevented from misleading students or generating harmful information.
Description
Technical Field
The invention belongs to the technical field of intelligent education, and particularly relates to an intelligent education big model based on a knowledge base.
Background
In the development of artificial intelligence, the creation and application of large-scale pre-trained models has achieved significant achievements, especially in terms of data extraction and decision support, however, despite the tremendous advances these models have achieved, significant problems and limitations remain in facing safety controllability and its application in the vertical industry (e.g., education).
Firstly, from the viewpoint of security and controllability, the development of large-scale models originates from abroad, and security risks may exist when the large-scale models are applied in domestic commercialization, particularly in scenes involving confidential information, moreover, training corpus of large models often lacks continuous learning adaptation to Chinese and domestic semantic environments, so that large models based on Chinese are relatively few, and are mainly applied to dialogue questions and answers at present in vertical application scenes such as intelligent education industry, but are still lacking in specific application scene research.
Second, existing pre-training model training data are mostly derived from the internet, and these data knowledge is not equivalent to the expertise of the education field, so when the existing large model faces the migration of the expertise of the education field such as physics, chemistry, english, etc., difficulties may occur, which may lead to misleading information being obtained by users in the education industry, for which an effective solution strategy or solution does not exist.
Accordingly, there is a need for a knowledge base-based intelligent educational large model that addresses the aforementioned problems.
Disclosure of Invention
Aiming at the situation, in order to overcome the defects of the prior art, the invention provides an intelligent education big model based on a knowledge base, which effectively solves the problems in the background art.
In order to achieve the above purpose, the present invention provides the following technical solutions: the intelligent education large model based on the knowledge base comprises a large model training and fine tuning module, a voice recognition module, a text preprocessing module and an intelligent dialogue module, wherein the voice recognition module and the text preprocessing module cooperatively process user input, and the intelligent dialogue module takes the large model as a base and responds to a user request by the voice recognition and text preprocessing module cooperatively to conduct intelligent reasoning and replying. Specifically, the voice recognition module is used for processing voice input of a user in a complex noise environment, then the voice input is converted into text information by the text preprocessing module and is input into the subsequent intelligent dialogue module, and the text information is submitted to the large model for reasoning prediction and replying. Meanwhile, the safety and the controllability of the model are enhanced, the safety shielding design is carried out on the training corpus and the output post-processing of the model, the safety and the privacy of educational users can be better protected, and the model is prevented from misleading students or generating harmful information.
And a voice recognition module: the module is mainly used for processing the voice input of a user in a complex noisy environment, and is responsible for a series of audio preprocessing works such as voice denoising, and the semi-transcribed voice signal is sent to a subsequent text preprocessing module for further voice text conversion flow. The design and the workflow of the voice recognition module are as follows:
s1, fine tuning a voice recognition module by using an audio data set containing various background sounds/noises, so that the robustness of the voice recognition module is improved;
s2, collecting pronunciation of a user for a specified text, determining pronunciation accuracy and pronunciation preference of the user, and further fine-tuning a voice recognition module to improve recognition accuracy;
s3, in the use process of the system, preprocessing steps such as noise reduction and correction are carried out on the audio input by the user, and the audio is output to a subsequent module;
text preprocessing module: the module is used for receiving the data from the voice recognition module and completing the tasks of voice transcription, word segmentation, text vectorization and the like. The design and workflow of the text preprocessing module are as follows:
s1, performing transcription work of converting voice into text for output from a voice recognition module; aiming at the converted text, word segmentation is carried out, and the converted text is segmented into a sequence of words or subwords;
s2, analyzing whether words in sentences contain non-learning related words and illegal contents, if so, sending corresponding prompt information to remind a user, and requiring the user to input or reject a response again;
s3, labeling the parts of speech of each vocabulary;
s4, identifying and marking named entities in the text;
s5, dividing the text into independent sentences to be used as input of the intelligent dialogue module;
intelligent dialogue module of collaborative self-research knowledge base: the module aims at further fine tuning by taking a domestic large model as a base, and is cooperated with a voice recognition and text preprocessing module to respond to a user request based on prior information provided by a self-research knowledge base to perform intelligent reasoning and reply. The workflow of the intelligent dialogue module is as follows:
s1, performing fine adjustment on high-quality education data by using a domestic Chinese base model based on a machine learning and deep learning algorithm, so that the system has the capabilities of grammar error recognition, confusion word correction and the like, is more suitable for education scenes, and obtains a special large model facing education;
s2, a user sets preference scenes of voice exercise, such as business dialogue, english speech, daily dialogue and the like, so that a using field Jing Weidiao of the intelligent dialogue module is realized;
s3, updating and maintaining a self-research knowledge base, and timely updating and expanding knowledge reserves of the model;
s4, generating a response text required by a user based on the background knowledge of the trimmed language big model, the scene self-adaptive prompt words and the knowledge text with high timeliness;
s5, generating voice corresponding to the response text by using the voice conversion sub-module in the intelligent dialogue module, and feeding back the voice to the user.
Further, the machine learning and deep learning algorithm is used for fine tuning S1 on high-quality educational data by using a domestic Chinese base model, and the working flow is as follows:
s1-1, constructing a tens of millions of scale fine tuning data set from Internet data by using a crawler and other technologies, performing supervised training on a base model, and enhancing the response capability and reasoning performance of the base model to education-related knowledge;
s1-2, aiming at the education industry, pertinently collecting high-quality data with strong correlation to carry out further parameter fine adjustment, and enhancing knowledge understanding of the model in the field of refinement profession;
s1-3, based on feedback of the professional education practitioner on the output content of the model, the model learns the language style of the professional and the analysis process of the specific problem, and the output professionality of the model is enhanced.
Further, the knowledge base is updated and maintained, and the knowledge reserve S3 of the model is updated and expanded in time, and the workflow is as follows:
s3-1, capturing and updating relevant document knowledge of each subject in a database, such as teaching materials, teaching plans, text information in video media and the like, from the Internet in real time based on a data collection sub-module to form a massive and timely database;
s3-2, classifying and partitioning according to knowledge features based on a data partitioning sub-module to construct a vector database for model analysis and use;
s3-3, based on the user query vectorization module, projecting the structured text information to an embedded space, extracting the feature vector of the text, and acquiring and returning the most relevant feature vector from a database by combining a vector retrieval technology, thereby realizing the rapid query of knowledge.
Compared with the prior art, the invention has the beneficial effects that:
1. according to the invention, through the special large model for self-research education, the real data of tens of millions of scales in the industry are collected, and the efficient intelligent technical deployment and the floor application of the education industry are completed.
2. The invention has the application expansion capability, and can realize the migration of the technical application from English to more education subjects such as Chinese, mathematics, physics, chemistry and the like by enhancing the proprietary training data of the model, thereby realizing wider social benefits.
3. According to the invention, the safety and controllability of the model are enhanced, the safety shielding design is carried out in the process of training corpus and outputting the model, the safety and privacy of educational users can be better protected, and the model is prevented from misleading students or generating harmful information.
Drawings
The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate the invention and together with the embodiments of the invention, serve to explain the invention. In the drawings:
FIG. 1 is a schematic block diagram of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all embodiments of the invention; all other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The first embodiment of the invention is given by fig. 1, and discloses an intelligent education big model based on a knowledge base, which comprises a big model training and fine tuning module, a voice recognition module, a text preprocessing module and an intelligent dialogue module, wherein the voice recognition module and the text preprocessing module cooperatively process user input, and the intelligent dialogue module takes the big model as a base and responds to a user request by the voice recognition and text preprocessing module cooperatively to conduct intelligent reasoning and reply. Specifically, the voice recognition module is used for processing voice input of a user in a complex noise environment, then the voice input is converted into text information by the text preprocessing module and is input into the subsequent intelligent dialogue module, and the text information is submitted to the large model for reasoning prediction and replying. Meanwhile, the safety and the controllability of the model are enhanced, the safety shielding design is carried out on the training corpus and the output post-processing of the model, the safety and the privacy of educational users can be better protected, and the model is prevented from misleading students or generating harmful information.
And a voice recognition module: the module is mainly used for processing the voice input of a user in a complex noisy environment, and is responsible for a series of audio preprocessing works such as voice denoising, and the semi-transcribed voice signal is sent to a subsequent text preprocessing module for further voice text conversion flow. The design and the workflow of the voice recognition module are as follows:
s1, fine tuning a voice recognition module by using an audio data set containing various background sounds/noises, so that the robustness of the voice recognition module is improved;
s2, collecting pronunciation of a user for a specified text, determining pronunciation accuracy and pronunciation preference of the user, and further fine-tuning a voice recognition module to improve recognition accuracy;
s3, in the use process of the system, preprocessing steps such as noise reduction and correction are carried out on the audio input by the user, and the audio is output to a subsequent module;
text preprocessing module: the module is used for receiving the data from the voice recognition module and completing the tasks of voice transcription, word segmentation, text vectorization and the like. The design and workflow of the text preprocessing module are as follows:
s1, performing transcription work of converting voice into text for output from a voice recognition module; aiming at the converted text, word segmentation is carried out, and the converted text is segmented into a sequence of words or subwords;
s2, analyzing whether words in sentences contain non-learning related words and illegal contents, if so, sending corresponding prompt information to remind a user, and requiring the user to input or reject a response again;
s3, labeling the parts of speech of each vocabulary;
s4, identifying and marking named entities in the text;
s5, dividing the text into independent sentences to be used as input of the intelligent dialogue module;
intelligent dialogue module of collaborative self-research knowledge base: the module aims at further fine tuning by taking a domestic large model as a base, and is cooperated with a voice recognition and text preprocessing module to respond to a user request based on prior information provided by a self-research knowledge base to perform intelligent reasoning and reply. The workflow of the intelligent dialogue module is as follows:
s1, performing fine adjustment on high-quality education data by using a domestic Chinese base model based on a machine learning and deep learning algorithm, so that the system has the capabilities of grammar error recognition, confusion word correction and the like, is more suitable for education scenes, and obtains a special large model facing education;
s2, a user sets preference scenes of voice exercise, such as business dialogue, english speech, daily dialogue and the like, so that a using field Jing Weidiao of the intelligent dialogue module is realized;
s3, updating and maintaining a self-research knowledge base, and timely updating and expanding knowledge reserves of the model;
s4, generating a response text required by a user based on the background knowledge of the trimmed language big model, the scene self-adaptive prompt words and the knowledge text with high timeliness;
s5, generating voice corresponding to the response text by using the voice conversion sub-module in the intelligent dialogue module, and feeding back the voice to the user.
Further, the machine learning and deep learning algorithm is used for fine tuning S1 on high-quality educational data by using a domestic Chinese base model, and the working flow is as follows:
s1-1, constructing a tens of millions of scale fine tuning data set from Internet data by using a crawler and other technologies, performing supervised training on a base model, and enhancing the response capability and reasoning performance of the base model to education-related knowledge;
s1-2, aiming at the education industry, pertinently collecting high-quality data with strong correlation to carry out further parameter fine adjustment, and enhancing knowledge understanding of the model in the field of refinement profession;
s1-3, based on feedback of the professional education practitioner on the output content of the model, the model learns the language style of the professional and the analysis process of the specific problem, and the output professionality of the model is enhanced.
Further, the knowledge base is updated and maintained, and the knowledge reserve S3 of the model is updated and expanded in time, and the workflow is as follows:
s3-1, capturing and updating relevant document knowledge of each subject in a database, such as teaching materials, teaching plans, text information in video media and the like, from the Internet in real time based on a data collection sub-module to form a massive and timely database;
s3-2, classifying and partitioning according to knowledge features based on a data partitioning sub-module to construct a vector database for model analysis and use;
s3-3, based on the user query vectorization module, projecting the structured text information to an embedded space, extracting the feature vector of the text, and acquiring and returning the most relevant feature vector from a database by combining a vector retrieval technology, thereby realizing the rapid query of knowledge.
It is noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Although embodiments of the present invention have been shown and described, it will be understood by those skilled in the art that various changes, modifications, substitutions and alterations can be made therein without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.
Claims (3)
1. The utility model provides an intelligent education big model based on knowledge base, includes speech recognition module, text processing module and intelligent dialogue module, its characterized in that: the intelligent dialogue module takes a large model as a base, and the intelligent dialogue module is cooperated with the voice recognition and text preprocessing module to perform intelligent reasoning and replying based on prior information provided by a self-research knowledge base;
and a voice recognition module: the module is mainly used for processing voice input of a user in a complex noisy environment, and is responsible for a series of audio preprocessing works such as voice denoising, and a semi-transcribed voice signal of the module is sent to a subsequent text preprocessing module for further voice text conversion flow; the design and the workflow of the voice recognition module are as follows:
s1, fine tuning a voice recognition module by using an audio data set containing various background sounds/noises, so that the robustness of the voice recognition module is improved;
s2, collecting pronunciation of a user for a specified text, determining pronunciation accuracy and pronunciation preference of the user, and further fine-tuning a voice recognition module to improve recognition accuracy;
s3, in the use process of the system, preprocessing steps such as noise reduction and correction are carried out on the audio input by the user, and the audio is output to a subsequent module;
text preprocessing module: the module is used for receiving the data from the voice recognition module and completing the tasks of voice transcription, word segmentation, text vectorization and the like; the design and workflow of the text preprocessing module are as follows:
s1, performing transcription work of converting voice into text for output from a voice recognition module; aiming at the converted text, word segmentation is carried out, and the converted text is segmented into a sequence of words or subwords;
s2, analyzing whether words in sentences contain non-learning related words and illegal contents, if so, sending corresponding prompt information to remind a user, and requiring the user to input or reject a response again;
s3, labeling the parts of speech of each vocabulary;
s4, identifying and marking named entities in the text;
s5, dividing the text into independent sentences to be used as input of the intelligent dialogue module;
intelligent dialogue module of collaborative self-research knowledge base: the module aims at further fine tuning by taking a domestic large model as a base, and is cooperated with a voice recognition and text preprocessing module to respond to a user request based on prior information provided by a self-research knowledge base to perform intelligent reasoning and reply; the workflow of the intelligent dialogue module is as follows:
s1, performing fine adjustment on high-quality education data by using a domestic Chinese base model based on a machine learning and deep learning algorithm, so that the system has the capabilities of grammar error recognition, confusion word correction and the like, is more suitable for education scenes, and obtains a special large model facing education;
s2, a user sets preference scenes of voice exercise, such as business dialogue, english speech, daily dialogue and the like, so that a using field Jing Weidiao of the intelligent dialogue module is realized;
s3, updating and maintaining a self-research knowledge base, and timely updating and expanding knowledge reserves of the model;
s4, generating a response text required by a user based on the background knowledge of the trimmed language big model, the scene self-adaptive prompt words and the knowledge text with high timeliness;
s5, generating voice corresponding to the response text by using the voice conversion sub-module in the intelligent dialogue module, and feeding back the voice to the user.
2. The knowledge base based intelligent educational large model according to claim 1, wherein: fine tuning S1 on high quality educational data using the domestic chinese base model, the workflow is as follows:
s1-1, constructing a tens of millions of scale fine tuning data set from Internet data by using a crawler and other technologies, performing supervised training on a base model, and enhancing the response capability and reasoning performance of the base model to education-related knowledge;
s1-2, aiming at the education industry, pertinently collecting high-quality data with strong correlation to carry out further parameter fine adjustment, and enhancing knowledge understanding of the model in the field of refinement profession;
s1-3, based on feedback of the professional education practitioner on the output content of the model, the model learns the language style of the professional and the analysis process of the specific problem, and the output professionality of the model is enhanced.
3. The knowledge base based intelligent educational large model according to claim 1, wherein: updating and maintaining a self-research knowledge base, and timely updating and expanding knowledge reserves S3 of a model, wherein the working flow is as follows:
s3-1, capturing and updating relevant document knowledge of each subject in a database, such as teaching materials, teaching plans, text information in video media and the like, from the Internet in real time based on a data collection sub-module to form a massive and timely database;
s3-2, classifying and partitioning according to knowledge features based on a data partitioning sub-module to construct a vector database for model analysis and use;
s3-3, based on the user query vectorization module, projecting the structured text information to an embedded space, extracting the feature vector of the text, and acquiring and returning the most relevant feature vector from a database by combining a vector retrieval technology, thereby realizing the rapid query of knowledge.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311082062.7A CN117112760A (en) | 2023-08-26 | 2023-08-26 | Intelligent education big model based on knowledge base |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311082062.7A CN117112760A (en) | 2023-08-26 | 2023-08-26 | Intelligent education big model based on knowledge base |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117112760A true CN117112760A (en) | 2023-11-24 |
Family
ID=88808753
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311082062.7A Pending CN117112760A (en) | 2023-08-26 | 2023-08-26 | Intelligent education big model based on knowledge base |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117112760A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117875304A (en) * | 2024-01-11 | 2024-04-12 | 西安西维迈创科技有限公司 | Corpus construction method, system and storage medium for subway field |
CN117994101A (en) * | 2024-04-03 | 2024-05-07 | 北京师范大学珠海校区 | Teaching design generation method and device based on large language model |
-
2023
- 2023-08-26 CN CN202311082062.7A patent/CN117112760A/en active Pending
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117875304A (en) * | 2024-01-11 | 2024-04-12 | 西安西维迈创科技有限公司 | Corpus construction method, system and storage medium for subway field |
CN117994101A (en) * | 2024-04-03 | 2024-05-07 | 北京师范大学珠海校区 | Teaching design generation method and device based on large language model |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104050160B (en) | Interpreter's method and apparatus that a kind of machine is blended with human translation | |
CN117112760A (en) | Intelligent education big model based on knowledge base | |
CN110910283A (en) | Method, device, equipment and storage medium for generating legal document | |
CN117076693A (en) | Method for constructing digital human teacher multi-mode large language model pre-training discipline corpus | |
CN108710653B (en) | On-demand method, device and system for reading book | |
CN112287090A (en) | Financial question asking back method and system based on knowledge graph | |
US11727915B1 (en) | Method and terminal for generating simulated voice of virtual teacher | |
CN112349294B (en) | Voice processing method and device, computer readable medium and electronic equipment | |
CN117932018A (en) | Large-model-based intelligent document question-answering method | |
CN117851871A (en) | Multi-mode data identification method for overseas Internet social network site | |
CN116542817A (en) | Intelligent digital lawyer consultation method and system | |
CN116561274A (en) | Knowledge question-answering method based on digital human technology and natural language big model | |
CN113393841B (en) | Training method, device, equipment and storage medium of voice recognition model | |
CN114925206A (en) | Artificial intelligence body, voice information recognition method, storage medium and program product | |
CN117592564A (en) | Question-answer interaction method, device, equipment and medium | |
CN117198267A (en) | Local dialect voice intelligent recognition and question-answering method, system, equipment and medium | |
CN116306653A (en) | Regularized domain knowledge-aided named entity recognition method | |
CN112233678B (en) | Course making method, display method, device and storage medium based on marking script | |
CN110858268B (en) | Method and system for detecting unsmooth phenomenon in voice translation system | |
CN112015920A (en) | Intelligent auxiliary learning system based on knowledge graph and edge calculation | |
Macintyre et al. | The 2021 International Conference on Machine Learning and Big Data Analytics for IoT Security and Privacy: SPIoT-2021 Volume 1 | |
Yang et al. | An automatic analysis and evaluation system used for teaching quality in mooc environment | |
CN117094335B (en) | External cross utterance emotion analysis and intelligent translation system and method under political equivalent framework | |
CN117313748B (en) | Multi-feature fusion semantic understanding method and device for government affair question and answer | |
CN118155623B (en) | Speech recognition method based on artificial intelligence |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |