WO2024016695A1

WO2024016695A1 - Multiview learning-based teaching knowledge graph construction and retrieval method and system

Info

Publication number: WO2024016695A1
Application number: PCT/CN2023/082103
Authority: WO
Inventors: 孙善宝
Original assignee: 山东浪潮科学研究院有限公司
Priority date: 2022-07-22
Filing date: 2023-03-17
Publication date: 2024-01-25
Also published as: CN115292513A

Abstract

A multiview learning-based teaching knowledge graph construction and retrieval method and system. The method comprises the following steps: constructing a graph construction and retrieval model on the basis of a multiview feature extractor, a teaching resource profile model, a knowledge graph construction model, a user profile model and a retrieval recommendation model, and performing model training on the multiview feature extractor, the teaching resource profile model, the knowledge graph construction model, the user profile model and the retrieval recommendation model in sequence; and constructing a teaching knowledge point knowledge graph by means of the trained graph construction and retrieval model, constructing an index on the basis of the teaching knowledge point knowledge graph, and providing a retrieval recommendation service.

Description

Teaching knowledge graph construction and retrieval method and system based on multi-view learning

Technical field

The present invention relates to the technical field of teaching resource recommendation, specifically to a teaching knowledge graph construction and retrieval method and system based on multi-view learning.

Background technique

Knowledge Graph is a large-scale semantic network based on data. As a knowledge representation form, it describes domain entities, concepts and various semantic relationships between them. Google proposed the "Google Knowledge Graph" in 2012, and the knowledge graph began to attract widespread attention from academia and industry. After continuous development in recent years, it has been used in many fields such as search optimization, e-commerce, intelligent recommendations, and social media. It has been applied in practice and has gradually become a necessary way to manage massive information.

In recent years, artificial intelligence technology has developed rapidly, and its commercialization speed has exceeded expectations. Artificial intelligence will bring disruptive changes to the entire society and has become an important development strategy for various countries in the future. In particular, the evolution of algorithms with deep learning as the core has strong evolutionary capabilities. With the support of big data, large-scale neural networks similar to the human brain structure can be constructed through training, which can already solve various problems.

With the rapid development of Internet technology, the traditional education industry has also ushered in the new model of "Internet +". Massive online teaching resources have changed the traditional teaching methods, from the Internet to the mobile Internet, creating a life and work that spans time and space. and learning styles, the way knowledge is acquired and explored has undergone fundamental changes. A large number of teaching resources are characterized by diversity, including textbooks, lesson plans, teaching videos, teaching voices, speeches and other forms of presentation of knowledge points. At the same time, the relationships between knowledge points are also more complex, and for different learners , also has its own needs for personalized learning. In this case, how to use deep learning technology, combined with multi-view learning and user portrait technology, to effectively utilize massive teaching resources, automatically build a more accurate and reasonable teaching knowledge map, and achieve personalized knowledge point retrieval and recommendation has become an urgent solution technical issues.

Contents of the invention

The technical task of the present invention is to address the above shortcomings and provide a teaching knowledge graph construction and retrieval method and system based on multi-view learning to solve how to use deep learning technology, combined with multi-view learning and user portrait technology, to effectively utilize massive teaching resources and automatically Technical issues to construct a more accurate and reasonable teaching knowledge graph and realize personalized knowledge point retrieval and recommendation.

In the first aspect, a teaching knowledge graph construction and retrieval method based on multi-view learning of the present invention method, including the following steps:

Construct a multi-view feature extractor, which is used to perform feature extraction and feature fusion on teaching resource data in multiple view modes to obtain the knowledge point ontology structure and associated knowledge points;

A teaching resource portrait model is constructed based on a convolutional neural network. The teaching resource portrait model takes teaching resource data and teaching resource Internet-related extended data as inputs to profile teaching resources and output teaching resource attributes. The teaching resource attributes include teaching resources. Basic attributes and extended attributes of teaching resources;

Build a knowledge graph construction model based on a neural network. The knowledge graph construction model uses the ontology structure of knowledge points, associated knowledge points and teaching resource attributes as input to generate a knowledge graph of teaching knowledge points;

Construct a user portrait model, which takes user information as input, profiles the user, and outputs user attributes;

Construct a retrieval recommendation model. The retrieval recommendation model is used to build an index based on the knowledge graph of teaching knowledge points and provide retrieval recommendation services. It is used to output multiple sets of knowledge points and recommended resources for users through the input retrieval content and combined with user attributes. choose;

Based on the multi-view feature extractor, teaching resource portrait model, knowledge graph construction model, user portrait model and retrieval recommendation model, a graph construction and retrieval model is constructed, and the multi-view feature extractor, teaching resource portrait model, knowledge graph are constructed in sequence. Carry out model training on the graph construction model, user portrait model and retrieval recommendation model, and obtain the trained graph construction and retrieval model;

A knowledge graph of teaching knowledge points is constructed through the trained graph construction and retrieval model, an index is constructed based on the knowledge graph of teaching knowledge points and retrieval recommendation services are provided, and multiple sets of knowledge points and recommended resources are output to users for selection.

Preferably, the data sources of the teaching resources include teaching materials and books, lesson plans, teaching videos, teaching voices and speech scripts. There are four viewing modes of the teaching resources, namely video, audio, image and text;

The basic attributes of the teaching resources include knowledge points, content structure, processes, principles, concepts, and tools;

The extended attributes of the teaching resources include Internet evaluation information and presentation forms;

The view form of the retrieval content includes text, voice, video and image;

The knowledge map of teaching knowledge points includes semantic network, basic data of teaching resources and teaching resources. Extended attributes, the semantic network is formed based on the ontology structure of knowledge points and the association of knowledge points;

The user information includes basic information and learning status;

The user attributes include knowledge point preferences, mastery level and comprehensive ability.

Preferably, the multi-view feature extractor includes:

A video feature extraction model. The video feature extraction model is a network model built based on a three-dimensional CNN convolutional neural network and is used to extract semantic features of knowledge points from video teaching resource data;

Audio feature extraction model. The audio feature extraction model is a network model built based on a CNN convolutional neural network and is used to extract semantic features of knowledge points from audio-based teaching resource data;

Image feature extraction model. The image feature extraction model is a network model built based on a convolutional neural network and is used to extract semantic features of knowledge points from image-based teaching resource data;

Text feature extraction model. The text feature extraction model is a language model based on BERT, which is used to extract semantic features of knowledge points from text-based teaching resource data;

Feature fusion model, the feature fusion model is used to fuse the semantic features of knowledge points output by the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model to obtain the knowledge point ontology structure and associated knowledge point;

The knowledge graph construction model includes:

Feature encoder, which is used to encode knowledge points so that the vector calculation distance between similar knowledge points is small and used to provide resource index queries;

Generate a network model, which is used to generate a knowledge graph of teaching knowledge points.

Preferably, model training is performed on the multi-view feature extractor, teaching resource portrait model, knowledge graph construction model, user portrait model and retrieval recommendation model in sequence, including the following steps:

Obtain teaching resource data from multiple data sources and annotate the data based on its materials;

Based on the teaching resource data, perform model pre-training on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor;

The model parameters of the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor are fixed, and the feature fusion model is type for model training;

The video feature extraction model, audio feature extraction model, image feature extraction model, text feature extraction model and feature fusion model in the multi-view feature extractor are combined, and the entire multi-view feature extractor is processed through the teaching resource data. Carry out model training and fine-tune parameters of each model in the multi-view feature extractor;

Collect Internet-related extended data on teaching resources and label them;

Carry out model training on the teaching resource portrait model based on the teaching resource data, teaching resource Internet-related extended data and tags;

Based on the knowledge point ontology structure and associated knowledge points output by the multi-view feature extractor, as well as the teaching resource basic attributes and teaching resource extended attributes output by the teaching resource portrait model, perform model training on the knowledge graph construction model;

Collect user information and label it;

Based on user information and tags, perform model training on the user portrait model;

The retrieval recommendation model is lightweight and tailored based on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor, and the search condition extraction model in the retrieval recommendation model is trained. ;

Based on the user attributes output by the user portrait model and the teaching knowledge point knowledge graph output by the knowledge graph construction model, model training is performed on the retrieval recommendation model.

Preferably, building a knowledge graph of teaching knowledge points through the trained graph construction and retrieval model includes the following steps:

Select a field and collect teaching resource data in the selected field;

Use the trained multi-view feature extractor to perform feature extraction and feature fusion on the teaching resources to obtain the knowledge point ontology structure and associated knowledge points;

Collect the Internet-related extended data of teaching resources, and based on the Internet-related extended data of teaching resources and the teaching resource data, profile the teaching resources through the trained teaching resource portrait model to obtain the teaching resource attributes;

Based on the knowledge point ontology structure, associated knowledge points, and teaching resource attributes, the knowledge points are encoded through the trained knowledge graph construction model, so that the vector calculation distance between similar knowledge point resources is small, which can be used for resource index query and generate teaching Knowledge point knowledge map;

Build an index based on the knowledge graph of teaching knowledge points and provide retrieval and recommendation services, and output multiple sets of knowledge points and recommended resources for users to choose, including the following steps:

Enter the search content, and the view mode of the search content includes text, voice, image and video;

Obtain the user's user information, profile the user through the trained user portrait model, and generate user attributes;

Based on the knowledge graph of teaching knowledge points, build an index through the trained retrieval recommendation model and provide retrieval recommendation services;

Based on indexing and retrieval recommendation services, knowledge points are extracted from the retrieval content input by users, combined with user attributes to form knowledge point feature vectors, and knowledge point queries and feature vector calculations are performed to output multiple sets of knowledge points and recommended resources for users make a choice;

The collected teaching resource data, Internet-related extended data, user information and search content, as well as the output sets of knowledge points and recommended resources, are fed back to the map construction and retrieval model, and model training is performed on the map construction and retrieval model. , to continuously optimize the map construction and retrieval model.

In the second aspect, a teaching knowledge graph construction and retrieval system based on multi-view learning of the present invention is used to construct and retrieve a teaching knowledge graph based on multi-view learning as described in any one of the first aspects. Users provide knowledge points and teaching resource recommendation services. The system includes:

A model building module, which is used to build a graph construction and retrieval model. The graph construction and retrieval model includes a multi-view feature extractor, a teaching resource portrait model, a knowledge graph construction model, a user portrait model and a retrieval recommendation model, The multi-feature view extractor is used to perform feature extraction and feature fusion on teaching resource data in multiple view modes to obtain the knowledge point ontology structure and associated knowledge points; the teaching resource portrait model is a network model based on a convolutional neural network. , using teaching resource data and teaching resource Internet-related extended data as input, the teaching resources are profiled, and the teaching resource attributes are output. The teaching resource attributes include basic teaching resource attributes and teaching resource extended attributes; the knowledge graph construction model is based on The ontology structure of knowledge points, associated knowledge points and teaching resource attributes are used as input to generate a knowledge graph of teaching knowledge points; the user portrait model uses user information as input to profile the user and output user attributes; the retrieval recommendation model is used based on The knowledge graph of teaching knowledge points constructs an index and provides retrieval recommendation services, which is used to output multiple sets of knowledge points and recommended resources for users to choose based on the input retrieval content and combined with user attributes;

a model training module, which is used to extract the multi-view features in sequence model, teaching resource portrait model, knowledge graph construction model, user portrait model and retrieval recommendation model for model training, and the trained graph construction and retrieval model is obtained;

A retrieval recommendation module. The retrieval recommendation module is used to construct a knowledge graph of teaching knowledge points through the trained graph construction and retrieval model, build an index based on the knowledge graph of teaching knowledge points and provide retrieval recommendation services, and output multiple sets of knowledge points for users. and recommend resources for users to choose from.

The view form of the retrieval content includes text, voice, video and image;

The knowledge map of teaching knowledge points includes a semantic network, basic data of teaching resources and extended attributes of teaching resources. The semantic network is formed based on the ontology structure of knowledge points and the association of knowledge points;

The user information includes basic information and learning status;

Preferably, the multi-view feature extractor includes:

The knowledge graph construction model includes:

Preferably, the model sequence module is used to perform model training on the multi-view feature extractor, teaching resource portrait model, knowledge graph construction model, user portrait model and retrieval recommendation model in sequence through the following steps:

Fix the model parameters of the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor, and perform model training on the feature fusion model;

Collect Internet-related extended data on teaching resources and label them;

Collect user information and label it;

Preferably, the retrieval and recommendation module is used to construct a knowledge graph of teaching knowledge points through the following steps, and output multiple sets of knowledge points and recommended resources for the user to select:

Select a field and collect teaching resource data in the selected field;

The teaching knowledge graph construction and retrieval method and system based on multi-view learning of the present invention have the following advantages:

1. Based on massive teaching resource data, effectively utilize deep learning feature extraction technology, fully consider the characteristics of Internet online learning, explore the connections between multiple views such as textbooks, lesson plans, teaching videos, teaching voices, and speech drafts, and combine it with the Internet Resource evaluation information is used to form more reasonable knowledge point attributes through teaching resource portraits, and a more accurate and reasonable teaching knowledge graph is constructed. Compared with the traditional knowledge graph construction and recommendation methods, multi-view learning and deep learning are used to construct query methods. According to Designing neural network models using multi-view data formed by different presentation methods of resources can better consider diversity, explore potential connections within knowledge points, and extract knowledge points more accurately and reasonably;

2. Teaching resource portraits and user portraits have been added to make the knowledge point extraction and retrieval process more targeted and focused. The recommended resources are more in line with learners' learning habits and meet learners' personalized needs. The recommendation results contain multiple sets of data, which increases the accuracy and fault tolerance of recommendations;

3. Learners learn based on recommended learning resources and provide timely feedback to continuously optimize the recommendation model.

Description of drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments or prior art will be briefly introduced below. Obviously, the drawings in the following description are only illustrative of the present invention. For some embodiments, for those of ordinary skill in the art, other drawings can be obtained based on these drawings without exerting creative efforts.

The present invention will be further described below in conjunction with the accompanying drawings.

Figure 1 is a block diagram of the working principle of the teaching knowledge graph construction and retrieval method based on multi-view learning in Embodiment 1.

Detailed ways

The present invention will be further described below in conjunction with the accompanying drawings and specific examples, so that those skilled in the art can better understand the present invention and implement it. However, the illustrated embodiments are not intended to limit the present invention. In the absence of conflict, Below, the embodiments of the present invention and the technical features in the embodiments can be combined with each other.

Embodiments of the present invention provide teaching knowledge graph construction and retrieval methods and systems based on multi-view learning, which are used to solve how to use deep learning technology, combined with multi-view learning and user portrait technology, to effectively utilize massive teaching resources and automatically build more accurate and reasonable Technical issues of teaching knowledge graph and realizing personalized knowledge point retrieval and recommendation.

Example 1:

The present invention is a teaching knowledge graph construction and retrieval method based on multi-view learning, which includes the following steps:

S100. Construct a multi-view feature extractor CR-Mutiview-Fet. The multi-feature view extractor CR-Mutiview-Fet is used to perform feature extraction and feature fusion on teaching resource data in multiple views to obtain the knowledge point ontology structure and Related knowledge points EP-OR;

S200. Construct a teaching resource profiling model CR-Profiler based on the convolutional neural network. The teaching resource profiling model CR-Profiler takes the teaching resource data CR and the teaching resource Internet-related extended data CR-WWW as inputs to profile the teaching resources and output Teaching resource attributes, which include teaching resource basic attributes CR-Basic and teaching resource extended attributes CR-Ext;

S300. Construct a knowledge graph construction model KG-Gen based on the neural network. The knowledge graph construction model KG-Gen uses the ontology structure of knowledge points, associated knowledge points EP-OR and teaching resource attributes as inputs to generate a knowledge graph EP of teaching knowledge points. -KG;

S400. Construct a user profile model Stu-Profiler. The user profile model Stu-Profiler takes user information Stu-Info as input, profiles the user, and outputs user attributes Stu-Prop;

S500. Construct a retrieval recommendation model CR-Recommend. The retrieval recommendation model CR-Recommend is used to build an index based on the teaching knowledge point knowledge graph EP-KG and provide retrieval recommendation services. It is used to use the input retrieval content and combine it with user attributes Stu -Prop, outputs multiple sets of knowledge points and recommended resources for users to choose;

S600. Graph construction and retrieval based on the multi-view feature extractor CR-Mutiview-Fet, teaching resource profile model CR-Profiler, knowledge graph construction model KG-Gen, user profile model Stu-Profiler and retrieval recommendation model CR-Recommend model, and perform model training on the multi-view feature extractor CR-Mutiview-Fet, teaching resource profile model CR-Profiler, knowledge graph construction model KG-Gen, user profile model Stu-Profiler and retrieval recommendation model CR-Recommend in sequence , obtain the trained graph construction and retrieval model;

S700. Construct a knowledge graph EP-KG of teaching knowledge points through the trained graph construction and retrieval model, build an index based on the knowledge graph EP-KG of teaching knowledge points and provide retrieval recommendation services, and output multiple sets of knowledge points and recommended resources for users. for users to choose.

The method of this embodiment is based on massive teaching resource data CR, effectively utilizes deep learning feature extraction technology, fully explores the connections between multiple views such as teaching materials, teaching plans, teaching videos, teaching voices, and speech drafts, and combines it with Internet resource evaluation information , forming more reasonable attributes of knowledge points through teaching resource portraits, and building a more accurate and reasonable teaching knowledge map. Utilizing the formed knowledge point semantic network and teaching resource map, combined with student user portraits, a retrieval recommendation model CR-Recommend based on the teaching knowledge map is formed to provide knowledge point learning and teaching resources for learners' personalized learning and in line with their own characteristics, to achieve Teach students in accordance with their aptitude.

The data sources of teaching resources collected in this embodiment include teaching materials and books, lesson plans, teaching videos, teaching voices and speech drafts, which are mainly divided into four types: knowledge point video V, knowledge point audio A, knowledge point film P and knowledge point teaching material L view mode.

That is, the multi-view feature extractor CR-Mutiview-Fet is responsible for extracting knowledge point features, outputting the ontology structure of knowledge points and associated knowledge points EP-OR. Four different feature extraction models are used for feature extraction and feature fusion for the above four views. , including video feature extraction module FV, audio feature extraction module FA, image feature extraction module FP, text feature extraction module FL and feature fusion module FF. The core of the video feature extraction module FV is a three-dimensional CNN convolutional neural network, which is responsible for extracting the semantic features of video knowledge points; The core of the audio feature extraction module FA is a CNN convolutional neural network, which is responsible for extracting the semantic features of knowledge points in audio; the core of the image feature extraction module FP is a convolutional neural network, which is responsible for extracting the semantic features of knowledge points in images; the core of the text feature extraction module FL is It is a pre-trained language model based on BERT, responsible for extracting the semantic features of text knowledge points; the feature fusion module FF will be based on the video feature extraction module FV, audio feature extraction module FA, image feature extraction module FP and text feature extraction module FL. Feature vectors are fused to obtain the knowledge point ontology structure and associated knowledge point EP-OR.

The core of the teaching resource profiling model CR-Profiler is the CNN convolutional neural network model, which is responsible for profiling the teaching resources based on the extended data CR-WWW and teaching resource data CR related to the Internet teaching resources, and obtaining the basic attributes of the teaching resources CR-Basic (which Including knowledge points, content structure, processes, principles, concepts, tools, etc.) and teaching resource extension attributes CR-Ext (which includes Internet evaluation information, presentation forms, etc.).

The core of the knowledge graph construction model KG-Gen is a neural network model, including the feature encoder Enc and the generative network model GN. The feature encoder is used to encode knowledge points so that the vector calculation distance between similar knowledge points is small and used to provide Resource index query; the generative network model is used to generate the knowledge graph EP-KG of teaching knowledge points. That is, the knowledge graph construction model KG-Gen is based on the knowledge point ontology structure formed by knowledge point extraction and associated knowledge points EP-OR, and combined with the educational resource attributes obtained by the teaching resource portrait CR-Profiler to generate the teaching knowledge point knowledge graph EP-KG .

The user portrait model Stu-Profiler is based on the user's user information Stu-Info (that is, the learner's personal information and learning situation) to profile the learner to form the user attribute Stu-Prop (that is, the knowledge point preferences, mastery and Comprehensive ability and other labels.)

The retrieval recommendation model CR-Recommend is based on the constructed knowledge graph EP-KG of teaching resource knowledge points to build an index and provide retrieval recommendation functions. Through the input retrieval content (including text, voice, video, image, etc.), combined with user portraits The user attribute Stu-Prop generated by the model Stu-Profiler outputs multiple sets of knowledge points and recommended teaching resources, which users as learners can choose from.

Step S200 combines and connects the multi-view feature extractor CR-Mutiview-Fet, the teaching resource profile model CR-Profiler, the knowledge graph construction model KG-Gen, the user profile model Stu-Profiler and the retrieval recommendation model CR-Recommend to form graph construction and retrieval. model, and perform model training on the multi-view feature extractor CR-Mutiview-Fet, teaching resource profile model CR-Profiler, knowledge graph construction model KG-Gen, user profile model Stu-Profiler and retrieval recommendation model CR-Recommend in sequence , obtain the trained graph construction and retrieval model.

In this step, training is performed through the following steps:

S210. Obtain teaching resource data CR from multiple data sources, and perform data annotation based on its materials;

S220. Based on the teaching resource data CR, perform model pre-training on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor CR-Mutiview-Fet respectively;

S230. Fix the model parameters of the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor CR-Mutiview-Fet, and perform model training on the feature fusion model;

S240. Combine the video feature extraction model, audio feature extraction model, image feature extraction model, text feature extraction model and feature fusion model in the multi-view feature extractor CR-Mutiview-Fet, and use the teaching resource data CR to The multi-view feature extractor CR-Mutiview-Fet performs model training as a whole, and parameters are fine-tuned for each model in the multi-view feature extractor CR-Mutiview-Fet;

S250. Collect the Internet-related extended data CR-WWW of teaching resources and perform labeling;

S260. Perform model training on the teaching resource profile model CR-Profiler based on the teaching resource data CR, teaching resource Internet-related extended data CR-WWW and tags;

S270. Based on the knowledge point ontology structure and associated knowledge points EP-OR output by the multi-view feature extractor CR-Mutiview-Fet, and the teaching resource basic attributes CR-Basic and teaching output by the teaching resource profile model CR-Profiler. Resource extension attribute CR-Ext, perform model training on the knowledge graph construction model KG-Gen;

S280. Collect user information Stu-Info and label it;

S290. Based on the user information Stu-Info and tags, perform model training on the user profile model Stu-Profiler;

S2A0. Based on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor CR-Mutiview-Fet, the retrieval recommendation model CR-Recommend is lightweight and tailored. The search condition extraction model in the retrieval recommendation model CR-Recommend is trained;

S2B0. Model the retrieval recommendation model CR-Recommend based on the user attribute Stu-Prop output by the user portrait model Stu-Profiler and the teaching knowledge point knowledge graph EP-KG output by the knowledge graph construction model KG-Gen. train.

Step S300 constructs a knowledge graph EP-KG of teaching knowledge points through the trained graph construction and retrieval model, including the following steps:

S310. Select a field and collect teaching resource data CR in the selected field;

S320. Use the trained multi-view feature extractor CR-Mutiview-Fet to perform feature extraction and feature fusion on the teaching resources to obtain the knowledge point ontology structure and associated knowledge point EP-OR;

S330. Collect the Internet-related extended data CR-WWW of teaching resources. Based on the Internet-related extended data CR-WWW of teaching resources and the teaching resource data CR, profile the teaching resources through the trained teaching resource profile model CR-Profiler to obtain the teaching resources. Resource attributes;

S340. Based on the knowledge point ontology structure, associated knowledge point EP-OR, and teaching resource attributes, the model KG-Gen is constructed through the trained knowledge graph to encode the knowledge points, so that the vector calculation distance between similar knowledge point resources is small, using Query the resource index and generate a knowledge graph EP-KG of teaching knowledge points.

Build an index based on the teaching knowledge point knowledge graph EP-KG and provide search recommendation services, output multiple sets of knowledge points and recommended resources for users to choose, including the following steps:

S350. Input the retrieval content. The viewing mode of the retrieval content includes text, voice, image and video;

S360: Obtain the user's user information Stu-Info, profile the user through the trained user profile model Stu-Profiler, and generate the user attribute Stu-Prop;

S370. Based on the teaching knowledge point knowledge graph EP-KG, build an index through the trained retrieval recommendation model CR-Recommend and provide retrieval recommendation services;

S380, based on indexing and retrieval recommendation services, extract knowledge points from the retrieval content input by the user, combine it with user attributes Stu-Prop to form a knowledge point feature vector, perform knowledge point query and feature vector calculation, and output multiple sets of knowledge points and recommendations. resources for users to choose;

S390. Feed back the collected teaching resource data CR, Internet-related extended data, user information Stu-Info and search content, as well as the output sets of knowledge points and recommended resources to the map construction and retrieval model, and build the map and retrieval models for model training to continuously optimize the map construction and retrieval models.

The method of this embodiment is based on massive teaching resource data CR, effectively utilizes deep learning feature extraction technology, fully considers the characteristics of Internet online learning, and explores the connections between multiple views such as textbooks, lesson plans, teaching videos, teaching voices, and speech scripts. , and combined with Internet resource evaluation information, more reasonable knowledge point attributes are formed through teaching resource portraits, and a more accurate and reasonable teaching knowledge map is constructed. Compared with traditional knowledge graph construction and recommendation methods, multi-view learning and deep learning are used to construct query methods. The neural network model is designed based on multi-view data formed by different presentation methods of resources, which can better consider diversity and explore Potential connections within the knowledge points enable a more accurate and reasonable extraction of knowledge points; teaching resource portraits and learner user portraits are added to the method, which makes the knowledge point extraction and retrieval process more targeted and focused, and the recommended resources are more effective Conform to learners’ learning habits and satisfy learners personalized needs, and the recommendation results contain multiple sets of data, which increases the accuracy and fault tolerance of recommendations. In addition, learners learn based on recommended learning resources and provide timely feedback to continuously optimize the recommendation model.

Example 2:

The present invention is a teaching knowledge graph construction and retrieval system based on multi-view learning, which includes a model construction module, a model training module and a retrieval recommendation module. The system provides users with knowledge points and teaching resource recommendation services based on the method disclosed in the embodiment.

The model building module is used to build a graph construction and retrieval model, which includes a multi-view feature extractor CR-Mutiview-Fet, a teaching resource profile model CR-Profiler, a knowledge graph construction model KG-Gen, and a user profile model. Stu-Profiler and retrieval recommendation model CR-Recommend.

The multi-view feature extractor CR-Mutiview-Fet is responsible for extracting knowledge point features, outputting the knowledge point ontology structure and associated knowledge point EP-OR. Four different feature extraction models are used for feature extraction and feature fusion for the above four views. It includes video feature extraction module FV, audio feature extraction module FA, image feature extraction module FP, text feature extraction module FL and feature fusion module FF. The core of the video feature extraction module FV is a three-dimensional CNN convolutional neural network, which is responsible for extracting the semantic features of knowledge points in the video; the core of the audio feature extraction module FA is a CNN convolutional neural network, which is responsible for extracting the semantic features of the knowledge points in the audio; the image feature extraction module The FP core is a convolutional neural network, responsible for extracting the semantic features of knowledge points in images; the text feature extraction module FL core is a pre-trained language model based on BERT, responsible for extracting the semantic features of knowledge points in text; the feature fusion module FF will extract the semantic features of knowledge points from the video. The feature vectors of the feature extraction module FV, audio feature extraction module FA, image feature extraction module FP and text feature extraction module FL are fused to obtain the knowledge point ontology structure and associated knowledge point EP-OR.

The core of the knowledge graph construction model KG-Gen is a neural network model, including the feature encoder Enc and the generative network model GN. The feature encoder is used to encode knowledge points so that the vector calculation distance between similar knowledge points is small and used to provide Resource index query; generative network model is used to generate teaching knowledge Point Knowledge Graph EP-KG. That is, the knowledge graph construction model KG-Gen is based on the knowledge point ontology structure and associated knowledge points EP-OR formed by knowledge point extraction, and combined with the educational resource attributes obtained by the teaching resource portrait CR-Profiler to generate the teaching knowledge point knowledge graph EP-KG .

The model training module is used to sequentially train the multi-view feature extractor CR-Mutiview-Fet, the teaching resource profile model CR-Profiler, the knowledge graph construction model KG-Gen, the user profile model Stu-Profiler and the retrieval recommendation model CR-Recommend. Model training to obtain the trained map construction and retrieval model.

As a specific implementation, the model training module is used to train through the following steps:

(1) Obtain teaching resource data CR from multiple data sources, and perform data annotation based on its materials;

(2) Based on the teaching resource data CR, perform model pre-training on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor CR-Mutiview-Fet respectively;

(3) Fix the model parameters of the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor CR-Mutiview-Fet, and perform model training on the feature fusion model ;

(4) Combine the video feature extraction model, audio feature extraction model, image feature extraction model, text feature extraction model and feature fusion model in the multi-view feature extractor CR-Mutiview-Fet, and use the teaching resource data CR Perform model training on the multi-view feature extractor CR-Mutiview-Fet as a whole, and perform parameter fine-tuning on each model in the multi-view feature extractor CR-Mutiview-Fet;

(5) Collect the Internet-related extended data CR-WWW of teaching resources and label them;

(6) Carry out model training on the teaching resource profile model CR-Profiler based on the teaching resource data CR, teaching resource Internet-related extended data CR-WWW and tags;

(7) Based on the ontology structure of knowledge points and associated knowledge points EP-OR output by the multi-view feature extractor CR-Mutiview-Fet, and the teaching resources output by the teaching resource profile model CR-Profiler The basic attribute CR-Basic and the teaching resource extended attribute CR-Ext are used to perform model training on the knowledge graph construction model KG-Gen;

(8) Collect user information Stu-Info and label it;

(9) Based on the user information Stu-Info and tags, perform model training on the user profile model Stu-Profiler;

(10) The retrieval recommendation model CR-Recommend is lightweight and tailored based on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor CR-Mutiview-Fet. The search condition extraction model in the above-mentioned retrieval recommendation model CR-Recommend is trained;

(11) Based on the user attribute Stu-Prop output by the user portrait model Stu-Profiler and the teaching knowledge point knowledge graph EP-KG output by the knowledge graph construction model KG-Gen, perform the retrieval recommendation model CR-Recommend Model training.

The retrieval and recommendation module is used to construct a knowledge graph EP-KG of teaching knowledge points through the trained graph construction and retrieval model, build an index based on the knowledge graph EP-KG of teaching knowledge points and provide retrieval recommendation services, and output multiple sets of knowledge points for users. and recommend resources for users to choose from.

As a specific implementation, the retrieval and recommendation module is used to construct the knowledge graph EP-KG of teaching knowledge points through the following steps, and output multiple sets of knowledge points and recommended resources for users to choose:

(1) Select a field and collect teaching resource data CR in the selected field;

(2) Use the trained multi-view feature extractor CR-Mutiview-Fet to perform feature extraction and feature fusion on the teaching resources to obtain the knowledge point ontology structure and associated knowledge point EP-OR;

(3) Collect the Internet-related extended data CR-WWW of teaching resources. Based on the Internet-related extended data CR-WWW of teaching resources and the teaching resource data CR, profile the teaching resources through the trained teaching resource profile model CR-Profiler, and obtain Teaching resource attributes;

(4) Based on the knowledge point ontology structure and associated knowledge point EP-OR and teaching resource attributes, the trained knowledge graph is used to construct the model KG-Gen to encode the knowledge points, so that the vector calculation distance between similar knowledge point resources is small. Used for resource index query and generation of knowledge graph EP-KG of teaching knowledge points;

(5) Enter the search content, and the view mode of the search content includes text, voice, image and video;

(6) Obtain the user's user information Stu-Info, profile the user through the trained user profile model Stu-Profiler, and generate user attributes Stu-Prop;

(7) Build an index based on the teaching knowledge point knowledge graph EP-KG and the trained retrieval recommendation model CR-Recommend and provide retrieval recommendation services;

(8) Based on indexing and retrieval recommendation services, extract knowledge points from the retrieval content input by the user, form a knowledge point feature vector based on the user attribute Stu-Prop, perform knowledge point query and feature vector calculation, and output multiple sets of knowledge point sums Recommend resources for users to choose from;

(9) Feed back the collected teaching resource data CR, Internet-related extended data, user information Stu-Info and search content, as well as the output sets of knowledge points and recommended resources to the map construction and retrieval model, and analyze the map Build and retrieve models and conduct model training to continuously optimize the map construction and retrieval models.

The present invention has been shown and described in detail through the drawings and preferred embodiments above. However, the present invention is not limited to these disclosed embodiments. Based on the above-mentioned multiple embodiments, those skilled in the art will know that the above-mentioned different embodiments can be combined. The code review means in the method can lead to more embodiments of the present invention, and these embodiments are also within the protection scope of the present invention.

Claims

A teaching knowledge graph construction and retrieval method based on multi-view learning, which is characterized by including the following steps:

Construct a multi-view feature extractor, which is used to perform feature extraction and feature fusion on teaching resource data in multiple view modes to obtain the knowledge point ontology structure and associated knowledge points;

A teaching resource portrait model is constructed based on the convolutional neural network. The teaching resource portrait model takes teaching resource data and teaching resource Internet-related extended data as inputs to profile the teaching resources and outputs teaching resource attributes. The teaching resource attributes include teaching resources. Basic attributes and extended attributes of teaching resources;

Build a knowledge graph construction model based on a neural network. The knowledge graph construction model uses the ontology structure of knowledge points, associated knowledge points and teaching resource attributes as input to generate a knowledge graph of teaching knowledge points;

Construct a user portrait model, which takes user information as input, profiles the user, and outputs user attributes;

Construct a retrieval recommendation model. The retrieval recommendation model is used to build an index based on the knowledge graph of teaching knowledge points and provide retrieval recommendation services. It is used to output multiple sets of knowledge points and recommended resources for users through the input retrieval content and combined with user attributes. choose;

Based on the multi-view feature extractor, teaching resource portrait model, knowledge graph construction model, user portrait model and retrieval recommendation model, a graph construction and retrieval model is constructed, and the multi-view feature extractor, teaching resource portrait model, knowledge graph are constructed in sequence. Carry out model training on the graph construction model, user portrait model and retrieval recommendation model, and obtain the trained graph construction and retrieval model;

A knowledge graph of teaching knowledge points is constructed through the trained graph construction and retrieval model, an index is constructed based on the knowledge graph of teaching knowledge points and retrieval recommendation services are provided, and multiple sets of knowledge points and recommended resources are output to users for selection.
The teaching knowledge graph construction and retrieval method based on multi-view learning according to claim 1, characterized in that the data sources of the teaching resources include teaching materials and books, lesson plans, teaching videos, teaching voices and speech scripts, and the data sources of the teaching resources include There are four view modes, namely video, audio, image and text;

The basic attributes of the teaching resources include knowledge points, content structure, processes, principles, concepts, and tools;

The extended attributes of the teaching resources include Internet evaluation information and presentation forms;

The view form of the retrieval content includes text, voice, video and image;

The knowledge map of teaching knowledge points includes a semantic network, basic data of teaching resources and extended attributes of teaching resources. The semantic network is formed based on the ontology structure of knowledge points and the association of knowledge points;

The user information includes basic information and learning status;

The user attributes include knowledge point preferences, mastery level and comprehensive ability.
The teaching knowledge graph construction and retrieval method based on multi-view learning according to claim 2, characterized in that the multi-view feature extractor includes:

A video feature extraction model. The video feature extraction model is a network model built based on a three-dimensional CNN convolutional neural network and is used to extract semantic features of knowledge points from video teaching resource data;

Audio feature extraction model. The audio feature extraction model is a network model built based on a CNN convolutional neural network and is used to extract semantic features of knowledge points from audio-based teaching resource data;

Image feature extraction model. The image feature extraction model is a network model built based on a convolutional neural network and is used to extract semantic features of knowledge points from image-based teaching resource data;

Text feature extraction model. The text feature extraction model is a language model based on BERT, which is used to extract semantic features of knowledge points from text-based teaching resource data;

Feature fusion model, the feature fusion model is used to fuse the semantic features of knowledge points output by the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model to obtain the knowledge point ontology structure and associated knowledge point;

The knowledge graph construction model includes:

Feature encoder, which is used to encode knowledge points so that the vector calculation distance between similar knowledge points is small and used to provide resource index queries;

Generate a network model, which is used to generate a knowledge graph of teaching knowledge points.
The teaching knowledge graph construction and retrieval method based on multi-view learning according to claim 3, characterized in that the multi-view feature extractor, teaching resource portrait model, knowledge graph construction model, user portrait model and retrieval recommendation model are sequentially Carry out model training, including the following steps:

Obtain teaching resource data from multiple data sources and annotate the data based on its materials;

Based on the teaching resource data, video features are extracted in the multi-view feature extractor model, audio feature extraction model, image feature extraction model and text feature extraction model for model pre-training;

Fix the model parameters of the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor, and perform model training on the feature fusion model;

The video feature extraction model, audio feature extraction model, image feature extraction model, text feature extraction model and feature fusion model in the multi-view feature extractor are combined, and the entire multi-view feature extractor is processed through the teaching resource data. Carry out model training and fine-tune parameters of each model in the multi-view feature extractor;

Collect Internet-related extended data on teaching resources and label them;

Carry out model training on the teaching resource portrait model based on the teaching resource data, teaching resource Internet-related extended data and tags;

Based on the knowledge point ontology structure and associated knowledge points output by the multi-view feature extractor, as well as the teaching resource basic attributes and teaching resource extended attributes output by the teaching resource portrait model, perform model training on the knowledge graph construction model;

Collect user information and label it;

Based on user information and tags, perform model training on the user portrait model;

The retrieval recommendation model is lightweight and tailored based on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor, and the search condition extraction model in the retrieval recommendation model is trained. ;

Based on the user attributes output by the user portrait model and the teaching knowledge point knowledge graph output by the knowledge graph construction model, model training is performed on the retrieval recommendation model.
The teaching knowledge graph construction and retrieval method based on multi-view learning according to claim 3, characterized in that the teaching knowledge point knowledge graph is constructed through the trained graph construction and retrieval model, including the following steps:

Select a field and collect teaching resource data in the selected field;

Use the trained multi-view feature extractor to perform feature extraction and feature fusion on the teaching resources to obtain the knowledge point ontology structure and associated knowledge points;

Collect the Internet-related extended data of teaching resources, and based on the Internet-related extended data of teaching resources and the teaching resource data, profile the teaching resources through the trained teaching resource portrait model to obtain the teaching resource attributes;

Based on the knowledge point ontology structure, associated knowledge points, and teaching resource attributes, the knowledge points are encoded through the trained knowledge graph construction model, so that the vector calculation distance between similar knowledge point resources is small, which can be used for resource index query and generate teaching Knowledge point knowledge map;

Build an index based on the knowledge graph of teaching knowledge points and provide retrieval and recommendation services, and output multiple sets of knowledge points and recommended resources for users to choose, including the following steps:

Enter the search content, and the view mode of the search content includes text, voice, image and video;

Obtain the user's user information, profile the user through the trained user portrait model, and generate user attributes;

Based on the knowledge graph of teaching knowledge points, build an index through the trained retrieval recommendation model and provide retrieval recommendation services;

Based on indexing and retrieval recommendation services, knowledge points are extracted from the retrieval content input by users, combined with user attributes to form knowledge point feature vectors, and knowledge point queries and feature vector calculations are performed to output multiple sets of knowledge points and recommended resources for users make a choice;

The collected teaching resource data, Internet-related extended data, user information and search content, as well as the output sets of knowledge points and recommended resources, are fed back to the map construction and retrieval model, and model training is performed on the map construction and retrieval model. , to continuously optimize the map construction and retrieval model.
A teaching knowledge graph construction and retrieval system based on multi-view learning, which is characterized in that it is used to provide users with a teaching knowledge graph construction and retrieval method based on multi-view learning as described in any one of claims 1-5. Knowledge points and teaching resource recommendation services, the system includes:

A model building module, which is used to build a graph construction and retrieval model. The graph construction and retrieval model includes a multi-view feature extractor, a teaching resource portrait model, a knowledge graph construction model, a user portrait model and a retrieval recommendation model, The multi-feature view extractor is used to perform feature extraction and feature fusion on teaching resource data in multiple view modes to obtain the knowledge point ontology structure and associated knowledge points; the teaching resource portrait model is a network model based on a convolutional neural network. , using teaching resource data and teaching resource Internet-related extended data as input, the teaching resources are profiled, and the teaching resource attributes are output. The teaching resource attributes include basic teaching resource attributes and teaching resource extended attributes; the knowledge graph construction model is based on The ontology structure of knowledge points, associated knowledge points and teaching resource attributes are used as input to generate a knowledge graph of teaching knowledge points; the user portrait model The model takes user information as input, profiles the user, and outputs user attributes; the retrieval recommendation model is used to build an index based on the knowledge graph of teaching knowledge points and provide retrieval recommendation services, and is used to combine the input retrieval content with user attributes. Output multiple sets of knowledge points and recommended resources for users to choose;

A model training module, which is used to sequentially perform model training on the multi-view feature extractor, teaching resource portrait model, knowledge graph construction model, user portrait model and retrieval recommendation model, and obtain the trained graph construction and retrieval Model;

A retrieval recommendation module. The retrieval recommendation module is used to construct a knowledge graph of teaching knowledge points through the trained graph construction and retrieval model, build an index based on the knowledge graph of teaching knowledge points and provide retrieval recommendation services, and output multiple sets of knowledge points for users. and recommend resources for users to choose from.
The teaching knowledge graph construction and retrieval system based on multi-view learning according to claim 6, characterized in that the data sources of the teaching resources include teaching materials and books, lesson plans, teaching videos, teaching voices and speech scripts, and the data sources of the teaching resources include There are four view modes, namely video, audio, image and text;

The basic attributes of the teaching resources include knowledge points, content structure, processes, principles, concepts, and tools;

The extended attributes of the teaching resources include Internet evaluation information and presentation forms;

The view form of the retrieval content includes text, voice, video and image;

The knowledge map of teaching knowledge points includes a semantic network, basic data of teaching resources and extended attributes of teaching resources. The semantic network is formed based on the ontology structure of knowledge points and the association of knowledge points;

The user information includes basic information and learning status;

The user attributes include knowledge point preferences, mastery level and comprehensive ability.
The teaching knowledge graph construction and retrieval system based on multi-view learning according to claim 7, characterized in that the multi-view feature extractor includes:

A video feature extraction model. The video feature extraction model is a network model built based on a three-dimensional CNN convolutional neural network and is used to extract semantic features of knowledge points from video teaching resource data;

Audio feature extraction model. The audio feature extraction model is a network model built based on a CNN convolutional neural network and is used to extract semantic features of knowledge points from audio-based teaching resource data;

Image feature extraction model. The image feature extraction model is a network model built based on a convolutional neural network and is used to extract semantic features of knowledge points from image-based teaching resource data;

Text feature extraction model, the text feature extraction model is a language model based on BERT, Used to extract semantic features of knowledge points from text-based teaching resource data;

Feature fusion model, the feature fusion model is used to fuse the semantic features of knowledge points output by the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model to obtain the knowledge point ontology structure and associated knowledge point;

The knowledge graph construction model includes:

Feature encoder, which is used to encode knowledge points so that the vector calculation distance between similar knowledge points is small and used to provide resource index queries;

Generate a network model, which is used to generate a knowledge graph of teaching knowledge points.
The teaching knowledge graph construction and retrieval system based on multi-view learning according to claim 8, characterized in that the model sequence module is used to sequentially perform the following steps on the multi-view feature extractor, the teaching resource portrait model, and the knowledge graph. Build models, user portrait models and retrieval recommendation models for model training:

Obtain teaching resource data from multiple data sources and annotate the data based on its materials;

Based on the teaching resource data, perform model pre-training on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor;

Fix the model parameters of the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor, and perform model training on the feature fusion model;

The video feature extraction model, audio feature extraction model, image feature extraction model, text feature extraction model and feature fusion model in the multi-view feature extractor are combined, and the entire multi-view feature extractor is processed through the teaching resource data. Carry out model training and fine-tune parameters of each model in the multi-view feature extractor;

Collect Internet-related extended data on teaching resources and label them;

Carry out model training on the teaching resource portrait model based on the teaching resource data, teaching resource Internet-related extended data and tags;

Based on the knowledge point ontology structure and associated knowledge points output by the multi-view feature extractor, as well as the teaching resource basic attributes and teaching resource extended attributes output by the teaching resource portrait model, perform model training on the knowledge graph construction model;

Collect user information and label it;

Based on user information and tags, perform model training on the user portrait model;

The retrieval recommendation model is lightweight and tailored based on the video feature extraction model, audio feature extraction model, image feature extraction model and text feature extraction model in the multi-view feature extractor, and the search condition extraction model in the retrieval recommendation model is trained. ;

Based on the user attributes output by the user portrait model and the teaching knowledge point knowledge graph output by the knowledge graph construction model, model training is performed on the retrieval recommendation model.
The teaching knowledge graph construction and retrieval system based on multi-view learning according to claim 9, characterized in that the retrieval recommendation module is used to construct a knowledge graph of teaching knowledge points through the following steps, and output multiple sets of knowledge points and recommendations for users Resources for users to choose from:

Select a field and collect teaching resource data in the selected field;

Use the trained multi-view feature extractor to perform feature extraction and feature fusion on the teaching resources to obtain the knowledge point ontology structure and associated knowledge points;

Collect the Internet-related extended data of teaching resources, and based on the Internet-related extended data of teaching resources and the teaching resource data, profile the teaching resources through the trained teaching resource portrait model to obtain the teaching resource attributes;

Based on the knowledge point ontology structure, associated knowledge points, and teaching resource attributes, the knowledge points are encoded through the trained knowledge graph construction model, so that the vector calculation distance between similar knowledge point resources is small, which can be used for resource index query and generate teaching Knowledge point knowledge map;

Enter the search content, and the view mode of the search content includes text, voice, image and video;

Obtain the user's user information, profile the user through the trained user portrait model, and generate user attributes;

Based on the knowledge graph of teaching knowledge points, build an index through the trained retrieval recommendation model and provide retrieval recommendation services;

Based on indexing and retrieval recommendation services, knowledge points are extracted from the retrieval content input by the user, combined with user attributes to form knowledge point feature vectors, knowledge point queries and feature vector calculations are performed, and multiple sets of knowledge points and recommended resources are output for users make a choice;

The collected teaching resource data, Internet-related extended data, user information and search content, as well as the output sets of knowledge points and recommended resources, are fed back to the map construction and retrieval model, and model training is performed on the map construction and retrieval model. , to continuously optimize the map construction and retrieval model.