WO2020216286A1 - Method for training teaching style prediction model, and computer storage medium - Google Patents

Method for training teaching style prediction model, and computer storage medium Download PDF

Info

Publication number
WO2020216286A1
WO2020216286A1 PCT/CN2020/086363 CN2020086363W WO2020216286A1 WO 2020216286 A1 WO2020216286 A1 WO 2020216286A1 CN 2020086363 W CN2020086363 W CN 2020086363W WO 2020216286 A1 WO2020216286 A1 WO 2020216286A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
feature
prediction
level
teacher style
Prior art date
Application number
PCT/CN2020/086363
Other languages
French (fr)
Chinese (zh)
Inventor
杨嵩
黄健
杨非
刘子韬
黄琰
Original Assignee
北京新唐思创教育科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京新唐思创教育科技有限公司 filed Critical 北京新唐思创教育科技有限公司
Publication of WO2020216286A1 publication Critical patent/WO2020216286A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation

Definitions

  • the embodiment of the present invention relates to the field of artificial intelligence, and in particular to a training method and computer storage medium of a teacher's style prediction model.
  • Teacher style is the judgment of teacher's individual value, and it is an important content of education evaluation. Predicting the teaching style of teachers can enable the school's teaching management department and teachers to understand the teaching situation, discover problems, sum up experience and reform work, so as to achieve the ultimate goal of improving teaching quality. Therefore, how to predict the teacher's style fairly and accurately has always been a problem explored by the education circle.
  • the input data of the model can include teaching audio and video teaching data of teachers. Because it is difficult to obtain teaching data samples of different teacher styles, the amount of teaching data samples is often small. In addition, the dimensionality of the features extracted in the teaching data sample is often high. Therefore, it is prone to overfitting when training the model, and it is impossible to train a model with better performance. Aiming at the problem of the small amount of teaching data samples and the high dimension of the extracted features, most of the existing processing methods use principal component analysis technology to reduce the dimensions of the high-dimensional features extracted from the teaching data samples, and then use The features after dimensionality reduction train the model.
  • one of the technical problems solved by the embodiments of the present invention is to provide a training method of a teacher's style prediction model and a non-transitory computer-readable storage medium to solve at least one of the above-mentioned problems.
  • the embodiment of the present invention provides a training method of a teacher's style prediction model.
  • the method includes: determining a plurality of sets of second characteristic data of the teaching content sample based on the first characteristic data of the teaching content sample, wherein the first characteristic data has a higher dimension than the second characteristic data;
  • the teacher style prediction model to be trained obtains the teacher style prediction data corresponding to the teaching content sample based on the multiple sets of second feature data; the teacher style annotation data based on the teaching content sample and the teacher style prediction data, Training the teacher style prediction model.
  • the embodiment of the present invention also provides a non-transitory computer-readable storage medium, the non-transitory computer-readable storage medium stores a readable program, and the readable program includes: a first feature used for teaching content samples Data, instructions for determining multiple sets of second feature data of the teaching content sample, wherein the first feature data has a higher dimension than the second feature data; used to predict the style of the teacher to be trained based on The multiple sets of second feature data are instructions for obtaining teacher style prediction data corresponding to the teaching content sample; used to train the teacher style based on the teacher style annotation data of the teaching content sample and the teacher style prediction data Predictive model instructions.
  • the training scheme of the teacher's style prediction model provided by the embodiment of the present invention, based on the first characteristic data of the teaching content sample, multiple sets of second characteristic data of the teaching content sample are determined, wherein the first characteristic data has a higher value than the second characteristic data.
  • the teacher style prediction data corresponding to the teaching content sample is obtained through the teacher style prediction model to be trained, and then based on the teacher style annotation data and teacher style prediction data of the teaching content sample, the teacher is trained
  • the style prediction model greatly reduces the dimensions of the input features of the teacher style prediction model to be trained by grouping the high-dimensional feature data of the teaching content samples into multiple sets of second feature data, thereby making The teacher style prediction performance of the trained teacher style prediction model can be effectively improved.
  • Figure 1 shows a flowchart of the steps of a method for training a teacher's style prediction model according to the first embodiment of the present invention
  • Fig. 2 shows a schematic structural diagram of a teacher style prediction model provided according to the first embodiment of the present invention
  • Fig. 3 shows a flowchart of steps of a method for predicting teacher style according to the second embodiment of the present invention.
  • the training method of the teacher's style prediction model includes the following steps:
  • step S101 based on the first feature data of the teaching content sample, multiple sets of second feature data of the teaching content sample are determined.
  • the teaching content sample may include audio data or video data of the teaching content as the training sample.
  • the first feature data can be understood as a feature vector with a higher dimension, for example, a feature vector of 1000 dimensions, a feature vector of 2000 dimensions, and so on.
  • the high-dimensional feature data of the teaching content sample may be high-dimensional voice acoustic feature data extracted from the audio data, and the voice acoustics
  • the feature data may include prosodic feature data, spectral feature data, and voice quality feature data of the audio, and the voice acoustic feature data is specifically a voice acoustic feature vector.
  • an existing speech acoustic feature extraction algorithm may be used to extract high-dimensional speech acoustic feature data from the audio data.
  • the high-dimensional feature data of the teaching content sample may be high-dimensional facial feature data extracted from the video data, and the person The facial feature data may include the feature data of the mouth region, the feature data of the eye region, the feature data of the cheek region, etc.
  • the facial feature data is specifically a facial feature vector of a human face.
  • an existing facial feature extraction algorithm can be used to extract high-dimensional facial feature data from the video data.
  • correlation analysis is performed on the high-dimensional feature data to determine the high-dimensional feature data.
  • Grouping of feature data based on the grouping of the high-dimensional feature data, the high-dimensional feature data is divided to obtain multiple groups of low-dimensional feature data of the teaching content sample. In this way, the dimension of the input features of the teacher style prediction model is greatly reduced.
  • the high-dimensional feature data is specifically high-dimensional voice acoustic feature data
  • the voice acoustic features include prosodic features, spectral features, and voice quality features. Therefore, it can be based on voice acoustic features.
  • the correlation analysis is performed on the high-dimensional voice acoustic feature data to determine the grouping of the high-dimensional voice acoustic feature data. Then, according to the grouping, the high-dimensional speech acoustic feature data is divided to obtain the prosody feature data, frequency spectrum feature data, and sound quality feature data of the teaching content sample.
  • the types of features included in the voice acoustic features are determined based on the prior knowledge of voice acoustics, and then the high-dimensional voice acoustic feature data is correlated based on the types of features included in the voice acoustic features.
  • the high-dimensional feature data is specifically high-dimensional facial feature data, it can be known from the prior knowledge of the human face that the human face includes the mouth area, the eye area, the nose area, and the cheek area. Therefore, it can be based on the human face.
  • the face includes the prior knowledge of the mouth area, the eye area, the nose area and the cheek area, and the correlation analysis is performed on the high-dimensional facial feature data to determine the grouping of the high-dimensional facial feature data.
  • the high-dimensional face feature data is divided to obtain the mouth area feature data, the eye area feature data, the nose area feature data, and the cheek area feature data of the teaching content sample.
  • the different areas included in the face are determined according to the prior knowledge of the face, and then the correlation analysis of the high-dimensional facial feature data is performed according to the different areas included in the face.
  • the teaching is evaluated in an equal-dimensional manner.
  • the high-dimensional feature data of the content sample is divided to obtain multiple sets of low-dimensional feature data of the teaching content sample.
  • the high-dimensional feature data is specifically 1000-dimensional feature data
  • the 1000-dimensional high-dimensional feature data can be equally divided into 10 groups of low-dimensional feature data, and the dimension of each group of low-dimensional feature data is 100 dimension.
  • how many groups and dimensions are divided into can be set through experiments. In this way, the dimension of the input features of the teacher style prediction model is greatly reduced.
  • the original high-dimensional feature data is v n
  • the divided low-dimensional feature data of the kth group is then Among them, concat (*) means that the feature data are spliced together in turn. If there is no prior knowledge to analyze the correlation of features, it can be equally divided into K parts, which also meets the above relationship.
  • step S102 the teacher style prediction data corresponding to the teaching content sample is obtained based on the multiple sets of second feature data through the teacher style prediction model to be trained.
  • the teacher style prediction model can be any suitable neural network model that can realize feature extraction or target object detection, including but not limited to convolutional neural network, enhanced learning neural network, and generation in counter neural network. Network, deep neural network, etc.
  • the settings of the specific structure in the neural network can be appropriately set by those skilled in the art according to actual requirements, such as the number of convolution layers, the size of the convolution kernel, the number of channels, and so on.
  • the teacher style prediction data may be a predicted teacher style category, or a predicted teacher style value.
  • the teacher style prediction model includes multiple low-level models and high-level models connected to the output terminals of the multiple low-level models, and the multiple low-level models and the high-level models are all deep neural network models .
  • the multiple low-level models are based on the multiple sets of low-dimensional feature data.
  • each of the multiple low-layer models includes a hidden layer and a prediction layer connected to the output end of the hidden layer, and the hidden layer is specifically a fully connected layer or a convolutional layer.
  • the prediction layer is specifically a fully connected layer.
  • the characteristic characterization data is specifically a characteristic characterization vector.
  • feature extraction operations are performed on multiple sets of low-dimensional feature data, which can re-encode multiple sets of low-dimensional feature data, and improve the robustness of feature representation data corresponding to multiple sets of low-dimensional feature data. It can improve the accuracy of the low-level model's preliminary prediction of the teacher style corresponding to the teaching content sample.
  • the high-level feature representation data is specifically a high-level feature representation vector.
  • the high-level feature representation data corresponding to the high-level model is generated, and then through the high-level model, based on the high-level feature representation data, the final prediction data of the teacher style corresponding to the teaching content sample is obtained, which can improve the high-level model's ability to teach content The accuracy of the final prediction of the teacher style corresponding to the sample.
  • the corresponding characteristic characterization data are respectively generated to generate the high-level characteristic characterization data.
  • high-level feature representation data can be generated, which can improve the robustness of the high-level feature representation data, thereby improving the high-level model's ability to respond to the teacher style corresponding to the teaching content sample. The accuracy of the final prediction.
  • the hidden layer in the high-level model is used to compare the Perform a feature extraction operation on the high-level feature characterization data to obtain the feature characterization data corresponding to the high-level feature characterization data; through the prediction layer in the high-level model, perform a mapping operation on the feature characterization data corresponding to the high-level feature characterization data to Obtain final prediction data of teacher style corresponding to the teaching content sample.
  • the hidden layer is specifically a fully connected layer or a convolutional layer
  • the prediction layer is specifically a fully connected layer
  • the feature characterization data is specifically a feature characterization vector.
  • the feature extraction operation of the high-level feature characterization data can re-encode the high-level feature characterization data, improve the robustness of the feature characterization data corresponding to the high-level feature characterization data, and thereby improve the high-level model's ability to teach The accuracy of the final prediction of the teacher's style corresponding to the content sample.
  • the teacher style prediction model includes multiple low-level models and high-level models connected to the output terminals of the multiple low-level models. After dividing the high-dimensional feature data, multiple feature groups are obtained, and then each feature group is input into the corresponding low-level model. Then through the corresponding low-level model, based on the feature grouping, the preliminary prediction of the teaching style of the teaching content sample is made to obtain the preliminary prediction data of the teacher style corresponding to the teaching content sample.
  • the low-level model includes a plurality of sequentially connected hidden layers and a prediction layer connected to the output end of the last hidden layer of the plurality of sequentially connected hidden layers.
  • high-level feature representation data corresponding to the high-level model is generated.
  • the final prediction of the teaching style of the teaching content sample is made to obtain the final prediction data of the teacher style corresponding to the teaching content sample.
  • the high-dimensional feature data is divided into K groups of low-dimensional feature data, and each group of low-dimensional feature data corresponds to a low-level model, then the k-th low-level model and the k-th group of low-dimensional feature data One to one correspondence.
  • f(*) is a non-linear function, usually a sigmoid function;
  • Is the first hidden vector representation of the nth sample data of the kth group of low-level models, and the dimension is
  • the dimension is Is the bias vector of the l kth hidden layer of the kth group of low-level models, and the dimension is Indicates the result calculated according to the weight matrix and bias vector of the layer
  • Is the bias vector of the L k hidden layer of the kth group of low-level models, with the dimension Indicates the result calculated according to the weight matrix and bias vector of the layer;
  • the output of the hidden layer of the k-th low-level model is As the input of the prediction layer of the k-th group of low-level models:
  • the dimension is 1, which is a real value between 0-1.
  • the high-level feature representation data is:
  • the dimension of h n is Combining the preliminary prediction data of teacher style of each low-level model and adding the hidden vector representation of the last hidden layer can obtain more information, so that the high-level model can predict more accurately.
  • the high-level feature representation data is used as the input of the high-level model for final prediction.
  • the high-level model includes a plurality of sequentially connected hidden layers and a prediction layer connected to the output end of the last hidden layer of the plurality of sequentially connected hidden layers .
  • the number of hidden layers of the high-level model is L
  • the hidden node dimension of the l-th hidden layer is D l .
  • the dimension is Is the offset vector of the first hidden layer of the high-level model, with the dimension D 1
  • y 1n indicates the result calculated according to the weight matrix and offset vector of the layer
  • f(*) is a nonlinear function, usually a sigmoid function
  • g 1n is the first hidden vector representation of the n-th sample data of the high-level model
  • the dimension is D 1 ⁇ 1.
  • y ln indicates the result calculated according to the weight matrix and offset vector of the layer;
  • h Ln is the L-th value of the high-level model for the nth sample data
  • Hidden vector representation of a hidden layer the dimension is D L ⁇ 1.
  • the output of the hidden layer of the high-level model is h Ln as the input of the prediction layer of the high-level model:
  • the structure of the low-level model and the high-level model are similar.
  • the reason why the low-level model and the high-level model are used is because the low-level model is used to make preliminary predictions of the teaching style of the teaching content samples.
  • the final prediction of the teaching style of the teaching content sample can improve the accuracy of the teacher style prediction model for the teacher style corresponding to the teaching content sample.
  • directly using a model (such as using only one underlying model) for modeling will cause a "dimension disaster".
  • the model is only applicable to training data, and can not get good performance on test data, which will cause overfitting.
  • step S103 the teacher style prediction model is trained based on the teacher style annotation data of the teaching content sample and the teacher style prediction data.
  • the teacher style annotation data can be understood as the actual teacher style data of the teaching content sample.
  • the target loss function is used to determine the teacher style annotation data and the teacher style prediction data.
  • the difference value between the teacher style prediction data; based on the difference value, the parameters of the teacher style prediction model are adjusted.
  • the teacher style annotation data and the teacher style are determined through the objective loss function
  • adjust the parameters of the teacher style prediction model based on the difference value adjust the parameters of the multiple low-level models and the high-level model in the teacher style prediction model based on the difference value.
  • the objective loss function includes a mean square error term and an L2 regularization term. In this way, the training process of the teacher's style prediction model can be prevented from being affected by overfitting.
  • the teacher style prediction data s n is obtained from the prediction layer of the high-level model.
  • the real teacher style data of the nth sample data is s′ n
  • train the teacher style prediction model to make s n and s′ n as close as possible.
  • the following function is selected as the loss function for training the teacher's style prediction model:
  • s n is the real teacher style data of the nth sample data
  • s′ n is the final teacher style prediction data of the high-level model for the nth sample data
  • W k is the weight matrix of the prediction layer of the low-level model
  • W l is the weight matrix of the hidden layer of the high-level model
  • W is the weight matrix of the high-level model prediction layer
  • is the weight attenuation term.
  • the value is between 0 and 1.
  • the first and second terms of the above formula calculate the mean square error, and the last four terms are added with L2 regularization to prevent the teacher style prediction model from overfitting.
  • the training of the teacher's style prediction model is to combine the low-level model and the high-level model for unified training, and optimize the teacher's style prediction model as a whole through the objective loss function.
  • the entire model is trained using the minimized objective loss function, that is, the parameters of the teacher style prediction model ( W k , W l , W, b k , b l , b).
  • the currently obtained teacher style final prediction data is evaluated as a basis for subsequent training of the teacher style prediction model.
  • the difference value may be transmitted back to the teacher style prediction model, so as to train the teacher style prediction model iteratively.
  • the training of the teacher style prediction model is an iterative process. The embodiments of this application only describe one training process, but those skilled in the art should understand that each training of the teacher style prediction model can be performed. This training method is adopted until the training of the teacher style prediction model is completed.
  • the teacher style prediction model based on the high-dimensional feature data of the teaching content sample, multiple sets of low-dimensional feature data of the teaching content sample are determined, and the teacher style prediction model to be trained is based on multiple sets of With low-dimensional feature data, the teacher style prediction data corresponding to the teaching content sample is obtained, and then based on the teacher style annotation data and the teacher style prediction data of the teaching content sample, the teacher style prediction model is trained.
  • the teaching style The high-dimensional feature data of the content samples are grouped into multiple sets of low-dimensional feature data, which greatly reduces the dimension of the input features of the teacher style prediction model to be trained, so that the teacher style prediction performance of the trained teacher style prediction model can be effectively Promote.
  • FIG. 3 a flowchart of the steps of a method for predicting teacher style according to the second embodiment of the present invention is shown.
  • the teacher style prediction method includes the following steps:
  • step S201 based on the first feature data of the teaching content data, multiple sets of second feature data of the teaching content data are determined.
  • the teaching content data may include audio data or video data of the teaching content.
  • the first feature data of the teaching content data may be high-dimensional voice acoustic feature data extracted from the audio data.
  • the teaching content data is video data of the teaching content
  • the first feature data of the teaching content data may be high-dimensional facial feature data extracted from the video data.
  • step S201 is similar to the specific implementation of step S101 described above, and will not be repeated here.
  • step S202 the teacher style prediction data corresponding to the teaching content data is obtained based on the multiple sets of second feature data of the teaching content data through the trained teacher style prediction model.
  • the multiple low-level models are based on The multiple sets of low-dimensional feature data obtain multiple teacher style preliminary prediction data corresponding to the teaching content data; through the high-level model, based on the multiple teacher style preliminary prediction data, obtain the corresponding teaching content data Teacher style final prediction data.
  • the teacher style prediction model trained in the first embodiment includes multiple low-level models, and the teaching content data is preliminarily predicted for the teaching style, and then the high-level model included in the teacher style prediction model trained in the first embodiment is based on
  • the preliminary prediction result of the teaching style and the final prediction of the teaching style on the teaching content data can improve the accuracy of the teacher style prediction model for predicting the teacher style corresponding to the teaching content data.
  • the hidden layer The multiple sets of low-dimensional feature data are respectively subjected to feature extraction operations to obtain feature representation data corresponding to the multiple sets of low-dimensional feature data; through the prediction layer, features corresponding to the multiple sets of low-dimensional feature data
  • the characterization data is subjected to a mapping operation to obtain a plurality of preliminary prediction data of teacher style corresponding to the teaching content data.
  • the characteristic characterization data is specifically a characteristic characterization vector.
  • feature extraction operations are performed on multiple sets of low-dimensional feature data, which can re-encode multiple sets of low-dimensional feature data, and improve the robustness of feature representation data corresponding to multiple sets of low-dimensional feature data. It can improve the accuracy of the preliminary prediction of the teacher’s style corresponding to the teaching content data by the low-level model.
  • the high-level feature representation data is specifically a high-level feature representation vector.
  • the high-level feature representation data corresponding to the high-level model is generated, and then through the high-level model, based on the high-level feature representation data, the final prediction data of the teacher style corresponding to the teaching content data is obtained, which can improve the high-level model's impact on the teaching content The accuracy of the final prediction of teacher style corresponding to the data.
  • the corresponding characteristic characterization data are respectively generated to generate the high-level characteristic characterization data.
  • high-level feature representation data can be generated, which can improve the robustness of the high-level feature representation data, thereby improving the high-level model's ability to respond to the teacher style corresponding to the teaching content data. The accuracy of the final prediction.
  • the hidden layer in the high-level model is used to compare the Perform a feature extraction operation on the high-level feature characterization data to obtain the feature characterization data corresponding to the high-level feature characterization data; through the prediction layer in the high-level model, perform a mapping operation on the feature characterization data corresponding to the high-level feature characterization data to Obtain final prediction data of teacher style corresponding to the teaching content data.
  • the feature extraction operation of the high-level feature characterization data can re-encode the high-level feature characterization data, improve the robustness of the feature characterization data corresponding to the high-level feature characterization data, and thereby improve the high-level model's ability to teach The accuracy of the final prediction of the teacher style corresponding to the content data.
  • the method further includes: performing a mapping operation on the teacher style prediction data to obtain the teacher style category corresponding to the teaching content data. In this way, the teacher style category corresponding to the teaching content data can be obtained.
  • the teacher style semantic space can be understood as a mapping space between teacher style prediction data and teacher style categories.
  • the teacher style prediction method based on the high-dimensional feature data of the teaching content data, multiple sets of low-dimensional feature data of the teaching content data are determined, and then the teacher style prediction model obtained through training in Embodiment 1 is based on the teaching content Multiple sets of low-dimensional feature data of the data to obtain teacher style prediction data corresponding to the teaching content data.
  • the teacher style prediction model obtained through training in Embodiment 1 is based on the teaching content Multiple sets of low-dimensional feature data of the data to obtain teacher style prediction data corresponding to the teaching content data.
  • the embodiment of the present invention also provides a non-transitory computer-readable storage medium, the non-transitory computer-readable storage medium stores a readable program, and the readable program includes: a high-dimensional feature used for teaching content samples Data, instructions for determining multiple sets of low-dimensional feature data of the teaching content sample; used to obtain the teacher style prediction corresponding to the teaching content sample based on the multiple sets of low-dimensional feature data through the teacher style prediction model to be trained Data instructions; instructions for training the teacher style prediction model based on the teacher style annotation data of the teaching content sample and the teacher style prediction data.
  • the teacher style prediction model includes a plurality of low-level models and a high-level model connected to the output ends of the plurality of low-level models.
  • the teacher style prediction model used to pass the training is based on the Multiple sets of low-dimensional feature data
  • instructions for obtaining teacher style prediction data corresponding to the teaching content sample include: obtaining the teaching content sample based on the multiple sets of low-dimensional feature data through the multiple low-level models An instruction for corresponding multiple teacher style preliminary prediction data; an instruction for obtaining the teacher style final prediction data corresponding to the teaching content sample based on the multiple teacher style preliminary prediction data through the high-level model.
  • each of the plurality of low-layer models includes a hidden layer and a prediction layer connected to the output terminal of the hidden layer.
  • the method for passing the plurality of low-layer models, Based on the multiple sets of low-dimensional feature data, an instruction to obtain multiple teacher style preliminary prediction data corresponding to the teaching content sample includes: using the hidden layer to perform respective operations on the multiple sets of low-dimensional feature data A feature extraction operation to obtain instructions for feature characterization data corresponding to the multiple sets of low-dimensional feature data; and to perform a mapping operation on feature characterization data corresponding to the multiple sets of low-dimensional feature data through the prediction layer, An instruction to obtain a plurality of preliminary prediction data of teacher style corresponding to the teaching content sample.
  • the instruction for obtaining teacher style final prediction data corresponding to the teaching content sample based on the plurality of teacher style preliminary prediction data through the high-level model includes: Preliminary teacher style prediction data to generate instructions corresponding to the high-level feature characterization data of the high-level model; for obtaining the final prediction data of teacher style corresponding to the teaching content sample based on the high-level feature characterization data through the high-level model instruction.
  • the instruction for generating high-level feature characterization data corresponding to the high-level model based on the plurality of teacher style preliminary prediction data includes: the instruction for generating high-level feature characterization data based on the plurality of teacher style preliminary prediction data and the Multiple sets of low-dimensional feature data correspond to feature characterization data, respectively, to generate instructions for the high-level feature characterization data.
  • the instruction for obtaining the teacher style final prediction data corresponding to the teaching content sample based on the high-level feature characterization data through the high-level model includes: Containing layers, performing feature extraction operations on the high-level feature characterization data to obtain instructions for the feature characterization data corresponding to the high-level feature characterization data; used to characterize the high-level feature data through the prediction layer in the high-level model
  • the corresponding characteristic characterization data performs a mapping operation to obtain an instruction of the final prediction data of teacher style corresponding to the teaching content sample.
  • the instruction for training the teacher style prediction model based on the teacher style annotation data of the teaching content sample and the teacher style prediction data includes: determining the teacher by a target loss function An instruction for the difference value between the style annotation data and the teacher style final prediction data; an instruction for adjusting the parameters of the multiple low-level models and the high-level model in the teacher style prediction model based on the difference value.
  • the readable program further includes: instructions for determining multiple sets of low-dimensional feature data of the teaching content data based on the high-dimensional feature data of the teaching content data; A predictive model, based on multiple sets of low-dimensional feature data of the teaching content data, obtains instructions for teacher style prediction data corresponding to the teaching content data.
  • the readable program further includes: instructions for performing a mapping operation on the teacher style prediction data to obtain the teacher style category corresponding to the teaching content data.
  • the teacher style prediction model to be trained is based on multiple Group low-dimensional feature data, obtain teacher style prediction data corresponding to the teaching content sample, and then train the teacher style prediction model based on the teacher style annotation data and teacher style prediction data of the teaching content sample.
  • the high-dimensional feature data of the teaching content samples are grouped into groups of low-dimensional feature data, which greatly reduces the dimension of the input features of the teacher style prediction model to be trained, so that the teacher style prediction performance of the trained teacher style prediction model can be effectively To improve.
  • each component/step described in the embodiment of the present invention can be split into more components/steps, or two or more components/steps or partial operations of components/steps can be combined into New components/steps to achieve the purpose of the embodiments of the present invention.
  • the above method according to the embodiments of the present invention can be implemented in hardware, firmware, or implemented as software or computer code that can be stored in a recording medium (such as CD ROM, RAM, floppy disk, hard disk, or magneto-optical disk), or implemented by
  • a recording medium such as CD ROM, RAM, floppy disk, hard disk, or magneto-optical disk
  • the computer code downloaded from the network is originally stored in a remote recording medium or a non-transitory machine-readable medium and will be stored in a local recording medium, so that the method described here can be stored using a general-purpose computer, a dedicated processor or a programmable Or such software processing on a recording medium of dedicated hardware (such as ASIC or FPGA).
  • a computer, processor, microprocessor controller, or programmable hardware includes storage components (for example, RAM, ROM, flash memory, etc.) that can store or receive software or computer code, when the software or computer code is used by the computer, When the processor or hardware is accessed and executed, the training method of the teacher style prediction model described here is implemented.
  • storage components for example, RAM, ROM, flash memory, etc.
  • the training method of the teacher style prediction model described here is implemented.
  • a general-purpose computer accesses the code for implementing the training method of the teacher-style prediction model shown here
  • the execution of the code converts the general-purpose computer into a special-purpose computer for executing the training method of the teacher-style prediction model shown here. computer.

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Operations Research (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Quality & Reliability (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Primary Health Care (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Multimedia (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)

Abstract

Provided are a method for training a teaching style prediction model, and a computer storage medium. The method comprises: determining, on the basis of high-dimensional feature data of a teaching content sample, multiple groups of low-dimensional feature data of the teaching content sample; by means of a teaching style prediction model to be trained, acquiring, on the basis of the multiple groups of low-dimensional feature data, teaching style prediction data corresponding to the teaching content sample; and on the basis of teaching style labeling data of the teaching content sample and the teaching style prediction data, training the teaching style prediction model. In the embodiments of the present invention, high-dimensional feature data of a teaching content sample is grouped into multiple groups of low-dimensional feature data, thereby greatly reducing the dimensionality of input features of a teaching style prediction module to be trained, so that the teaching style prediction performance of the trained teaching style prediction module can be effectively improved.

Description

教师风格预测模型的训练方法及计算机存储介质Training method and computer storage medium of teacher's style prediction model
本申请要求于2019年4月23日提交中国专利局、申请号为201910330162.4、发明名称为“教师风格预测模型的训练方法及计算机存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on April 23, 2019, the application number is 201910330162.4, and the invention title is "Training Method for Teacher Style Prediction Model and Computer Storage Medium", the entire content of which is incorporated by reference In this application.
技术领域Technical field
本发明实施例涉及人工智能领域,尤其涉及一种教师风格预测模型的训练方法及计算机存储介质。The embodiment of the present invention relates to the field of artificial intelligence, and in particular to a training method and computer storage medium of a teacher's style prediction model.
背景技术Background technique
教师风格是对教师个体价值的判断,是教育评价的重要内容。对教师的教学风格进行预测,可以使学校教学管理部门和教师本人了解教学情况、发现问题、总结经验并改造工作,以达到最终提高教学质量的目的。因而如何较为公平公正、准确地预测教师风格,一直是教育界探索的一个问题。Teacher style is the judgment of teacher's individual value, and it is an important content of education evaluation. Predicting the teaching style of teachers can enable the school's teaching management department and teachers to understand the teaching situation, discover problems, sum up experience and reform work, so as to achieve the ultimate goal of improving teaching quality. Therefore, how to predict the teacher's style fairly and accurately has always been a problem explored by the education circle.
当前主要采用建模的方式对教师的教学风格进行预测,模型的输入数据可以包括教师的教学音频和视频等教学数据。由于获取不同教师风格的教学数据样本比较困难,教学数据样本的数据量往往较少。此外,在教学数据样本中提取的特征的维度往往较高,因此,在训练模型时容易产生过拟合的问题,无法训练出性能较好的模型。针对教学数据样本的数据量较少及提取的特征维度较高的问题,现有的处理方法大部分是利用主成分分 析技术,对在教学数据样本中提取的高维特征进行降维,然后使用降维后的特征对模型进行训练。然而,这种处理方法不可避免地丧失一部分在教学数据样本中提取的原始特征的特性,无法充分利用提取的原始特征的信息,并且还无法对降维后的特征具体代表的含义进行分析。因此,到目前为止,还没有一种能够有效地提升教师风格预测性能的模型训练方法。Currently, modeling is mainly used to predict the teaching style of teachers. The input data of the model can include teaching audio and video teaching data of teachers. Because it is difficult to obtain teaching data samples of different teacher styles, the amount of teaching data samples is often small. In addition, the dimensionality of the features extracted in the teaching data sample is often high. Therefore, it is prone to overfitting when training the model, and it is impossible to train a model with better performance. Aiming at the problem of the small amount of teaching data samples and the high dimension of the extracted features, most of the existing processing methods use principal component analysis technology to reduce the dimensions of the high-dimensional features extracted from the teaching data samples, and then use The features after dimensionality reduction train the model. However, this processing method inevitably loses part of the characteristics of the original features extracted from the teaching data samples, cannot make full use of the information of the extracted original features, and cannot analyze the specific meaning of the features after dimensionality reduction. Therefore, so far, there is no model training method that can effectively improve the performance of teacher style prediction.
发明内容Summary of the invention
有鉴于此,本发明实施例所解决的技术问题之一在于提供一种教师风格预测模型的训练方法及非瞬时性计算机可读存储介质,用以解决上述问题至少之一。In view of this, one of the technical problems solved by the embodiments of the present invention is to provide a training method of a teacher's style prediction model and a non-transitory computer-readable storage medium to solve at least one of the above-mentioned problems.
本发明实施例提供一种教师风格预测模型的训练方法。所述方法包括:基于教学内容样本的第一特征数据,确定所述教学内容样本的多组第二特征数据,其中所述第一特征数据具有比所述第二特征数据更高的维度;通过待训练的教师风格预测模型,基于所述多组第二特征数据,获得所述教学内容样本对应的教师风格预测数据;基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型。The embodiment of the present invention provides a training method of a teacher's style prediction model. The method includes: determining a plurality of sets of second characteristic data of the teaching content sample based on the first characteristic data of the teaching content sample, wherein the first characteristic data has a higher dimension than the second characteristic data; The teacher style prediction model to be trained obtains the teacher style prediction data corresponding to the teaching content sample based on the multiple sets of second feature data; the teacher style annotation data based on the teaching content sample and the teacher style prediction data, Training the teacher style prediction model.
本发明实施例还提供一种非瞬时性计算机可读存储介质,所述非瞬时性计算机可读存储介质存储有可读程序,所述可读程序包括:用于基于教学内容样本的第一特征数据,确定所述教学内容样本的多组第二特征数据的指令,其中所述第一特征数据具有比所述第二特征数据更高的维度;用于通过待训练的教师风格预测模型,基于所述多组第二特征数据,获得所 述教学内容样本对应的教师风格预测数据的指令;用于基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型的指令。The embodiment of the present invention also provides a non-transitory computer-readable storage medium, the non-transitory computer-readable storage medium stores a readable program, and the readable program includes: a first feature used for teaching content samples Data, instructions for determining multiple sets of second feature data of the teaching content sample, wherein the first feature data has a higher dimension than the second feature data; used to predict the style of the teacher to be trained based on The multiple sets of second feature data are instructions for obtaining teacher style prediction data corresponding to the teaching content sample; used to train the teacher style based on the teacher style annotation data of the teaching content sample and the teacher style prediction data Predictive model instructions.
根据本发明实施例提供的教师风格预测模型的训练方案,基于教学内容样本的第一特征数据,确定教学内容样本的多组第二特征数据,其中第一特征数据具有比第二特征数据更高的维度,并通过待训练的教师风格预测模型,基于多组第二特征数据,获得教学内容样本对应的教师风格预测数据,再基于教学内容样本的教师风格标注数据和教师风格预测数据,训练教师风格预测模型,与现有的其它方式相比,通过将教学内容样本的高维特征数据分组为多组第二特征数据,大降低了待训练的教师风格预测模型的输入特征的维度,从而使得训练得到的教师风格预测模型的教师风格预测性能能够得到有效地提升。According to the training scheme of the teacher's style prediction model provided by the embodiment of the present invention, based on the first characteristic data of the teaching content sample, multiple sets of second characteristic data of the teaching content sample are determined, wherein the first characteristic data has a higher value than the second characteristic data. Based on multiple sets of second feature data, the teacher style prediction data corresponding to the teaching content sample is obtained through the teacher style prediction model to be trained, and then based on the teacher style annotation data and teacher style prediction data of the teaching content sample, the teacher is trained Compared with other existing methods, the style prediction model greatly reduces the dimensions of the input features of the teacher style prediction model to be trained by grouping the high-dimensional feature data of the teaching content samples into multiple sets of second feature data, thereby making The teacher style prediction performance of the trained teacher style prediction model can be effectively improved.
附图说明Description of the drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明实施例中记载的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其他的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some of the embodiments described in the embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained from these drawings.
图1示出了根据本发明实施例一的一种教师风格预测模型的训练方法的步骤流程图;Figure 1 shows a flowchart of the steps of a method for training a teacher's style prediction model according to the first embodiment of the present invention;
图2示出了根据本发明实施例一提供的教师风格预测模型的结构示意 图;Fig. 2 shows a schematic structural diagram of a teacher style prediction model provided according to the first embodiment of the present invention;
图3示出了根据本发明实施例二的一种教师风格预测方法的步骤流程图。Fig. 3 shows a flowchart of steps of a method for predicting teacher style according to the second embodiment of the present invention.
具体实施方式Detailed ways
为了使本领域的人员更好地理解本发明实施例中的技术方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅是本发明实施例一部分实施例,而不是全部的实施例。基于本发明实施例中的实施例,本领域普通技术人员所获得的所有其他实施例,都应当属于本发明实施例保护的范围。In order to enable those skilled in the art to better understand the technical solutions in the embodiments of the present invention, the following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the description The embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments in the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art should fall within the protection scope of the embodiments of the present invention.
下面结合本发明实施例附图进一步说明本发明实施例具体实现。The specific implementation of the embodiments of the present invention will be further described below in conjunction with the accompanying drawings of the embodiments of the present invention.
实施例一Example one
参照图1,示出了根据本发明实施例一的一种教师风格预测模型的训练方法的步骤流程图。1, there is shown a step flow chart of a method for training a teacher's style prediction model according to the first embodiment of the present invention.
具体地,本发明实施例提供的教师风格预测模型的训练方法包括以下步骤:Specifically, the training method of the teacher's style prediction model provided by the embodiment of the present invention includes the following steps:
在步骤S101中,基于教学内容样本的第一特征数据,确定所述教学内容样本的多组第二特征数据。In step S101, based on the first feature data of the teaching content sample, multiple sets of second feature data of the teaching content sample are determined.
在本实施例中,所述教学内容样本可包括作为训练样本的教学内容的音频数据或视频数据。第一特征数据可理解为维度较高的特征向量,例如, 1000维的特征向量,2000维的特征向量等。当所述教学内容样本为作为训练样本的教学内容的音频数据时,所述教学内容样本的高维特征数据可为从所述音频数据中提取的高维的语音声学特征数据,所述语音声学特征数据可包括音频的韵律特征数据、频谱特征数据和音质特征数据等,所述语音声学特征数据具体为语音声学特征向量。在具体的实施方式中,可采用现有的语音声学特征提取算法从所述音频数据中提取得到高维的语音声学特征数据。当所述教学内容样本为作为训练样本的教学内容的视频数据时,所述教学内容样本的高维特征数据可为从所述视频数据中提取的高维的人脸面部特征数据,所述人脸面部特征数据可包括嘴巴区域的特征数据、眼睛区域的特征数据和脸颊区域的特征数据等,所述人脸面部特征数据具体为人脸面部特征向量。在具体的实施方式中,可采用现有的人脸面部特征提取算法从所述视频数据中提取得到高维的人脸面部特征数据。In this embodiment, the teaching content sample may include audio data or video data of the teaching content as the training sample. The first feature data can be understood as a feature vector with a higher dimension, for example, a feature vector of 1000 dimensions, a feature vector of 2000 dimensions, and so on. When the teaching content sample is audio data of the teaching content as a training sample, the high-dimensional feature data of the teaching content sample may be high-dimensional voice acoustic feature data extracted from the audio data, and the voice acoustics The feature data may include prosodic feature data, spectral feature data, and voice quality feature data of the audio, and the voice acoustic feature data is specifically a voice acoustic feature vector. In a specific implementation, an existing speech acoustic feature extraction algorithm may be used to extract high-dimensional speech acoustic feature data from the audio data. When the teaching content sample is video data of teaching content as a training sample, the high-dimensional feature data of the teaching content sample may be high-dimensional facial feature data extracted from the video data, and the person The facial feature data may include the feature data of the mouth region, the feature data of the eye region, the feature data of the cheek region, etc. The facial feature data is specifically a facial feature vector of a human face. In a specific implementation manner, an existing facial feature extraction algorithm can be used to extract high-dimensional facial feature data from the video data.
在本实施例中,在基于教学内容样本的高维特征数据,确定所述教学内容样本的多组低维特征数据时,对所述高维特征数据进行相关性分析,以确定所述高维特征数据的分组;基于所述高维特征数据的分组,对所述高维特征数据进行划分,以获得所述教学内容样本的多组低维特征数据。籍此,大大降低了教师风格预测模型的输入特征的维度。In this embodiment, when multiple sets of low-dimensional feature data of the teaching content sample are determined based on the high-dimensional feature data of the teaching content sample, correlation analysis is performed on the high-dimensional feature data to determine the high-dimensional feature data. Grouping of feature data; based on the grouping of the high-dimensional feature data, the high-dimensional feature data is divided to obtain multiple groups of low-dimensional feature data of the teaching content sample. In this way, the dimension of the input features of the teacher style prediction model is greatly reduced.
具体的,当所述高维特征数据具体为高维的语音声学特征数据时,通过语音声学的先验知识可知,语音声学特征包括韵律特征、频谱特征和音质特征,因此,可基于语音声学特征包括韵律特征、频谱特征和音质特征的先验知识,对高维的语音声学特征数据进行相关性分析,以确定所述高 维的语音声学特征数据的分组。然后,按照所述分组,对高维的语音声学特征数据进行划分,以获得所述教学内容样本的韵律特征数据、频谱特征数据和音质特征数据。简单地说,依据语音声学的先验知识确定语音声学特征包括的特征的种类,再依据语音声学特征包括的特征的种类,对高维的语音声学特征数据进行相关性分析。当所述高维特征数据具体为高维的人脸面部特征数据时,通过人脸面部的先验知识可知,人脸面部包括嘴巴区域、眼睛区域、鼻子区域和脸颊区域,因此,可基于人脸面部包括嘴巴区域、眼睛区域、鼻子区域和脸颊区域的先验知识,对高维的人脸面部特征数据进行相关性分析,以确定所述高维的人脸面部特征数据的分组。然后,按照所述分组,对高维的人脸面部特征数据进行划分,以获得所述教学内容样本的嘴巴区域特征数据、眼睛区域特征数据、鼻子区域特征数据和脸颊区域特征数据。简单地说,依据人脸面部的先验知识确定人脸面部包括的不同区域,再依据人脸面部包括的不同区域,对高维的人脸面部特征数据进行相关性分析。Specifically, when the high-dimensional feature data is specifically high-dimensional voice acoustic feature data, it can be known from the prior knowledge of voice acoustics that the voice acoustic features include prosodic features, spectral features, and voice quality features. Therefore, it can be based on voice acoustic features. Including the prior knowledge of the prosody feature, the frequency spectrum feature, and the voice quality feature, the correlation analysis is performed on the high-dimensional voice acoustic feature data to determine the grouping of the high-dimensional voice acoustic feature data. Then, according to the grouping, the high-dimensional speech acoustic feature data is divided to obtain the prosody feature data, frequency spectrum feature data, and sound quality feature data of the teaching content sample. Simply put, the types of features included in the voice acoustic features are determined based on the prior knowledge of voice acoustics, and then the high-dimensional voice acoustic feature data is correlated based on the types of features included in the voice acoustic features. When the high-dimensional feature data is specifically high-dimensional facial feature data, it can be known from the prior knowledge of the human face that the human face includes the mouth area, the eye area, the nose area, and the cheek area. Therefore, it can be based on the human face. The face includes the prior knowledge of the mouth area, the eye area, the nose area and the cheek area, and the correlation analysis is performed on the high-dimensional facial feature data to determine the grouping of the high-dimensional facial feature data. Then, according to the grouping, the high-dimensional face feature data is divided to obtain the mouth area feature data, the eye area feature data, the nose area feature data, and the cheek area feature data of the teaching content sample. Simply put, the different areas included in the face are determined according to the prior knowledge of the face, and then the correlation analysis of the high-dimensional facial feature data is performed according to the different areas included in the face.
在本实施例中,在基于教学内容样本的高维特征数据,确定所述教学内容样本的多组低维特征数据时,在没有先验知识的情况下,以维度均等的方式对所述教学内容样本的高维特征数据进行划分,以获得所述教学内容样本的多组低维特征数据。举例来说,当所述高维特征数据具体为1000维的特征数据时,可将1000维的高维特征数据均等地分为10组低维特征数据,每组低维特征数据的维度为100维。在具体的实施方式中,具体划分为多少组多少维可以通过实验设置。籍此,大大降低了教师风格预测模 型的输入特征的维度。In this embodiment, when multiple sets of low-dimensional feature data of the teaching content sample are determined based on the high-dimensional feature data of the teaching content sample, without prior knowledge, the teaching is evaluated in an equal-dimensional manner. The high-dimensional feature data of the content sample is divided to obtain multiple sets of low-dimensional feature data of the teaching content sample. For example, when the high-dimensional feature data is specifically 1000-dimensional feature data, the 1000-dimensional high-dimensional feature data can be equally divided into 10 groups of low-dimensional feature data, and the dimension of each group of low-dimensional feature data is 100 dimension. In a specific implementation, how many groups and dimensions are divided into can be set through experiments. In this way, the dimension of the input features of the teacher style prediction model is greatly reduced.
具体的,向系统输入教学内容样本,设置对其中一条样本数据n(设共有N条样本数据,则n=1,2,...,N)的高维特征数据为v n,维度为D,然后通过一定的先验知识对高维特征数据v n进行相关性分析,将高维特征数据v n划分为K组(k=1,2,...,K),每组的维度设为D k,满足 Specifically, input the teaching content sample to the system, set the high-dimensional feature data for one of the sample data n (suppose there are N sample data, then n=1, 2,...,N) as v n and the dimension as D , And then perform correlation analysis on the high-dimensional feature data v n through certain prior knowledge, and divide the high-dimensional feature data v n into K groups (k = 1, 2,..., K), and the dimensionality of each group is set Is D k , which satisfies
Figure PCTCN2020086363-appb-000001
Figure PCTCN2020086363-appb-000001
对于第n条样本数据,原始的高维特征数据为v n,划分后的第k组的低维特征数据为
Figure PCTCN2020086363-appb-000002
Figure PCTCN2020086363-appb-000003
其中,concat(*)表示特征数据依次拼接起来。若没有先验知识可以进行特征的相关性分析,则可均等地划分为K份,同样满足上述关系。
For the nth sample data, the original high-dimensional feature data is v n , and the divided low-dimensional feature data of the kth group is
Figure PCTCN2020086363-appb-000002
then
Figure PCTCN2020086363-appb-000003
Among them, concat (*) means that the feature data are spliced together in turn. If there is no prior knowledge to analyze the correlation of features, it can be equally divided into K parts, which also meets the above relationship.
在步骤S102中,通过待训练的教师风格预测模型,基于所述多组第二特征数据,获得所述教学内容样本对应的教师风格预测数据。In step S102, the teacher style prediction data corresponding to the teaching content sample is obtained based on the multiple sets of second feature data through the teacher style prediction model to be trained.
在本实施例中,所述教师风格预测模型可以是任意适当的可实现特征提取或目标对象检测的神经网络模型,包括但不限于卷积神经网络、增强学习神经网络、对抗神经网络中的生成网络、深度神经网络等等。神经网络中具体结构的设置可以由本领域技术人员根据实际需求适当设定,如卷积层的层数、卷积核的大小、通道数等等。所述教师风格预测数据可为预测的教师风格类别,还可为预测的教师风格数值等。In this embodiment, the teacher style prediction model can be any suitable neural network model that can realize feature extraction or target object detection, including but not limited to convolutional neural network, enhanced learning neural network, and generation in counter neural network. Network, deep neural network, etc. The settings of the specific structure in the neural network can be appropriately set by those skilled in the art according to actual requirements, such as the number of convolution layers, the size of the convolution kernel, the number of channels, and so on. The teacher style prediction data may be a predicted teacher style category, or a predicted teacher style value.
在本实施例中,所述教师风格预测模型包括多个低层模型及与所述多个低层模型的输出端连接的高层模型,所述多个低层模型和所述高层模型 均为深度神经网络模型。在通过待训练的教师风格预测模型,基于所述多组低维特征数据,获得所述教学内容样本对应的教师风格预测数据时,通过所述多个低层模型,基于所述多组低维特征数据,获得所述教学内容样本对应的多个教师风格初步预测数据;通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容样本对应的教师风格最终预测数据。籍此,通过教师风格预测模型包括的多个低层模型,对教学内容样本进行教学风格的初步预测,再通过教师风格预测模型包括的高层模型,基于教学风格的初步预测结果,对教学内容样本进行教学风格的最终预测,能够提高教师风格预测模型对教学内容样本对应的教师风格的预测准确度。In this embodiment, the teacher style prediction model includes multiple low-level models and high-level models connected to the output terminals of the multiple low-level models, and the multiple low-level models and the high-level models are all deep neural network models . When obtaining the teacher style prediction data corresponding to the teaching content sample through the teacher style prediction model to be trained based on the multiple sets of low-dimensional feature data, the multiple low-level models are based on the multiple sets of low-dimensional feature data. Data to obtain a plurality of teacher style preliminary prediction data corresponding to the teaching content sample; through the high-level model, based on the plurality of teacher style preliminary prediction data, obtain the teacher style final prediction data corresponding to the teaching content sample. In this way, through the multiple low-level models included in the teacher style prediction model, preliminary predictions of the teaching style are made on the teaching content samples, and then through the high-level models included in the teacher style prediction model, based on the preliminary prediction results of the teaching style, the teaching content samples are made The final prediction of the teaching style can improve the prediction accuracy of the teacher style prediction model for the teacher style corresponding to the teaching content sample.
在本实施例中,所述多个低层模型中的每个低层模型包括隐含层及与所述隐含层的输出端连接的预测层,所述隐含层具体为全连接层或卷积层,所述预测层具体为全连接层。在通过所述多个低层模型,基于所述多组低维特征数据,获得所述教学内容样本对应的多个教师风格初步预测数据时,通过所述隐含层,对所述多组低维特征数据分别进行特征提取操作,以获得所述多组低维特征数据分别对应的特征表征数据;通过所述预测层,对所述多组低维特征数据分别对应的特征表征数据进行映射操作,以获得所述教学内容样本对应的多个教师风格初步预测数据。其中,所述特征表征数据具体为特征表征向量。籍此,通过隐含层,对多组低维特征数据分别进行特征提取操作,能够对多组低维特征数据分别进行特征再编码,提高多组低维特征数据分别对应的特征表征数据的鲁棒性,进而提高低层模型 对教学内容样本对应的教师风格的初步预测的准确度。In this embodiment, each of the multiple low-layer models includes a hidden layer and a prediction layer connected to the output end of the hidden layer, and the hidden layer is specifically a fully connected layer or a convolutional layer. The prediction layer is specifically a fully connected layer. When obtaining a plurality of preliminary prediction data of teacher style corresponding to the teaching content sample based on the plurality of sets of low-dimensional feature data through the plurality of low-level models, the plurality of sets of low-dimensional The feature data are respectively subjected to feature extraction operations to obtain feature representation data corresponding to the multiple sets of low-dimensional feature data; through the prediction layer, a mapping operation is performed on the feature representation data corresponding to the multiple sets of low-dimensional feature data, To obtain a plurality of preliminary prediction data of teacher style corresponding to the teaching content sample. Wherein, the characteristic characterization data is specifically a characteristic characterization vector. In this way, through the hidden layer, feature extraction operations are performed on multiple sets of low-dimensional feature data, which can re-encode multiple sets of low-dimensional feature data, and improve the robustness of feature representation data corresponding to multiple sets of low-dimensional feature data. It can improve the accuracy of the low-level model's preliminary prediction of the teacher style corresponding to the teaching content sample.
在本实施例中,在通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容样本对应的教师风格最终预测数据时,基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据;通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容样本对应的教师风格最终预测数据。其中,所述高层特征表征数据具体为高层特征表征向量。籍此,基于教师风格初步预测数据,生成对应高层模型的高层特征表征数据,再通过高层模型,基于高层特征表征数据,获得教学内容样本对应的教师风格最终预测数据,能够提高高层模型对教学内容样本对应的教师风格的最终预测的准确度。In this embodiment, when obtaining the teacher style final prediction data corresponding to the teaching content sample through the high-level model based on the plurality of teacher style preliminary prediction data, based on the plurality of teacher style preliminary prediction data, Generate high-level feature representation data corresponding to the high-level model; through the high-level model, based on the high-level feature representation data, obtain final prediction data of teacher style corresponding to the teaching content sample. Wherein, the high-level feature representation data is specifically a high-level feature representation vector. In this way, based on the preliminary prediction data of teacher style, the high-level feature representation data corresponding to the high-level model is generated, and then through the high-level model, based on the high-level feature representation data, the final prediction data of the teacher style corresponding to the teaching content sample is obtained, which can improve the high-level model's ability to teach content The accuracy of the final prediction of the teacher style corresponding to the sample.
在本实施例中,在基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据时,基于所述多个教师风格初步预测数据和所述多组低维特征数据分别对应的特征表征数据,生成所述高层特征表征数据。籍此,基于教师风格初步预测数据和低维特征数据对应的特征表征数据,生成高层特征表征数据,能够提高高层特征表征数据的鲁棒性,进而提高高层模型对教学内容样本对应的教师风格的最终预测的准确度。In this embodiment, when generating high-level feature representation data corresponding to the high-level model based on the plurality of teacher style preliminary prediction data, based on the plurality of teacher style preliminary prediction data and the multiple sets of low-dimensional feature data The corresponding characteristic characterization data are respectively generated to generate the high-level characteristic characterization data. In this way, based on the preliminary prediction data of teacher style and the feature representation data corresponding to the low-dimensional feature data, high-level feature representation data can be generated, which can improve the robustness of the high-level feature representation data, thereby improving the high-level model's ability to respond to the teacher style corresponding to the teaching content sample. The accuracy of the final prediction.
在本实施例中,在通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容样本对应的教师风格最终预测数据时,通过所述高层模型中的隐含层,对所述高层特征表征数据进行特征提取操作,以获得所述高层特征表征数据对应的特征表征数据;通过所述高层模型中的预测层,对所述高层特征表征数据对应的特征表征数据进行映射操作,以获得所述 教学内容样本对应的教师风格最终预测数据。其中,所述隐含层具体为全连接层或卷积层,所述预测层具体为全连接层,所述特征表征数据具体为特征表征向量。籍此,通过隐含层,对高层特征表征数据进行特征提取操作,能够对高层特征表征数据进行特征再编码,提高高层特征表征数据对应的特征表征数据的鲁棒性,进而提高高层模型对教学内容样本对应的教师风格的最终预测的准确度。In this embodiment, when obtaining the teacher style final prediction data corresponding to the teaching content sample through the high-level model and based on the high-level feature characterization data, the hidden layer in the high-level model is used to compare the Perform a feature extraction operation on the high-level feature characterization data to obtain the feature characterization data corresponding to the high-level feature characterization data; through the prediction layer in the high-level model, perform a mapping operation on the feature characterization data corresponding to the high-level feature characterization data to Obtain final prediction data of teacher style corresponding to the teaching content sample. Wherein, the hidden layer is specifically a fully connected layer or a convolutional layer, the prediction layer is specifically a fully connected layer, and the feature characterization data is specifically a feature characterization vector. In this way, through the hidden layer, the feature extraction operation of the high-level feature characterization data can re-encode the high-level feature characterization data, improve the robustness of the feature characterization data corresponding to the high-level feature characterization data, and thereby improve the high-level model's ability to teach The accuracy of the final prediction of the teacher's style corresponding to the content sample.
具体的,如图2所示,所述教师风格预测模型包括多个低层模型及与所述多个低层模型的输出端连接的高层模型。对高维特征数据进行划分后,获得多个特征分组,然后将每个特征分组分别输入到对应的低层模型中。再通过对应的低层模型,基于特征分组,对教学内容样本进行教学风格的初步预测,以获得教学内容样本对应的教师风格初步预测数据。其中,所述低层模型包括多个依次连接的隐含层和与多个依次连接的隐含层中的最后一个隐含层的输出端连接的预测层。再基于多个低层模型输出的教师风格初步预测数据和多个低层模型中最后一层隐含层输出的特征分组的特征表征数据,生成对应高层模型的高层特征表征数据。最后,通过高层模型,基于高层特征表征数据,对教学内容样本进行教学风格的最终预测,以获得教学内容样本对应的教师风格最终预测数据。Specifically, as shown in FIG. 2, the teacher style prediction model includes multiple low-level models and high-level models connected to the output terminals of the multiple low-level models. After dividing the high-dimensional feature data, multiple feature groups are obtained, and then each feature group is input into the corresponding low-level model. Then through the corresponding low-level model, based on the feature grouping, the preliminary prediction of the teaching style of the teaching content sample is made to obtain the preliminary prediction data of the teacher style corresponding to the teaching content sample. Wherein, the low-level model includes a plurality of sequentially connected hidden layers and a prediction layer connected to the output end of the last hidden layer of the plurality of sequentially connected hidden layers. Based on the preliminary prediction data of teacher style output by multiple low-level models and the feature representation data of feature groups output by the last hidden layer of the multiple low-level models, high-level feature representation data corresponding to the high-level model is generated. Finally, through the high-level model, based on the high-level feature representation data, the final prediction of the teaching style of the teaching content sample is made to obtain the final prediction data of the teacher style corresponding to the teaching content sample.
具体的,将高维特征数据划分为K组低维特征数据,每组低维特征数据对应于一个低层模型,则第k个低层模型与第k组低维特征数据
Figure PCTCN2020086363-appb-000004
一一对应。对于第n条样本数据的第k组的低维特征数据
Figure PCTCN2020086363-appb-000005
设低层模型的隐含层层数为L k(l k=1,2,...,L k),第l k个隐含层的隐藏节点维度为
Figure PCTCN2020086363-appb-000006
Specifically, the high-dimensional feature data is divided into K groups of low-dimensional feature data, and each group of low-dimensional feature data corresponds to a low-level model, then the k-th low-level model and the k-th group of low-dimensional feature data
Figure PCTCN2020086363-appb-000004
One to one correspondence. Low-dimensional feature data of the kth group for the nth sample data
Figure PCTCN2020086363-appb-000005
Suppose the number of hidden layers of the low-level model is L k (l k =1, 2,..., L k ), and the hidden node dimension of the l kth hidden layer is
Figure PCTCN2020086363-appb-000006
当l k=1时, When l k =1,
Figure PCTCN2020086363-appb-000007
Figure PCTCN2020086363-appb-000007
其中,
Figure PCTCN2020086363-appb-000008
是第k组低层模型的第一个隐含层的权重矩阵,维度为
Figure PCTCN2020086363-appb-000009
是第k组低层模型的第一个隐含层的偏置向量,维度为
Figure PCTCN2020086363-appb-000010
指示根据该层权重矩阵和偏置向量计算出的结果;f(*)为非线性函数,通常为sigmoid函数;
Figure PCTCN2020086363-appb-000011
是第k组低层模型针对第n条样本数据的第一个隐含层隐向量表示,维度为
Figure PCTCN2020086363-appb-000012
among them,
Figure PCTCN2020086363-appb-000008
Is the weight matrix of the first hidden layer of the kth group of low-level models, with dimensions
Figure PCTCN2020086363-appb-000009
Is the bias vector of the first hidden layer of the kth group of low-level models, with the dimension
Figure PCTCN2020086363-appb-000010
Indicates the result calculated according to the weight matrix and bias vector of this layer; f(*) is a non-linear function, usually a sigmoid function;
Figure PCTCN2020086363-appb-000011
Is the first hidden vector representation of the nth sample data of the kth group of low-level models, and the dimension is
Figure PCTCN2020086363-appb-000012
当1<l k<L k时, When 1<l k <L k ,
Figure PCTCN2020086363-appb-000013
Figure PCTCN2020086363-appb-000013
其中,
Figure PCTCN2020086363-appb-000014
是第k组低层模型第l k个隐含层的权重矩阵,维度为
Figure PCTCN2020086363-appb-000015
是第k组低层模型第l k个隐含层的偏置向量,维度为
Figure PCTCN2020086363-appb-000016
指示根据该层权重矩阵和偏置向量计算出的结果;
Figure PCTCN2020086363-appb-000017
是第k组低层模型针对第n条数据的第l k个隐含层隐向量表示,维度为
Figure PCTCN2020086363-appb-000018
among them,
Figure PCTCN2020086363-appb-000014
Is the weight matrix of the l kth hidden layer of the kth group of low-level models, the dimension is
Figure PCTCN2020086363-appb-000015
Is the bias vector of the l kth hidden layer of the kth group of low-level models, and the dimension is
Figure PCTCN2020086363-appb-000016
Indicates the result calculated according to the weight matrix and bias vector of the layer;
Figure PCTCN2020086363-appb-000017
Is the latent vector representation of the l kth hidden layer of the kth group of low-level models for the nth data, and the dimension is
Figure PCTCN2020086363-appb-000018
当l k=L k时, When l k =L k ,
Figure PCTCN2020086363-appb-000019
Figure PCTCN2020086363-appb-000019
其中,
Figure PCTCN2020086363-appb-000020
是第k组低层模型第L k个隐含层的权重矩阵,维 度为
Figure PCTCN2020086363-appb-000021
是第k组低层模型第L k个隐含层的偏置向量,维度为
Figure PCTCN2020086363-appb-000022
指示根据该层权重矩阵和偏置向量计算出的结果;
Figure PCTCN2020086363-appb-000023
是第k组低层模型针对第n条样本数据的第L k个隐含层隐向量表示,维度为
Figure PCTCN2020086363-appb-000024
among them,
Figure PCTCN2020086363-appb-000020
Is the weight matrix of the L k hidden layer of the kth group of low-level models, with dimensions
Figure PCTCN2020086363-appb-000021
Is the bias vector of the L k hidden layer of the kth group of low-level models, with the dimension
Figure PCTCN2020086363-appb-000022
Indicates the result calculated according to the weight matrix and bias vector of the layer;
Figure PCTCN2020086363-appb-000023
Is the L kth hidden layer latent vector representation of the kth group of low-level models for the nth sample data, the dimension is
Figure PCTCN2020086363-appb-000024
第k组低层模型的隐含层的输出为
Figure PCTCN2020086363-appb-000025
作为第k组低层模型的预测层的输入:
The output of the hidden layer of the k-th low-level model is
Figure PCTCN2020086363-appb-000025
As the input of the prediction layer of the k-th group of low-level models:
Figure PCTCN2020086363-appb-000026
Figure PCTCN2020086363-appb-000026
其中,
Figure PCTCN2020086363-appb-000027
是第k组低层模型的预测层的权重矩阵,维度为
Figure PCTCN2020086363-appb-000028
是第k组低层模型的预测层的偏置向量,维度为1;
Figure PCTCN2020086363-appb-000029
是第k组低层模型针对第n条样本数据的教师风格初步预测数据,维度为1,是在0-1之间的实值。
among them,
Figure PCTCN2020086363-appb-000027
Is the weight matrix of the prediction layer of the kth group of low-level models, with the dimension
Figure PCTCN2020086363-appb-000028
Is the bias vector of the prediction layer of the k-th group of low-level models, with a dimension of 1;
Figure PCTCN2020086363-appb-000029
It is the preliminary prediction data of teacher style of the kth group of low-level models for the nth sample data. The dimension is 1, which is a real value between 0-1.
组合各个低层模型的最后隐含层的隐向量表示以及教师风格初步预测数据,得到高层特征表征数据。则高层特征表征数据为:Combine the hidden vector representation of the last hidden layer of each low-level model and preliminary prediction data of teacher style to obtain high-level feature representation data. Then the high-level feature representation data is:
Figure PCTCN2020086363-appb-000030
Figure PCTCN2020086363-appb-000030
其中,h n的维度为
Figure PCTCN2020086363-appb-000031
组合各个低层模型的教师风格初步预测数据并加入最后隐含层的隐向量表示可以获取更多的信息,使高层模型能够预测更准确。
Among them, the dimension of h n is
Figure PCTCN2020086363-appb-000031
Combining the preliminary prediction data of teacher style of each low-level model and adding the hidden vector representation of the last hidden layer can obtain more information, so that the high-level model can predict more accurately.
高层特征表征数据作为高层模型的输入进行最终预测,所述高层模型包括多个依次连接的隐含层和与多个依次连接的隐含层中的最后一个隐含 层的输出端连接的预测层。设高层模型的隐含层的层数为L,其第l个隐含层的隐藏节点维度为D lThe high-level feature representation data is used as the input of the high-level model for final prediction. The high-level model includes a plurality of sequentially connected hidden layers and a prediction layer connected to the output end of the last hidden layer of the plurality of sequentially connected hidden layers . Suppose the number of hidden layers of the high-level model is L, and the hidden node dimension of the l-th hidden layer is D l .
当l=1时,When l=1,
y 1n=W 1h n+b 1;g 1n=f(y 1n); y 1n = W 1 h n + b 1 ; g 1n = f(y 1n );
其中,
Figure PCTCN2020086363-appb-000032
是高层模型第一个隐含层的权重矩阵,维度为
Figure PCTCN2020086363-appb-000033
是高层模型第一个隐含层的偏置向量,维度为D 1;y 1n指示根据该层权重矩阵和偏置向量计算出的结果;f(*)为非线性函数,通常为sigmoid函数;g 1n是高层模型针对第n条样本数据的第一个隐含层隐向量表示,维度为D 1×1。
among them,
Figure PCTCN2020086363-appb-000032
Is the weight matrix of the first hidden layer of the high-level model, the dimension is
Figure PCTCN2020086363-appb-000033
Is the offset vector of the first hidden layer of the high-level model, with the dimension D 1 ; y 1n indicates the result calculated according to the weight matrix and offset vector of the layer; f(*) is a nonlinear function, usually a sigmoid function; g 1n is the first hidden vector representation of the n-th sample data of the high-level model, and the dimension is D 1 ×1.
当1<l<L时,When 1<l<L,
y ln=W lg (l-1)n+b l
Figure PCTCN2020086363-appb-000034
y ln = W l g (l-1)n + b l ;
Figure PCTCN2020086363-appb-000034
其中,
Figure PCTCN2020086363-appb-000035
是高层模型第l个隐含层的权重矩阵,维度为D l×D l-1
Figure PCTCN2020086363-appb-000036
是高层模型第l个隐含层的偏置向量,维度为D l;y ln指示根据该层权重矩阵和偏置向量计算出的结果;g ln是高层模型针对第n条样本数据的第l个隐含层隐向量表示,维度为D l×1。
among them,
Figure PCTCN2020086363-appb-000035
Is the weight matrix of the l hidden layer of the high-level model, with a dimension of D l × D l-1 ;
Figure PCTCN2020086363-appb-000036
Is the offset vector of the l hidden layer of the high-level model, with the dimension D l ; y ln indicates the result calculated according to the weight matrix and offset vector of the layer; g ln is the l th of the high-level model for the nth sample data Hidden vector representation of a hidden layer, the dimension is D l ×1.
当l k=L k时, When l k =L k ,
y Ln=W Lg (L-1)n+b L;h Ln=f(y Ln); y Ln = W L g (L-1)n + b L ; h Ln = f(y Ln );
其中,
Figure PCTCN2020086363-appb-000037
是高层模型第L个隐含层的权重矩阵,维度为D L×D L-1
Figure PCTCN2020086363-appb-000038
是高层模型第L个隐含层的偏置向量,维度为D L;y ln指示根据该层权重矩阵和偏置向量计算出的结果;h Ln是高层模型针对第n条样本数据的第L个隐含层隐向量表示,维度为D L×1。
among them,
Figure PCTCN2020086363-appb-000037
Is the weight matrix of the Lth hidden layer of the high-level model, with the dimension D L ×D L-1 ;
Figure PCTCN2020086363-appb-000038
Is the offset vector of the Lth hidden layer of the high-level model, with a dimension of D L ; y ln indicates the result calculated according to the weight matrix and offset vector of the layer; h Ln is the L-th value of the high-level model for the nth sample data Hidden vector representation of a hidden layer, the dimension is D L ×1.
高层模型隐含层的输出为h Ln作为高层模型预测层的输入: The output of the hidden layer of the high-level model is h Ln as the input of the prediction layer of the high-level model:
s n=Wh Ln+b s n =Wh Ln +b
其中,
Figure PCTCN2020086363-appb-000039
是高层模型预测层的权重矩阵,维度为1×D L
Figure PCTCN2020086363-appb-000040
是高层模型预测层的偏置向量,维度为1;s n是高层模型针对第n条样本数据的教师风格最终预测数据,维度为1,是在0-1之间的实值。
among them,
Figure PCTCN2020086363-appb-000039
Is the weight matrix of the prediction layer of the high-level model, with a dimension of 1×D L ;
Figure PCTCN2020086363-appb-000040
Is the bias vector of the prediction layer of the high-level model, with a dimension of 1; s n is the final teacher-style prediction data of the high-level model for the nth sample data, with a dimension of 1, which is a real value between 0-1.
由上述描述可知,在具体的实施方式中,低层模型和高层模型的结构是类似的,之所以使用低层模型和高层模型,是因为通过低层模型,对教学内容样本进行教学风格的初步预测,再通过高层模型,基于低层模型的教学风格初步预测结果,对教学内容样本进行教学风格的最终预测,能够提高教师风格预测模型对教学内容样本对应的教师风格的预测准确度。此外,由于教学内容样本的数据量较少,并且教学内容样本的特征数据的维度过高,直接使用一个模型(如只使用一个底层模型)来进行建模,会造成“维度灾难”,训练得到的模型只适用于训练数据,在测试数据上不能得到很好的性能,这会造成过拟合影响。It can be seen from the above description that in the specific implementation, the structure of the low-level model and the high-level model are similar. The reason why the low-level model and the high-level model are used is because the low-level model is used to make preliminary predictions of the teaching style of the teaching content samples. Through the high-level model, based on the preliminary prediction results of the teaching style of the low-level model, the final prediction of the teaching style of the teaching content sample can improve the accuracy of the teacher style prediction model for the teacher style corresponding to the teaching content sample. In addition, due to the small amount of data in the teaching content sample, and the high dimensionality of the feature data of the teaching content sample, directly using a model (such as using only one underlying model) for modeling will cause a "dimension disaster". The model is only applicable to training data, and can not get good performance on test data, which will cause overfitting.
在步骤S103中,基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型。In step S103, the teacher style prediction model is trained based on the teacher style annotation data of the teaching content sample and the teacher style prediction data.
在本实施例中,所述教师风格标注数据可理解为所述教学内容样本的教师风格真实数据。In this embodiment, the teacher style annotation data can be understood as the actual teacher style data of the teaching content sample.
在本实施例中,在基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型时,通过目标损失函数,确定所述教师风格标注数据和所述教师风格预测数据之间的差异值;基于所述差异值,调整所述教师风格预测模型的参数。In this embodiment, when training the teacher style prediction model based on the teacher style annotation data of the teaching content sample and the teacher style prediction data, the target loss function is used to determine the teacher style annotation data and the teacher style prediction data. The difference value between the teacher style prediction data; based on the difference value, the parameters of the teacher style prediction model are adjusted.
在本实施例中,在通过目标损失函数,确定所述教师风格标注数据和所述教师风格预测数据之间的差异值时,通过目标损失函数,确定所述教师风格标注数据和所述教师风格最终预测数据之间的差异值。在基于所述差异值,调整所述教师风格预测模型的参数时,基于所述差异值,调整所述教师风格预测模型中的所述多个低层模型和所述高层模型的参数。In this embodiment, when the difference value between the teacher style annotation data and the teacher style prediction data is determined through the objective loss function, the teacher style annotation data and the teacher style are determined through the objective loss function The difference between the final predicted data. When adjusting the parameters of the teacher style prediction model based on the difference value, adjust the parameters of the multiple low-level models and the high-level model in the teacher style prediction model based on the difference value.
在本实施例中,所述目标损失函数包括均方误差项和L2正则化项。籍此,能够防止教师风格预测模型的训练过程受到过拟合影响。In this embodiment, the objective loss function includes a mean square error term and an L2 regularization term. In this way, the training process of the teacher's style prediction model can be prevented from being affected by overfitting.
具体的,给定第n条样本数据的高维特征数据v n,可以通过低层模型和高层模型计算,最终从高层模型的预测层得到教师风格预测数据s n。设第n条样本数据的教师风格真实数据为s′ n,训练教师风格预测模型,使得s n与s′ n尽可能接近。在训练的过程中,选用以下函数作为训练教师风格预测模型的损失函数: Specifically, given the high-dimensional feature data v n of the nth sample data, it can be calculated by the low-level model and the high-level model, and finally the teacher style prediction data s n is obtained from the prediction layer of the high-level model. Suppose the real teacher style data of the nth sample data is s′ n , and train the teacher style prediction model to make s n and s′ n as close as possible. In the training process, the following function is selected as the loss function for training the teacher's style prediction model:
Figure PCTCN2020086363-appb-000041
Figure PCTCN2020086363-appb-000041
其中,s n是第n条样本数据的教师风格真实数据,
Figure PCTCN2020086363-appb-000042
是针对第n条样本数据的第k个低层模型的教师风格初步预测数据,s′ n是针对第n条样本数据 的高层模型的教师风格最终预测数据,
Figure PCTCN2020086363-appb-000043
是低层模型隐含层的权重矩阵,W k是低层模型预测层的权重矩阵,W l是高层模型隐含层的权重矩阵,W是高层模型预测层的权重矩阵,λ是权重衰减项,取值在0到1之间。上式的第一项和第二项计算均方误差,后四项加入L2正则化防止教师风格预测模型过拟合。
Among them, s n is the real teacher style data of the nth sample data,
Figure PCTCN2020086363-appb-000042
Is the preliminary teacher style prediction data of the kth low-level model for the nth sample data, s′ n is the final teacher style prediction data of the high-level model for the nth sample data,
Figure PCTCN2020086363-appb-000043
Is the weight matrix of the hidden layer of the low-level model, W k is the weight matrix of the prediction layer of the low-level model, W l is the weight matrix of the hidden layer of the high-level model, W is the weight matrix of the high-level model prediction layer, and λ is the weight attenuation term. The value is between 0 and 1. The first and second terms of the above formula calculate the mean square error, and the last four terms are added with L2 regularization to prevent the teacher style prediction model from overfitting.
教师风格预测模型的训练是将低层模型和高层模型联合起来统一进行训练,通过目标损失函数来整体优化教师风格预测模型。整个模型使用最小化目标损失函数进行训练,即训练得到教师风格预测模型的参数(
Figure PCTCN2020086363-appb-000044
W k,W l,W,
Figure PCTCN2020086363-appb-000045
b k,b l,b)。
The training of the teacher's style prediction model is to combine the low-level model and the high-level model for unified training, and optimize the teacher's style prediction model as a whole through the objective loss function. The entire model is trained using the minimized objective loss function, that is, the parameters of the teacher style prediction model (
Figure PCTCN2020086363-appb-000044
W k , W l , W,
Figure PCTCN2020086363-appb-000045
b k , b l , b).
具体地,通过确定所述教师风格标注数据和所述教师风格最终预测数据之间的差异值,对当前获得的教师风格最终预测数据进行评估,以作为后续训练所述教师风格预测模型的依据。具体地,可将所述差异值反向传输给所述教师风格预测模型,从而迭代地训练所述教师风格预测模型。所述教师风格预测模型的训练是一个迭代的过程,本申请实施例仅对其中的一次训练过程进行了说明,但本领域技术人员应当明了,对所述教师风格预测模型的每次训练都可采用该训练方式,直至完成所述教师风格预测模型的训练。Specifically, by determining the difference value between the teacher style annotation data and the teacher style final prediction data, the currently obtained teacher style final prediction data is evaluated as a basis for subsequent training of the teacher style prediction model. Specifically, the difference value may be transmitted back to the teacher style prediction model, so as to train the teacher style prediction model iteratively. The training of the teacher style prediction model is an iterative process. The embodiments of this application only describe one training process, but those skilled in the art should understand that each training of the teacher style prediction model can be performed. This training method is adopted until the training of the teacher style prediction model is completed.
通过本申请实施例提供的教师风格预测模型的训练方法,基于教学内容样本的高维特征数据,确定教学内容样本的多组低维特征数据,并通过待训练的教师风格预测模型,基于多组低维特征数据,获得教学内容样本对应的教师风格预测数据,再基于教学内容样本的教师风格标注数据和教 师风格预测数据,训练教师风格预测模型,与现有的其它方式相比,通过将教学内容样本的高维特征数据分组为多组低维特征数据,大大降低了待训练的教师风格预测模型的输入特征的维度,从而使得训练得到的教师风格预测模型的教师风格预测性能能够得到有效地提升。Through the training method of the teacher style prediction model provided by the embodiments of the application, based on the high-dimensional feature data of the teaching content sample, multiple sets of low-dimensional feature data of the teaching content sample are determined, and the teacher style prediction model to be trained is based on multiple sets of With low-dimensional feature data, the teacher style prediction data corresponding to the teaching content sample is obtained, and then based on the teacher style annotation data and the teacher style prediction data of the teaching content sample, the teacher style prediction model is trained. Compared with other existing methods, the teaching style The high-dimensional feature data of the content samples are grouped into multiple sets of low-dimensional feature data, which greatly reduces the dimension of the input features of the teacher style prediction model to be trained, so that the teacher style prediction performance of the trained teacher style prediction model can be effectively Promote.
实施例二Example two
参照图3,示出了根据本发明实施例二的一种教师风格预测方法的步骤流程图。Referring to Fig. 3, a flowchart of the steps of a method for predicting teacher style according to the second embodiment of the present invention is shown.
具体地,本发明实施例提供的教师风格预测方法包括以下步骤:Specifically, the teacher style prediction method provided by the embodiment of the present invention includes the following steps:
在步骤S201中,基于教学内容数据的第一特征数据,确定所述教学内容数据的多组第二特征数据。In step S201, based on the first feature data of the teaching content data, multiple sets of second feature data of the teaching content data are determined.
在本实施例中,所述教学内容数据可包括教学内容的音频数据或视频数据。当所述教学内容数据为教学内容的音频数据时,所述教学内容数据的第一特征数据可为从所述音频数据中提取的高维的语音声学特征数据。当所述教学内容数据为教学内容的视频数据时,所述教学内容数据的第一特征数据可为从所述视频数据中提取的高维的人脸面部特征数据。In this embodiment, the teaching content data may include audio data or video data of the teaching content. When the teaching content data is audio data of the teaching content, the first feature data of the teaching content data may be high-dimensional voice acoustic feature data extracted from the audio data. When the teaching content data is video data of the teaching content, the first feature data of the teaching content data may be high-dimensional facial feature data extracted from the video data.
在本实施例中,步骤S201的具体实施方式与上述步骤S101的具体实施方式类似,在此不再赘述。In this embodiment, the specific implementation of step S201 is similar to the specific implementation of step S101 described above, and will not be repeated here.
在步骤S202中,通过训练后的教师风格预测模型,基于所述教学内容数据的多组第二特征数据,获得所述教学内容数据对应的教师风格预测数据。In step S202, the teacher style prediction data corresponding to the teaching content data is obtained based on the multiple sets of second feature data of the teaching content data through the trained teacher style prediction model.
在本实施例中,在通过实施例一训练得到的教师风格预测模型,基于多组低维特征数据,获得所述教学内容数据对应的教师风格预测数据时,通过所述多个低层模型,基于所述多组低维特征数据,获得所述教学内容数据对应的多个教师风格初步预测数据;通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容数据对应的教师风格最终预测数据。籍此,通过实施例一训练得到的教师风格预测模型包括的多个低层模型,对教学内容数据进行教学风格的初步预测,再通过实施例一训练得到的教师风格预测模型包括的高层模型,基于教学风格的初步预测结果,对教学内容数据进行教学风格的最终预测,能够提高教师风格预测模型对教学内容数据对应的教师风格的预测准确度。In this embodiment, when the teacher style prediction model obtained through training in Embodiment 1 is based on multiple sets of low-dimensional feature data to obtain the teacher style prediction data corresponding to the teaching content data, the multiple low-level models are based on The multiple sets of low-dimensional feature data obtain multiple teacher style preliminary prediction data corresponding to the teaching content data; through the high-level model, based on the multiple teacher style preliminary prediction data, obtain the corresponding teaching content data Teacher style final prediction data. In this way, the teacher style prediction model trained in the first embodiment includes multiple low-level models, and the teaching content data is preliminarily predicted for the teaching style, and then the high-level model included in the teacher style prediction model trained in the first embodiment is based on The preliminary prediction result of the teaching style and the final prediction of the teaching style on the teaching content data can improve the accuracy of the teacher style prediction model for predicting the teacher style corresponding to the teaching content data.
在本实施例中,在通过所述多个低层模型,基于所述多组低维特征数据,获得所述教学内容数据对应的多个教师风格初步预测数据时,通过所述隐含层,对所述多组低维特征数据分别进行特征提取操作,以获得所述多组低维特征数据分别对应的特征表征数据;通过所述预测层,对所述多组低维特征数据分别对应的特征表征数据进行映射操作,以获得所述教学内容数据对应的多个教师风格初步预测数据。其中,所述特征表征数据具体为特征表征向量。籍此,通过隐含层,对多组低维特征数据分别进行特征提取操作,能够对多组低维特征数据分别进行特征再编码,提高多组低维特征数据分别对应的特征表征数据的鲁棒性,进而提高低层模型对教学内容数据对应的教师风格的初步预测的准确度。In this embodiment, when the plurality of low-level models are used to obtain the plurality of teacher style preliminary prediction data corresponding to the teaching content data based on the plurality of sets of low-dimensional feature data, the hidden layer The multiple sets of low-dimensional feature data are respectively subjected to feature extraction operations to obtain feature representation data corresponding to the multiple sets of low-dimensional feature data; through the prediction layer, features corresponding to the multiple sets of low-dimensional feature data The characterization data is subjected to a mapping operation to obtain a plurality of preliminary prediction data of teacher style corresponding to the teaching content data. Wherein, the characteristic characterization data is specifically a characteristic characterization vector. In this way, through the hidden layer, feature extraction operations are performed on multiple sets of low-dimensional feature data, which can re-encode multiple sets of low-dimensional feature data, and improve the robustness of feature representation data corresponding to multiple sets of low-dimensional feature data. It can improve the accuracy of the preliminary prediction of the teacher’s style corresponding to the teaching content data by the low-level model.
在本实施例中,在通过所述高层模型,基于所述多个教师风格初步预 测数据,获得所述教学内容数据对应的教师风格最终预测数据时,基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据;通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容数据对应的教师风格最终预测数据。其中,所述高层特征表征数据具体为高层特征表征向量。籍此,基于教师风格初步预测数据,生成对应高层模型的高层特征表征数据,再通过高层模型,基于高层特征表征数据,获得教学内容数据对应的教师风格最终预测数据,能够提高高层模型对教学内容数据对应的教师风格的最终预测的准确度。In this embodiment, when obtaining the teacher style final prediction data corresponding to the teaching content data through the high-level model based on the plurality of teacher style preliminary prediction data, based on the plurality of teacher style preliminary prediction data, Generate high-level feature representation data corresponding to the high-level model; through the high-level model, based on the high-level feature representation data, obtain final prediction data of teacher style corresponding to the teaching content data. Wherein, the high-level feature representation data is specifically a high-level feature representation vector. In this way, based on the preliminary prediction data of teacher style, the high-level feature representation data corresponding to the high-level model is generated, and then through the high-level model, based on the high-level feature representation data, the final prediction data of the teacher style corresponding to the teaching content data is obtained, which can improve the high-level model's impact on the teaching content The accuracy of the final prediction of teacher style corresponding to the data.
在本实施例中,在基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据时,基于所述多个教师风格初步预测数据和所述多组低维特征数据分别对应的特征表征数据,生成所述高层特征表征数据。籍此,基于教师风格初步预测数据和低维特征数据对应的特征表征数据,生成高层特征表征数据,能够提高高层特征表征数据的鲁棒性,进而提高高层模型对教学内容数据对应的教师风格的最终预测的准确度。In this embodiment, when generating high-level feature representation data corresponding to the high-level model based on the plurality of teacher style preliminary prediction data, based on the plurality of teacher style preliminary prediction data and the multiple sets of low-dimensional feature data The corresponding characteristic characterization data are respectively generated to generate the high-level characteristic characterization data. In this way, based on the preliminary prediction data of teacher style and the feature representation data corresponding to the low-dimensional feature data, high-level feature representation data can be generated, which can improve the robustness of the high-level feature representation data, thereby improving the high-level model's ability to respond to the teacher style corresponding to the teaching content data. The accuracy of the final prediction.
在本实施例中,在通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容数据对应的教师风格最终预测数据时,通过所述高层模型中的隐含层,对所述高层特征表征数据进行特征提取操作,以获得所述高层特征表征数据对应的特征表征数据;通过所述高层模型中的预测层,对所述高层特征表征数据对应的特征表征数据进行映射操作,以获得所述教学内容数据对应的教师风格最终预测数据。籍此,通过隐含层,对高层特征表征数据进行特征提取操作,能够对高层特征表征数据进行特征再编 码,提高高层特征表征数据对应的特征表征数据的鲁棒性,进而提高高层模型对教学内容数据对应的教师风格的最终预测的准确度。In this embodiment, when obtaining the final prediction data of teacher style corresponding to the teaching content data through the high-level model based on the high-level feature characterization data, the hidden layer in the high-level model is used to compare the Perform a feature extraction operation on the high-level feature characterization data to obtain the feature characterization data corresponding to the high-level feature characterization data; through the prediction layer in the high-level model, perform a mapping operation on the feature characterization data corresponding to the high-level feature characterization data to Obtain final prediction data of teacher style corresponding to the teaching content data. In this way, through the hidden layer, the feature extraction operation of the high-level feature characterization data can re-encode the high-level feature characterization data, improve the robustness of the feature characterization data corresponding to the high-level feature characterization data, and thereby improve the high-level model's ability to teach The accuracy of the final prediction of the teacher style corresponding to the content data.
在本实施例中,所述方法还包括:对所述教师风格预测数据进行映射操作,以获得所述教学内容数据对应的教师风格类别。籍此,能够获得教学内容数据对应的教师风格类别。In this embodiment, the method further includes: performing a mapping operation on the teacher style prediction data to obtain the teacher style category corresponding to the teaching content data. In this way, the teacher style category corresponding to the teaching content data can be obtained.
具体的,基于教师风格预测数据,在预先构建好的教师风格语义空间中进行映射操作,以获得所述教学内容数据对应的教师风格类别。其中,所述教师风格语义空间可理解为教师风格预测数据与教师风格类别之间的映射空间。Specifically, based on the teacher style prediction data, a mapping operation is performed in the pre-built teacher style semantic space to obtain the teacher style category corresponding to the teaching content data. Wherein, the teacher style semantic space can be understood as a mapping space between teacher style prediction data and teacher style categories.
通过本申请实施例提供的教师风格预测方法,基于教学内容数据的高维特征数据,确定教学内容数据的多组低维特征数据,再通过实施例一训练得到的教师风格预测模型,基于教学内容数据的多组低维特征数据,获得教学内容数据对应的教师风格预测数据,与现有的其它方式相比,通过将教学内容数据的高维特征数据分组为多组低维特征数据,大大降低了训练得到的教师风格预测模型的输入特征的维度,从而有效地提升了教师风格预测模型的教师风格预测性能。Through the teacher style prediction method provided in the embodiments of this application, based on the high-dimensional feature data of the teaching content data, multiple sets of low-dimensional feature data of the teaching content data are determined, and then the teacher style prediction model obtained through training in Embodiment 1 is based on the teaching content Multiple sets of low-dimensional feature data of the data to obtain teacher style prediction data corresponding to the teaching content data. Compared with other existing methods, by grouping the high-dimensional feature data of the teaching content data into multiple sets of low-dimensional feature data, it greatly reduces The dimension of the input features of the trained teacher style prediction model is improved, thereby effectively improving the teacher style prediction performance of the teacher style prediction model.
实施例三Example three
本发明实施例还提供一种非瞬时性计算机可读存储介质,所述非瞬时性计算机可读存储介质存储有可读程序,所述可读程序包括:用于基于教学内容样本的高维特征数据,确定所述教学内容样本的多组低维特征数据 的指令;用于通过待训练的教师风格预测模型,基于所述多组低维特征数据,获得所述教学内容样本对应的教师风格预测数据的指令;用于基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型的指令。The embodiment of the present invention also provides a non-transitory computer-readable storage medium, the non-transitory computer-readable storage medium stores a readable program, and the readable program includes: a high-dimensional feature used for teaching content samples Data, instructions for determining multiple sets of low-dimensional feature data of the teaching content sample; used to obtain the teacher style prediction corresponding to the teaching content sample based on the multiple sets of low-dimensional feature data through the teacher style prediction model to be trained Data instructions; instructions for training the teacher style prediction model based on the teacher style annotation data of the teaching content sample and the teacher style prediction data.
可选地,所述教师风格预测模型包括多个低层模型及与所述多个低层模型的输出端连接的高层模型,对应地,所述用于通过待训练的教师风格预测模型,基于所述多组低维特征数据,获得所述教学内容样本对应的教师风格预测数据的指令,包括:用于通过所述多个低层模型,基于所述多组低维特征数据,获得所述教学内容样本对应的多个教师风格初步预测数据的指令;用于通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容样本对应的教师风格最终预测数据的指令。Optionally, the teacher style prediction model includes a plurality of low-level models and a high-level model connected to the output ends of the plurality of low-level models. Correspondingly, the teacher style prediction model used to pass the training is based on the Multiple sets of low-dimensional feature data, and instructions for obtaining teacher style prediction data corresponding to the teaching content sample include: obtaining the teaching content sample based on the multiple sets of low-dimensional feature data through the multiple low-level models An instruction for corresponding multiple teacher style preliminary prediction data; an instruction for obtaining the teacher style final prediction data corresponding to the teaching content sample based on the multiple teacher style preliminary prediction data through the high-level model.
可选地,所述多个低层模型中的每个低层模型包括隐含层及与所述隐含层的输出端连接的预测层,对应地,所述用于通过所述多个低层模型,基于所述多组低维特征数据,获得所述教学内容样本对应的多个教师风格初步预测数据的指令,包括:用于通过所述隐含层,对所述多组低维特征数据分别进行特征提取操作,以获得所述多组低维特征数据分别对应的特征表征数据的指令;用于通过所述预测层,对所述多组低维特征数据分别对应的特征表征数据进行映射操作,以获得所述教学内容样本对应的多个教师风格初步预测数据的指令。Optionally, each of the plurality of low-layer models includes a hidden layer and a prediction layer connected to the output terminal of the hidden layer. Correspondingly, the method for passing the plurality of low-layer models, Based on the multiple sets of low-dimensional feature data, an instruction to obtain multiple teacher style preliminary prediction data corresponding to the teaching content sample includes: using the hidden layer to perform respective operations on the multiple sets of low-dimensional feature data A feature extraction operation to obtain instructions for feature characterization data corresponding to the multiple sets of low-dimensional feature data; and to perform a mapping operation on feature characterization data corresponding to the multiple sets of low-dimensional feature data through the prediction layer, An instruction to obtain a plurality of preliminary prediction data of teacher style corresponding to the teaching content sample.
可选地,所述用于通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容样本对应的教师风格最终预测数据的指令,包 括:用于基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据的指令;用于通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容样本对应的教师风格最终预测数据的指令。Optionally, the instruction for obtaining teacher style final prediction data corresponding to the teaching content sample based on the plurality of teacher style preliminary prediction data through the high-level model includes: Preliminary teacher style prediction data to generate instructions corresponding to the high-level feature characterization data of the high-level model; for obtaining the final prediction data of teacher style corresponding to the teaching content sample based on the high-level feature characterization data through the high-level model instruction.
可选地,所述用于基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据的指令,包括:用于基于所述多个教师风格初步预测数据和所述多组低维特征数据分别对应的特征表征数据,生成所述高层特征表征数据的指令。Optionally, the instruction for generating high-level feature characterization data corresponding to the high-level model based on the plurality of teacher style preliminary prediction data includes: the instruction for generating high-level feature characterization data based on the plurality of teacher style preliminary prediction data and the Multiple sets of low-dimensional feature data correspond to feature characterization data, respectively, to generate instructions for the high-level feature characterization data.
可选地,所述用于通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容样本对应的教师风格最终预测数据的指令,包括:用于通过所述高层模型中的隐含层,对所述高层特征表征数据进行特征提取操作,以获得所述高层特征表征数据对应的特征表征数据的指令;用于通过所述高层模型中的预测层,对所述高层特征表征数据对应的特征表征数据进行映射操作,以获得所述教学内容样本对应的教师风格最终预测数据的指令。Optionally, the instruction for obtaining the teacher style final prediction data corresponding to the teaching content sample based on the high-level feature characterization data through the high-level model includes: Containing layers, performing feature extraction operations on the high-level feature characterization data to obtain instructions for the feature characterization data corresponding to the high-level feature characterization data; used to characterize the high-level feature data through the prediction layer in the high-level model The corresponding characteristic characterization data performs a mapping operation to obtain an instruction of the final prediction data of teacher style corresponding to the teaching content sample.
可选地,所述用于基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型的指令,包括:用于通过目标损失函数,确定所述教师风格标注数据和所述教师风格最终预测数据的差异值的指令;用于基于所述差异值,调整所述教师风格预测模型中的所述多个低层模型和所述高层模型的参数的指令。Optionally, the instruction for training the teacher style prediction model based on the teacher style annotation data of the teaching content sample and the teacher style prediction data includes: determining the teacher by a target loss function An instruction for the difference value between the style annotation data and the teacher style final prediction data; an instruction for adjusting the parameters of the multiple low-level models and the high-level model in the teacher style prediction model based on the difference value.
可选地,所述可读程序还包括:用于基于教学内容数据的高维特征数据,确定所述教学内容数据的多组低维特征数据的指令;用于通过训练后 的所述教师风格预测模型,基于所述教学内容数据的多组低维特征数据,获得所述教学内容数据对应的教师风格预测数据的指令。Optionally, the readable program further includes: instructions for determining multiple sets of low-dimensional feature data of the teaching content data based on the high-dimensional feature data of the teaching content data; A predictive model, based on multiple sets of low-dimensional feature data of the teaching content data, obtains instructions for teacher style prediction data corresponding to the teaching content data.
可选地,所述可读程序还包括:用于对所述教师风格预测数据进行映射操作,以获得所述教学内容数据对应的教师风格类别的指令。Optionally, the readable program further includes: instructions for performing a mapping operation on the teacher style prediction data to obtain the teacher style category corresponding to the teaching content data.
通过本申请实施例提供的非瞬时性计算机可读存储介质,基于教学内容样本的高维特征数据,确定教学内容样本的多组低维特征数据,并通过待训练的教师风格预测模型,基于多组低维特征数据,获得教学内容样本对应的教师风格预测数据,再基于教学内容样本的教师风格标注数据和教师风格预测数据,训练教师风格预测模型,与现有的其它方式相比,通过将教学内容样本的高维特征数据分组为多组低维特征数据,大大降低了待训练的教师风格预测模型的输入特征的维度,从而使得训练得到的教师风格预测模型的教师风格预测性能能够得到有效地提升。Through the non-transitory computer-readable storage medium provided by the embodiments of this application, based on the high-dimensional feature data of the teaching content sample, multiple sets of low-dimensional feature data of the teaching content sample are determined, and the teacher style prediction model to be trained is based on multiple Group low-dimensional feature data, obtain teacher style prediction data corresponding to the teaching content sample, and then train the teacher style prediction model based on the teacher style annotation data and teacher style prediction data of the teaching content sample. Compared with other existing methods, The high-dimensional feature data of the teaching content samples are grouped into groups of low-dimensional feature data, which greatly reduces the dimension of the input features of the teacher style prediction model to be trained, so that the teacher style prediction performance of the trained teacher style prediction model can be effectively To improve.
需要指出,根据实施的需要,可将本发明实施例中描述的各个部件/步骤拆分为更多部件/步骤,也可将两个或多个部件/步骤或者部件/步骤的部分操作组合成新的部件/步骤,以实现本发明实施例的目的。It should be pointed out that, according to the needs of implementation, each component/step described in the embodiment of the present invention can be split into more components/steps, or two or more components/steps or partial operations of components/steps can be combined into New components/steps to achieve the purpose of the embodiments of the present invention.
上述根据本发明实施例的方法可在硬件、固件中实现,或者被实现为可存储在记录介质(诸如CD ROM、RAM、软盘、硬盘或磁光盘)中的软件或计算机代码,或者被实现通过网络下载的原始存储在远程记录介质或非暂时机器可读介质中并将被存储在本地记录介质中的计算机代码,从而在此描述的方法可被存储在使用通用计算机、专用处理器或者可编程或专用硬件(诸如ASIC或FPGA)的记录介质上的这样的软件处理。可以理 解,计算机、处理器、微处理器控制器或可编程硬件包括可存储或接收软件或计算机代码的存储组件(例如,RAM、ROM、闪存等),当所述软件或计算机代码被计算机、处理器或硬件访问且执行时,实现在此描述的教师风格预测模型的训练方法。此外,当通用计算机访问用于实现在此示出的教师风格预测模型的训练方法的代码时,代码的执行将通用计算机转换为用于执行在此示出的教师风格预测模型的训练方法的专用计算机。The above method according to the embodiments of the present invention can be implemented in hardware, firmware, or implemented as software or computer code that can be stored in a recording medium (such as CD ROM, RAM, floppy disk, hard disk, or magneto-optical disk), or implemented by The computer code downloaded from the network is originally stored in a remote recording medium or a non-transitory machine-readable medium and will be stored in a local recording medium, so that the method described here can be stored using a general-purpose computer, a dedicated processor or a programmable Or such software processing on a recording medium of dedicated hardware (such as ASIC or FPGA). It can be understood that a computer, processor, microprocessor controller, or programmable hardware includes storage components (for example, RAM, ROM, flash memory, etc.) that can store or receive software or computer code, when the software or computer code is used by the computer, When the processor or hardware is accessed and executed, the training method of the teacher style prediction model described here is implemented. In addition, when a general-purpose computer accesses the code for implementing the training method of the teacher-style prediction model shown here, the execution of the code converts the general-purpose computer into a special-purpose computer for executing the training method of the teacher-style prediction model shown here. computer.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及方法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明实施例的范围。A person of ordinary skill in the art may realize that the units and method steps of the examples described in combination with the embodiments disclosed herein can be implemented by electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are executed by hardware or software depends on the specific application and design constraint conditions of the technical solution. Professionals and technicians can use different methods for each specific application to implement the described functions, but such implementation should not be considered as going beyond the scope of the embodiments of the present invention.
以上实施方式仅用于说明本发明实施例,而并非对本发明实施例的限制,有关技术领域的普通技术人员,在不脱离本发明实施例的精神和范围的情况下,还可以做出各种变化和变型,因此所有等同的技术方案也属于本发明实施例的范畴,本发明实施例的专利保护范围应由权利要求限定。The above implementations are only used to illustrate the embodiments of the present invention, and are not intended to limit the embodiments of the present invention. Those of ordinary skill in the relevant technical field can also make various modifications without departing from the spirit and scope of the embodiments of the present invention. Changes and modifications, therefore, all equivalent technical solutions also belong to the scope of the embodiments of the present invention, and the patent protection scope of the embodiments of the present invention should be defined by the claims.

Claims (18)

  1. 一种教师风格预测模型的训练方法,其特征在于,所述方法包括:A training method of a teacher's style prediction model, characterized in that the method comprises:
    基于教学内容样本的第一特征数据,确定所述教学内容样本的多组第二特征数据,其中所述第一特征数据具有比所述第二特征数据更高的维度;Determine multiple sets of second feature data of the teaching content sample based on the first feature data of the teaching content sample, wherein the first feature data has a higher dimension than the second feature data;
    通过待训练的教师风格预测模型,基于所述多组第二特征数据,获得所述教学内容样本对应的教师风格预测数据;以及Obtain the teacher style prediction data corresponding to the teaching content sample based on the plurality of sets of second feature data through the teacher style prediction model to be trained; and
    基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型。Training the teacher style prediction model based on the teacher style annotation data of the teaching content sample and the teacher style prediction data.
  2. 根据权利要求1所述的方法,其特征在于,所述教师风格预测模型包括多个低层模型及与所述多个低层模型的输出端连接的高层模型,The method according to claim 1, wherein the teacher style prediction model comprises a plurality of low-level models and a high-level model connected to the output terminals of the plurality of low-level models,
    所述通过待训练的教师风格预测模型,基于所述多组第二特征数据,获得所述教学内容样本对应的教师风格预测数据,包括:The obtaining the teacher style prediction data corresponding to the teaching content sample through the teacher style prediction model to be trained based on the multiple sets of second feature data includes:
    通过所述多个低层模型,基于所述多组第二特征数据,获得所述教学内容样本对应的多个教师风格初步预测数据;以及Obtain multiple preliminary prediction data of teacher style corresponding to the teaching content sample based on the multiple sets of second feature data through the multiple low-level models; and
    通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容样本对应的教师风格最终预测数据。Through the high-level model, based on the plurality of preliminary prediction data of teacher style, the final prediction data of the teacher style corresponding to the teaching content sample is obtained.
  3. 根据权利要求2所述的方法,其特征在于,所述多个低层模型中的每个低层模型包括隐含层及与所述隐含层的输出端连接的预测层,The method according to claim 2, wherein each of the plurality of low-layer models includes a hidden layer and a prediction layer connected to the output terminal of the hidden layer,
    所述通过所述多个低层模型,基于所述多组第二特征数据,获得所述教学内容样本对应的多个教师风格初步预测数据,包括:The obtaining multiple preliminary prediction data of teacher style corresponding to the teaching content sample based on the multiple sets of second feature data through the multiple low-level models includes:
    通过所述隐含层,对所述多组第二特征数据分别进行特征提取操作,以获得所述多组第二特征数据分别对应的特征表征数据;以及Performing feature extraction operations on the multiple sets of second feature data respectively through the hidden layer to obtain feature representation data corresponding to the multiple sets of second feature data; and
    通过所述预测层,对所述多组第二特征数据分别对应的特征表征数据进行映射操作,以获得所述教学内容样本对应的多个教师风格初步预测数据。Through the prediction layer, a mapping operation is performed on the feature representation data corresponding to the multiple sets of second feature data to obtain multiple teacher style preliminary prediction data corresponding to the teaching content sample.
  4. 根据权利要求3所述的方法,其特征在于,所述通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容样本对应的教师风格最终预测数据,包括:The method according to claim 3, wherein the obtaining, through the high-level model, based on the plurality of preliminary prediction data of teacher style, the final prediction data of the teacher style corresponding to the teaching content sample comprises:
    基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据;以及Generate high-level feature representation data corresponding to the high-level model based on the plurality of preliminary prediction data of teacher style; and
    通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容样本对应的教师风格最终预测数据。Through the high-level model, based on the high-level feature characterization data, the final prediction data of the teacher style corresponding to the teaching content sample is obtained.
  5. 根据权利要求4所述的方法,其特征在于,所述基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据,包括:The method according to claim 4, wherein the generating high-level feature characterization data corresponding to the high-level model based on the preliminary prediction data of the multiple teacher styles comprises:
    基于所述多个教师风格初步预测数据和所述多组第二特征数据分别对应的特征表征数据,生成所述高层特征表征数据。The high-level characteristic characteristic data is generated based on the characteristic characterization data corresponding to the plurality of preliminary prediction data of teacher style and the plurality of sets of second characteristic data respectively.
  6. 根据权利要求4所述的方法,其特征在于,所述通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容样本对应的教师风格最终预测数据,包括:The method according to claim 4, wherein said obtaining, through the high-level model, based on the high-level feature characterization data, the final prediction data of teacher style corresponding to the teaching content sample comprises:
    通过所述高层模型中的隐含层,对所述高层特征表征数据进行特征提取操作,以获得所述高层特征表征数据对应的特征表征数据;以及Performing a feature extraction operation on the high-level feature characterization data through the hidden layer in the high-level model to obtain feature characterization data corresponding to the high-level feature characterization data; and
    通过所述高层模型中的预测层,对所述高层特征表征数据对应的特征表征数据进行映射操作,以获得所述教学内容样本对应的教师风格最终预测数据。Through the prediction layer in the high-level model, a mapping operation is performed on the feature representation data corresponding to the high-level feature representation data to obtain final prediction data of teacher style corresponding to the teaching content sample.
  7. 根据权利要求2所述的方法,其特征在于,所述基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型,包括:The method according to claim 2, wherein the training of the teacher style prediction model based on the teacher style annotation data of the teaching content sample and the teacher style prediction data comprises:
    通过目标损失函数,确定所述教师风格标注数据和所述教师风格最终预测数据的差异值;以及Determine the difference value between the teacher style annotation data and the teacher style final prediction data through a target loss function; and
    基于所述差异值,调整所述教师风格预测模型中的所述多个低层模型和所述高层模型的参数。Based on the difference value, the parameters of the multiple low-level models and the high-level model in the teacher style prediction model are adjusted.
  8. 根据权利要求1-7中任意一项权利要求所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-7, wherein the method further comprises:
    基于教学内容数据的第一特征数据,确定所述教学内容数据的多组第二特征数据;以及Based on the first feature data of the teaching content data, determining multiple sets of second feature data of the teaching content data; and
    通过训练后的教师风格预测模型,基于所述教学内容数据的多组第二特征数据,获得所述教学内容数据对应的教师风格预测数据。Through the trained teacher style prediction model, based on the multiple sets of second feature data of the teaching content data, the teacher style prediction data corresponding to the teaching content data is obtained.
  9. 根据权利要求8所述的方法,其特征在于,所述方法还包括:The method according to claim 8, wherein the method further comprises:
    对所述教师风格预测数据进行映射操作,以获得所述教学内容数据对应的教师风格类别。A mapping operation is performed on the teacher style prediction data to obtain the teacher style category corresponding to the teaching content data.
  10. 一种非瞬时性计算机可读存储介质,其特征在于,所述非瞬时性计算机可读存储介质存储有可读程序,所述可读程序被处理器执行时使所述处理器执行以下步骤:A non-transitory computer-readable storage medium, wherein the non-transitory computer-readable storage medium stores a readable program, and when the readable program is executed by a processor, the processor executes the following steps:
    基于教学内容样本的第一特征数据,确定所述教学内容样本的多组第二特征数据,其中所述第一特征数据具有比所述第二特征数据更高的维度;Determine multiple sets of second feature data of the teaching content sample based on the first feature data of the teaching content sample, wherein the first feature data has a higher dimension than the second feature data;
    通过待训练的教师风格预测模型,基于所述多组第二特征数据,获得所述教学内容样本对应的教师风格预测数据;以及Obtain the teacher style prediction data corresponding to the teaching content sample based on the plurality of sets of second feature data through the teacher style prediction model to be trained; and
    基于所述教学内容样本的教师风格标注数据和所述教师风格预测数据,训练所述教师风格预测模型。Training the teacher style prediction model based on the teacher style annotation data of the teaching content sample and the teacher style prediction data.
  11. 根据权利要求10所述的非瞬时性计算机可读存储介质,其特征在于,所述教师风格预测模型包括多个低层模型及与所述多个低层模型的输出端连接的高层模型,The non-transitory computer-readable storage medium of claim 10, wherein the teacher style prediction model comprises a plurality of low-level models and a high-level model connected to the output terminals of the plurality of low-level models,
    所述通过待训练的教师风格预测模型,基于所述多组第二特征数据, 获得所述教学内容样本对应的教师风格预测数据,包括:The obtaining the teacher style prediction data corresponding to the teaching content sample through the teacher style prediction model to be trained based on the multiple sets of second feature data includes:
    通过所述多个低层模型,基于所述多组第二特征数据,获得所述教学内容样本对应的多个教师风格初步预测数据;以及Obtain multiple preliminary prediction data of teacher style corresponding to the teaching content sample based on the multiple sets of second feature data through the multiple low-level models; and
    通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容样本对应的教师风格最终预测数据。Through the high-level model, based on the plurality of preliminary prediction data of teacher style, the final prediction data of the teacher style corresponding to the teaching content sample is obtained.
  12. 根据权利要求11所述的非瞬时性计算机可读存储介质,其特征在于,所述多个低层模型中的每个低层模型包括隐含层及与所述隐含层的输出端连接的预测层,The non-transitory computer-readable storage medium of claim 11, wherein each of the plurality of low-level models includes a hidden layer and a prediction layer connected to an output terminal of the hidden layer ,
    所述通过所述多个低层模型,基于所述多组第二特征数据,获得所述教学内容样本对应的多个教师风格初步预测数据,包括:The obtaining multiple preliminary prediction data of teacher style corresponding to the teaching content sample based on the multiple sets of second feature data through the multiple low-level models includes:
    通过所述隐含层,对所述多组第二特征数据分别进行特征提取操作,以获得所述多组第二特征数据分别对应的特征表征数据;以及Performing feature extraction operations on the multiple sets of second feature data respectively through the hidden layer to obtain feature representation data corresponding to the multiple sets of second feature data; and
    通过所述预测层,对所述多组第二特征数据分别对应的特征表征数据进行映射操作,以获得所述教学内容样本对应的多个教师风格初步预测数据。Through the prediction layer, a mapping operation is performed on the feature representation data corresponding to the multiple sets of second feature data to obtain multiple teacher style preliminary prediction data corresponding to the teaching content sample.
  13. 根据权利要求12所述的非瞬时性计算机可读存储介质,其特征在于,所述通过所述高层模型,基于所述多个教师风格初步预测数据,获得所述教学内容样本对应的教师风格最终预测数据,包括:The non-transitory computer-readable storage medium according to claim 12, wherein said high-level model is used to obtain the final teacher style corresponding to said teaching content sample based on said plurality of teacher style preliminary prediction data Forecast data, including:
    基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层 特征表征数据;以及Generate high-level feature representation data corresponding to the high-level model based on the plurality of preliminary prediction data of teacher style; and
    通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容样本对应的教师风格最终预测数据。Through the high-level model, based on the high-level feature characterization data, the final prediction data of the teacher style corresponding to the teaching content sample is obtained.
  14. 根据权利要求13所述的非瞬时性计算机可读存储介质,其特征在于,所述基于所述多个教师风格初步预测数据,生成对应所述高层模型的高层特征表征数据,包括:The non-transitory computer-readable storage medium of claim 13, wherein the generating high-level feature characterization data corresponding to the high-level model based on the preliminary prediction data of the plurality of teacher styles comprises:
    基于所述多个教师风格初步预测数据和所述多组第二特征数据分别对应的特征表征数据,生成所述高层特征表征数据。The high-level characteristic characteristic data is generated based on the characteristic characterization data corresponding to the plurality of preliminary prediction data of teacher style and the plurality of sets of second characteristic data respectively.
  15. 根据权利要求13所述的非瞬时性计算机可读存储介质,其特征在于,所述通过所述高层模型,基于所述高层特征表征数据,获得所述教学内容样本对应的教师风格最终预测数据,包括:The non-transitory computer-readable storage medium according to claim 13, wherein the final prediction data of teacher style corresponding to the teaching content sample is obtained through the high-level model and based on the high-level feature characterization data, include:
    通过所述高层模型中的隐含层,对所述高层特征表征数据进行特征提取操作,以获得所述高层特征表征数据对应的特征表征数据;以及Performing a feature extraction operation on the high-level feature characterization data through the hidden layer in the high-level model to obtain feature characterization data corresponding to the high-level feature characterization data; and
    通过所述高层模型中的预测层,对所述高层特征表征数据对应的特征表征数据进行映射操作,以获得所述教学内容样本对应的教师风格最终预测数据。Through the prediction layer in the high-level model, a mapping operation is performed on the feature representation data corresponding to the high-level feature representation data to obtain final prediction data of teacher style corresponding to the teaching content sample.
  16. 根据权利要求11所述的非瞬时性计算机可读存储介质,其特征在于,所述基于所述教学内容样本的教师风格标注数据和所述教师风格预测 数据,训练所述教师风格预测模型,包括:The non-transitory computer-readable storage medium of claim 11, wherein the teacher style annotation data based on the teaching content sample and the teacher style prediction data to train the teacher style prediction model comprises :
    通过目标损失函数,确定所述教师风格标注数据和所述教师风格最终预测数据的差异值;以及Determine the difference value between the teacher style annotation data and the teacher style final prediction data through a target loss function; and
    基于所述差异值,调整所述教师风格预测模型中的所述多个低层模型和所述高层模型的参数。Based on the difference value, the parameters of the multiple low-level models and the high-level model in the teacher style prediction model are adjusted.
  17. 根据权利要求10-16中任意一项权利要求所述的非瞬时性计算机可读存储介质,其特征在于,所述可读程序被所述处理器执行时使所述处理器进一步执行以下步骤:The non-transitory computer-readable storage medium according to any one of claims 10-16, wherein when the readable program is executed by the processor, the processor further executes the following steps:
    基于教学内容数据的第一特征数据,确定所述教学内容数据的多组第二特征数据;以及Based on the first feature data of the teaching content data, determining multiple sets of second feature data of the teaching content data; and
    通过训练后的教师风格预测模型,基于所述教学内容数据的多组第二特征数据,获得所述教学内容数据对应的教师风格预测数据。Through the trained teacher style prediction model, based on the multiple sets of second feature data of the teaching content data, the teacher style prediction data corresponding to the teaching content data is obtained.
  18. 根据权利要求17所述的非瞬时性计算机可读存储介质,其特征在于,所述可读程序被所述处理器执行时使所述处理器进一步执行以下步骤:The non-transitory computer-readable storage medium of claim 17, wherein when the readable program is executed by the processor, the processor further executes the following steps:
    对所述教师风格预测数据进行映射操作,以获得所述教学内容数据对应的教师风格类别。A mapping operation is performed on the teacher style prediction data to obtain the teacher style category corresponding to the teaching content data.
PCT/CN2020/086363 2019-04-23 2020-04-23 Method for training teaching style prediction model, and computer storage medium WO2020216286A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910330162.4 2019-04-23
CN201910330162.4A CN111832787B (en) 2019-04-23 2019-04-23 Teacher style prediction model training method and computer storage medium

Publications (1)

Publication Number Publication Date
WO2020216286A1 true WO2020216286A1 (en) 2020-10-29

Family

ID=72911994

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/086363 WO2020216286A1 (en) 2019-04-23 2020-04-23 Method for training teaching style prediction model, and computer storage medium

Country Status (2)

Country Link
CN (1) CN111832787B (en)
WO (1) WO2020216286A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113408571A (en) * 2021-05-08 2021-09-17 浙江智慧视频安防创新中心有限公司 Image classification method and device based on model distillation, storage medium and terminal
CN114780785A (en) * 2022-06-23 2022-07-22 新缪斯(深圳)音乐科技产业发展有限公司 Music teaching recommendation method and system based on knowledge graph

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7684390B2 (en) * 2004-12-30 2010-03-23 Intel Corporation Integrated circuit capable of transmitting probe packets across a stack of switches
CN106250403A (en) * 2016-07-19 2016-12-21 北京奇艺世纪科技有限公司 Customer loss Forecasting Methodology and device
CN107045673A (en) * 2017-03-31 2017-08-15 杭州电子科技大学 Public bicycles changes in flow rate amount Forecasting Methodology based on heap Model Fusion
CN107992887A (en) * 2017-11-28 2018-05-04 东软集团股份有限公司 Classifier generation method, sorting technique, device, electronic equipment and storage medium
CN108073888A (en) * 2017-08-07 2018-05-25 中国科学院深圳先进技术研究院 A kind of teaching auxiliary and the teaching auxiliary system using this method
CN108090857A (en) * 2017-12-29 2018-05-29 复旦大学 A kind of multi-modal student classroom behavior analysis system and method

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104392174B (en) * 2014-10-23 2016-04-06 腾讯科技(深圳)有限公司 The generation method of the proper vector of application program dynamic behaviour and device
CN104933588A (en) * 2015-07-01 2015-09-23 北京京东尚科信息技术有限公司 Data annotation platform for expanding merchandise varieties and data annotation method
KR102570278B1 (en) * 2017-07-31 2023-08-24 삼성전자주식회사 Apparatus and method for generating training data used to training student model from teacher model
CN107423442B (en) * 2017-08-07 2020-09-25 火烈鸟网络(广州)股份有限公司 Application recommendation method and system based on user portrait behavior analysis, storage medium and computer equipment
CN107577943B (en) * 2017-09-08 2021-07-13 北京奇虎科技有限公司 Sample prediction method and device based on machine learning and server
CN108021947B (en) * 2017-12-25 2018-10-26 北京航空航天大学 A kind of layering extreme learning machine target identification method of view-based access control model
CN108897834A (en) * 2018-06-22 2018-11-27 招商信诺人寿保险有限公司 Data processing and method for digging

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7684390B2 (en) * 2004-12-30 2010-03-23 Intel Corporation Integrated circuit capable of transmitting probe packets across a stack of switches
CN106250403A (en) * 2016-07-19 2016-12-21 北京奇艺世纪科技有限公司 Customer loss Forecasting Methodology and device
CN107045673A (en) * 2017-03-31 2017-08-15 杭州电子科技大学 Public bicycles changes in flow rate amount Forecasting Methodology based on heap Model Fusion
CN108073888A (en) * 2017-08-07 2018-05-25 中国科学院深圳先进技术研究院 A kind of teaching auxiliary and the teaching auxiliary system using this method
CN107992887A (en) * 2017-11-28 2018-05-04 东软集团股份有限公司 Classifier generation method, sorting technique, device, electronic equipment and storage medium
CN108090857A (en) * 2017-12-29 2018-05-29 复旦大学 A kind of multi-modal student classroom behavior analysis system and method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113408571A (en) * 2021-05-08 2021-09-17 浙江智慧视频安防创新中心有限公司 Image classification method and device based on model distillation, storage medium and terminal
CN114780785A (en) * 2022-06-23 2022-07-22 新缪斯(深圳)音乐科技产业发展有限公司 Music teaching recommendation method and system based on knowledge graph
CN114780785B (en) * 2022-06-23 2022-09-13 新缪斯(深圳)音乐科技产业发展有限公司 Music teaching recommendation method and system based on knowledge graph

Also Published As

Publication number Publication date
CN111832787A (en) 2020-10-27
CN111832787B (en) 2022-12-09

Similar Documents

Publication Publication Date Title
US10176811B2 (en) Neural network-based voiceprint information extraction method and apparatus
CN110674880B (en) Network training method, device, medium and electronic equipment for knowledge distillation
WO2020232860A1 (en) Speech synthesis method and apparatus, and computer readable storage medium
US11282503B2 (en) Voice conversion training method and server and computer readable storage medium
US10580432B2 (en) Speech recognition using connectionist temporal classification
JP2020056982A (en) Speech evaluation method, device, apparatus and readable storage medium
US20220067588A1 (en) Transforming a trained artificial intelligence model into a trustworthy artificial intelligence model
WO2018161763A1 (en) Training method for voice data set, computer device and computer readable storage medium
WO2020216286A1 (en) Method for training teaching style prediction model, and computer storage medium
CN111081230A (en) Speech recognition method and apparatus
WO2021227259A1 (en) Accent detection method and device and non-transitory storage medium
CN114676687A (en) Aspect level emotion classification method based on enhanced semantic syntactic information
JPWO2017146073A1 (en) Voice quality conversion device, voice quality conversion method and program
CN116051388A (en) Automatic photo editing via language request
US20080126094A1 (en) Data Modelling of Class Independent Recognition Models
CN113689514B (en) Theme-oriented image scene graph generation method
CN115132174A (en) Voice data processing method and device, computer equipment and storage medium
CN110808036B (en) Incremental voice command word recognition method
WO2020162190A1 (en) Acoustic model learning device, acoustic model learning method, and program
CN111292715A (en) Speech synthesis method, speech synthesis device, electronic equipment and computer-readable storage medium
CN111599342A (en) Tone selecting method and system
CN112951270B (en) Voice fluency detection method and device and electronic equipment
JP7359028B2 (en) Learning devices, learning methods, and learning programs
US11538474B2 (en) Electronic device and method for controlling the electronic device thereof
WO2023281606A1 (en) Learning device, learning method, and learning program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20796009

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20796009

Country of ref document: EP

Kind code of ref document: A1