CN114596882B

CN114596882B - Editing method capable of realizing rapid positioning of course content

Info

Publication number: CN114596882B
Application number: CN202210226590.4A
Authority: CN
Inventors: 卢小燕; 崔峻; 黎佳佳; 葛瑞兵
Original assignee: Yunxuetang Information Technology Jiangsu Co ltd
Current assignee: Yunxuetang Information Technology Jiangsu Co ltd
Priority date: 2022-03-09
Filing date: 2022-03-09
Publication date: 2024-02-02
Anticipated expiration: 2042-03-09
Also published as: CN114596882A

Abstract

The invention discloses a clipping method capable of realizing rapid positioning of course content, and relates to the field of clipping methods for rapid positioning of course content. The method for quickly positioning and editing the course content directly aims at the course content, does not need to download the course content to process local audios and videos, and aims at the special property of the course content, the course is divided from PPT pages, sentences and words in three dimensions by acquiring PPT page cutting actions and generating corresponding subtitles, and visual descriptive display is carried out on all fragments, and the whole course content editing and content positioning process is more efficient and convenient by taking the actual course content as the reference basis of editing.

Description

Editing method capable of realizing rapid positioning of course content

Technical Field

The invention relates to the field of editing methods for quickly positioning course contents, in particular to an editing method capable of quickly positioning course contents.

Background

In the prior art, a classroom process including a lecturer explain video pictures and a slide show part is recorded into a complete video file, and the complete video file is saved and downloaded into a local file and then clipped by professional clipping software. In the editing process, firstly, a course producer is required to play the classroom video completely, confirm and record the approximate time range of the clips required to be edited, including the starting time and the ending time of the clips, secondly, the course producer is required to play the audio and video starting time and the ending time repeatedly, cut the nodes required to be edited, and repeatedly listen to the cut content after the cut to ensure that the text of the cut content is consistent and complete, and likewise, if a certain clip is required to be deleted, the user is required to repeatedly listen to the cut content after the deletion to ensure that the content of the rest clips is still continuous and complete, and if deviation of the clipped content is found, the operations of cancelling and re-editing are required to be repeatedly executed to ensure that the content of the classroom is not lost.

The professional audio and video editing software is complex to operate, constitutes larger learning cost for professional training staff, and in the editing process, the playback operation needs to be repeatedly performed on video files with longer duration to find a certain time point or a certain frame which needs to be edited so as to confirm that the editing needs to be performed, and because the operation accuracy of fingers or a mouse is insufficient, the cutting point needs to be repeatedly selected for a plurality of times in the actual operation process to complete one time of cutting, so that the whole course content is not coherent easily, the whole course editing process is time-consuming and labor-consuming, and the rapid generation and the rapid updating iteration of knowledge content are not facilitated.

Disclosure of Invention

The invention provides the following technical scheme: a clipping method capable of realizing rapid positioning of course content is characterized in that: the specific implementation steps are as follows:

s1, segmenting the classroom content according to PPT page cutting;

s2, after course recording is finished, processing the fragments into audio, cutting the audio into a proper file size according to fixed time length, recognizing the voice into words by means of an ASR technology, sending an HTTPS POST request with audio data to a server, and receiving a JSON character string with a recognition result as a return result;

s3, obtaining course segmentation modes from the PPT page, the sentence and the word, and determining logic inclusion relations of course fragments formed by the three segmentation modes according to a time axis;

s4, the user can search a certain text, namely, a course segment containing a certain character string is searched, and the user can directly locate to the corresponding PPT page, sentence and word;

in the segmentation processing of the classroom content according to PPT page cutting in S1, when a client initiates to switch pages of the PPT, a server records corresponding time points, after course recording is finished, the recorded time nodes are mapped to a time axis of an overall course, the time for starting playing and finishing playing of each PPT page is found, the overall course file is segmented into course fragments with different sizes from the switching dimension of the PPT page, the PPT page is used as visual description information of the course fragments, according to the audio time length of the fragment, the text content of each sentence in the fragment, the starting and finishing time points of the occurrence of the sentence, the content of each word in the sentence and the starting and finishing time points of the occurrence of the word in the sentence in the S2, the time points of the return result are mapped with the overall audio file one by one, finding out the starting and ending time of each sentence or word on a time axis, dividing the whole course file into course segments with different sizes based on the content, and displaying the corresponding character string information of the segment as the visual description information of the course segment, wherein in the editing process according to the course dividing mode in S3, the user judges the content of the course segment based on the visual description information of each course segment and comprehensively judges in combination with the visual description of other dimensions, and selects, plays or deletes one or more course segments according to the actual requirement, the process can be carried out on any dimension, namely the user can randomly select, play or delete any PPT page, any sentence or even any word, thereby achieving the aim of quickly editing the course, the content of the course in S1 takes the time axis as the core, and integrating PPT animation and page cutting actions in a classroom and other classroom actions in the classroom on a time axis in different signaling modes, analyzing the uploaded PPT content into a structured json file, and playing the structured json file and an audio/video file recorded by a lecturer on a playing clipping end together in the same time axis to form clipping and playing effects.

And preferably, according to the step S5 of searching a certain text, directly selecting, playing or deleting the searched course segment, thereby achieving the purpose of rapidly positioning and rapidly editing the course content.

The invention has the technical effects and advantages that: the invention provides a method for rapidly positioning and editing course contents, which is used for directly editing the course contents without downloading local audio and video, aiming at the particularity of the course contents, the course is divided from PPT pages, sentences and words in three dimensions by acquiring PPT page cutting actions and generating corresponding subtitles, and visual descriptive display is carried out on all fragments, and the actual course contents are used as the reference basis of the editing, so that the whole course contents editing and content positioning process is more efficient and convenient.

Drawings

FIG. 1 is a flow chart of the method of the present invention.

FIG. 2 is a block diagram of an embodiment of the present invention.

Detailed Description

The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

DETAILED DESCRIPTION OF EMBODIMENT (S) OF INVENTION

As shown in fig. 1 and 2, it should be noted in advance that we use classroom content instead of video content as a clipping object, the classroom content uses a time axis as a core, and PPT animation and page cutting actions in the classroom and other classroom actions in the classroom are integrated on the time axis in different signaling manners, and the uploaded PPT content is parsed into a structured json file and played together with an audio/video file recorded by a lecturer at a clipping end in the same time axis, so as to form clipping and playing effects.

As shown in fig. 1 and 2, firstly, the classroom content is segmented according to PPT cutting, when a client initiates to switch pages of PPT in the course of teaching, the server records corresponding time points, after the course recording is finished, the recorded time nodes are mapped to the time axis of the whole course, the time for starting playing and finishing playing of each PPT page is found, the whole course file is divided into course segments with different sizes from the switching dimension of the PPT page, and the PPT page is displayed as visual description information of the course segments.

After the course recording is finished, the fragments are processed into audio, the audio is cut into proper file sizes according to fixed time length, the voice is recognized as words by means of ASR technology, an HTTPS POST request with audio data is sent to a server, a JSON character string with a recognition result is received as a return result, the audio time length of the fragments is returned, the word content of each sentence and the starting and ending time points of the occurrence of the sentences in the fragments, the starting and ending time points of the occurrence of the content of each word and the starting and ending time points of the occurrence of the words in the sentences are mapped with the whole audio file one by one, the starting and ending time of each sentence or word on the time axis is found, the whole course file is divided into the course fragments with different sizes based on the content, and the corresponding character string information of the fragments is used as visual description information of the course fragments.

As shown in fig. 1 and 2, we obtain a course segmentation mode from three dimensions of PPT page, sentence, word and determining the logic containing relation of course segments formed by the three segmentation modes according to a time axis, in the process of editing, a user judges the content of the course segments based on visual description information of each course segment and comprehensively judges in combination with visual description of other dimensions, and according to actual requirements, one or more course segments are selected, played or deleted.

As shown in fig. 1 and 2, at the same time, a user may search a certain text, that is, search a lesson segment containing a certain character string, directly locate to include a corresponding PPT page, sentence and word, and directly select, play or delete the searched lesson segment according to actual requirements, so as to achieve the purpose of quickly locating, and then quickly editing the lesson content.

As shown in fig. 1 and 2, the above is to cut segments of content from different dimensions including but not limited to PPT cutting pages, sentences, words, etc., and define visual description presentation for each cut segment, when the classroom content is clipped, we directly clip the classroom content, and directly operate the segments with the visual description as reference basis for the classroom segments.

Finally, it should be noted that: the foregoing description is only illustrative of the preferred embodiments of the present invention, and although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that modifications may be made to the embodiments described, or equivalents may be substituted for elements thereof, and any modifications, equivalents, improvements or changes may be made without departing from the spirit and principles of the present invention.

Claims

1. A clipping method capable of realizing rapid positioning of course content is characterized in that: the specific implementation steps are as follows:

s1, segmenting the classroom content according to PPT page cutting;

2. The editing method for enabling rapid positioning of curriculum content as recited in claim 1, wherein: and (3) according to the text searching in the step (S4), directly selecting, playing or deleting the searched course segment, thereby achieving the purpose of rapidly positioning the course content and further rapidly editing.