CN114596882B - Editing method capable of realizing rapid positioning of course content - Google Patents
Editing method capable of realizing rapid positioning of course content Download PDFInfo
- Publication number
- CN114596882B CN114596882B CN202210226590.4A CN202210226590A CN114596882B CN 114596882 B CN114596882 B CN 114596882B CN 202210226590 A CN202210226590 A CN 202210226590A CN 114596882 B CN114596882 B CN 114596882B
- Authority
- CN
- China
- Prior art keywords
- course
- content
- ppt
- sentence
- editing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 25
- 239000012634 fragment Substances 0.000 claims abstract description 20
- 230000000007 visual effect Effects 0.000 claims abstract description 16
- 230000008569 process Effects 0.000 claims abstract description 13
- 230000011218 segmentation Effects 0.000 claims description 8
- 230000000694 effects Effects 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 4
- 230000011664 signaling Effects 0.000 claims description 3
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000001427 coherent effect Effects 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Television Signal Processing For Recording (AREA)
Abstract
The invention discloses a clipping method capable of realizing rapid positioning of course content, and relates to the field of clipping methods for rapid positioning of course content. The method for quickly positioning and editing the course content directly aims at the course content, does not need to download the course content to process local audios and videos, and aims at the special property of the course content, the course is divided from PPT pages, sentences and words in three dimensions by acquiring PPT page cutting actions and generating corresponding subtitles, and visual descriptive display is carried out on all fragments, and the whole course content editing and content positioning process is more efficient and convenient by taking the actual course content as the reference basis of editing.
Description
Technical Field
The invention relates to the field of editing methods for quickly positioning course contents, in particular to an editing method capable of quickly positioning course contents.
Background
In the prior art, a classroom process including a lecturer explain video pictures and a slide show part is recorded into a complete video file, and the complete video file is saved and downloaded into a local file and then clipped by professional clipping software. In the editing process, firstly, a course producer is required to play the classroom video completely, confirm and record the approximate time range of the clips required to be edited, including the starting time and the ending time of the clips, secondly, the course producer is required to play the audio and video starting time and the ending time repeatedly, cut the nodes required to be edited, and repeatedly listen to the cut content after the cut to ensure that the text of the cut content is consistent and complete, and likewise, if a certain clip is required to be deleted, the user is required to repeatedly listen to the cut content after the deletion to ensure that the content of the rest clips is still continuous and complete, and if deviation of the clipped content is found, the operations of cancelling and re-editing are required to be repeatedly executed to ensure that the content of the classroom is not lost.
The professional audio and video editing software is complex to operate, constitutes larger learning cost for professional training staff, and in the editing process, the playback operation needs to be repeatedly performed on video files with longer duration to find a certain time point or a certain frame which needs to be edited so as to confirm that the editing needs to be performed, and because the operation accuracy of fingers or a mouse is insufficient, the cutting point needs to be repeatedly selected for a plurality of times in the actual operation process to complete one time of cutting, so that the whole course content is not coherent easily, the whole course editing process is time-consuming and labor-consuming, and the rapid generation and the rapid updating iteration of knowledge content are not facilitated.
Disclosure of Invention
The invention provides the following technical scheme: a clipping method capable of realizing rapid positioning of course content is characterized in that: the specific implementation steps are as follows:
s1, segmenting the classroom content according to PPT page cutting;
s2, after course recording is finished, processing the fragments into audio, cutting the audio into a proper file size according to fixed time length, recognizing the voice into words by means of an ASR technology, sending an HTTPS POST request with audio data to a server, and receiving a JSON character string with a recognition result as a return result;
s3, obtaining course segmentation modes from the PPT page, the sentence and the word, and determining logic inclusion relations of course fragments formed by the three segmentation modes according to a time axis;
s4, the user can search a certain text, namely, a course segment containing a certain character string is searched, and the user can directly locate to the corresponding PPT page, sentence and word;
in the segmentation processing of the classroom content according to PPT page cutting in S1, when a client initiates to switch pages of the PPT, a server records corresponding time points, after course recording is finished, the recorded time nodes are mapped to a time axis of an overall course, the time for starting playing and finishing playing of each PPT page is found, the overall course file is segmented into course fragments with different sizes from the switching dimension of the PPT page, the PPT page is used as visual description information of the course fragments, according to the audio time length of the fragment, the text content of each sentence in the fragment, the starting and finishing time points of the occurrence of the sentence, the content of each word in the sentence and the starting and finishing time points of the occurrence of the word in the sentence in the S2, the time points of the return result are mapped with the overall audio file one by one, finding out the starting and ending time of each sentence or word on a time axis, dividing the whole course file into course segments with different sizes based on the content, and displaying the corresponding character string information of the segment as the visual description information of the course segment, wherein in the editing process according to the course dividing mode in S3, the user judges the content of the course segment based on the visual description information of each course segment and comprehensively judges in combination with the visual description of other dimensions, and selects, plays or deletes one or more course segments according to the actual requirement, the process can be carried out on any dimension, namely the user can randomly select, play or delete any PPT page, any sentence or even any word, thereby achieving the aim of quickly editing the course, the content of the course in S1 takes the time axis as the core, and integrating PPT animation and page cutting actions in a classroom and other classroom actions in the classroom on a time axis in different signaling modes, analyzing the uploaded PPT content into a structured json file, and playing the structured json file and an audio/video file recorded by a lecturer on a playing clipping end together in the same time axis to form clipping and playing effects.
And preferably, according to the step S5 of searching a certain text, directly selecting, playing or deleting the searched course segment, thereby achieving the purpose of rapidly positioning and rapidly editing the course content.
The invention has the technical effects and advantages that: the invention provides a method for rapidly positioning and editing course contents, which is used for directly editing the course contents without downloading local audio and video, aiming at the particularity of the course contents, the course is divided from PPT pages, sentences and words in three dimensions by acquiring PPT page cutting actions and generating corresponding subtitles, and visual descriptive display is carried out on all fragments, and the actual course contents are used as the reference basis of the editing, so that the whole course contents editing and content positioning process is more efficient and convenient.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
FIG. 2 is a block diagram of an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
DETAILED DESCRIPTION OF EMBODIMENT (S) OF INVENTION
As shown in fig. 1 and 2, it should be noted in advance that we use classroom content instead of video content as a clipping object, the classroom content uses a time axis as a core, and PPT animation and page cutting actions in the classroom and other classroom actions in the classroom are integrated on the time axis in different signaling manners, and the uploaded PPT content is parsed into a structured json file and played together with an audio/video file recorded by a lecturer at a clipping end in the same time axis, so as to form clipping and playing effects.
As shown in fig. 1 and 2, firstly, the classroom content is segmented according to PPT cutting, when a client initiates to switch pages of PPT in the course of teaching, the server records corresponding time points, after the course recording is finished, the recorded time nodes are mapped to the time axis of the whole course, the time for starting playing and finishing playing of each PPT page is found, the whole course file is divided into course segments with different sizes from the switching dimension of the PPT page, and the PPT page is displayed as visual description information of the course segments.
After the course recording is finished, the fragments are processed into audio, the audio is cut into proper file sizes according to fixed time length, the voice is recognized as words by means of ASR technology, an HTTPS POST request with audio data is sent to a server, a JSON character string with a recognition result is received as a return result, the audio time length of the fragments is returned, the word content of each sentence and the starting and ending time points of the occurrence of the sentences in the fragments, the starting and ending time points of the occurrence of the content of each word and the starting and ending time points of the occurrence of the words in the sentences are mapped with the whole audio file one by one, the starting and ending time of each sentence or word on the time axis is found, the whole course file is divided into the course fragments with different sizes based on the content, and the corresponding character string information of the fragments is used as visual description information of the course fragments.
As shown in fig. 1 and 2, we obtain a course segmentation mode from three dimensions of PPT page, sentence, word and determining the logic containing relation of course segments formed by the three segmentation modes according to a time axis, in the process of editing, a user judges the content of the course segments based on visual description information of each course segment and comprehensively judges in combination with visual description of other dimensions, and according to actual requirements, one or more course segments are selected, played or deleted.
As shown in fig. 1 and 2, at the same time, a user may search a certain text, that is, search a lesson segment containing a certain character string, directly locate to include a corresponding PPT page, sentence and word, and directly select, play or delete the searched lesson segment according to actual requirements, so as to achieve the purpose of quickly locating, and then quickly editing the lesson content.
As shown in fig. 1 and 2, the above is to cut segments of content from different dimensions including but not limited to PPT cutting pages, sentences, words, etc., and define visual description presentation for each cut segment, when the classroom content is clipped, we directly clip the classroom content, and directly operate the segments with the visual description as reference basis for the classroom segments.
Finally, it should be noted that: the foregoing description is only illustrative of the preferred embodiments of the present invention, and although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that modifications may be made to the embodiments described, or equivalents may be substituted for elements thereof, and any modifications, equivalents, improvements or changes may be made without departing from the spirit and principles of the present invention.
Claims (2)
1. A clipping method capable of realizing rapid positioning of course content is characterized in that: the specific implementation steps are as follows:
s1, segmenting the classroom content according to PPT page cutting;
s2, after course recording is finished, processing the fragments into audio, cutting the audio into a proper file size according to fixed time length, recognizing the voice into words by means of an ASR technology, sending an HTTPS POST request with audio data to a server, and receiving a JSON character string with a recognition result as a return result;
s3, obtaining course segmentation modes from the PPT page, the sentence and the word, and determining logic inclusion relations of course fragments formed by the three segmentation modes according to a time axis;
s4, the user can search a certain text, namely, a course segment containing a certain character string is searched, and the user can directly locate to the corresponding PPT page, sentence and word;
in the segmentation processing of the classroom content according to PPT page cutting in S1, when a client initiates to switch pages of the PPT, a server records corresponding time points, after course recording is finished, the recorded time nodes are mapped to a time axis of an overall course, the time for starting playing and finishing playing of each PPT page is found, the overall course file is segmented into course fragments with different sizes from the switching dimension of the PPT page, the PPT page is used as visual description information of the course fragments, according to the audio time length of the fragment, the text content of each sentence in the fragment, the starting and finishing time points of the occurrence of the sentence, the content of each word in the sentence and the starting and finishing time points of the occurrence of the word in the sentence in the S2, the time points of the return result are mapped with the overall audio file one by one, finding out the starting and ending time of each sentence or word on a time axis, dividing the whole course file into course segments with different sizes based on the content, and displaying the corresponding character string information of the segment as the visual description information of the course segment, wherein in the editing process according to the course dividing mode in S3, the user judges the content of the course segment based on the visual description information of each course segment and comprehensively judges in combination with the visual description of other dimensions, and selects, plays or deletes one or more course segments according to the actual requirement, the process can be carried out on any dimension, namely the user can randomly select, play or delete any PPT page, any sentence or even any word, thereby achieving the aim of quickly editing the course, the content of the course in S1 takes the time axis as the core, and integrating PPT animation and page cutting actions in a classroom and other classroom actions in the classroom on a time axis in different signaling modes, analyzing the uploaded PPT content into a structured json file, and playing the structured json file and an audio/video file recorded by a lecturer on a playing clipping end together in the same time axis to form clipping and playing effects.
2. The editing method for enabling rapid positioning of curriculum content as recited in claim 1, wherein: and (3) according to the text searching in the step (S4), directly selecting, playing or deleting the searched course segment, thereby achieving the purpose of rapidly positioning the course content and further rapidly editing.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210226590.4A CN114596882B (en) | 2022-03-09 | 2022-03-09 | Editing method capable of realizing rapid positioning of course content |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210226590.4A CN114596882B (en) | 2022-03-09 | 2022-03-09 | Editing method capable of realizing rapid positioning of course content |
Publications (2)
Publication Number | Publication Date |
---|---|
CN114596882A CN114596882A (en) | 2022-06-07 |
CN114596882B true CN114596882B (en) | 2024-02-02 |
Family
ID=81807105
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210226590.4A Active CN114596882B (en) | 2022-03-09 | 2022-03-09 | Editing method capable of realizing rapid positioning of course content |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114596882B (en) |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013001537A1 (en) * | 2011-06-30 | 2013-01-03 | Human Monitoring Ltd. | Methods and systems of editing and decoding a video file |
WO2013009695A1 (en) * | 2011-07-08 | 2013-01-17 | Percy 3Dmedia, Inc. | 3d user personalized media templates |
CN105608226A (en) * | 2016-01-21 | 2016-05-25 | 南京南瑞集团公司 | Table component based on WEB webpage |
CN107920280A (en) * | 2017-03-23 | 2018-04-17 | 广州思涵信息科技有限公司 | The accurate matched method and system of video, teaching materials PPT and voice content |
WO2018072390A1 (en) * | 2016-10-19 | 2018-04-26 | 深圳市鹰硕技术有限公司 | Classroom teaching recording and requesting method and system |
CN109274913A (en) * | 2018-10-17 | 2019-01-25 | 北京竞业达数码科技股份有限公司 | A kind of video intelligent slice clipping method and system |
CN110414352A (en) * | 2019-06-26 | 2019-11-05 | 深圳市容会科技有限公司 | The method and relevant device of PPT the file information are extracted from video file |
CN110415319A (en) * | 2019-08-07 | 2019-11-05 | 深圳市前海手绘科技文化有限公司 | Animation method, device and electronic equipment and storage medium based on PPT |
CN111460220A (en) * | 2020-04-13 | 2020-07-28 | 赵琰 | Method for making word flash card video and video product |
CN112287914A (en) * | 2020-12-27 | 2021-01-29 | 平安科技(深圳)有限公司 | PPT video segment extraction method, device, equipment and medium |
WO2021205362A1 (en) * | 2020-04-08 | 2021-10-14 | Docebo Spa a Socio Unico | Method and system for automated generation and editing of educational and training materials |
WO2021259221A1 (en) * | 2020-06-23 | 2021-12-30 | 北京字节跳动网络技术有限公司 | Video translation method and apparatus, storage medium, and electronic device |
-
2022
- 2022-03-09 CN CN202210226590.4A patent/CN114596882B/en active Active
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013001537A1 (en) * | 2011-06-30 | 2013-01-03 | Human Monitoring Ltd. | Methods and systems of editing and decoding a video file |
WO2013009695A1 (en) * | 2011-07-08 | 2013-01-17 | Percy 3Dmedia, Inc. | 3d user personalized media templates |
CN105608226A (en) * | 2016-01-21 | 2016-05-25 | 南京南瑞集团公司 | Table component based on WEB webpage |
WO2018072390A1 (en) * | 2016-10-19 | 2018-04-26 | 深圳市鹰硕技术有限公司 | Classroom teaching recording and requesting method and system |
CN107920280A (en) * | 2017-03-23 | 2018-04-17 | 广州思涵信息科技有限公司 | The accurate matched method and system of video, teaching materials PPT and voice content |
CN109274913A (en) * | 2018-10-17 | 2019-01-25 | 北京竞业达数码科技股份有限公司 | A kind of video intelligent slice clipping method and system |
CN110414352A (en) * | 2019-06-26 | 2019-11-05 | 深圳市容会科技有限公司 | The method and relevant device of PPT the file information are extracted from video file |
CN110415319A (en) * | 2019-08-07 | 2019-11-05 | 深圳市前海手绘科技文化有限公司 | Animation method, device and electronic equipment and storage medium based on PPT |
WO2021205362A1 (en) * | 2020-04-08 | 2021-10-14 | Docebo Spa a Socio Unico | Method and system for automated generation and editing of educational and training materials |
CN111460220A (en) * | 2020-04-13 | 2020-07-28 | 赵琰 | Method for making word flash card video and video product |
WO2021259221A1 (en) * | 2020-06-23 | 2021-12-30 | 北京字节跳动网络技术有限公司 | Video translation method and apparatus, storage medium, and electronic device |
CN112287914A (en) * | 2020-12-27 | 2021-01-29 | 平安科技(深圳)有限公司 | PPT video segment extraction method, device, equipment and medium |
Also Published As
Publication number | Publication date |
---|---|
CN114596882A (en) | 2022-06-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113709561B (en) | Video editing method, device, equipment and storage medium | |
CN107230397B (en) | Parent-child audio generation and processing method and device for preschool education | |
CN110121116A (en) | Video generation method and device | |
US20130007043A1 (en) | Voice description of time-based media for indexing and searching | |
US9348829B2 (en) | Media management system and process | |
US20080046925A1 (en) | Temporal and spatial in-video marking, indexing, and searching | |
US20070027844A1 (en) | Navigating recorded multimedia content using keywords or phrases | |
US20080177536A1 (en) | A/v content editing | |
KR20070121810A (en) | Synthesis of composite news stories | |
US8612384B2 (en) | Methods and apparatus for searching and accessing multimedia content | |
AU2017200461A1 (en) | Method of searching recorded media content | |
CN110781328A (en) | Video generation method, system, device and storage medium based on voice recognition | |
CN110750996B (en) | Method and device for generating multimedia information and readable storage medium | |
WO2018120819A1 (en) | Method and device for producing presentation | |
CN112287168A (en) | Method and apparatus for generating video | |
CN102623034B (en) | Method and device for realizing mutual positioning and character fast recording of video data and text data | |
CN110619673B (en) | Method for generating and playing sound chart, method, system and equipment for processing data | |
CN114596882B (en) | Editing method capable of realizing rapid positioning of course content | |
KR20220135901A (en) | Devices, methods and programs for providing customized educational content | |
JP2001502858A (en) | Digital image system having a database of digital audio and image information coded data. | |
JP2019197210A (en) | Speech recognition error correction support device and its program | |
KR101783872B1 (en) | Video Search System and Method thereof | |
KR100882857B1 (en) | Method for reproducing contents by using discriminating code | |
KR20140137219A (en) | Method for providing s,e,u-contents by easily, quickly and accurately extracting only wanted part from multimedia file | |
JP2019128850A (en) | Information processing device, moving-image search method, generation method, and program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |