CN114596882B - Editing method capable of realizing rapid positioning of course content - Google Patents

Editing method capable of realizing rapid positioning of course content Download PDF

Info

Publication number
CN114596882B
CN114596882B CN202210226590.4A CN202210226590A CN114596882B CN 114596882 B CN114596882 B CN 114596882B CN 202210226590 A CN202210226590 A CN 202210226590A CN 114596882 B CN114596882 B CN 114596882B
Authority
CN
China
Prior art keywords
course
content
ppt
sentence
editing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210226590.4A
Other languages
Chinese (zh)
Other versions
CN114596882A (en
Inventor
卢小燕
崔峻
黎佳佳
葛瑞兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yunxuetang Information Technology Jiangsu Co ltd
Original Assignee
Yunxuetang Information Technology Jiangsu Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yunxuetang Information Technology Jiangsu Co ltd filed Critical Yunxuetang Information Technology Jiangsu Co ltd
Priority to CN202210226590.4A priority Critical patent/CN114596882B/en
Publication of CN114596882A publication Critical patent/CN114596882A/en
Application granted granted Critical
Publication of CN114596882B publication Critical patent/CN114596882B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

The invention discloses a clipping method capable of realizing rapid positioning of course content, and relates to the field of clipping methods for rapid positioning of course content. The method for quickly positioning and editing the course content directly aims at the course content, does not need to download the course content to process local audios and videos, and aims at the special property of the course content, the course is divided from PPT pages, sentences and words in three dimensions by acquiring PPT page cutting actions and generating corresponding subtitles, and visual descriptive display is carried out on all fragments, and the whole course content editing and content positioning process is more efficient and convenient by taking the actual course content as the reference basis of editing.

Description

Editing method capable of realizing rapid positioning of course content
Technical Field
The invention relates to the field of editing methods for quickly positioning course contents, in particular to an editing method capable of quickly positioning course contents.
Background
In the prior art, a classroom process including a lecturer explain video pictures and a slide show part is recorded into a complete video file, and the complete video file is saved and downloaded into a local file and then clipped by professional clipping software. In the editing process, firstly, a course producer is required to play the classroom video completely, confirm and record the approximate time range of the clips required to be edited, including the starting time and the ending time of the clips, secondly, the course producer is required to play the audio and video starting time and the ending time repeatedly, cut the nodes required to be edited, and repeatedly listen to the cut content after the cut to ensure that the text of the cut content is consistent and complete, and likewise, if a certain clip is required to be deleted, the user is required to repeatedly listen to the cut content after the deletion to ensure that the content of the rest clips is still continuous and complete, and if deviation of the clipped content is found, the operations of cancelling and re-editing are required to be repeatedly executed to ensure that the content of the classroom is not lost.
The professional audio and video editing software is complex to operate, constitutes larger learning cost for professional training staff, and in the editing process, the playback operation needs to be repeatedly performed on video files with longer duration to find a certain time point or a certain frame which needs to be edited so as to confirm that the editing needs to be performed, and because the operation accuracy of fingers or a mouse is insufficient, the cutting point needs to be repeatedly selected for a plurality of times in the actual operation process to complete one time of cutting, so that the whole course content is not coherent easily, the whole course editing process is time-consuming and labor-consuming, and the rapid generation and the rapid updating iteration of knowledge content are not facilitated.
Disclosure of Invention
The invention provides the following technical scheme: a clipping method capable of realizing rapid positioning of course content is characterized in that: the specific implementation steps are as follows:
s1, segmenting the classroom content according to PPT page cutting;
s2, after course recording is finished, processing the fragments into audio, cutting the audio into a proper file size according to fixed time length, recognizing the voice into words by means of an ASR technology, sending an HTTPS POST request with audio data to a server, and receiving a JSON character string with a recognition result as a return result;
s3, obtaining course segmentation modes from the PPT page, the sentence and the word, and determining logic inclusion relations of course fragments formed by the three segmentation modes according to a time axis;
s4, the user can search a certain text, namely, a course segment containing a certain character string is searched, and the user can directly locate to the corresponding PPT page, sentence and word;
in the segmentation processing of the classroom content according to PPT page cutting in S1, when a client initiates to switch pages of the PPT, a server records corresponding time points, after course recording is finished, the recorded time nodes are mapped to a time axis of an overall course, the time for starting playing and finishing playing of each PPT page is found, the overall course file is segmented into course fragments with different sizes from the switching dimension of the PPT page, the PPT page is used as visual description information of the course fragments, according to the audio time length of the fragment, the text content of each sentence in the fragment, the starting and finishing time points of the occurrence of the sentence, the content of each word in the sentence and the starting and finishing time points of the occurrence of the word in the sentence in the S2, the time points of the return result are mapped with the overall audio file one by one, finding out the starting and ending time of each sentence or word on a time axis, dividing the whole course file into course segments with different sizes based on the content, and displaying the corresponding character string information of the segment as the visual description information of the course segment, wherein in the editing process according to the course dividing mode in S3, the user judges the content of the course segment based on the visual description information of each course segment and comprehensively judges in combination with the visual description of other dimensions, and selects, plays or deletes one or more course segments according to the actual requirement, the process can be carried out on any dimension, namely the user can randomly select, play or delete any PPT page, any sentence or even any word, thereby achieving the aim of quickly editing the course, the content of the course in S1 takes the time axis as the core, and integrating PPT animation and page cutting actions in a classroom and other classroom actions in the classroom on a time axis in different signaling modes, analyzing the uploaded PPT content into a structured json file, and playing the structured json file and an audio/video file recorded by a lecturer on a playing clipping end together in the same time axis to form clipping and playing effects.
And preferably, according to the step S5 of searching a certain text, directly selecting, playing or deleting the searched course segment, thereby achieving the purpose of rapidly positioning and rapidly editing the course content.
The invention has the technical effects and advantages that: the invention provides a method for rapidly positioning and editing course contents, which is used for directly editing the course contents without downloading local audio and video, aiming at the particularity of the course contents, the course is divided from PPT pages, sentences and words in three dimensions by acquiring PPT page cutting actions and generating corresponding subtitles, and visual descriptive display is carried out on all fragments, and the actual course contents are used as the reference basis of the editing, so that the whole course contents editing and content positioning process is more efficient and convenient.
Drawings
FIG. 1 is a flow chart of the method of the present invention.
FIG. 2 is a block diagram of an embodiment of the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
DETAILED DESCRIPTION OF EMBODIMENT (S) OF INVENTION
As shown in fig. 1 and 2, it should be noted in advance that we use classroom content instead of video content as a clipping object, the classroom content uses a time axis as a core, and PPT animation and page cutting actions in the classroom and other classroom actions in the classroom are integrated on the time axis in different signaling manners, and the uploaded PPT content is parsed into a structured json file and played together with an audio/video file recorded by a lecturer at a clipping end in the same time axis, so as to form clipping and playing effects.
As shown in fig. 1 and 2, firstly, the classroom content is segmented according to PPT cutting, when a client initiates to switch pages of PPT in the course of teaching, the server records corresponding time points, after the course recording is finished, the recorded time nodes are mapped to the time axis of the whole course, the time for starting playing and finishing playing of each PPT page is found, the whole course file is divided into course segments with different sizes from the switching dimension of the PPT page, and the PPT page is displayed as visual description information of the course segments.
After the course recording is finished, the fragments are processed into audio, the audio is cut into proper file sizes according to fixed time length, the voice is recognized as words by means of ASR technology, an HTTPS POST request with audio data is sent to a server, a JSON character string with a recognition result is received as a return result, the audio time length of the fragments is returned, the word content of each sentence and the starting and ending time points of the occurrence of the sentences in the fragments, the starting and ending time points of the occurrence of the content of each word and the starting and ending time points of the occurrence of the words in the sentences are mapped with the whole audio file one by one, the starting and ending time of each sentence or word on the time axis is found, the whole course file is divided into the course fragments with different sizes based on the content, and the corresponding character string information of the fragments is used as visual description information of the course fragments.
As shown in fig. 1 and 2, we obtain a course segmentation mode from three dimensions of PPT page, sentence, word and determining the logic containing relation of course segments formed by the three segmentation modes according to a time axis, in the process of editing, a user judges the content of the course segments based on visual description information of each course segment and comprehensively judges in combination with visual description of other dimensions, and according to actual requirements, one or more course segments are selected, played or deleted.
As shown in fig. 1 and 2, at the same time, a user may search a certain text, that is, search a lesson segment containing a certain character string, directly locate to include a corresponding PPT page, sentence and word, and directly select, play or delete the searched lesson segment according to actual requirements, so as to achieve the purpose of quickly locating, and then quickly editing the lesson content.
As shown in fig. 1 and 2, the above is to cut segments of content from different dimensions including but not limited to PPT cutting pages, sentences, words, etc., and define visual description presentation for each cut segment, when the classroom content is clipped, we directly clip the classroom content, and directly operate the segments with the visual description as reference basis for the classroom segments.
Finally, it should be noted that: the foregoing description is only illustrative of the preferred embodiments of the present invention, and although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that modifications may be made to the embodiments described, or equivalents may be substituted for elements thereof, and any modifications, equivalents, improvements or changes may be made without departing from the spirit and principles of the present invention.

Claims (2)

1. A clipping method capable of realizing rapid positioning of course content is characterized in that: the specific implementation steps are as follows:
s1, segmenting the classroom content according to PPT page cutting;
s2, after course recording is finished, processing the fragments into audio, cutting the audio into a proper file size according to fixed time length, recognizing the voice into words by means of an ASR technology, sending an HTTPS POST request with audio data to a server, and receiving a JSON character string with a recognition result as a return result;
s3, obtaining course segmentation modes from the PPT page, the sentence and the word, and determining logic inclusion relations of course fragments formed by the three segmentation modes according to a time axis;
s4, the user can search a certain text, namely, a course segment containing a certain character string is searched, and the user can directly locate to the corresponding PPT page, sentence and word;
in the segmentation processing of the classroom content according to PPT page cutting in S1, when a client initiates to switch pages of the PPT, a server records corresponding time points, after course recording is finished, the recorded time nodes are mapped to a time axis of an overall course, the time for starting playing and finishing playing of each PPT page is found, the overall course file is segmented into course fragments with different sizes from the switching dimension of the PPT page, the PPT page is used as visual description information of the course fragments, according to the audio time length of the fragment, the text content of each sentence in the fragment, the starting and finishing time points of the occurrence of the sentence, the content of each word in the sentence and the starting and finishing time points of the occurrence of the word in the sentence in the S2, the time points of the return result are mapped with the overall audio file one by one, finding out the starting and ending time of each sentence or word on a time axis, dividing the whole course file into course segments with different sizes based on the content, and displaying the corresponding character string information of the segment as the visual description information of the course segment, wherein in the editing process according to the course dividing mode in S3, the user judges the content of the course segment based on the visual description information of each course segment and comprehensively judges in combination with the visual description of other dimensions, and selects, plays or deletes one or more course segments according to the actual requirement, the process can be carried out on any dimension, namely the user can randomly select, play or delete any PPT page, any sentence or even any word, thereby achieving the aim of quickly editing the course, the content of the course in S1 takes the time axis as the core, and integrating PPT animation and page cutting actions in a classroom and other classroom actions in the classroom on a time axis in different signaling modes, analyzing the uploaded PPT content into a structured json file, and playing the structured json file and an audio/video file recorded by a lecturer on a playing clipping end together in the same time axis to form clipping and playing effects.
2. The editing method for enabling rapid positioning of curriculum content as recited in claim 1, wherein: and (3) according to the text searching in the step (S4), directly selecting, playing or deleting the searched course segment, thereby achieving the purpose of rapidly positioning the course content and further rapidly editing.
CN202210226590.4A 2022-03-09 2022-03-09 Editing method capable of realizing rapid positioning of course content Active CN114596882B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210226590.4A CN114596882B (en) 2022-03-09 2022-03-09 Editing method capable of realizing rapid positioning of course content

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210226590.4A CN114596882B (en) 2022-03-09 2022-03-09 Editing method capable of realizing rapid positioning of course content

Publications (2)

Publication Number Publication Date
CN114596882A CN114596882A (en) 2022-06-07
CN114596882B true CN114596882B (en) 2024-02-02

Family

ID=81807105

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210226590.4A Active CN114596882B (en) 2022-03-09 2022-03-09 Editing method capable of realizing rapid positioning of course content

Country Status (1)

Country Link
CN (1) CN114596882B (en)

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013001537A1 (en) * 2011-06-30 2013-01-03 Human Monitoring Ltd. Methods and systems of editing and decoding a video file
WO2013009695A1 (en) * 2011-07-08 2013-01-17 Percy 3Dmedia, Inc. 3d user personalized media templates
CN105608226A (en) * 2016-01-21 2016-05-25 南京南瑞集团公司 Table component based on WEB webpage
CN107920280A (en) * 2017-03-23 2018-04-17 广州思涵信息科技有限公司 The accurate matched method and system of video, teaching materials PPT and voice content
WO2018072390A1 (en) * 2016-10-19 2018-04-26 深圳市鹰硕技术有限公司 Classroom teaching recording and requesting method and system
CN109274913A (en) * 2018-10-17 2019-01-25 北京竞业达数码科技股份有限公司 A kind of video intelligent slice clipping method and system
CN110414352A (en) * 2019-06-26 2019-11-05 深圳市容会科技有限公司 The method and relevant device of PPT the file information are extracted from video file
CN110415319A (en) * 2019-08-07 2019-11-05 深圳市前海手绘科技文化有限公司 Animation method, device and electronic equipment and storage medium based on PPT
CN111460220A (en) * 2020-04-13 2020-07-28 赵琰 Method for making word flash card video and video product
CN112287914A (en) * 2020-12-27 2021-01-29 平安科技(深圳)有限公司 PPT video segment extraction method, device, equipment and medium
WO2021205362A1 (en) * 2020-04-08 2021-10-14 Docebo Spa a Socio Unico Method and system for automated generation and editing of educational and training materials
WO2021259221A1 (en) * 2020-06-23 2021-12-30 北京字节跳动网络技术有限公司 Video translation method and apparatus, storage medium, and electronic device

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013001537A1 (en) * 2011-06-30 2013-01-03 Human Monitoring Ltd. Methods and systems of editing and decoding a video file
WO2013009695A1 (en) * 2011-07-08 2013-01-17 Percy 3Dmedia, Inc. 3d user personalized media templates
CN105608226A (en) * 2016-01-21 2016-05-25 南京南瑞集团公司 Table component based on WEB webpage
WO2018072390A1 (en) * 2016-10-19 2018-04-26 深圳市鹰硕技术有限公司 Classroom teaching recording and requesting method and system
CN107920280A (en) * 2017-03-23 2018-04-17 广州思涵信息科技有限公司 The accurate matched method and system of video, teaching materials PPT and voice content
CN109274913A (en) * 2018-10-17 2019-01-25 北京竞业达数码科技股份有限公司 A kind of video intelligent slice clipping method and system
CN110414352A (en) * 2019-06-26 2019-11-05 深圳市容会科技有限公司 The method and relevant device of PPT the file information are extracted from video file
CN110415319A (en) * 2019-08-07 2019-11-05 深圳市前海手绘科技文化有限公司 Animation method, device and electronic equipment and storage medium based on PPT
WO2021205362A1 (en) * 2020-04-08 2021-10-14 Docebo Spa a Socio Unico Method and system for automated generation and editing of educational and training materials
CN111460220A (en) * 2020-04-13 2020-07-28 赵琰 Method for making word flash card video and video product
WO2021259221A1 (en) * 2020-06-23 2021-12-30 北京字节跳动网络技术有限公司 Video translation method and apparatus, storage medium, and electronic device
CN112287914A (en) * 2020-12-27 2021-01-29 平安科技(深圳)有限公司 PPT video segment extraction method, device, equipment and medium

Also Published As

Publication number Publication date
CN114596882A (en) 2022-06-07

Similar Documents

Publication Publication Date Title
CN113709561B (en) Video editing method, device, equipment and storage medium
CN107230397B (en) Parent-child audio generation and processing method and device for preschool education
CN110121116A (en) Video generation method and device
US20130007043A1 (en) Voice description of time-based media for indexing and searching
US9348829B2 (en) Media management system and process
US20080046925A1 (en) Temporal and spatial in-video marking, indexing, and searching
US20070027844A1 (en) Navigating recorded multimedia content using keywords or phrases
US20080177536A1 (en) A/v content editing
KR20070121810A (en) Synthesis of composite news stories
US8612384B2 (en) Methods and apparatus for searching and accessing multimedia content
AU2017200461A1 (en) Method of searching recorded media content
CN110781328A (en) Video generation method, system, device and storage medium based on voice recognition
CN110750996B (en) Method and device for generating multimedia information and readable storage medium
WO2018120819A1 (en) Method and device for producing presentation
CN112287168A (en) Method and apparatus for generating video
CN102623034B (en) Method and device for realizing mutual positioning and character fast recording of video data and text data
CN110619673B (en) Method for generating and playing sound chart, method, system and equipment for processing data
CN114596882B (en) Editing method capable of realizing rapid positioning of course content
KR20220135901A (en) Devices, methods and programs for providing customized educational content
JP2001502858A (en) Digital image system having a database of digital audio and image information coded data.
JP2019197210A (en) Speech recognition error correction support device and its program
KR101783872B1 (en) Video Search System and Method thereof
KR100882857B1 (en) Method for reproducing contents by using discriminating code
KR20140137219A (en) Method for providing s,e,u-contents by easily, quickly and accurately extracting only wanted part from multimedia file
JP2019128850A (en) Information processing device, moving-image search method, generation method, and program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant