CN111970538A

CN111970538A - Teaching video processing method and system

Info

Publication number: CN111970538A
Application number: CN202010875240.1A
Authority: CN
Inventors: 樊星
Original assignee: Shanghai Squirrel Classroom Artificial Intelligence Technology Co Ltd
Current assignee: Shanghai Squirrel Classroom Artificial Intelligence Technology Co Ltd
Priority date: 2020-08-27
Filing date: 2020-08-27
Publication date: 2020-11-20

Abstract

The invention provides a teaching video processing method and a system, which are characterized in that corresponding original teaching video data are generated by shooting a teaching process at multiple angles, then video quality processing pretreatment, splicing treatment and quality judgment treatment are carried out on the original teaching video data, and finally a teaching spliced video with the optimal video quality is taken as a final teaching video.

Description

Teaching video processing method and system

Technical Field

The invention relates to the technical field of intelligent teaching, in particular to a teaching video processing method and a system.

Background

At present, intelligent teaching needs to execute teaching of preset courses by means of corresponding teaching videos, existing teaching videos are generally pre-recorded plane teaching videos, and although the teaching videos can be manufactured in a standardized mode and the manufacturing efficiency of the teaching videos is improved, the teaching videos manufactured and formed in the mode are too single in content and monotonous in video image quality, students cannot experience immersive experience in the process of watching the teaching videos, and meanwhile the display quality of the teaching videos is also reduced.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a teaching video processing method and a system, which comprises the steps of shooting a preset teaching process from two different directions, correspondingly obtaining two original video data related to the preset teaching process, respectively carrying out video quality processing pretreatment on the two original video data, correspondingly obtaining two pretreated video data meeting a preset image quality condition and/or a preset sound quality condition, screening the pretreated video data according to the shooting directions, thus obtaining two video subdata sets related to different shooting directions, sequentially splicing the video subdata contained in the two video subdata sets according to the shooting time sequence, correspondingly obtaining a teaching spliced video, and determining the respective image picture transition continuity degree and sound transition continuity degree of the teaching spliced video, the quality of all the teaching spliced videos is sequenced, so that the optimal teaching spliced video is converted into a three-dimensional teaching video; therefore, the teaching video processing method and the system can ensure that the teaching video can be updated to have higher presentation quality on image quality and sound quality, improve the immersion of users during the watching process and improve the watching quality of the teaching video.

The invention provides a teaching video processing method, which comprises the following steps:

step S1, shooting a preset teaching process from two different directions so as to correspondingly obtain two original video data related to the preset teaching process, and respectively carrying out video quality processing pretreatment on the two original video data so as to correspondingly obtain two pretreated video data meeting preset image quality conditions and/or preset sound quality conditions;

step S2, according to the shooting direction, obtaining two video subdata sets corresponding to two shooting directions from the two preprocessed video data, wherein each video subdata set corresponds to one shooting direction, each video subdata set comprises N video subdata which are connected back and forth on a shooting time sequence, the lengths of the shooting time periods corresponding to all the video subdata are equal, and the N video subdata correspond to the N shooting time periods respectively;

step S3, splicing M times of video subdata according to the shooting time sequence to obtain M teaching spliced videos; wherein, every splicing operation comprises: randomly selecting N1 video subdata from one video subdata set, randomly selecting N2 video subdata from another video subdata set, and selecting N1+ N2 as N; the shooting time periods corresponding to the N1 pieces of video sub-data and the shooting time periods corresponding to the N2 pieces of video sub-data jointly constitute the N shooting time periods; splicing the N1 pieces of video subdata and the N2 pieces of video subdata in sequence according to a shooting time sequence, thereby obtaining a teaching spliced video corresponding to the splicing operation;

and step S4, determining the video goodness of each teaching mosaic video in the M teaching mosaic videos, determining the goodness sequence of the M teaching mosaic videos according to the video goodness, and taking the teaching mosaic video with the optimal video goodness as the final teaching video.

In one embodiment, in the step S1, a preset teaching process is photographed from two different orientations to correspondingly obtain two original video data related to the preset teaching process, and the two original video data are respectively subjected to video quality processing preprocessing, so that two preprocessed video data meeting a preset image quality condition and/or a preset sound quality condition are correspondingly obtained,

step S101, shooting the preset teaching process from two different view angle directions which are mutually crossed, so as to obtain two groups of original video data which have parallax relation and are related to the preset teaching process;

step S102, performing image and sound separation processing on each group of original video data to obtain image data information and sound data information corresponding to each group of original video data, and respectively determining an average image resolution value and an average sound signal-to-noise ratio value corresponding to each group of original video data according to the image data information and the sound data information corresponding to each group of original video data;

step S103, if the average image resolution is smaller than a preset image resolution threshold, performing pixel detail restoration on the image data information, so as to enable the average image resolution to be greater than or equal to the preset image resolution threshold, or, if the average sound signal-to-noise ratio value is smaller than a preset signal-to-noise ratio threshold, performing background noise filtering processing on the sound data information, so as to enable the average signal-to-noise ratio value to be greater than or equal to the preset signal-to-noise ratio threshold;

step S104, recombining the image data information of which the average image resolution is greater than or equal to the preset image resolution threshold and the sound data information of which the average sound signal-to-noise ratio is greater than or equal to the preset signal-to-noise ratio threshold, thereby obtaining the preprocessed video data.

In one embodiment, in step S4, determining the video goodness of each of the M teaching videos spliced together, determining the goodness sequence of the M teaching videos spliced together according to the video goodness, and using the teaching video spliced together with the optimal video goodness as the final teaching video specifically includes:

step S401, the following operation steps S4011-4013 are executed for each teaching mosaic video, so that the video goodness of M teaching mosaic videos is obtained:

step S4011, determining the image picture transition consistency degree between the last image frame of the previous video subdata and the first image frame of the next video subdata in every two adjacent video subdata in the current teaching splicing video;

step S4012, determining the sound transition consistency degree between the last section of sound segment of the previous video subdata and the first section of sound segment of the next video subdata in every two adjacent video subdata in the current teaching splicing video;

step S4013, determining video goodness of the current teaching mosaic video according to image picture transition transformation consistency degree and sound transition transformation consistency degree corresponding to every two adjacent video subdata in the current teaching mosaic video;

and S402, determining the quality sequence of the M teaching spliced videos according to the video goodness, and taking the teaching spliced video with the optimal video goodness as a final teaching video.

In one embodiment, in step S4011, the determining a degree of continuity of image-to-picture transition between a last image frame of a previous video subdata and a first image frame of a next video subdata in every two adjacent video subdata in the current teaching mosaic video specifically includes:

set the reference number m_ij、m_i(j+1)Respectively corresponding to the last image frame of jth video subdata in the ith teaching splicing video and the first image frame of jth +1 video subdata in the ith teaching splicing video, and obtaining an image frame m_ijImage frame m_i(j+1)The resolution of (a) is p x q, wherein p is the maximum pixel number of the image frame in the horizontal direction, q is the maximum pixel number of the image frame in the vertical direction, and (x, y) represents the image frame m_ijImage frame m_i(j+1)The value of x is any positive integer from 0 to p, the value of y is any positive integer from 0 to q, and m is the coordinate value of any pixel point_ijs(x,y)、m_i(j+1)s(x, y) respectively represent image frames m_ijImage frame m_i(j+1)The normalized color value of the corresponding pixel point (x, y) is calculated as follows:

m_ijs(x,y)＝10⁶m_ijR(x,y)+10³m_ijG(x,y)+10⁶m_ijB(x,y) (1)

m_i(j+1)s(x,y)＝10⁶m_i(j+1)R(x,y)+10³m_i(j+1)G(x,y)+10⁶m_i(j+1)B(x,y) (2)

in the above formulas (1) and (2), m_ijR(x,y)、m_ijG(x,y)、m_ijB(x, y) respectively represent image frames m_ijRed, green, blue color values of the upper pixel point (x, y), m_i(j+1)R(x,y)、m_i(j+1)G(x,y)、m_i(j+1)B(x, y) respectively represent image frames m_i(j+1)Red, green, blue color values of the upper pixel point (x, y);

then, the image frame m is calculated according to the following formula (3)_ijImage frame m_i(j+1)Color difference value at pixel point (x, y)

Then, the image frame m is calculated according to the following formula (4)_ijImage frame m_i(j+1)Evaluation value of transition consistency of image frames

Wherein when said

The larger the value of (a), the image frame m is indicated_ijImage frame m_i(j+1)The higher the image picture transition consistency degree between the two is;

and then, calculating a comprehensive evaluation value of the transition consistency degree of the image picture corresponding to the ith teaching spliced video according to the following formula (5):

wherein, F_iThe comprehensive evaluation value of the transition consistency degree of the image picture corresponding to the ith teaching spliced video is represented; h represents the total number of video subdata in the ith teaching spliced video.

In one embodiment, in step S4012, determining a degree of continuity of sound transition between a last sound segment of previous video sub data and a first sound segment of next video sub data in every two adjacent video sub data in the current teaching spliced video, specifically includes:

set the reference number n_ij、n_i(j+1)Respectively corresponding to the decibel of an ending sound corresponding to the last image frame of jth video subdata in the ith teaching splicing video and the decibel of a starting sound corresponding to the first image frame of jth +1 video subdata in the ith teaching splicing video;

calculating the sound transition consistency degree between the jth video subdata and the jth +1 video subdata in the ith teaching splicing video according to the following formula (6)

When said

The larger the numerical value is, the higher the sound transition consistency degree between the jth video subdata and the jth +1 video subdata in the ith teaching splicing video is;

and then, calculating the comprehensive evaluation of the sound transition consistency degree corresponding to the ith teaching mosaic video according to the following formula (7)Value W_i：

In an embodiment, in step S4013, determining a video goodness of the current teaching mosaic video according to an image-picture transition consistency degree and a sound transition consistency degree corresponding to every two adjacent video subdata in the current teaching mosaic video specifically includes:

calculating a video goodness evaluation value P of the ith teaching spliced video according to the following formula (8)_i：

P_i＝α₁F_i+α₂W_i (8)

Wherein alpha is₁Denotes a preset F_iThe corresponding weight value is a numerical value which is greater than 0 and less than 1; alpha is alpha₂Denotes a preset W_iThe corresponding calculated weight value is a value greater than 0 and less than 1, and alpha₁And alpha₂The sum is 1;

P_ithe larger the value is, the better the goodness of the ith teaching spliced video is.

The embodiment of the invention also provides a teaching video processing system, which comprises:

the shooting processing module is used for shooting a preset teaching process from two different directions so as to correspondingly obtain two original video data related to the preset teaching process, and respectively carrying out video quality processing preprocessing on the two original video data so as to correspondingly obtain two preprocessed video data meeting a preset image quality condition and/or a preset sound quality condition;

the video processing module is used for obtaining two video subdata sets corresponding to two shooting positions from the two preprocessed video data according to the shooting positions, each video subdata set corresponds to one shooting position, each video subdata set comprises N video subdata which are connected back and forth on a shooting time sequence, the lengths of the shooting time periods corresponding to all the video subdata are equal, and the N video subdata correspond to the N shooting time periods respectively;

the video splicing module is used for splicing M times of video subdata according to a shooting time sequence to obtain M teaching spliced videos; wherein, every splicing operation comprises: randomly selecting N1 video subdata from one video subdata set, randomly selecting N2 video subdata from another video subdata set, and selecting N1+ N2 as N; the shooting time periods corresponding to the N1 pieces of video sub-data and the shooting time periods corresponding to the N2 pieces of video sub-data jointly constitute the N shooting time periods; splicing the N1 pieces of video subdata and the N2 pieces of video subdata in sequence according to a shooting time sequence, thereby obtaining a teaching spliced video corresponding to the splicing operation;

and the final determining module is used for determining the respective video goodness of each teaching mosaic video in the M teaching mosaic videos, determining the goodness sequence of the M teaching mosaic videos according to the video goodness, and taking the teaching mosaic video with the optimal video goodness as the final teaching video.

Compared with the prior art, the teaching video processing method and the system comprise the steps of shooting a preset teaching process from two different directions, correspondingly obtaining two original video data related to the preset teaching process, respectively carrying out video quality processing pretreatment on the two original video data so as to correspondingly obtain two pretreated video data meeting a preset image quality condition and/or a preset sound quality condition, screening the pretreated video data according to the shooting directions so as to obtain two video sub-data sets related to different shooting directions, sequentially splicing the video sub-data contained in each video sub-data set according to the shooting time sequence so as to correspondingly obtain a teaching spliced video, and determining the respective image picture transition continuity degree and sound transition continuity degree of the teaching spliced video, the quality of all the teaching spliced videos is sequenced, so that the optimal teaching spliced video is converted into a three-dimensional teaching video; therefore, the teaching video processing method and the system can ensure that the teaching video can be updated to have higher presentation quality on image quality and sound quality, improve the immersion of users during the watching process and improve the watching quality of the teaching video.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a schematic flow chart of a teaching video processing method according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Fig. 1 is a schematic flow chart of a teaching video processing method according to an embodiment of the present invention. The teaching video processing method comprises the following steps:

The beneficial effects of the above technical scheme are: the teaching video processing method generates corresponding original teaching video data in a multi-angle shooting mode, then carries out video quality processing pretreatment, splicing treatment and quality judgment treatment on the original teaching video data, and finally takes the teaching spliced video with the optimal video goodness as the final teaching video.

The beneficial effects of the above technical scheme are: corresponding video data information can be comprehensively acquired by shooting the preset teaching process from two different view angle directions which are mutually crossed, and quality restoration processing can be pertinently performed by separating image signals and sound signals in the video data information, so that the preprocessed video data are ensured to have good data quality on an image layer and a sound layer.

m_ijs(x,y)＝10⁶m_ijR(x,y)+10³m_ijG(x,y)+10⁶m_ijB(x,y) (1)

m_i(j+1)s(x,y)＝10⁶m_i(j+1)R(x,y)+10³m_i(j+1)G(x,y)+10⁶m_i(j+1)B(x,y) (2)

in the above formulas (1) and (2), m_ijR(x,y)、m_ijG(x,y)、m_ijB(x, y) respectively represent image frames m_ijRed, green, blue color values of the upper pixel point (x, y), m_i(j+1)R(x,y)、m_i(j+1)G(x,y)、m_i(j+1)B(x, y) respectively represent image frames m_i(j+1)Red color of upper pixel point (x, y)Value, green color value, blue color value;

Wherein when said

When said

and then, calculating a comprehensive evaluation value W of the sound transition consistency degree corresponding to the ith teaching spliced video according to the following formula (7)_i：

calculating the view of the ith teaching spliced video according to the following formula (8)Frequency goodness evaluation value P_i：

P_i＝α₁F_i+α₂W_i (8)

The beneficial effects of the above technical scheme are: through carrying out comprehensive evaluation about image picture transition consistency degree and sound transition consistency degree to this teaching mosaic video, can comprehensively and accurately discriminate the best teaching mosaic video of quality to improve final teaching video's viewability, guarantee final teaching video's quality of watching. Compared with video software operation and a traditional method, the speed is obviously improved in the calculation process, and the accuracy is greatly improved.

Corresponding to the teaching video processing method provided by the embodiment of the invention, the invention also provides a teaching video processing system, which comprises:

It can be known from the content of the above embodiment that the method and system for processing teaching video includes shooting a preset teaching process from two different orientations, correspondingly obtaining two original video data related to the preset teaching process, respectively performing video quality processing preprocessing on the two original video data, correspondingly obtaining two preprocessed video data satisfying a preset image quality condition and/or a preset sound quality condition, screening the preprocessed video data according to the shooting orientation, thus obtaining two video sub-data sets related to different shooting orientations, sequentially splicing the video sub-data included in each video sub-data set according to the shooting time sequence, correspondingly obtaining a teaching spliced video, and determining respective image-frame transition continuity degree and sound transition continuity degree of the teaching spliced video, the quality of all the teaching spliced videos is sequenced, so that the optimal teaching spliced video is converted into a three-dimensional teaching video; therefore, the teaching video processing method and the system can ensure that the teaching video can be updated to have higher presentation quality on image quality and sound quality, improve the immersion of users during the watching process and improve the watching quality of the teaching video.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.

Claims

1. The teaching video processing method is characterized by comprising the following steps:

2. The instructional video processing method of claim 1, wherein:

in the step S1, the preset teaching process is photographed from two different orientations to correspondingly obtain two original video data related to the preset teaching process, and the two original video data are respectively subjected to video quality processing preprocessing, so that the two preprocessed video data meeting the preset image quality condition and/or the preset sound quality condition are correspondingly obtained,

3. The instructional video processing method of claim 1, wherein:

in step S4, determine the respective video goodness of each teaching mosaic video in the M teaching mosaic videos, determine the goodness sequence of the M teaching mosaic videos according to the video goodness, regard the teaching mosaic video with the optimal video goodness as the final teaching video, specifically include:

4. The instructional video processing method of claim 3, wherein:

in step S4011, determining a degree of image-to-image transition continuity between a last image frame of the previous video sub-data and a first image frame of the next video sub-data in every two adjacent video sub-data in the current teaching mosaic video specifically includes:

m_ijs(x,y)＝10⁶m_ijR(x,y)+10³m_ijG(x,y)+10⁶m_ijB(x,y) (1)

m_i(j+1)s(x,y)＝10⁶m_i(j+1)R(x,y)+10³m_i(j+1)G(x,y)+10⁶m_i(j+1)B(x,y) (2)

in the above formulas (1) and (2), m_ijR(x,y)、m_ijG(x,y)、m_ijB(x, y) respectively represent image frames m_ijRed, green, blue color values of the upper pixel point (x, y), m_i(j+1)R(x,y)、m_i(j+1)G(x,y)、m_i(j+1)B(x, y) each representsImage frame m_i(j+1)Red, green, blue color values of the upper pixel point (x, y);

Wherein when said

5. The instructional video processing method of claim 4, wherein:

step S4012, determining a sound transition consistency degree between a last sound segment of a previous video sub data and a first sound segment of a next video sub data in every two adjacent video sub data in the current teaching mosaic video, specifically including:

When said

6. The teaching video processing method of claim 5, wherein:

in step S4013, determining a video goodness of the current teaching mosaic video according to an image picture transition consistency degree and a sound transition consistency degree corresponding to every two adjacent video subdata in the current teaching mosaic video, specifically including:

P_i＝α₁F_i+α₂W_i (8)

7. Teaching video processing system, characterized in that it comprises: