CN107483843A - Audio frequency and video match clipping method and device - Google Patents
Audio frequency and video match clipping method and device Download PDFInfo
- Publication number
- CN107483843A CN107483843A CN201710701832.XA CN201710701832A CN107483843A CN 107483843 A CN107483843 A CN 107483843A CN 201710701832 A CN201710701832 A CN 201710701832A CN 107483843 A CN107483843 A CN 107483843A
- Authority
- CN
- China
- Prior art keywords
- video
- target
- target video
- music
- value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/222—Studio circuitry; Studio devices; Studio equipment
- H04N5/262—Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
- H04N5/265—Mixing
Abstract
The present invention provides a kind of audio frequency and video matching clipping method and device, is related to multimedia data processing field.Methods described and device are labeled with the target music of multiple cut points by obtaining in advance, and the target music is labeled as multiple snatch of musics by the cut point;It is multiple video segments by least one target video cutting obtained according to the duration of the snatch of music;The video segment of predetermined number is chosen from multiple video segments as target video fragment;Using filling algorithm, the filling that the video segment is calculated according to the weight of the target video fragment and filling position is worth, between the target video fragment is inserted into the corresponding cut point of the target music, make the new video file for the target video fragment and integral Maximum Value of snatch of music match group inserted.Based on above-mentioned design, methods described and device simplify the editing operation of audio frequency and video, while also improve the quality of the video of editing.
Description
Technical field
The present invention relates to multimedia data processing field, and clipping method is matched in particular to a kind of audio frequency and video
And device.
Background technology
As generally when carrying out editing to audio or video, whole process needs to have been manually done.Specific operation process, such as,
Operating personnel are using video clipping software by one video of multiple sections of Video Compositions, and then the background music of the editing video, makes
The duration of background music is identical with the duration of the video, and finally the background music is loaded into video, obtains new video.Existing
Have in technology, editing operation is complicated, high to the technical requirements of the operating personnel of editing Voice & Video, and what not so editing obtained regards
Easily there is the situation that video content and music rhythm are not taken in frequency, and influences the quality of video.Therefore, how a kind of operation letter is provided
It is single and the method and device of the quality of editing video can be improved, it has also become the technical problem of those skilled in the art's urgent need to resolve.
The content of the invention
In order to overcome above-mentioned deficiency of the prior art, the present invention provides a kind of audio frequency and video matching clipping method and device,
To solve the above problems.
To achieve these goals, the technical scheme that present pre-ferred embodiments are provided is as follows:
Present pre-ferred embodiments provide a kind of audio frequency and video matching clipping method, and methods described includes:
The target music for being labeled with multiple cut points in advance is obtained, the target music is by the cut point labeled as multiple
Snatch of music;
It is multiple video segments by least one target video cutting obtained according to the duration of the snatch of music;
The video segment of predetermined number is chosen from multiple video segments as target video fragment;
Using filling algorithm, the target video fragment is calculated according to the weight of the target video fragment and filling position
Filling value, and be worth according to the filling of each target video fragment, the target video fragment is inserted into the target music
Between corresponding cut point, the target video fragment and the new of the integral Maximum Value of snatch of music match group that make to insert regard
Frequency file.
In the preferred embodiment, above-mentioned overall value is worth sum for the filling of each target video fragment,
The filling value of the target video fragment includes self-value and discrete value;
The mode of the self-value is calculated, including:
The characteristic information of each target video fragment is analyzed, the characteristic information includes face information, more people's scene informations, people
Smile information and artificial label information in face;
According to the characteristic information, weights corresponding to each video segment are assigned as the target video fragment from the personal value
Value;
The mode of the discrete value is calculated, including:
If during in the presence of at least two target video fragments in same target video, the discrete value regards with the target
Between the corresponding video segment and other video segments of target video fragment in target video in target video of frequency fragment
Distance be associated;
If whether target video fragment when or not in same target video, the discrete value is preset value.
In the preferred embodiment, above-mentioned calculated according to the weight and filling position of the target video fragment should
The filling value of target video fragment, and be worth according to the filling of each target video fragment, the target video fragment is filled out
The step of entering between the corresponding cut point of the target music, including:
Using greedy approximate data, iterate to calculate each video segment as the self-value of the target video fragment and
Discrete value, to obtain the overall value of multiple corresponding video files;
The video file corresponding to maximum overall value is chosen from multiple overall values as new video file.
In the preferred embodiment, the video segment between the corresponding cut point of target music described above exists
Under conditions of meeting discreteness, the order successively decreased according to weights is filled.
In the preferred embodiment, above-mentioned acquisition be labeled with advance the step of target music of multiple cut points it
Before, methods described also includes:
The acoustic amplitudes information of frequency domain is preset from the target extraction of music;
Time point that amplitude in the default frequency domain increases sharply is chosen as the cut point, between making between adjacent cut point
Exceed preset duration every duration.
In the preferred embodiment, the above-mentioned duration according to the snatch of music, it is at least one by what is obtained
The step of target video cutting is multiple video segments, including:
Choose the duration of duration is most long in the snatch of music period as the video segment of cutting.
In the preferred embodiment, the piece of video hop count for inserting the target music is equal to the target music
Musical film hop count.
It is in the preferred embodiment, above-mentioned that the target video fragment is inserted into the target music is corresponding
The step of between cut point, including:
The length of every section of target video fragment is corrected, so that the length of target video fragment is right equal in the target music
The length for the snatch of music answered.
Presently preferred embodiments of the present invention also provides a kind of audio frequency and video matching editing device, including:
Acquiring unit, the target music of multiple cut points is labeled with for obtaining in advance, and the target music is cut by described
Cutpoint is labeled as multiple snatch of musics;
Cut cells, for the duration according to the snatch of music, it is by least one target video cutting obtained
Multiple video segments;
Unit is chosen, for choosing the video segment of predetermined number from multiple video segments as target video fragment;
Video Composition unit, for using filling algorithm, according to the weight and filling position meter of the target video fragment
The filling value of the target video fragment is calculated, and is worth according to the filling of each target video fragment, by the target video piece
Section is inserted between the corresponding cut point of the target music, and it is whole that the target video fragment for making to insert matches composition with snatch of music
The new video file of body Maximum Value.
In the preferred embodiment, above-mentioned overall value is worth sum for the filling of each target video fragment,
The filling value of the target video fragment includes self-value and discrete value;
The Video Composition unit calculates the mode of the self-value, including:
The characteristic information of each target video fragment is analyzed, the characteristic information includes face information, more people's scene informations, people
Smile information and artificial label information in face;
According to the characteristic information, weights corresponding to each video segment are assigned as the target video fragment from the personal value
Value;
The Video Composition unit calculates the mode of the discrete value, including:
If during in the presence of at least two target video fragments in same target video, the discrete value regards with the target
Between the corresponding video segment and other video segments of target video fragment in target video in target video of frequency fragment
Distance be associated;
If whether target video fragment when or not in same target video, the discrete value is preset value.
In terms of existing technologies, audio frequency and video matching clipping method and device provided by the invention at least have with following
Beneficial effect:Methods described is labeled with the target music of multiple cut points by the way that multiple video segments of cutting are inserted, and is obtained
To video file, and select it is overall take Maximum Value as new video file, simplify the editing operation of audio frequency and video, also simultaneously
Improve the quality of the video of editing.Specifically, this method uses filling algorithm, according to the weight of target video fragment and filling
Position calculates the filling value of the video segment, and is worth according to the filling of each video segment, by the target video fragment
Insert between the corresponding cut point of the target music, make the target video fragment inserted and snatch of music match group integral
The new video file of Maximum Value.Methods described and device can make video segment and target music in the video file of editing
Corresponding music rhythm matches, and while the quality of editing video is improved, additionally aids the experience sense of lifting user.
To enable the above objects, features and advantages of the present invention to become apparent, present pre-ferred embodiments cited below particularly,
And accompanying drawing appended by coordinating, it is described in detail below.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by embodiment it is required use it is attached
Figure is briefly described.It should be appreciated that the following drawings illustrate only certain embodiments of the present invention, therefore it is not construed as pair
The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this
A little accompanying drawings obtain other related accompanying drawings.
Fig. 1 is the block diagram for the terminal device that present pre-ferred embodiments provide.
Fig. 2 is the schematic flow sheet that the audio frequency and video that present pre-ferred embodiments provide match clipping method.
Fig. 3 is the schematic flow sheet of step S240 sub-step shown in Fig. 2.
Fig. 4 is the block diagram that the audio frequency and video that present pre-ferred embodiments provide match editing device.
Icon:10- terminal devices;11- processors;12- memories;13- display units;100- audio frequency and video matching editing dress
Put;110- acquiring units;120- cut cellses;130- chooses unit;140- Video Composition units.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation describes.Obviously, described embodiment is only the part of the embodiment of the present invention, rather than whole embodiments.It is logical
The component for the embodiment of the present invention being often described and illustrated herein in the accompanying drawings can be configured to arrange and design with a variety of.
Therefore, below the detailed description of the embodiments of the invention to providing in the accompanying drawings be not intended to limit it is claimed
The scope of the present invention, but be merely representative of the present invention selected embodiment.Based on embodiments of the invention, people in the art
The every other embodiment that member is obtained on the premise of creative work is not made, belongs to the scope of protection of the invention.
It should be noted that:Similar label and letter represents similar terms in following accompanying drawing, therefore, once a certain Xiang Yi
It is defined, then it further need not be defined and explained in subsequent accompanying drawing in individual accompanying drawing.
Below in conjunction with the accompanying drawings, some embodiments of the present invention are elaborated.It is following in the case where not conflicting
Feature in embodiment and embodiment can be mutually combined.
Fig. 1 is refer to, is the block diagram for the terminal device 10 that present pre-ferred embodiments provide.In the present embodiment
In, the terminal device 10 can be as editing video and the operating platform of audio, with for users to use.The terminal device 10
Processor 11, memory 12 and audio frequency and video matching editing device 100 can be included.User can utilize the sound in terminal device 10
Video matching editing device 100, realize that shearing, editor and synthesis of Voice & Video etc. operate, to obtain regarding after editing
Frequency file, simplify the operating process of editing audio frequency and video.
Further, the overview of terminal device 10 is with including other elements, such as display unit 13.The processor
11st, directly or indirectly it is electrically connected between memory 12 and each element of modern unit, to realize the transmission of data and friendship
Mutually.The audio frequency and video matching editing device 100 can be stored in including at least one in the form of software or firmware (firmware)
In the memory 12 or the software work(that is solidificated in the operating system (operating system, OS) of the terminal device 10
Can module.The memory 12 can store the data such as voice data, video data.The processor 11 is used to perform described deposit
The executable module stored in reservoir 12, such as the software function module included by audio frequency and video matching editing device 100 and calculating
Machine program etc..
Further, the memory 12 may be, but not limited to, random access memory (Random Access
Memory, RAM), read-only storage (Read Only Memory, ROM), programmable read only memory (Programmable
Read-Only Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only
Memory, EPROM), electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only
Memory, EEPROM) etc..Wherein, memory 12 is used for storage program, and the processor 11 is held after execute instruction is received
Row described program.The access of the processor 11 and other possible components to memory 12 can be in the storage control
Control is lower to be carried out.
The processor 11 can be general processor, including central processing unit (Central Processing Unit,
CPU), network processing unit (Network Processor, NP) etc.;It can also be digital signal processor (DSP), special integrated
Circuit (ASIC), ready-made programmable gate array (FPGA) either other PLDs, discrete gate or transistor logic
Device, discrete hardware components, it is possible to achieve or disclosed each method, step and box in the execution embodiment of the present invention
Figure.General processor can be microprocessor or the processor 11 can also be any conventional processor etc..
In the present embodiment, the display unit 13 is used to play the video of the editing of terminal device 10, audio (ratio
Such as, target music).The display unit 13 can be also used for showing the history usage record of audio or video.It is in addition, described aobvious
Show that unit 13 can also show the editing toolbar that user is accustomed to and set according to the editing of oneself, use convenient for the user to operate.
The display unit 13 may be, but not limited to, touching display screen, common liquid crystals display screen etc., be not especially limited here.
It is understood that the structure shown in Fig. 1 is only a kind of structural representation of terminal device 10, the terminal device
10 may also include more either less components than shown in Fig. 1 or have the configuration different from shown in Fig. 1.Shown in Fig. 1
Each component can use hardware, software or its combination realize.
In the present embodiment, the terminal device 10 may be, but not limited to, smart mobile phone, PC (Personal
Computer, PC), tablet personal computer, personal digital assistant (Personal DigitalAssistant, PDA) etc., it is preferable that institute
It is smart mobile phone to state terminal device 10.
Fig. 2 is refer to, is the schematic flow sheet for the audio frequency and video matching clipping method that present pre-ferred embodiments provide.At this
In embodiment, the audio frequency and video matching clipping method is applied to the terminal device 10 shown in Fig. 1.Methods described will be by that will shear
Video segment be filled in the target music for being marked with cut point, to form new video file, and then simplify editing sound
The operating procedure of frequency and video.The audio frequency and video described in Fig. 2 are matched with the idiographic flow of clipping method below and step is carried out in detail
It is thin to illustrate.
In embodiments of the present invention, the audio frequency and video matching clipping method comprises the following steps:
Step S210, the target music for being labeled with multiple cut points in advance is obtained, the target music is by the cut point
Labeled as multiple snatch of musics.
In the present embodiment, acquired target music is previously provided with multiple cut points, and its cut point, which is used to be used as, to be filled out
Enter inserting a little for video segment, so that the video segment and corresponding snatch of music inserted are engaged.In addition, user can be according to tool
Body situation sets the quantity of cut point, is not especially limited here.
Before step S210, memory 12 can be previously stored with a first or more songs, and user can like according to itself
The good first background music as video to be clipped of selection wherein one, that is, the target music.It is of course also possible to randomly select
Wherein a piece of music is as the target music.Then target music can be set according to the musical features of the target music of extraction
Cut point.Understandably, target music is cut a point mark and is divided into multiple snatch of musics, the video segment of each cutting with it is right
The snatch of music answered is engaged, i.e. the video segment being engaged just is inserted in corresponding snatch of music.
In this embodiment, the musical features include tempo characteristic, and the tempo characteristic includes the sound of the target music
Sound amplitude information.The step of musical features of the above-mentioned target music according to extraction set cut point to target music is appreciated that
For:The acoustic amplitudes information of frequency domain is preset from the target extraction of music;Choose that amplitude in the default frequency domain increases sharply when
Between point be used as the cut point, the interval duration between adjacent cut point is exceeded preset duration.
Usually, harmony component and rhythm component are included in music.Understandably, harmony component is the musical instrument for having tone
The music played, for example, orchestra.The music that rhythm component is played by the musical instrument of no tone, for example, drum class pleasure
Device.The musical features can be the beat information in rhythm component, such as the nodal information that amplitude increases suddenly.In extraction mesh
During the musical features of mark with phonetic symbols pleasure, the harmony component in target music and rhythm component can be separated, to obtain rhythm component.
Then musical features of the acoustic amplitudes information as the target music are extracted from rhythm component.
Further, if sound corresponding to rhythm component is separated into sonograph, the time point that the amplitude increases sharply can be with
It is interpreted as in default frequency domain, amplitude is from the time flex point for being reduced to increase.Alternatively, amplitude corresponding to the flex point is not less than pre-
If amplitude threshold.
Further, the cut point coordinates the video segment of cutting, is used as being loaded into the incision of video segment
Point.And the interval duration between adjacent cut point exceedes preset duration, to avoid the interval duration between adjacent cut point too short,
And make to insert video segment also section, and then influence the result of broadcast of the video after editing.
In the present embodiment, the amplitude threshold, preset duration, default frequency domain can be configured as the case may be,
Here it is not especially limited.
Step S220, it is multiple by least one target video cutting obtained according to the duration of the snatch of music
Video segment.
In the present embodiment, methods described can choose period the regarding as cutting that duration is most long in the snatch of music
The duration of frequency fragment, then by the one or more target video cuttings being obtained ahead of time be same fixed duration multiple piece of video
Section.Understandably, interval duration most long period when described fixed between a length of adjacent marker point, so as in target music
Two neighboring mark point can fill up the video segment, when avoiding the appearance broadcasting music of the video after editing, no video content exhibition
Existing situation occurs.
Step S230, the video segment of predetermined number is chosen from multiple video segments as target video fragment.
In the present embodiment, the target video segments of selection can be with equal, so that often with the musical film hop count being divided
Section target video fragment corresponds with snatch of music.
Step S240, using filling algorithm, the target is calculated according to the weight of the target video fragment and filling position
The filling value of video segment, and be worth according to the filling of each target video fragment, the target video fragment is inserted into institute
Between stating the corresponding cut point of target music, make the target video fragment inserted with the integral value of snatch of music match group most
Big new video file.
In the present embodiment, the filling algorithm is editing filling algorithm.It is to be appreciated that the editing filling algorithm is
Seek a kind of best fit strategy, target video fragment is inserted between the mark point of target music, so that overall value is maximum.
Wherein, the overall value can be regarded as the quality of the video obtained by editing, such as, between video segment and snatch of music
Continuity between matching degree, video segment etc..
Further, the overall value can be that the filling of each target video fragment is worth sum, and the target regards
The filling value of frequency fragment includes self-value and discrete value.The weights of the self-value and corresponding target video fragment
It is associated.Specifically, the characteristic information that the mode of the self-value can include analyzing each target video fragment is calculated, it is described
Characteristic information includes face information, more people's scene informations, the smile information in face and artificial label information;According to the spy
Reference ceases, and assigns self-value of the weights as the target video fragment corresponding to each video segment.
In the present embodiment, before weights are assigned, each characteristic information is previously provided with corresponding weights.Specifically,
For example, if analysis video segment obtains the smile information in face, weights corresponding to the smile information prestored are called, with
Weights as the video segment.Certainly, in other embodiments, the weights of video segment can also be artificially set.Weights
Size can be configured according to actual conditions, the size of weights is not especially limited here.
Further, the characteristic information can also include other information, to enrich the type of video segment.Such as institute
Animal painting information can also be included by stating characteristic information, such as, the image information of the animal such as cat, dog, no longer it is specifically described here.
In the present embodiment, the discrete value can include the value under two kinds of different situations.If for example, in the presence of at least
When two target video fragments are in same target video, the discrete value is corresponding with the target video fragment to be regarded in target
Video segment in frequency is associated with the distance between other video segments of target video fragment in target video;If target
When video segment is not in same target video, the discrete value is preset value.Its preset value can enter as the case may be
Row is set, and is not especially limited here.
It is more than most it is alternatively possible to be worth the smallest discrete of target video fragment of the setting not in same target video
Big self-value, to avoid being filled in two target video fragments adjacent in snatch of music also phase in former target video
It is adjacent.Namely based on above-mentioned design, the visual effect of the video file of formation can be improved.
Further, step S240 can also include one or more sub-steps.For example, Fig. 3 is refer to as shown in Fig. 2
The schematic flow sheet of step S240 sub-step.In the present embodiment, the step S240 can include sub-step S241 and son
Step S242.
Sub-step S241, using greedy approximate data, each video segment is iterated to calculate as the target video fragment
Self-value and discrete value, with obtain it is multiple corresponding to video files overall value.
Sub-step S242, the video file corresponding to maximum overall value is chosen from multiple overall values and is regarded as new
Frequency file.
In the present embodiment, the dynamic programming algorithm for solving 0-1 knapsack problems can be used to obtain filling out for overall value maximum
Fill mode;Then greedy approximate data can be used, iterate to calculate, and with last computation results contrast, it is smaller to cast out overall value
, and the result calculated using the larger video file of overall value as this.By way of iterative calculation, it can select final
Filling mode corresponding to maximum overall value, and obtain the maximum video file of overall value.
Further, the video segment inserted between the corresponding cut point of the target music is meeting the bar of discreteness
Under part, the order successively decreased according to weights is filled.Understandably, meeting to be filled in video segment adjacent in target music in original
In target video fragment it is non-conterminous under the conditions of, can successively decrease according to the weights of target video fragment order filling target video.
Further, before target video fragment is inserted, methods described can also include every section of target video piece of amendment
The length of section, so that the length of target video fragment is equal to the length of corresponding snatch of music in the target music.
Further, can be corresponded to by being sheared to target video fragment so that the length of target video fragment is equal to
Adjacent marker point between snatch of music length.Can also by carrying out quick or slow processes to target video fragment,
So that the target video fragment length is equal to the length of the snatch of music.Based on above-mentioned design, it can make what is obtained after institute's editing
Video file is continuous simultaneously in target music, moreover it is possible to makes the broadcasting of video have continuity, the use of lifting viewing video file
The experience sense at family, also just improve the quality of the video file.
Fig. 4 is refer to, is the block diagram for the audio frequency and video matching editing device 100 that present pre-ferred embodiments provide.
Present pre-ferred embodiments also provide a kind of audio frequency and video matching editing device 100, described device can include acquiring unit 110,
Cut cells 120, choose unit 130 and Video Composition unit 140.
The acquiring unit 110, it is labeled with the target music of multiple cut points, the target music quilt in advance for obtaining
The cut point is labeled as multiple snatch of musics.Specifically, the acquiring unit 110 can be used for performing the step shown in Fig. 2
Rapid S210, specific operating method can refer to the detailed description to step S210.
The cut cells 120, for the duration according to the snatch of music, at least one target video that will be obtained
Cutting is multiple video segments.Specifically, the cut cells 120 can be used for performing the step S220 shown in Fig. 2, specifically
Operating method can refer to the detailed description to step S220.
The selection unit 130, the video segment of predetermined number is chosen from multiple video segments as target video piece
Section.Specifically, the unit 130 of choosing can be used for performing the step S230 shown in Fig. 2, and specific operating method can refer to pair
Step S230 detailed description.
The Video Composition unit 140, for using filling algorithm, according to the weight of the target video fragment and filling
Position calculates the filling value of the target video fragment, and is worth according to the filling of each target video fragment, by the target
Video segment is inserted between the corresponding cut point of the target music, and the target video fragment for making to insert matches with snatch of music
Form the maximum new video file of overall value.Specifically, the Video Composition unit 140 can be used for performing shown in Fig. 2
Step S240, specific operating method can refer to the detailed description to step S240.
Further, the Video Composition unit 140 can be also used for performing the sub-step S241 and sub-step shown in Fig. 3
Rapid S242, specific operating method can refer to sub-paragraphs S241 and sub-step S242 detailed description, repeat no more here.
In summary, the present invention provides a kind of audio frequency and video matching clipping method and device.Methods described is by by cutting
Multiple video segments, which are inserted, to be labeled with the target music of multiple cut points, obtains video file, and selects entirety to fix the price value most
It is big as new video file, simplify the editing operation of audio frequency and video, while also improve the quality of the video of editing.It is described
The filling that method calculates the video segment according to the weight and filling position of target video fragment is worth, and according to each piece of video
The filling value of section, the maximum filling mode of overall value is chosen so that the target video fragment is inserted into the target music phase
Between corresponding cut point, make the new video for the target video fragment and integral Maximum Value of snatch of music match group inserted
File.Methods described and device can make music rhythm phase corresponding to video segment and target music in the video file of editing
Match somebody with somebody, while the quality of editing video is improved, additionally aid the experience sense of lifting user.
The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention, for the skill of this area
For art personnel, the present invention can have various modifications and variations.Within the spirit and principles of the invention, that is made any repaiies
Change, equivalent substitution, improvement etc., should be included in the scope of the protection.
Claims (10)
1. a kind of audio frequency and video match clipping method, it is characterised in that methods described includes:
The target music for being labeled with multiple cut points in advance is obtained, the target music is labeled as multiple music by the cut point
Fragment;
It is multiple video segments by least one target video cutting obtained according to the duration of the snatch of music;
The video segment of predetermined number is chosen from multiple video segments as target video fragment;
Using filling algorithm, the filling of the target video fragment is calculated according to the weight of the target video fragment and filling position
Value, and be worth according to the filling of each target video fragment, it is relative that the target video fragment is inserted into the target music
Between the cut point answered, make the new video text for the target video fragment and integral Maximum Value of snatch of music match group inserted
Part.
2. according to the method for claim 1, it is characterised in that the overall value is the filling of each target video fragment
Sum is worth, the filling value of the target video fragment includes self-value and discrete value;
The mode of the self-value is calculated, including:
The characteristic information of each target video fragment is analyzed, the characteristic information is included in face information, more people's scene informations, face
Smile information and artificial label information;
According to the characteristic information, self-value of the weights as the target video fragment corresponding to each video segment is assigned;
The mode of the discrete value is calculated, including:
If during in the presence of at least two target video fragments in same target video, the discrete value and the target video piece
Between the corresponding video segment and other video segments of target video fragment in target video in target video of section away from
From associated;
If whether target video fragment when or not in same target video, the discrete value is preset value.
3. according to the method for claim 2, it is characterised in that the weight and filling according to the target video fragment
Position calculates the filling value of the target video fragment, and is worth according to the filling of each target video fragment, by the target
The step of video segment is inserted between the corresponding cut point of the target music, including:
Using greedy approximate data, iterate to calculate each video segment and be used as the self-value of the target video fragment and discrete
Value, to obtain the overall value of multiple corresponding video files;
The video file corresponding to maximum overall value is chosen from multiple overall values as new video file.
4. according to the method for claim 3, it is characterised in that insert between the corresponding cut point of the target music
Video segment under conditions of discreteness is met, fill by the order successively decreased according to weights.
5. according to the method for claim 1, it is characterised in that described to obtain the target sound for being labeled with multiple cut points in advance
Before happy step, methods described also includes:
The acoustic amplitudes information of frequency domain is preset from the target extraction of music;
Time point that amplitude in the default frequency domain increases sharply is chosen as the cut point, when making the interval between adjacent cut point
Length exceedes preset duration.
6. according to the method for claim 1, it is characterised in that the duration according to the snatch of music, will obtain
At least one target video cutting the step of being multiple video segments, including:
Choose the duration of duration is most long in the snatch of music period as the video segment of cutting.
7. according to the method for claim 1, it is characterised in that insert the piece of video hop count of the target music equal to described
The musical film hop count of target music.
8. according to the method for claim 1, it is characterised in that described that the target video fragment is inserted into the target sound
The step of between happy corresponding cut point, including:
The length of every section of target video fragment is corrected, so that the length of target video fragment is equal to corresponding in the target music
The length of snatch of music.
9. a kind of audio frequency and video match editing device, it is characterised in that including:
Acquiring unit, it is labeled with the target music of multiple cut points in advance for obtaining, the target music is by the cut point
Labeled as multiple snatch of musics;
Cut cells, it is multiple by least one target video cutting obtained for the duration according to the snatch of music
Video segment;
Unit is chosen, for choosing the video segment of predetermined number from multiple video segments as target video fragment;
Video Composition unit, for using filling algorithm, being calculated according to the weight of the target video fragment and filling position should
The filling value of target video fragment, and be worth according to the filling of each target video fragment, the target video fragment is filled out
Between entering the corresponding cut point of the target music, make target video fragment and the integral valency of snatch of music match group inserted
It is worth maximum new video file.
10. device according to claim 9, it is characterised in that the overall value is filled out for each target video fragment
Value sum is filled, the filling value of the target video fragment includes self-value and discrete value;
The Video Composition unit calculates the mode of the self-value, including:
The characteristic information of each target video fragment is analyzed, the characteristic information is included in face information, more people's scene informations, face
Smile information and artificial label information;
According to the characteristic information, self-value of the weights as the target video fragment corresponding to each video segment is assigned;
The Video Composition unit calculates the mode of the discrete value, including:
If during in the presence of at least two target video fragments in same target video, the discrete value and the target video piece
Between the corresponding video segment and other video segments of target video fragment in target video in target video of section away from
From associated;
If whether target video fragment when or not in same target video, the discrete value is preset value.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710701832.XA CN107483843B (en) | 2017-08-16 | 2017-08-16 | Audio-video matches clipping method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710701832.XA CN107483843B (en) | 2017-08-16 | 2017-08-16 | Audio-video matches clipping method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107483843A true CN107483843A (en) | 2017-12-15 |
CN107483843B CN107483843B (en) | 2019-11-15 |
Family
ID=60600510
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710701832.XA Active CN107483843B (en) | 2017-08-16 | 2017-08-16 | Audio-video matches clipping method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107483843B (en) |
Cited By (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108830208A (en) * | 2018-06-08 | 2018-11-16 | Oppo广东移动通信有限公司 | Method for processing video frequency and device, electronic equipment, computer readable storage medium |
CN108986056A (en) * | 2018-08-24 | 2018-12-11 | 潘小亮 | Content requirements judge system |
CN109167934A (en) * | 2018-09-03 | 2019-01-08 | 咪咕视讯科技有限公司 | A kind of method for processing video frequency, device and computer readable storage medium |
CN109168084A (en) * | 2018-10-24 | 2019-01-08 | 麒麟合盛网络技术股份有限公司 | A kind of method and apparatus of video clipping |
CN110233976A (en) * | 2019-06-21 | 2019-09-13 | 广州酷狗计算机科技有限公司 | The method and device of Video Composition |
CN110336960A (en) * | 2019-07-17 | 2019-10-15 | 广州酷狗计算机科技有限公司 | Method, apparatus, terminal and the storage medium of Video Composition |
CN110545476A (en) * | 2019-09-23 | 2019-12-06 | 广州酷狗计算机科技有限公司 | Video synthesis method and device, computer equipment and storage medium |
CN110650368A (en) * | 2019-09-25 | 2020-01-03 | 新东方教育科技集团有限公司 | Video processing method and device and electronic equipment |
CN110955786A (en) * | 2019-11-29 | 2020-04-03 | 网易(杭州)网络有限公司 | Dance action data generation method and device |
CN111008287A (en) * | 2019-12-19 | 2020-04-14 | Oppo(重庆)智能科技有限公司 | Audio and video processing method and device, server and storage medium |
CN111064992A (en) * | 2019-12-10 | 2020-04-24 | 懂频智能科技(上海)有限公司 | Method for automatically switching video contents according to music beats |
CN111541946A (en) * | 2020-07-10 | 2020-08-14 | 成都品果科技有限公司 | Automatic video generation method and system for resource matching based on materials |
CN111556254A (en) * | 2020-04-10 | 2020-08-18 | 早安科技(广州)有限公司 | Method, system, medium and intelligent device for video cutting by using video content |
CN111683209A (en) * | 2020-06-10 | 2020-09-18 | 北京奇艺世纪科技有限公司 | Mixed-cut video generation method and device, electronic equipment and computer-readable storage medium |
CN112188307A (en) * | 2019-07-03 | 2021-01-05 | 腾讯科技(深圳)有限公司 | Video resource synthesis method and device, storage medium and electronic device |
CN112235631A (en) * | 2019-07-15 | 2021-01-15 | 北京字节跳动网络技术有限公司 | Video processing method and device, electronic equipment and storage medium |
CN112449231A (en) * | 2019-08-30 | 2021-03-05 | 腾讯科技(深圳)有限公司 | Multimedia file material processing method and device, electronic equipment and storage medium |
WO2021088830A1 (en) * | 2019-11-04 | 2021-05-14 | 北京字节跳动网络技术有限公司 | Method and apparatus for displaying music points, and electronic device and medium |
WO2021121023A1 (en) * | 2019-12-17 | 2021-06-24 | Oppo广东移动通信有限公司 | Video editing method, video editing apparatus, terminal, and readable storage medium |
CN113077470A (en) * | 2021-03-26 | 2021-07-06 | 天翼爱音乐文化科技有限公司 | Method, system, device and medium for cutting horizontal and vertical screen conversion picture |
CN114390367A (en) * | 2020-10-16 | 2022-04-22 | 上海哔哩哔哩科技有限公司 | Audio and video processing method and device |
CN114390352A (en) * | 2020-10-16 | 2022-04-22 | 上海哔哩哔哩科技有限公司 | Audio and video processing method and device |
WO2022152064A1 (en) * | 2021-01-15 | 2022-07-21 | 北京字跳网络技术有限公司 | Video generation method and apparatus, electronic device, and storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030160944A1 (en) * | 2002-02-28 | 2003-08-28 | Jonathan Foote | Method for automatically producing music videos |
CN101107667A (en) * | 2004-12-17 | 2008-01-16 | 诺基亚公司 | Method and apparatus for video editing on small screen with minimal input device |
CN101640057A (en) * | 2009-05-31 | 2010-02-03 | 北京中星微电子有限公司 | Audio and video matching method and device therefor |
EP2993668A1 (en) * | 2014-09-08 | 2016-03-09 | Thomson Licensing | Method for editing an audiovisual segment and corresponding device and computer program product |
CN105530440A (en) * | 2014-09-29 | 2016-04-27 | 北京金山安全软件有限公司 | Video production method and device |
-
2017
- 2017-08-16 CN CN201710701832.XA patent/CN107483843B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030160944A1 (en) * | 2002-02-28 | 2003-08-28 | Jonathan Foote | Method for automatically producing music videos |
CN101107667A (en) * | 2004-12-17 | 2008-01-16 | 诺基亚公司 | Method and apparatus for video editing on small screen with minimal input device |
CN101640057A (en) * | 2009-05-31 | 2010-02-03 | 北京中星微电子有限公司 | Audio and video matching method and device therefor |
EP2993668A1 (en) * | 2014-09-08 | 2016-03-09 | Thomson Licensing | Method for editing an audiovisual segment and corresponding device and computer program product |
CN105530440A (en) * | 2014-09-29 | 2016-04-27 | 北京金山安全软件有限公司 | Video production method and device |
Cited By (36)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108830208A (en) * | 2018-06-08 | 2018-11-16 | Oppo广东移动通信有限公司 | Method for processing video frequency and device, electronic equipment, computer readable storage medium |
CN108986056A (en) * | 2018-08-24 | 2018-12-11 | 潘小亮 | Content requirements judge system |
CN109167934A (en) * | 2018-09-03 | 2019-01-08 | 咪咕视讯科技有限公司 | A kind of method for processing video frequency, device and computer readable storage medium |
CN109167934B (en) * | 2018-09-03 | 2020-12-22 | 咪咕视讯科技有限公司 | Video processing method and device and computer readable storage medium |
CN109168084A (en) * | 2018-10-24 | 2019-01-08 | 麒麟合盛网络技术股份有限公司 | A kind of method and apparatus of video clipping |
CN110233976B (en) * | 2019-06-21 | 2022-09-09 | 广州酷狗计算机科技有限公司 | Video synthesis method and device |
CN110233976A (en) * | 2019-06-21 | 2019-09-13 | 广州酷狗计算机科技有限公司 | The method and device of Video Composition |
CN112188307A (en) * | 2019-07-03 | 2021-01-05 | 腾讯科技(深圳)有限公司 | Video resource synthesis method and device, storage medium and electronic device |
GB2600309B (en) * | 2019-07-15 | 2024-01-31 | Beijing Bytedance Network Tech Co Ltd | Video processing method and apparatus, and electronic device and storage medium |
WO2021008394A1 (en) * | 2019-07-15 | 2021-01-21 | 北京字节跳动网络技术有限公司 | Video processing method and apparatus, and electronic device and storage medium |
GB2600309A (en) * | 2019-07-15 | 2022-04-27 | Beijing Bytedance Network Tech Co Ltd | Video processing method and apparatus, and electronic device and storage medium |
CN112235631A (en) * | 2019-07-15 | 2021-01-15 | 北京字节跳动网络技术有限公司 | Video processing method and device, electronic equipment and storage medium |
CN110336960A (en) * | 2019-07-17 | 2019-10-15 | 广州酷狗计算机科技有限公司 | Method, apparatus, terminal and the storage medium of Video Composition |
CN110336960B (en) * | 2019-07-17 | 2021-12-10 | 广州酷狗计算机科技有限公司 | Video synthesis method, device, terminal and storage medium |
CN112449231A (en) * | 2019-08-30 | 2021-03-05 | 腾讯科技(深圳)有限公司 | Multimedia file material processing method and device, electronic equipment and storage medium |
CN110545476A (en) * | 2019-09-23 | 2019-12-06 | 广州酷狗计算机科技有限公司 | Video synthesis method and device, computer equipment and storage medium |
CN110545476B (en) * | 2019-09-23 | 2022-03-25 | 广州酷狗计算机科技有限公司 | Video synthesis method and device, computer equipment and storage medium |
CN110650368B (en) * | 2019-09-25 | 2022-04-26 | 新东方教育科技集团有限公司 | Video processing method and device and electronic equipment |
CN110650368A (en) * | 2019-09-25 | 2020-01-03 | 新东方教育科技集团有限公司 | Video processing method and device and electronic equipment |
US11335379B2 (en) | 2019-09-25 | 2022-05-17 | New Oriental Education & Technology Group Inc. | Video processing method, device and electronic equipment |
WO2021088830A1 (en) * | 2019-11-04 | 2021-05-14 | 北京字节跳动网络技术有限公司 | Method and apparatus for displaying music points, and electronic device and medium |
US11587593B2 (en) | 2019-11-04 | 2023-02-21 | Beijing Bytedance Network Technology Co., Ltd. | Method and apparatus for displaying music points, and electronic device and medium |
CN110955786B (en) * | 2019-11-29 | 2023-10-27 | 网易(杭州)网络有限公司 | Dance action data generation method and device |
CN110955786A (en) * | 2019-11-29 | 2020-04-03 | 网易(杭州)网络有限公司 | Dance action data generation method and device |
CN111064992A (en) * | 2019-12-10 | 2020-04-24 | 懂频智能科技(上海)有限公司 | Method for automatically switching video contents according to music beats |
WO2021121023A1 (en) * | 2019-12-17 | 2021-06-24 | Oppo广东移动通信有限公司 | Video editing method, video editing apparatus, terminal, and readable storage medium |
CN111008287B (en) * | 2019-12-19 | 2023-08-04 | Oppo(重庆)智能科技有限公司 | Audio and video processing method and device, server and storage medium |
CN111008287A (en) * | 2019-12-19 | 2020-04-14 | Oppo(重庆)智能科技有限公司 | Audio and video processing method and device, server and storage medium |
CN111556254B (en) * | 2020-04-10 | 2021-04-02 | 早安科技(广州)有限公司 | Method, system, medium and intelligent device for video cutting by using video content |
CN111556254A (en) * | 2020-04-10 | 2020-08-18 | 早安科技(广州)有限公司 | Method, system, medium and intelligent device for video cutting by using video content |
CN111683209A (en) * | 2020-06-10 | 2020-09-18 | 北京奇艺世纪科技有限公司 | Mixed-cut video generation method and device, electronic equipment and computer-readable storage medium |
CN111541946A (en) * | 2020-07-10 | 2020-08-14 | 成都品果科技有限公司 | Automatic video generation method and system for resource matching based on materials |
CN114390352A (en) * | 2020-10-16 | 2022-04-22 | 上海哔哩哔哩科技有限公司 | Audio and video processing method and device |
CN114390367A (en) * | 2020-10-16 | 2022-04-22 | 上海哔哩哔哩科技有限公司 | Audio and video processing method and device |
WO2022152064A1 (en) * | 2021-01-15 | 2022-07-21 | 北京字跳网络技术有限公司 | Video generation method and apparatus, electronic device, and storage medium |
CN113077470A (en) * | 2021-03-26 | 2021-07-06 | 天翼爱音乐文化科技有限公司 | Method, system, device and medium for cutting horizontal and vertical screen conversion picture |
Also Published As
Publication number | Publication date |
---|---|
CN107483843B (en) | 2019-11-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107483843A (en) | Audio frequency and video match clipping method and device | |
CN107393569B (en) | Audio-video clipping method and device | |
US8921678B2 (en) | Generating tones by combining sound materials | |
CN110415723B (en) | Method, device, server and computer readable storage medium for audio segmentation | |
CN106652997A (en) | Audio synthesis method and terminal | |
CN108877753B (en) | Music synthesis method and system, terminal and computer readable storage medium | |
CN109741425B (en) | Banner picture generation method and device, storage medium and computer equipment | |
CN110519638A (en) | Processing method, processing unit, electronic device and storage medium | |
CN107978310B (en) | Audio processing method and device | |
US11511200B2 (en) | Game playing method and system based on a multimedia file | |
CN106468987B (en) | Information processing method and client | |
CN104038473A (en) | Method of audio ad insertion, device, equipment and system | |
CN110377212B (en) | Method, apparatus, computer device and storage medium for triggering display through audio | |
CN106775568A (en) | A kind of effect adjusting method, device and mobile terminal | |
US20130263720A1 (en) | Music piece order determination device, music piece order determination method, and music piece order determination program | |
CN112269898A (en) | Background music obtaining method and device, electronic equipment and readable storage medium | |
CN105183853A (en) | Method and device used for presenting label page | |
CN107481739B (en) | Audio cutting method and device | |
CN109859739B (en) | Melody generation method and device based on voice synthesis and terminal equipment | |
CN106448713A (en) | Audio frequency playing method and audio frequency playing device | |
CN105118081A (en) | Processing method and device for picture synthesis video | |
CN109327731B (en) | Method and system for synthesizing DIY video in real time based on karaoke | |
US7612279B1 (en) | Methods and apparatus for structuring audio data | |
CN107688661B (en) | Lyric similarity calculation method, terminal device and computer-readable storage medium | |
CN113674725B (en) | Audio mixing method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |