CN107483843A

CN107483843A - Audio frequency and video match clipping method and device

Info

Publication number: CN107483843A
Application number: CN201710701832.XA
Authority: CN
Inventors: 陈杰; 徐滢
Original assignee: Chengdu Pinguo Technology Co Ltd
Current assignee: Chengdu Pinguo Technology Co Ltd
Priority date: 2017-08-16
Filing date: 2017-08-16
Publication date: 2017-12-15
Anticipated expiration: 2037-08-16
Also published as: CN107483843B

Abstract

The present invention provides a kind of audio frequency and video matching clipping method and device, is related to multimedia data processing field.Methods described and device are labeled with the target music of multiple cut points by obtaining in advance, and the target music is labeled as multiple snatch of musics by the cut point；It is multiple video segments by least one target video cutting obtained according to the duration of the snatch of music；The video segment of predetermined number is chosen from multiple video segments as target video fragment；Using filling algorithm, the filling that the video segment is calculated according to the weight of the target video fragment and filling position is worth, between the target video fragment is inserted into the corresponding cut point of the target music, make the new video file for the target video fragment and integral Maximum Value of snatch of music match group inserted.Based on above-mentioned design, methods described and device simplify the editing operation of audio frequency and video, while also improve the quality of the video of editing.

Description

Audio frequency and video match clipping method and device

Technical field

The present invention relates to multimedia data processing field, and clipping method is matched in particular to a kind of audio frequency and video And device.

Background technology

As generally when carrying out editing to audio or video, whole process needs to have been manually done.Specific operation process, such as, Operating personnel are using video clipping software by one video of multiple sections of Video Compositions, and then the background music of the editing video, makes The duration of background music is identical with the duration of the video, and finally the background music is loaded into video, obtains new video.Existing Have in technology, editing operation is complicated, high to the technical requirements of the operating personnel of editing Voice ＆ Video, and what not so editing obtained regards Easily there is the situation that video content and music rhythm are not taken in frequency, and influences the quality of video.Therefore, how a kind of operation letter is provided It is single and the method and device of the quality of editing video can be improved, it has also become the technical problem of those skilled in the art's urgent need to resolve.

The content of the invention

In order to overcome above-mentioned deficiency of the prior art, the present invention provides a kind of audio frequency and video matching clipping method and device, To solve the above problems.

To achieve these goals, the technical scheme that present pre-ferred embodiments are provided is as follows：

Present pre-ferred embodiments provide a kind of audio frequency and video matching clipping method, and methods described includes：

The target music for being labeled with multiple cut points in advance is obtained, the target music is by the cut point labeled as multiple Snatch of music；

It is multiple video segments by least one target video cutting obtained according to the duration of the snatch of music；

The video segment of predetermined number is chosen from multiple video segments as target video fragment；

Using filling algorithm, the target video fragment is calculated according to the weight of the target video fragment and filling position Filling value, and be worth according to the filling of each target video fragment, the target video fragment is inserted into the target music Between corresponding cut point, the target video fragment and the new of the integral Maximum Value of snatch of music match group that make to insert regard Frequency file.

In the preferred embodiment, above-mentioned overall value is worth sum for the filling of each target video fragment, The filling value of the target video fragment includes self-value and discrete value；

The mode of the self-value is calculated, including：

The characteristic information of each target video fragment is analyzed, the characteristic information includes face information, more people's scene informations, people Smile information and artificial label information in face；

According to the characteristic information, weights corresponding to each video segment are assigned as the target video fragment from the personal value Value；

The mode of the discrete value is calculated, including：

If during in the presence of at least two target video fragments in same target video, the discrete value regards with the target Between the corresponding video segment and other video segments of target video fragment in target video in target video of frequency fragment Distance be associated；

If whether target video fragment when or not in same target video, the discrete value is preset value.

In the preferred embodiment, above-mentioned calculated according to the weight and filling position of the target video fragment should The filling value of target video fragment, and be worth according to the filling of each target video fragment, the target video fragment is filled out The step of entering between the corresponding cut point of the target music, including：

Using greedy approximate data, iterate to calculate each video segment as the self-value of the target video fragment and Discrete value, to obtain the overall value of multiple corresponding video files；

The video file corresponding to maximum overall value is chosen from multiple overall values as new video file.

In the preferred embodiment, the video segment between the corresponding cut point of target music described above exists Under conditions of meeting discreteness, the order successively decreased according to weights is filled.

In the preferred embodiment, above-mentioned acquisition be labeled with advance the step of target music of multiple cut points it Before, methods described also includes：

The acoustic amplitudes information of frequency domain is preset from the target extraction of music；

Time point that amplitude in the default frequency domain increases sharply is chosen as the cut point, between making between adjacent cut point Exceed preset duration every duration.

In the preferred embodiment, the above-mentioned duration according to the snatch of music, it is at least one by what is obtained The step of target video cutting is multiple video segments, including：

Choose the duration of duration is most long in the snatch of music period as the video segment of cutting.

In the preferred embodiment, the piece of video hop count for inserting the target music is equal to the target music Musical film hop count.

It is in the preferred embodiment, above-mentioned that the target video fragment is inserted into the target music is corresponding The step of between cut point, including：

The length of every section of target video fragment is corrected, so that the length of target video fragment is right equal in the target music The length for the snatch of music answered.

Presently preferred embodiments of the present invention also provides a kind of audio frequency and video matching editing device, including：

Acquiring unit, the target music of multiple cut points is labeled with for obtaining in advance, and the target music is cut by described Cutpoint is labeled as multiple snatch of musics；

Cut cells, for the duration according to the snatch of music, it is by least one target video cutting obtained Multiple video segments；

Unit is chosen, for choosing the video segment of predetermined number from multiple video segments as target video fragment；

Video Composition unit, for using filling algorithm, according to the weight and filling position meter of the target video fragment The filling value of the target video fragment is calculated, and is worth according to the filling of each target video fragment, by the target video piece Section is inserted between the corresponding cut point of the target music, and it is whole that the target video fragment for making to insert matches composition with snatch of music The new video file of body Maximum Value.

The Video Composition unit calculates the mode of the self-value, including：

The Video Composition unit calculates the mode of the discrete value, including：

In terms of existing technologies, audio frequency and video matching clipping method and device provided by the invention at least have with following Beneficial effect：Methods described is labeled with the target music of multiple cut points by the way that multiple video segments of cutting are inserted, and is obtained To video file, and select it is overall take Maximum Value as new video file, simplify the editing operation of audio frequency and video, also simultaneously Improve the quality of the video of editing.Specifically, this method uses filling algorithm, according to the weight of target video fragment and filling Position calculates the filling value of the video segment, and is worth according to the filling of each video segment, by the target video fragment Insert between the corresponding cut point of the target music, make the target video fragment inserted and snatch of music match group integral The new video file of Maximum Value.Methods described and device can make video segment and target music in the video file of editing Corresponding music rhythm matches, and while the quality of editing video is improved, additionally aids the experience sense of lifting user.

To enable the above objects, features and advantages of the present invention to become apparent, present pre-ferred embodiments cited below particularly, And accompanying drawing appended by coordinating, it is described in detail below.

Brief description of the drawings

In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by embodiment it is required use it is attached Figure is briefly described.It should be appreciated that the following drawings illustrate only certain embodiments of the present invention, therefore it is not construed as pair The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this A little accompanying drawings obtain other related accompanying drawings.

Fig. 1 is the block diagram for the terminal device that present pre-ferred embodiments provide.

Fig. 2 is the schematic flow sheet that the audio frequency and video that present pre-ferred embodiments provide match clipping method.

Fig. 3 is the schematic flow sheet of step S240 sub-step shown in Fig. 2.

Fig. 4 is the block diagram that the audio frequency and video that present pre-ferred embodiments provide match editing device.

Icon：10- terminal devices；11- processors；12- memories；13- display units；100- audio frequency and video matching editing dress Put；110- acquiring units；120- cut cellses；130- chooses unit；140- Video Composition units.

Embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation describes.Obviously, described embodiment is only the part of the embodiment of the present invention, rather than whole embodiments.It is logical The component for the embodiment of the present invention being often described and illustrated herein in the accompanying drawings can be configured to arrange and design with a variety of.

Therefore, below the detailed description of the embodiments of the invention to providing in the accompanying drawings be not intended to limit it is claimed The scope of the present invention, but be merely representative of the present invention selected embodiment.Based on embodiments of the invention, people in the art The every other embodiment that member is obtained on the premise of creative work is not made, belongs to the scope of protection of the invention.

It should be noted that：Similar label and letter represents similar terms in following accompanying drawing, therefore, once a certain Xiang Yi It is defined, then it further need not be defined and explained in subsequent accompanying drawing in individual accompanying drawing.

Below in conjunction with the accompanying drawings, some embodiments of the present invention are elaborated.It is following in the case where not conflicting Feature in embodiment and embodiment can be mutually combined.

Fig. 1 is refer to, is the block diagram for the terminal device 10 that present pre-ferred embodiments provide.In the present embodiment In, the terminal device 10 can be as editing video and the operating platform of audio, with for users to use.The terminal device 10 Processor 11, memory 12 and audio frequency and video matching editing device 100 can be included.User can utilize the sound in terminal device 10 Video matching editing device 100, realize that shearing, editor and synthesis of Voice ＆ Video etc. operate, to obtain regarding after editing Frequency file, simplify the operating process of editing audio frequency and video.

Further, the overview of terminal device 10 is with including other elements, such as display unit 13.The processor 11st, directly or indirectly it is electrically connected between memory 12 and each element of modern unit, to realize the transmission of data and friendship Mutually.The audio frequency and video matching editing device 100 can be stored in including at least one in the form of software or firmware (firmware) In the memory 12 or the software work(that is solidificated in the operating system (operating system, OS) of the terminal device 10 Can module.The memory 12 can store the data such as voice data, video data.The processor 11 is used to perform described deposit The executable module stored in reservoir 12, such as the software function module included by audio frequency and video matching editing device 100 and calculating Machine program etc..

Further, the memory 12 may be, but not limited to, random access memory (Random Access Memory, RAM), read-only storage (Read Only Memory, ROM), programmable read only memory (Programmable Read-Only Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM), electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc..Wherein, memory 12 is used for storage program, and the processor 11 is held after execute instruction is received Row described program.The access of the processor 11 and other possible components to memory 12 can be in the storage control Control is lower to be carried out.

The processor 11 can be general processor, including central processing unit (Central Processing Unit, CPU), network processing unit (Network Processor, NP) etc.；It can also be digital signal processor (DSP), special integrated Circuit (ASIC), ready-made programmable gate array (FPGA) either other PLDs, discrete gate or transistor logic Device, discrete hardware components, it is possible to achieve or disclosed each method, step and box in the execution embodiment of the present invention Figure.General processor can be microprocessor or the processor 11 can also be any conventional processor etc..

In the present embodiment, the display unit 13 is used to play the video of the editing of terminal device 10, audio (ratio Such as, target music).The display unit 13 can be also used for showing the history usage record of audio or video.It is in addition, described aobvious Show that unit 13 can also show the editing toolbar that user is accustomed to and set according to the editing of oneself, use convenient for the user to operate. The display unit 13 may be, but not limited to, touching display screen, common liquid crystals display screen etc., be not especially limited here.

It is understood that the structure shown in Fig. 1 is only a kind of structural representation of terminal device 10, the terminal device 10 may also include more either less components than shown in Fig. 1 or have the configuration different from shown in Fig. 1.Shown in Fig. 1 Each component can use hardware, software or its combination realize.

In the present embodiment, the terminal device 10 may be, but not limited to, smart mobile phone, PC (Personal Computer, PC), tablet personal computer, personal digital assistant (Personal DigitalAssistant, PDA) etc., it is preferable that institute It is smart mobile phone to state terminal device 10.

Fig. 2 is refer to, is the schematic flow sheet for the audio frequency and video matching clipping method that present pre-ferred embodiments provide.At this In embodiment, the audio frequency and video matching clipping method is applied to the terminal device 10 shown in Fig. 1.Methods described will be by that will shear Video segment be filled in the target music for being marked with cut point, to form new video file, and then simplify editing sound The operating procedure of frequency and video.The audio frequency and video described in Fig. 2 are matched with the idiographic flow of clipping method below and step is carried out in detail It is thin to illustrate.

In embodiments of the present invention, the audio frequency and video matching clipping method comprises the following steps：

Step S210, the target music for being labeled with multiple cut points in advance is obtained, the target music is by the cut point Labeled as multiple snatch of musics.

In the present embodiment, acquired target music is previously provided with multiple cut points, and its cut point, which is used to be used as, to be filled out Enter inserting a little for video segment, so that the video segment and corresponding snatch of music inserted are engaged.In addition, user can be according to tool Body situation sets the quantity of cut point, is not especially limited here.

Before step S210, memory 12 can be previously stored with a first or more songs, and user can like according to itself The good first background music as video to be clipped of selection wherein one, that is, the target music.It is of course also possible to randomly select Wherein a piece of music is as the target music.Then target music can be set according to the musical features of the target music of extraction Cut point.Understandably, target music is cut a point mark and is divided into multiple snatch of musics, the video segment of each cutting with it is right The snatch of music answered is engaged, i.e. the video segment being engaged just is inserted in corresponding snatch of music.

In this embodiment, the musical features include tempo characteristic, and the tempo characteristic includes the sound of the target music Sound amplitude information.The step of musical features of the above-mentioned target music according to extraction set cut point to target music is appreciated that For：The acoustic amplitudes information of frequency domain is preset from the target extraction of music；Choose that amplitude in the default frequency domain increases sharply when Between point be used as the cut point, the interval duration between adjacent cut point is exceeded preset duration.

Usually, harmony component and rhythm component are included in music.Understandably, harmony component is the musical instrument for having tone The music played, for example, orchestra.The music that rhythm component is played by the musical instrument of no tone, for example, drum class pleasure Device.The musical features can be the beat information in rhythm component, such as the nodal information that amplitude increases suddenly.In extraction mesh During the musical features of mark with phonetic symbols pleasure, the harmony component in target music and rhythm component can be separated, to obtain rhythm component. Then musical features of the acoustic amplitudes information as the target music are extracted from rhythm component.

Further, if sound corresponding to rhythm component is separated into sonograph, the time point that the amplitude increases sharply can be with It is interpreted as in default frequency domain, amplitude is from the time flex point for being reduced to increase.Alternatively, amplitude corresponding to the flex point is not less than pre- If amplitude threshold.

Further, the cut point coordinates the video segment of cutting, is used as being loaded into the incision of video segment Point.And the interval duration between adjacent cut point exceedes preset duration, to avoid the interval duration between adjacent cut point too short, And make to insert video segment also section, and then influence the result of broadcast of the video after editing.

In the present embodiment, the amplitude threshold, preset duration, default frequency domain can be configured as the case may be, Here it is not especially limited.

Step S220, it is multiple by least one target video cutting obtained according to the duration of the snatch of music Video segment.

In the present embodiment, methods described can choose period the regarding as cutting that duration is most long in the snatch of music The duration of frequency fragment, then by the one or more target video cuttings being obtained ahead of time be same fixed duration multiple piece of video Section.Understandably, interval duration most long period when described fixed between a length of adjacent marker point, so as in target music Two neighboring mark point can fill up the video segment, when avoiding the appearance broadcasting music of the video after editing, no video content exhibition Existing situation occurs.

Step S230, the video segment of predetermined number is chosen from multiple video segments as target video fragment.

In the present embodiment, the target video segments of selection can be with equal, so that often with the musical film hop count being divided Section target video fragment corresponds with snatch of music.

Step S240, using filling algorithm, the target is calculated according to the weight of the target video fragment and filling position The filling value of video segment, and be worth according to the filling of each target video fragment, the target video fragment is inserted into institute Between stating the corresponding cut point of target music, make the target video fragment inserted with the integral value of snatch of music match group most Big new video file.

In the present embodiment, the filling algorithm is editing filling algorithm.It is to be appreciated that the editing filling algorithm is Seek a kind of best fit strategy, target video fragment is inserted between the mark point of target music, so that overall value is maximum. Wherein, the overall value can be regarded as the quality of the video obtained by editing, such as, between video segment and snatch of music Continuity between matching degree, video segment etc..

Further, the overall value can be that the filling of each target video fragment is worth sum, and the target regards The filling value of frequency fragment includes self-value and discrete value.The weights of the self-value and corresponding target video fragment It is associated.Specifically, the characteristic information that the mode of the self-value can include analyzing each target video fragment is calculated, it is described Characteristic information includes face information, more people's scene informations, the smile information in face and artificial label information；According to the spy Reference ceases, and assigns self-value of the weights as the target video fragment corresponding to each video segment.

In the present embodiment, before weights are assigned, each characteristic information is previously provided with corresponding weights.Specifically, For example, if analysis video segment obtains the smile information in face, weights corresponding to the smile information prestored are called, with Weights as the video segment.Certainly, in other embodiments, the weights of video segment can also be artificially set.Weights Size can be configured according to actual conditions, the size of weights is not especially limited here.

Further, the characteristic information can also include other information, to enrich the type of video segment.Such as institute Animal painting information can also be included by stating characteristic information, such as, the image information of the animal such as cat, dog, no longer it is specifically described here.

In the present embodiment, the discrete value can include the value under two kinds of different situations.If for example, in the presence of at least When two target video fragments are in same target video, the discrete value is corresponding with the target video fragment to be regarded in target Video segment in frequency is associated with the distance between other video segments of target video fragment in target video；If target When video segment is not in same target video, the discrete value is preset value.Its preset value can enter as the case may be Row is set, and is not especially limited here.

It is more than most it is alternatively possible to be worth the smallest discrete of target video fragment of the setting not in same target video Big self-value, to avoid being filled in two target video fragments adjacent in snatch of music also phase in former target video It is adjacent.Namely based on above-mentioned design, the visual effect of the video file of formation can be improved.

Further, step S240 can also include one or more sub-steps.For example, Fig. 3 is refer to as shown in Fig. 2 The schematic flow sheet of step S240 sub-step.In the present embodiment, the step S240 can include sub-step S241 and son Step S242.

Sub-step S241, using greedy approximate data, each video segment is iterated to calculate as the target video fragment Self-value and discrete value, with obtain it is multiple corresponding to video files overall value.

Sub-step S242, the video file corresponding to maximum overall value is chosen from multiple overall values and is regarded as new Frequency file.

In the present embodiment, the dynamic programming algorithm for solving 0-1 knapsack problems can be used to obtain filling out for overall value maximum Fill mode；Then greedy approximate data can be used, iterate to calculate, and with last computation results contrast, it is smaller to cast out overall value , and the result calculated using the larger video file of overall value as this.By way of iterative calculation, it can select final Filling mode corresponding to maximum overall value, and obtain the maximum video file of overall value.

Further, the video segment inserted between the corresponding cut point of the target music is meeting the bar of discreteness Under part, the order successively decreased according to weights is filled.Understandably, meeting to be filled in video segment adjacent in target music in original In target video fragment it is non-conterminous under the conditions of, can successively decrease according to the weights of target video fragment order filling target video.

Further, before target video fragment is inserted, methods described can also include every section of target video piece of amendment The length of section, so that the length of target video fragment is equal to the length of corresponding snatch of music in the target music.

Further, can be corresponded to by being sheared to target video fragment so that the length of target video fragment is equal to Adjacent marker point between snatch of music length.Can also by carrying out quick or slow processes to target video fragment, So that the target video fragment length is equal to the length of the snatch of music.Based on above-mentioned design, it can make what is obtained after institute's editing Video file is continuous simultaneously in target music, moreover it is possible to makes the broadcasting of video have continuity, the use of lifting viewing video file The experience sense at family, also just improve the quality of the video file.

Fig. 4 is refer to, is the block diagram for the audio frequency and video matching editing device 100 that present pre-ferred embodiments provide. Present pre-ferred embodiments also provide a kind of audio frequency and video matching editing device 100, described device can include acquiring unit 110, Cut cells 120, choose unit 130 and Video Composition unit 140.

The acquiring unit 110, it is labeled with the target music of multiple cut points, the target music quilt in advance for obtaining The cut point is labeled as multiple snatch of musics.Specifically, the acquiring unit 110 can be used for performing the step shown in Fig. 2 Rapid S210, specific operating method can refer to the detailed description to step S210.

The cut cells 120, for the duration according to the snatch of music, at least one target video that will be obtained Cutting is multiple video segments.Specifically, the cut cells 120 can be used for performing the step S220 shown in Fig. 2, specifically Operating method can refer to the detailed description to step S220.

The selection unit 130, the video segment of predetermined number is chosen from multiple video segments as target video piece Section.Specifically, the unit 130 of choosing can be used for performing the step S230 shown in Fig. 2, and specific operating method can refer to pair Step S230 detailed description.

The Video Composition unit 140, for using filling algorithm, according to the weight of the target video fragment and filling Position calculates the filling value of the target video fragment, and is worth according to the filling of each target video fragment, by the target Video segment is inserted between the corresponding cut point of the target music, and the target video fragment for making to insert matches with snatch of music Form the maximum new video file of overall value.Specifically, the Video Composition unit 140 can be used for performing shown in Fig. 2 Step S240, specific operating method can refer to the detailed description to step S240.

Further, the Video Composition unit 140 can be also used for performing the sub-step S241 and sub-step shown in Fig. 3 Rapid S242, specific operating method can refer to sub-paragraphs S241 and sub-step S242 detailed description, repeat no more here.

In summary, the present invention provides a kind of audio frequency and video matching clipping method and device.Methods described is by by cutting Multiple video segments, which are inserted, to be labeled with the target music of multiple cut points, obtains video file, and selects entirety to fix the price value most It is big as new video file, simplify the editing operation of audio frequency and video, while also improve the quality of the video of editing.It is described The filling that method calculates the video segment according to the weight and filling position of target video fragment is worth, and according to each piece of video The filling value of section, the maximum filling mode of overall value is chosen so that the target video fragment is inserted into the target music phase Between corresponding cut point, make the new video for the target video fragment and integral Maximum Value of snatch of music match group inserted File.Methods described and device can make music rhythm phase corresponding to video segment and target music in the video file of editing Match somebody with somebody, while the quality of editing video is improved, additionally aid the experience sense of lifting user.

The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention, for the skill of this area For art personnel, the present invention can have various modifications and variations.Within the spirit and principles of the invention, that is made any repaiies Change, equivalent substitution, improvement etc., should be included in the scope of the protection.

Claims

1. a kind of audio frequency and video match clipping method, it is characterised in that methods described includes：

The target music for being labeled with multiple cut points in advance is obtained, the target music is labeled as multiple music by the cut point Fragment；

Using filling algorithm, the filling of the target video fragment is calculated according to the weight of the target video fragment and filling position Value, and be worth according to the filling of each target video fragment, it is relative that the target video fragment is inserted into the target music Between the cut point answered, make the new video text for the target video fragment and integral Maximum Value of snatch of music match group inserted Part.

2. according to the method for claim 1, it is characterised in that the overall value is the filling of each target video fragment Sum is worth, the filling value of the target video fragment includes self-value and discrete value；

The mode of the self-value is calculated, including：

The characteristic information of each target video fragment is analyzed, the characteristic information is included in face information, more people's scene informations, face Smile information and artificial label information；

According to the characteristic information, self-value of the weights as the target video fragment corresponding to each video segment is assigned；

The mode of the discrete value is calculated, including：

If during in the presence of at least two target video fragments in same target video, the discrete value and the target video piece Between the corresponding video segment and other video segments of target video fragment in target video in target video of section away from From associated；

3. according to the method for claim 2, it is characterised in that the weight and filling according to the target video fragment Position calculates the filling value of the target video fragment, and is worth according to the filling of each target video fragment, by the target The step of video segment is inserted between the corresponding cut point of the target music, including：

Using greedy approximate data, iterate to calculate each video segment and be used as the self-value of the target video fragment and discrete Value, to obtain the overall value of multiple corresponding video files；

4. according to the method for claim 3, it is characterised in that insert between the corresponding cut point of the target music Video segment under conditions of discreteness is met, fill by the order successively decreased according to weights.

5. according to the method for claim 1, it is characterised in that described to obtain the target sound for being labeled with multiple cut points in advance Before happy step, methods described also includes：

Time point that amplitude in the default frequency domain increases sharply is chosen as the cut point, when making the interval between adjacent cut point Length exceedes preset duration.

6. according to the method for claim 1, it is characterised in that the duration according to the snatch of music, will obtain At least one target video cutting the step of being multiple video segments, including：

7. according to the method for claim 1, it is characterised in that insert the piece of video hop count of the target music equal to described The musical film hop count of target music.

8. according to the method for claim 1, it is characterised in that described that the target video fragment is inserted into the target sound The step of between happy corresponding cut point, including：

The length of every section of target video fragment is corrected, so that the length of target video fragment is equal to corresponding in the target music The length of snatch of music.

9. a kind of audio frequency and video match editing device, it is characterised in that including：

Acquiring unit, it is labeled with the target music of multiple cut points in advance for obtaining, the target music is by the cut point Labeled as multiple snatch of musics；

Cut cells, it is multiple by least one target video cutting obtained for the duration according to the snatch of music Video segment；

Video Composition unit, for using filling algorithm, being calculated according to the weight of the target video fragment and filling position should The filling value of target video fragment, and be worth according to the filling of each target video fragment, the target video fragment is filled out Between entering the corresponding cut point of the target music, make target video fragment and the integral valency of snatch of music match group inserted It is worth maximum new video file.

10. device according to claim 9, it is characterised in that the overall value is filled out for each target video fragment Value sum is filled, the filling value of the target video fragment includes self-value and discrete value；

The Video Composition unit calculates the mode of the self-value, including：