CN107483843A - Audio frequency and video match clipping method and device - Google Patents

Audio frequency and video match clipping method and device Download PDF

Info

Publication number
CN107483843A
CN107483843A CN201710701832.XA CN201710701832A CN107483843A CN 107483843 A CN107483843 A CN 107483843A CN 201710701832 A CN201710701832 A CN 201710701832A CN 107483843 A CN107483843 A CN 107483843A
Authority
CN
China
Prior art keywords
video
target
target video
music
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710701832.XA
Other languages
Chinese (zh)
Other versions
CN107483843B (en
Inventor
陈杰
徐滢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Pinguo Technology Co Ltd
Original Assignee
Chengdu Pinguo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Pinguo Technology Co Ltd filed Critical Chengdu Pinguo Technology Co Ltd
Priority to CN201710701832.XA priority Critical patent/CN107483843B/en
Publication of CN107483843A publication Critical patent/CN107483843A/en
Application granted granted Critical
Publication of CN107483843B publication Critical patent/CN107483843B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/265Mixing

Abstract

The present invention provides a kind of audio frequency and video matching clipping method and device, is related to multimedia data processing field.Methods described and device are labeled with the target music of multiple cut points by obtaining in advance, and the target music is labeled as multiple snatch of musics by the cut point;It is multiple video segments by least one target video cutting obtained according to the duration of the snatch of music;The video segment of predetermined number is chosen from multiple video segments as target video fragment;Using filling algorithm, the filling that the video segment is calculated according to the weight of the target video fragment and filling position is worth, between the target video fragment is inserted into the corresponding cut point of the target music, make the new video file for the target video fragment and integral Maximum Value of snatch of music match group inserted.Based on above-mentioned design, methods described and device simplify the editing operation of audio frequency and video, while also improve the quality of the video of editing.

Description

Audio frequency and video match clipping method and device
Technical field
The present invention relates to multimedia data processing field, and clipping method is matched in particular to a kind of audio frequency and video And device.
Background technology
As generally when carrying out editing to audio or video, whole process needs to have been manually done.Specific operation process, such as, Operating personnel are using video clipping software by one video of multiple sections of Video Compositions, and then the background music of the editing video, makes The duration of background music is identical with the duration of the video, and finally the background music is loaded into video, obtains new video.Existing Have in technology, editing operation is complicated, high to the technical requirements of the operating personnel of editing Voice & Video, and what not so editing obtained regards Easily there is the situation that video content and music rhythm are not taken in frequency, and influences the quality of video.Therefore, how a kind of operation letter is provided It is single and the method and device of the quality of editing video can be improved, it has also become the technical problem of those skilled in the art's urgent need to resolve.
The content of the invention
In order to overcome above-mentioned deficiency of the prior art, the present invention provides a kind of audio frequency and video matching clipping method and device, To solve the above problems.
To achieve these goals, the technical scheme that present pre-ferred embodiments are provided is as follows:
Present pre-ferred embodiments provide a kind of audio frequency and video matching clipping method, and methods described includes:
The target music for being labeled with multiple cut points in advance is obtained, the target music is by the cut point labeled as multiple Snatch of music;
It is multiple video segments by least one target video cutting obtained according to the duration of the snatch of music;
The video segment of predetermined number is chosen from multiple video segments as target video fragment;
Using filling algorithm, the target video fragment is calculated according to the weight of the target video fragment and filling position Filling value, and be worth according to the filling of each target video fragment, the target video fragment is inserted into the target music Between corresponding cut point, the target video fragment and the new of the integral Maximum Value of snatch of music match group that make to insert regard Frequency file.
In the preferred embodiment, above-mentioned overall value is worth sum for the filling of each target video fragment, The filling value of the target video fragment includes self-value and discrete value;
The mode of the self-value is calculated, including:
The characteristic information of each target video fragment is analyzed, the characteristic information includes face information, more people's scene informations, people Smile information and artificial label information in face;
According to the characteristic information, weights corresponding to each video segment are assigned as the target video fragment from the personal value Value;
The mode of the discrete value is calculated, including:
If during in the presence of at least two target video fragments in same target video, the discrete value regards with the target Between the corresponding video segment and other video segments of target video fragment in target video in target video of frequency fragment Distance be associated;
If whether target video fragment when or not in same target video, the discrete value is preset value.
In the preferred embodiment, above-mentioned calculated according to the weight and filling position of the target video fragment should The filling value of target video fragment, and be worth according to the filling of each target video fragment, the target video fragment is filled out The step of entering between the corresponding cut point of the target music, including:
Using greedy approximate data, iterate to calculate each video segment as the self-value of the target video fragment and Discrete value, to obtain the overall value of multiple corresponding video files;
The video file corresponding to maximum overall value is chosen from multiple overall values as new video file.
In the preferred embodiment, the video segment between the corresponding cut point of target music described above exists Under conditions of meeting discreteness, the order successively decreased according to weights is filled.
In the preferred embodiment, above-mentioned acquisition be labeled with advance the step of target music of multiple cut points it Before, methods described also includes:
The acoustic amplitudes information of frequency domain is preset from the target extraction of music;
Time point that amplitude in the default frequency domain increases sharply is chosen as the cut point, between making between adjacent cut point Exceed preset duration every duration.
In the preferred embodiment, the above-mentioned duration according to the snatch of music, it is at least one by what is obtained The step of target video cutting is multiple video segments, including:
Choose the duration of duration is most long in the snatch of music period as the video segment of cutting.
In the preferred embodiment, the piece of video hop count for inserting the target music is equal to the target music Musical film hop count.
It is in the preferred embodiment, above-mentioned that the target video fragment is inserted into the target music is corresponding The step of between cut point, including:
The length of every section of target video fragment is corrected, so that the length of target video fragment is right equal in the target music The length for the snatch of music answered.
Presently preferred embodiments of the present invention also provides a kind of audio frequency and video matching editing device, including:
Acquiring unit, the target music of multiple cut points is labeled with for obtaining in advance, and the target music is cut by described Cutpoint is labeled as multiple snatch of musics;
Cut cells, for the duration according to the snatch of music, it is by least one target video cutting obtained Multiple video segments;
Unit is chosen, for choosing the video segment of predetermined number from multiple video segments as target video fragment;
Video Composition unit, for using filling algorithm, according to the weight and filling position meter of the target video fragment The filling value of the target video fragment is calculated, and is worth according to the filling of each target video fragment, by the target video piece Section is inserted between the corresponding cut point of the target music, and it is whole that the target video fragment for making to insert matches composition with snatch of music The new video file of body Maximum Value.
In the preferred embodiment, above-mentioned overall value is worth sum for the filling of each target video fragment, The filling value of the target video fragment includes self-value and discrete value;
The Video Composition unit calculates the mode of the self-value, including:
The characteristic information of each target video fragment is analyzed, the characteristic information includes face information, more people's scene informations, people Smile information and artificial label information in face;
According to the characteristic information, weights corresponding to each video segment are assigned as the target video fragment from the personal value Value;
The Video Composition unit calculates the mode of the discrete value, including:
If during in the presence of at least two target video fragments in same target video, the discrete value regards with the target Between the corresponding video segment and other video segments of target video fragment in target video in target video of frequency fragment Distance be associated;
If whether target video fragment when or not in same target video, the discrete value is preset value.
In terms of existing technologies, audio frequency and video matching clipping method and device provided by the invention at least have with following Beneficial effect:Methods described is labeled with the target music of multiple cut points by the way that multiple video segments of cutting are inserted, and is obtained To video file, and select it is overall take Maximum Value as new video file, simplify the editing operation of audio frequency and video, also simultaneously Improve the quality of the video of editing.Specifically, this method uses filling algorithm, according to the weight of target video fragment and filling Position calculates the filling value of the video segment, and is worth according to the filling of each video segment, by the target video fragment Insert between the corresponding cut point of the target music, make the target video fragment inserted and snatch of music match group integral The new video file of Maximum Value.Methods described and device can make video segment and target music in the video file of editing Corresponding music rhythm matches, and while the quality of editing video is improved, additionally aids the experience sense of lifting user.
To enable the above objects, features and advantages of the present invention to become apparent, present pre-ferred embodiments cited below particularly, And accompanying drawing appended by coordinating, it is described in detail below.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by embodiment it is required use it is attached Figure is briefly described.It should be appreciated that the following drawings illustrate only certain embodiments of the present invention, therefore it is not construed as pair The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this A little accompanying drawings obtain other related accompanying drawings.
Fig. 1 is the block diagram for the terminal device that present pre-ferred embodiments provide.
Fig. 2 is the schematic flow sheet that the audio frequency and video that present pre-ferred embodiments provide match clipping method.
Fig. 3 is the schematic flow sheet of step S240 sub-step shown in Fig. 2.
Fig. 4 is the block diagram that the audio frequency and video that present pre-ferred embodiments provide match editing device.
Icon:10- terminal devices;11- processors;12- memories;13- display units;100- audio frequency and video matching editing dress Put;110- acquiring units;120- cut cellses;130- chooses unit;140- Video Composition units.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation describes.Obviously, described embodiment is only the part of the embodiment of the present invention, rather than whole embodiments.It is logical The component for the embodiment of the present invention being often described and illustrated herein in the accompanying drawings can be configured to arrange and design with a variety of.
Therefore, below the detailed description of the embodiments of the invention to providing in the accompanying drawings be not intended to limit it is claimed The scope of the present invention, but be merely representative of the present invention selected embodiment.Based on embodiments of the invention, people in the art The every other embodiment that member is obtained on the premise of creative work is not made, belongs to the scope of protection of the invention.
It should be noted that:Similar label and letter represents similar terms in following accompanying drawing, therefore, once a certain Xiang Yi It is defined, then it further need not be defined and explained in subsequent accompanying drawing in individual accompanying drawing.
Below in conjunction with the accompanying drawings, some embodiments of the present invention are elaborated.It is following in the case where not conflicting Feature in embodiment and embodiment can be mutually combined.
Fig. 1 is refer to, is the block diagram for the terminal device 10 that present pre-ferred embodiments provide.In the present embodiment In, the terminal device 10 can be as editing video and the operating platform of audio, with for users to use.The terminal device 10 Processor 11, memory 12 and audio frequency and video matching editing device 100 can be included.User can utilize the sound in terminal device 10 Video matching editing device 100, realize that shearing, editor and synthesis of Voice & Video etc. operate, to obtain regarding after editing Frequency file, simplify the operating process of editing audio frequency and video.
Further, the overview of terminal device 10 is with including other elements, such as display unit 13.The processor 11st, directly or indirectly it is electrically connected between memory 12 and each element of modern unit, to realize the transmission of data and friendship Mutually.The audio frequency and video matching editing device 100 can be stored in including at least one in the form of software or firmware (firmware) In the memory 12 or the software work(that is solidificated in the operating system (operating system, OS) of the terminal device 10 Can module.The memory 12 can store the data such as voice data, video data.The processor 11 is used to perform described deposit The executable module stored in reservoir 12, such as the software function module included by audio frequency and video matching editing device 100 and calculating Machine program etc..
Further, the memory 12 may be, but not limited to, random access memory (Random Access Memory, RAM), read-only storage (Read Only Memory, ROM), programmable read only memory (Programmable Read-Only Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM), electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc..Wherein, memory 12 is used for storage program, and the processor 11 is held after execute instruction is received Row described program.The access of the processor 11 and other possible components to memory 12 can be in the storage control Control is lower to be carried out.
The processor 11 can be general processor, including central processing unit (Central Processing Unit, CPU), network processing unit (Network Processor, NP) etc.;It can also be digital signal processor (DSP), special integrated Circuit (ASIC), ready-made programmable gate array (FPGA) either other PLDs, discrete gate or transistor logic Device, discrete hardware components, it is possible to achieve or disclosed each method, step and box in the execution embodiment of the present invention Figure.General processor can be microprocessor or the processor 11 can also be any conventional processor etc..
In the present embodiment, the display unit 13 is used to play the video of the editing of terminal device 10, audio (ratio Such as, target music).The display unit 13 can be also used for showing the history usage record of audio or video.It is in addition, described aobvious Show that unit 13 can also show the editing toolbar that user is accustomed to and set according to the editing of oneself, use convenient for the user to operate. The display unit 13 may be, but not limited to, touching display screen, common liquid crystals display screen etc., be not especially limited here.
It is understood that the structure shown in Fig. 1 is only a kind of structural representation of terminal device 10, the terminal device 10 may also include more either less components than shown in Fig. 1 or have the configuration different from shown in Fig. 1.Shown in Fig. 1 Each component can use hardware, software or its combination realize.
In the present embodiment, the terminal device 10 may be, but not limited to, smart mobile phone, PC (Personal Computer, PC), tablet personal computer, personal digital assistant (Personal DigitalAssistant, PDA) etc., it is preferable that institute It is smart mobile phone to state terminal device 10.
Fig. 2 is refer to, is the schematic flow sheet for the audio frequency and video matching clipping method that present pre-ferred embodiments provide.At this In embodiment, the audio frequency and video matching clipping method is applied to the terminal device 10 shown in Fig. 1.Methods described will be by that will shear Video segment be filled in the target music for being marked with cut point, to form new video file, and then simplify editing sound The operating procedure of frequency and video.The audio frequency and video described in Fig. 2 are matched with the idiographic flow of clipping method below and step is carried out in detail It is thin to illustrate.
In embodiments of the present invention, the audio frequency and video matching clipping method comprises the following steps:
Step S210, the target music for being labeled with multiple cut points in advance is obtained, the target music is by the cut point Labeled as multiple snatch of musics.
In the present embodiment, acquired target music is previously provided with multiple cut points, and its cut point, which is used to be used as, to be filled out Enter inserting a little for video segment, so that the video segment and corresponding snatch of music inserted are engaged.In addition, user can be according to tool Body situation sets the quantity of cut point, is not especially limited here.
Before step S210, memory 12 can be previously stored with a first or more songs, and user can like according to itself The good first background music as video to be clipped of selection wherein one, that is, the target music.It is of course also possible to randomly select Wherein a piece of music is as the target music.Then target music can be set according to the musical features of the target music of extraction Cut point.Understandably, target music is cut a point mark and is divided into multiple snatch of musics, the video segment of each cutting with it is right The snatch of music answered is engaged, i.e. the video segment being engaged just is inserted in corresponding snatch of music.
In this embodiment, the musical features include tempo characteristic, and the tempo characteristic includes the sound of the target music Sound amplitude information.The step of musical features of the above-mentioned target music according to extraction set cut point to target music is appreciated that For:The acoustic amplitudes information of frequency domain is preset from the target extraction of music;Choose that amplitude in the default frequency domain increases sharply when Between point be used as the cut point, the interval duration between adjacent cut point is exceeded preset duration.
Usually, harmony component and rhythm component are included in music.Understandably, harmony component is the musical instrument for having tone The music played, for example, orchestra.The music that rhythm component is played by the musical instrument of no tone, for example, drum class pleasure Device.The musical features can be the beat information in rhythm component, such as the nodal information that amplitude increases suddenly.In extraction mesh During the musical features of mark with phonetic symbols pleasure, the harmony component in target music and rhythm component can be separated, to obtain rhythm component. Then musical features of the acoustic amplitudes information as the target music are extracted from rhythm component.
Further, if sound corresponding to rhythm component is separated into sonograph, the time point that the amplitude increases sharply can be with It is interpreted as in default frequency domain, amplitude is from the time flex point for being reduced to increase.Alternatively, amplitude corresponding to the flex point is not less than pre- If amplitude threshold.
Further, the cut point coordinates the video segment of cutting, is used as being loaded into the incision of video segment Point.And the interval duration between adjacent cut point exceedes preset duration, to avoid the interval duration between adjacent cut point too short, And make to insert video segment also section, and then influence the result of broadcast of the video after editing.
In the present embodiment, the amplitude threshold, preset duration, default frequency domain can be configured as the case may be, Here it is not especially limited.
Step S220, it is multiple by least one target video cutting obtained according to the duration of the snatch of music Video segment.
In the present embodiment, methods described can choose period the regarding as cutting that duration is most long in the snatch of music The duration of frequency fragment, then by the one or more target video cuttings being obtained ahead of time be same fixed duration multiple piece of video Section.Understandably, interval duration most long period when described fixed between a length of adjacent marker point, so as in target music Two neighboring mark point can fill up the video segment, when avoiding the appearance broadcasting music of the video after editing, no video content exhibition Existing situation occurs.
Step S230, the video segment of predetermined number is chosen from multiple video segments as target video fragment.
In the present embodiment, the target video segments of selection can be with equal, so that often with the musical film hop count being divided Section target video fragment corresponds with snatch of music.
Step S240, using filling algorithm, the target is calculated according to the weight of the target video fragment and filling position The filling value of video segment, and be worth according to the filling of each target video fragment, the target video fragment is inserted into institute Between stating the corresponding cut point of target music, make the target video fragment inserted with the integral value of snatch of music match group most Big new video file.
In the present embodiment, the filling algorithm is editing filling algorithm.It is to be appreciated that the editing filling algorithm is Seek a kind of best fit strategy, target video fragment is inserted between the mark point of target music, so that overall value is maximum. Wherein, the overall value can be regarded as the quality of the video obtained by editing, such as, between video segment and snatch of music Continuity between matching degree, video segment etc..
Further, the overall value can be that the filling of each target video fragment is worth sum, and the target regards The filling value of frequency fragment includes self-value and discrete value.The weights of the self-value and corresponding target video fragment It is associated.Specifically, the characteristic information that the mode of the self-value can include analyzing each target video fragment is calculated, it is described Characteristic information includes face information, more people's scene informations, the smile information in face and artificial label information;According to the spy Reference ceases, and assigns self-value of the weights as the target video fragment corresponding to each video segment.
In the present embodiment, before weights are assigned, each characteristic information is previously provided with corresponding weights.Specifically, For example, if analysis video segment obtains the smile information in face, weights corresponding to the smile information prestored are called, with Weights as the video segment.Certainly, in other embodiments, the weights of video segment can also be artificially set.Weights Size can be configured according to actual conditions, the size of weights is not especially limited here.
Further, the characteristic information can also include other information, to enrich the type of video segment.Such as institute Animal painting information can also be included by stating characteristic information, such as, the image information of the animal such as cat, dog, no longer it is specifically described here.
In the present embodiment, the discrete value can include the value under two kinds of different situations.If for example, in the presence of at least When two target video fragments are in same target video, the discrete value is corresponding with the target video fragment to be regarded in target Video segment in frequency is associated with the distance between other video segments of target video fragment in target video;If target When video segment is not in same target video, the discrete value is preset value.Its preset value can enter as the case may be Row is set, and is not especially limited here.
It is more than most it is alternatively possible to be worth the smallest discrete of target video fragment of the setting not in same target video Big self-value, to avoid being filled in two target video fragments adjacent in snatch of music also phase in former target video It is adjacent.Namely based on above-mentioned design, the visual effect of the video file of formation can be improved.
Further, step S240 can also include one or more sub-steps.For example, Fig. 3 is refer to as shown in Fig. 2 The schematic flow sheet of step S240 sub-step.In the present embodiment, the step S240 can include sub-step S241 and son Step S242.
Sub-step S241, using greedy approximate data, each video segment is iterated to calculate as the target video fragment Self-value and discrete value, with obtain it is multiple corresponding to video files overall value.
Sub-step S242, the video file corresponding to maximum overall value is chosen from multiple overall values and is regarded as new Frequency file.
In the present embodiment, the dynamic programming algorithm for solving 0-1 knapsack problems can be used to obtain filling out for overall value maximum Fill mode;Then greedy approximate data can be used, iterate to calculate, and with last computation results contrast, it is smaller to cast out overall value , and the result calculated using the larger video file of overall value as this.By way of iterative calculation, it can select final Filling mode corresponding to maximum overall value, and obtain the maximum video file of overall value.
Further, the video segment inserted between the corresponding cut point of the target music is meeting the bar of discreteness Under part, the order successively decreased according to weights is filled.Understandably, meeting to be filled in video segment adjacent in target music in original In target video fragment it is non-conterminous under the conditions of, can successively decrease according to the weights of target video fragment order filling target video.
Further, before target video fragment is inserted, methods described can also include every section of target video piece of amendment The length of section, so that the length of target video fragment is equal to the length of corresponding snatch of music in the target music.
Further, can be corresponded to by being sheared to target video fragment so that the length of target video fragment is equal to Adjacent marker point between snatch of music length.Can also by carrying out quick or slow processes to target video fragment, So that the target video fragment length is equal to the length of the snatch of music.Based on above-mentioned design, it can make what is obtained after institute's editing Video file is continuous simultaneously in target music, moreover it is possible to makes the broadcasting of video have continuity, the use of lifting viewing video file The experience sense at family, also just improve the quality of the video file.
Fig. 4 is refer to, is the block diagram for the audio frequency and video matching editing device 100 that present pre-ferred embodiments provide. Present pre-ferred embodiments also provide a kind of audio frequency and video matching editing device 100, described device can include acquiring unit 110, Cut cells 120, choose unit 130 and Video Composition unit 140.
The acquiring unit 110, it is labeled with the target music of multiple cut points, the target music quilt in advance for obtaining The cut point is labeled as multiple snatch of musics.Specifically, the acquiring unit 110 can be used for performing the step shown in Fig. 2 Rapid S210, specific operating method can refer to the detailed description to step S210.
The cut cells 120, for the duration according to the snatch of music, at least one target video that will be obtained Cutting is multiple video segments.Specifically, the cut cells 120 can be used for performing the step S220 shown in Fig. 2, specifically Operating method can refer to the detailed description to step S220.
The selection unit 130, the video segment of predetermined number is chosen from multiple video segments as target video piece Section.Specifically, the unit 130 of choosing can be used for performing the step S230 shown in Fig. 2, and specific operating method can refer to pair Step S230 detailed description.
The Video Composition unit 140, for using filling algorithm, according to the weight of the target video fragment and filling Position calculates the filling value of the target video fragment, and is worth according to the filling of each target video fragment, by the target Video segment is inserted between the corresponding cut point of the target music, and the target video fragment for making to insert matches with snatch of music Form the maximum new video file of overall value.Specifically, the Video Composition unit 140 can be used for performing shown in Fig. 2 Step S240, specific operating method can refer to the detailed description to step S240.
Further, the Video Composition unit 140 can be also used for performing the sub-step S241 and sub-step shown in Fig. 3 Rapid S242, specific operating method can refer to sub-paragraphs S241 and sub-step S242 detailed description, repeat no more here.
In summary, the present invention provides a kind of audio frequency and video matching clipping method and device.Methods described is by by cutting Multiple video segments, which are inserted, to be labeled with the target music of multiple cut points, obtains video file, and selects entirety to fix the price value most It is big as new video file, simplify the editing operation of audio frequency and video, while also improve the quality of the video of editing.It is described The filling that method calculates the video segment according to the weight and filling position of target video fragment is worth, and according to each piece of video The filling value of section, the maximum filling mode of overall value is chosen so that the target video fragment is inserted into the target music phase Between corresponding cut point, make the new video for the target video fragment and integral Maximum Value of snatch of music match group inserted File.Methods described and device can make music rhythm phase corresponding to video segment and target music in the video file of editing Match somebody with somebody, while the quality of editing video is improved, additionally aid the experience sense of lifting user.
The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention, for the skill of this area For art personnel, the present invention can have various modifications and variations.Within the spirit and principles of the invention, that is made any repaiies Change, equivalent substitution, improvement etc., should be included in the scope of the protection.

Claims (10)

1. a kind of audio frequency and video match clipping method, it is characterised in that methods described includes:
The target music for being labeled with multiple cut points in advance is obtained, the target music is labeled as multiple music by the cut point Fragment;
It is multiple video segments by least one target video cutting obtained according to the duration of the snatch of music;
The video segment of predetermined number is chosen from multiple video segments as target video fragment;
Using filling algorithm, the filling of the target video fragment is calculated according to the weight of the target video fragment and filling position Value, and be worth according to the filling of each target video fragment, it is relative that the target video fragment is inserted into the target music Between the cut point answered, make the new video text for the target video fragment and integral Maximum Value of snatch of music match group inserted Part.
2. according to the method for claim 1, it is characterised in that the overall value is the filling of each target video fragment Sum is worth, the filling value of the target video fragment includes self-value and discrete value;
The mode of the self-value is calculated, including:
The characteristic information of each target video fragment is analyzed, the characteristic information is included in face information, more people's scene informations, face Smile information and artificial label information;
According to the characteristic information, self-value of the weights as the target video fragment corresponding to each video segment is assigned;
The mode of the discrete value is calculated, including:
If during in the presence of at least two target video fragments in same target video, the discrete value and the target video piece Between the corresponding video segment and other video segments of target video fragment in target video in target video of section away from From associated;
If whether target video fragment when or not in same target video, the discrete value is preset value.
3. according to the method for claim 2, it is characterised in that the weight and filling according to the target video fragment Position calculates the filling value of the target video fragment, and is worth according to the filling of each target video fragment, by the target The step of video segment is inserted between the corresponding cut point of the target music, including:
Using greedy approximate data, iterate to calculate each video segment and be used as the self-value of the target video fragment and discrete Value, to obtain the overall value of multiple corresponding video files;
The video file corresponding to maximum overall value is chosen from multiple overall values as new video file.
4. according to the method for claim 3, it is characterised in that insert between the corresponding cut point of the target music Video segment under conditions of discreteness is met, fill by the order successively decreased according to weights.
5. according to the method for claim 1, it is characterised in that described to obtain the target sound for being labeled with multiple cut points in advance Before happy step, methods described also includes:
The acoustic amplitudes information of frequency domain is preset from the target extraction of music;
Time point that amplitude in the default frequency domain increases sharply is chosen as the cut point, when making the interval between adjacent cut point Length exceedes preset duration.
6. according to the method for claim 1, it is characterised in that the duration according to the snatch of music, will obtain At least one target video cutting the step of being multiple video segments, including:
Choose the duration of duration is most long in the snatch of music period as the video segment of cutting.
7. according to the method for claim 1, it is characterised in that insert the piece of video hop count of the target music equal to described The musical film hop count of target music.
8. according to the method for claim 1, it is characterised in that described that the target video fragment is inserted into the target sound The step of between happy corresponding cut point, including:
The length of every section of target video fragment is corrected, so that the length of target video fragment is equal to corresponding in the target music The length of snatch of music.
9. a kind of audio frequency and video match editing device, it is characterised in that including:
Acquiring unit, it is labeled with the target music of multiple cut points in advance for obtaining, the target music is by the cut point Labeled as multiple snatch of musics;
Cut cells, it is multiple by least one target video cutting obtained for the duration according to the snatch of music Video segment;
Unit is chosen, for choosing the video segment of predetermined number from multiple video segments as target video fragment;
Video Composition unit, for using filling algorithm, being calculated according to the weight of the target video fragment and filling position should The filling value of target video fragment, and be worth according to the filling of each target video fragment, the target video fragment is filled out Between entering the corresponding cut point of the target music, make target video fragment and the integral valency of snatch of music match group inserted It is worth maximum new video file.
10. device according to claim 9, it is characterised in that the overall value is filled out for each target video fragment Value sum is filled, the filling value of the target video fragment includes self-value and discrete value;
The Video Composition unit calculates the mode of the self-value, including:
The characteristic information of each target video fragment is analyzed, the characteristic information is included in face information, more people's scene informations, face Smile information and artificial label information;
According to the characteristic information, self-value of the weights as the target video fragment corresponding to each video segment is assigned;
The Video Composition unit calculates the mode of the discrete value, including:
If during in the presence of at least two target video fragments in same target video, the discrete value and the target video piece Between the corresponding video segment and other video segments of target video fragment in target video in target video of section away from From associated;
If whether target video fragment when or not in same target video, the discrete value is preset value.
CN201710701832.XA 2017-08-16 2017-08-16 Audio-video matches clipping method and device Active CN107483843B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710701832.XA CN107483843B (en) 2017-08-16 2017-08-16 Audio-video matches clipping method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710701832.XA CN107483843B (en) 2017-08-16 2017-08-16 Audio-video matches clipping method and device

Publications (2)

Publication Number Publication Date
CN107483843A true CN107483843A (en) 2017-12-15
CN107483843B CN107483843B (en) 2019-11-15

Family

ID=60600510

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710701832.XA Active CN107483843B (en) 2017-08-16 2017-08-16 Audio-video matches clipping method and device

Country Status (1)

Country Link
CN (1) CN107483843B (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108830208A (en) * 2018-06-08 2018-11-16 Oppo广东移动通信有限公司 Method for processing video frequency and device, electronic equipment, computer readable storage medium
CN108986056A (en) * 2018-08-24 2018-12-11 潘小亮 Content requirements judge system
CN109167934A (en) * 2018-09-03 2019-01-08 咪咕视讯科技有限公司 A kind of method for processing video frequency, device and computer readable storage medium
CN109168084A (en) * 2018-10-24 2019-01-08 麒麟合盛网络技术股份有限公司 A kind of method and apparatus of video clipping
CN110233976A (en) * 2019-06-21 2019-09-13 广州酷狗计算机科技有限公司 The method and device of Video Composition
CN110336960A (en) * 2019-07-17 2019-10-15 广州酷狗计算机科技有限公司 Method, apparatus, terminal and the storage medium of Video Composition
CN110545476A (en) * 2019-09-23 2019-12-06 广州酷狗计算机科技有限公司 Video synthesis method and device, computer equipment and storage medium
CN110650368A (en) * 2019-09-25 2020-01-03 新东方教育科技集团有限公司 Video processing method and device and electronic equipment
CN110955786A (en) * 2019-11-29 2020-04-03 网易(杭州)网络有限公司 Dance action data generation method and device
CN111008287A (en) * 2019-12-19 2020-04-14 Oppo(重庆)智能科技有限公司 Audio and video processing method and device, server and storage medium
CN111064992A (en) * 2019-12-10 2020-04-24 懂频智能科技(上海)有限公司 Method for automatically switching video contents according to music beats
CN111541946A (en) * 2020-07-10 2020-08-14 成都品果科技有限公司 Automatic video generation method and system for resource matching based on materials
CN111556254A (en) * 2020-04-10 2020-08-18 早安科技(广州)有限公司 Method, system, medium and intelligent device for video cutting by using video content
CN111683209A (en) * 2020-06-10 2020-09-18 北京奇艺世纪科技有限公司 Mixed-cut video generation method and device, electronic equipment and computer-readable storage medium
CN112188307A (en) * 2019-07-03 2021-01-05 腾讯科技(深圳)有限公司 Video resource synthesis method and device, storage medium and electronic device
CN112235631A (en) * 2019-07-15 2021-01-15 北京字节跳动网络技术有限公司 Video processing method and device, electronic equipment and storage medium
CN112449231A (en) * 2019-08-30 2021-03-05 腾讯科技(深圳)有限公司 Multimedia file material processing method and device, electronic equipment and storage medium
WO2021088830A1 (en) * 2019-11-04 2021-05-14 北京字节跳动网络技术有限公司 Method and apparatus for displaying music points, and electronic device and medium
WO2021121023A1 (en) * 2019-12-17 2021-06-24 Oppo广东移动通信有限公司 Video editing method, video editing apparatus, terminal, and readable storage medium
CN113077470A (en) * 2021-03-26 2021-07-06 天翼爱音乐文化科技有限公司 Method, system, device and medium for cutting horizontal and vertical screen conversion picture
CN114390367A (en) * 2020-10-16 2022-04-22 上海哔哩哔哩科技有限公司 Audio and video processing method and device
CN114390352A (en) * 2020-10-16 2022-04-22 上海哔哩哔哩科技有限公司 Audio and video processing method and device
WO2022152064A1 (en) * 2021-01-15 2022-07-21 北京字跳网络技术有限公司 Video generation method and apparatus, electronic device, and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030160944A1 (en) * 2002-02-28 2003-08-28 Jonathan Foote Method for automatically producing music videos
CN101107667A (en) * 2004-12-17 2008-01-16 诺基亚公司 Method and apparatus for video editing on small screen with minimal input device
CN101640057A (en) * 2009-05-31 2010-02-03 北京中星微电子有限公司 Audio and video matching method and device therefor
EP2993668A1 (en) * 2014-09-08 2016-03-09 Thomson Licensing Method for editing an audiovisual segment and corresponding device and computer program product
CN105530440A (en) * 2014-09-29 2016-04-27 北京金山安全软件有限公司 Video production method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030160944A1 (en) * 2002-02-28 2003-08-28 Jonathan Foote Method for automatically producing music videos
CN101107667A (en) * 2004-12-17 2008-01-16 诺基亚公司 Method and apparatus for video editing on small screen with minimal input device
CN101640057A (en) * 2009-05-31 2010-02-03 北京中星微电子有限公司 Audio and video matching method and device therefor
EP2993668A1 (en) * 2014-09-08 2016-03-09 Thomson Licensing Method for editing an audiovisual segment and corresponding device and computer program product
CN105530440A (en) * 2014-09-29 2016-04-27 北京金山安全软件有限公司 Video production method and device

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108830208A (en) * 2018-06-08 2018-11-16 Oppo广东移动通信有限公司 Method for processing video frequency and device, electronic equipment, computer readable storage medium
CN108986056A (en) * 2018-08-24 2018-12-11 潘小亮 Content requirements judge system
CN109167934A (en) * 2018-09-03 2019-01-08 咪咕视讯科技有限公司 A kind of method for processing video frequency, device and computer readable storage medium
CN109167934B (en) * 2018-09-03 2020-12-22 咪咕视讯科技有限公司 Video processing method and device and computer readable storage medium
CN109168084A (en) * 2018-10-24 2019-01-08 麒麟合盛网络技术股份有限公司 A kind of method and apparatus of video clipping
CN110233976B (en) * 2019-06-21 2022-09-09 广州酷狗计算机科技有限公司 Video synthesis method and device
CN110233976A (en) * 2019-06-21 2019-09-13 广州酷狗计算机科技有限公司 The method and device of Video Composition
CN112188307A (en) * 2019-07-03 2021-01-05 腾讯科技(深圳)有限公司 Video resource synthesis method and device, storage medium and electronic device
GB2600309B (en) * 2019-07-15 2024-01-31 Beijing Bytedance Network Tech Co Ltd Video processing method and apparatus, and electronic device and storage medium
WO2021008394A1 (en) * 2019-07-15 2021-01-21 北京字节跳动网络技术有限公司 Video processing method and apparatus, and electronic device and storage medium
GB2600309A (en) * 2019-07-15 2022-04-27 Beijing Bytedance Network Tech Co Ltd Video processing method and apparatus, and electronic device and storage medium
CN112235631A (en) * 2019-07-15 2021-01-15 北京字节跳动网络技术有限公司 Video processing method and device, electronic equipment and storage medium
CN110336960A (en) * 2019-07-17 2019-10-15 广州酷狗计算机科技有限公司 Method, apparatus, terminal and the storage medium of Video Composition
CN110336960B (en) * 2019-07-17 2021-12-10 广州酷狗计算机科技有限公司 Video synthesis method, device, terminal and storage medium
CN112449231A (en) * 2019-08-30 2021-03-05 腾讯科技(深圳)有限公司 Multimedia file material processing method and device, electronic equipment and storage medium
CN110545476A (en) * 2019-09-23 2019-12-06 广州酷狗计算机科技有限公司 Video synthesis method and device, computer equipment and storage medium
CN110545476B (en) * 2019-09-23 2022-03-25 广州酷狗计算机科技有限公司 Video synthesis method and device, computer equipment and storage medium
CN110650368B (en) * 2019-09-25 2022-04-26 新东方教育科技集团有限公司 Video processing method and device and electronic equipment
CN110650368A (en) * 2019-09-25 2020-01-03 新东方教育科技集团有限公司 Video processing method and device and electronic equipment
US11335379B2 (en) 2019-09-25 2022-05-17 New Oriental Education & Technology Group Inc. Video processing method, device and electronic equipment
WO2021088830A1 (en) * 2019-11-04 2021-05-14 北京字节跳动网络技术有限公司 Method and apparatus for displaying music points, and electronic device and medium
US11587593B2 (en) 2019-11-04 2023-02-21 Beijing Bytedance Network Technology Co., Ltd. Method and apparatus for displaying music points, and electronic device and medium
CN110955786B (en) * 2019-11-29 2023-10-27 网易(杭州)网络有限公司 Dance action data generation method and device
CN110955786A (en) * 2019-11-29 2020-04-03 网易(杭州)网络有限公司 Dance action data generation method and device
CN111064992A (en) * 2019-12-10 2020-04-24 懂频智能科技(上海)有限公司 Method for automatically switching video contents according to music beats
WO2021121023A1 (en) * 2019-12-17 2021-06-24 Oppo广东移动通信有限公司 Video editing method, video editing apparatus, terminal, and readable storage medium
CN111008287B (en) * 2019-12-19 2023-08-04 Oppo(重庆)智能科技有限公司 Audio and video processing method and device, server and storage medium
CN111008287A (en) * 2019-12-19 2020-04-14 Oppo(重庆)智能科技有限公司 Audio and video processing method and device, server and storage medium
CN111556254B (en) * 2020-04-10 2021-04-02 早安科技(广州)有限公司 Method, system, medium and intelligent device for video cutting by using video content
CN111556254A (en) * 2020-04-10 2020-08-18 早安科技(广州)有限公司 Method, system, medium and intelligent device for video cutting by using video content
CN111683209A (en) * 2020-06-10 2020-09-18 北京奇艺世纪科技有限公司 Mixed-cut video generation method and device, electronic equipment and computer-readable storage medium
CN111541946A (en) * 2020-07-10 2020-08-14 成都品果科技有限公司 Automatic video generation method and system for resource matching based on materials
CN114390352A (en) * 2020-10-16 2022-04-22 上海哔哩哔哩科技有限公司 Audio and video processing method and device
CN114390367A (en) * 2020-10-16 2022-04-22 上海哔哩哔哩科技有限公司 Audio and video processing method and device
WO2022152064A1 (en) * 2021-01-15 2022-07-21 北京字跳网络技术有限公司 Video generation method and apparatus, electronic device, and storage medium
CN113077470A (en) * 2021-03-26 2021-07-06 天翼爱音乐文化科技有限公司 Method, system, device and medium for cutting horizontal and vertical screen conversion picture

Also Published As

Publication number Publication date
CN107483843B (en) 2019-11-15

Similar Documents

Publication Publication Date Title
CN107483843A (en) Audio frequency and video match clipping method and device
CN107393569B (en) Audio-video clipping method and device
US8921678B2 (en) Generating tones by combining sound materials
CN110415723B (en) Method, device, server and computer readable storage medium for audio segmentation
CN106652997A (en) Audio synthesis method and terminal
CN108877753B (en) Music synthesis method and system, terminal and computer readable storage medium
CN109741425B (en) Banner picture generation method and device, storage medium and computer equipment
CN110519638A (en) Processing method, processing unit, electronic device and storage medium
CN107978310B (en) Audio processing method and device
US11511200B2 (en) Game playing method and system based on a multimedia file
CN106468987B (en) Information processing method and client
CN104038473A (en) Method of audio ad insertion, device, equipment and system
CN110377212B (en) Method, apparatus, computer device and storage medium for triggering display through audio
CN106775568A (en) A kind of effect adjusting method, device and mobile terminal
US20130263720A1 (en) Music piece order determination device, music piece order determination method, and music piece order determination program
CN112269898A (en) Background music obtaining method and device, electronic equipment and readable storage medium
CN105183853A (en) Method and device used for presenting label page
CN107481739B (en) Audio cutting method and device
CN109859739B (en) Melody generation method and device based on voice synthesis and terminal equipment
CN106448713A (en) Audio frequency playing method and audio frequency playing device
CN105118081A (en) Processing method and device for picture synthesis video
CN109327731B (en) Method and system for synthesizing DIY video in real time based on karaoke
US7612279B1 (en) Methods and apparatus for structuring audio data
CN107688661B (en) Lyric similarity calculation method, terminal device and computer-readable storage medium
CN113674725B (en) Audio mixing method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant