CN107483843B - Audio-video matches clipping method and device - Google Patents

Audio-video matches clipping method and device Download PDF

Info

Publication number
CN107483843B
CN107483843B CN201710701832.XA CN201710701832A CN107483843B CN 107483843 B CN107483843 B CN 107483843B CN 201710701832 A CN201710701832 A CN 201710701832A CN 107483843 B CN107483843 B CN 107483843B
Authority
CN
China
Prior art keywords
video
target
target video
music
value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710701832.XA
Other languages
Chinese (zh)
Other versions
CN107483843A (en
Inventor
陈杰
徐滢
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Pinguo Technology Co Ltd
Original Assignee
Chengdu Pinguo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Pinguo Technology Co Ltd filed Critical Chengdu Pinguo Technology Co Ltd
Priority to CN201710701832.XA priority Critical patent/CN107483843B/en
Publication of CN107483843A publication Critical patent/CN107483843A/en
Application granted granted Critical
Publication of CN107483843B publication Critical patent/CN107483843B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/265Mixing

Abstract

The present invention provides a kind of audio-video matching clipping method and device, is related to multimedia data processing field.The method and device are labeled with the target music of multiple cut points by obtaining in advance, and the target music is labeled as multiple snatch of music by the cut point;It is multiple video clips by least one the target video cutting obtained according to the duration of the snatch of music;The video clip of predetermined number is chosen from multiple video clips as target video segment;Using filling algorithm, the filling value of the video clip is calculated according to the weight of the target video segment and filling position, the target video segment is inserted between the corresponding cut point of the target music, the target video segment of filling and the new video file of the integral Maximum Value of snatch of music match group are made.Based on above-mentioned design, the method and device simplify the editing operation of audio-video, while also improving the quality of the video of editing.

Description

Audio-video matches clipping method and device
Technical field
The present invention relates to multimedia data processing fields, match clipping method in particular to a kind of audio-video And device.
Background technique
As usually when carrying out editing to audio or video, whole process needs to have been manually done.Specific operation process, for example, Operator is using video clipping software by one video of multiple sections of Video Compositions, and then the background music of the editing video, makes The duration of background music and the duration of the video are identical, and finally the background music is loaded into video, obtains new video.Existing Have in technology, editing operation is complicated, view that not so editing obtain high to the technical requirements of the operator of editing audio and video Frequency is easy to appear video content and the case where music rhythm is not taken, and influences the quality of video.Therefore, how a kind of operation letter is provided List and can be improved editing video quality method and device, it has also become the technical issues of those skilled in the art's urgent need to resolve.
Summary of the invention
In order to overcome the deficiencies in the prior art described above, the present invention provides a kind of audio-video matching clipping method and device, To solve the above problems.
To achieve the goals above, technical solution provided by present pre-ferred embodiments is as follows:
Present pre-ferred embodiments provide a kind of audio-video matching clipping method, which comprises
The target music for being labeled with multiple cut points in advance is obtained, the target music is by the cut point labeled as multiple Snatch of music;
It is multiple video clips by least one the target video cutting obtained according to the duration of the snatch of music;
The video clip of predetermined number is chosen from multiple video clips as target video segment;
Using filling algorithm, which is calculated according to the weight of the target video segment and filling position Filling value, and be worth according to the filling of each target video segment, the target video segment is inserted into the target music Between corresponding cut point, make the target video segment of filling and the new view of the integral Maximum Value of snatch of music match group Frequency file.
In the preferred embodiment, above-mentioned overall value is the sum of filling value of each target video segment, The filling value of the target video segment includes self-value and discrete value;
Calculate the mode of the self-value, comprising:
The characteristic information of each target video segment is analyzed, the characteristic information includes face information, more people's scene informations, people Smile information and artificial mark information in face;
According to the characteristic information, the corresponding weight of each video clip is assigned as the target video segment from the personal value Value;
Calculate the mode of the discrete value, comprising:
When at least two target video segments are in same target video if it exists, the discrete value is regarded with the target The corresponding video clip in target video of frequency segment and other target video segments are between the video clip in target video Distance it is associated;
If target video segment is not in same target video, the discrete value is preset value.
In the preferred embodiment, above-mentioned calculated according to the weight and filling position of the target video segment should The filling of target video segment is worth, and is worth according to the filling of each target video segment, and the target video segment is filled out The step of entering between the corresponding cut point of the target music, comprising:
Using greedy approximate algorithm, iterate to calculate each video clip as the self-value of the target video segment and Discrete value, to obtain the overall value of multiple corresponding video files;
Video file corresponding to maximum overall value is chosen from multiple overall values as new video file.
In the preferred embodiment, the video clip between the corresponding cut point of target music described above exists Under conditions of meeting discreteness, filled according to the sequence that weight successively decreases.
In the preferred embodiment, above-mentioned acquisition be labeled in advance the step of target music of multiple cut points it Before, the method also includes:
The acoustic amplitudes information of frequency domain is preset from the target extraction of music;
The time point of amplitude surge in the default frequency domain is chosen as the cut point, between making between adjacent cut point Every duration more than preset duration.
In the preferred embodiment, the above-mentioned duration according to the snatch of music, by least one obtained The step of target video cutting is multiple video clips, comprising:
Choose duration of the duration longest period as the video clip of cutting in the snatch of music.
In the preferred embodiment, the piece of video number of segment for inserting the target music is equal to the target music Musical film number of segment.
In the preferred embodiment, above-mentioned that the target video segment filling target music is corresponding The step of between cut point, comprising:
The length of every section of target video segment is corrected, so that the length of target video segment is right equal in the target music The length for the snatch of music answered.
Presently preferred embodiments of the present invention also provides a kind of audio-video matching editing device, comprising:
Acquiring unit, for obtaining the target music for being labeled with multiple cut points in advance, the target music is cut by described Cutpoint is labeled as multiple snatch of music;
At least one the target video cutting obtained is by cut cells for the duration according to the snatch of music Multiple video clips;
Selection unit, for choosing the video clip of predetermined number from multiple video clips as target video segment;
Video Composition unit, for using filling algorithm, according to the weight and filling position meter of the target video segment The filling value of the target video segment is calculated, and is worth according to the filling of each target video segment, by the target video piece Section is inserted between the corresponding cut point of the target music, so that the target video segment of filling is matched composition with snatch of music whole The new video file of body Maximum Value.
In the preferred embodiment, above-mentioned overall value is the sum of filling value of each target video segment, The filling value of the target video segment includes self-value and discrete value;
The Video Composition unit calculates the mode of the self-value, comprising:
The characteristic information of each target video segment is analyzed, the characteristic information includes face information, more people's scene informations, people Smile information and artificial mark information in face;
According to the characteristic information, the corresponding weight of each video clip is assigned as the target video segment from the personal value Value;
The Video Composition unit calculates the mode of the discrete value, comprising:
When at least two target video segments are in same target video if it exists, the discrete value is regarded with the target The corresponding video clip in target video of frequency segment and other target video segments are between the video clip in target video Distance it is associated;
If target video segment is not in same target video, the discrete value is preset value.
In terms of existing technologies, audio-video provided by the invention matching clipping method and device at least have and following have Beneficial effect: the method is obtained by the way that multiple video clips filling of cutting to be labeled in the target music of multiple cut points To video file, and select it is whole fix the price be worth it is maximum simplify the editing operation of audio-video as new video file, simultaneously also Improve the quality of the video of editing.Specifically, this method uses filling algorithm, according to the weight and filling of target video segment Position calculates the filling value of the video clip, and is worth according to the filling of each video clip, by the target video segment It inserts between the corresponding cut point of the target music, keeps the target video segment of filling and snatch of music match group integral The new video file of Maximum Value.The method and device can make video clip and target music in the video file of editing Corresponding music rhythm matches, and while improving the quality of editing video, additionally aids the experience sense for promoting user.
To enable the above objects, features and advantages of the present invention to be clearer and more comprehensible, present pre-ferred embodiments are cited below particularly, And cooperate appended attached drawing, it is described in detail below.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below will be to needed in the embodiment attached Figure is briefly described.It should be appreciated that the following drawings illustrates only certain embodiments of the present invention, therefore it is not construed as pair The restriction of range for those of ordinary skill in the art without creative efforts, can also be according to this A little attached drawings obtain other relevant attached drawings.
Fig. 1 is the block diagram for the terminal device that present pre-ferred embodiments provide.
Fig. 2 is the flow diagram that the audio-video that present pre-ferred embodiments provide matches clipping method.
Fig. 3 is the flow diagram of the sub-step of step S240 shown in Fig. 2.
Fig. 4 is the block diagram that the audio-video that present pre-ferred embodiments provide matches editing device.
Icon: 10- terminal device;11- processor;12- memory;13- display unit;100- audio-video matches editing dress It sets;110- acquiring unit;120- cut cells;130- selection unit;140- Video Composition unit.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description.Obviously, described embodiment is only a part of the embodiments of the present invention, instead of all the embodiments.It is logical The component for the embodiment of the present invention being often described and illustrated herein in the accompanying drawings can be arranged and be designed with a variety of different configurations.
Therefore, the detailed description of the embodiment of the present invention provided in the accompanying drawings is not intended to limit below claimed The scope of the present invention, but be merely representative of selected embodiment of the invention.Based on the embodiment of the present invention, those skilled in the art Member's every other embodiment obtained without making creative work, shall fall within the protection scope of the present invention.
It should also be noted that similar label and letter indicate similar terms in following attached drawing, therefore, once a certain Xiang Yi It is defined in a attached drawing, does not then need that it is further defined and explained in subsequent attached drawing.
With reference to the accompanying drawing, it elaborates to some embodiments of the present invention.In the absence of conflict, following Feature in embodiment and embodiment can be combined with each other.
Fig. 1 is please referred to, is the block diagram for the terminal device 10 that present pre-ferred embodiments provide.In the present embodiment In, the terminal device 10 can be used as the operating platform of editing video and audio, with for users to use.The terminal device 10 It may include processor 11, memory 12 and audio-video matching editing device 100.User can utilize the sound in terminal device 10 Video matching editing device 100 realizes that shearing, editor and synthesis of audio and video etc. operate, to obtain the view after editing Frequency file simplifies the operating process of editing audio-video.
Further, 10 overview of terminal device with include other elements, such as display unit 13.The processor 11, it is directly or indirectly electrically connected between memory 12 and each element of modern unit, to realize the transmission and friendship of data Mutually.The audio-video matching editing device 100 includes that at least one can be stored in the form of software or firmware (firmware) In the memory 12 or the software function that is solidificated in the operating system (operating system, OS) of the terminal device 10 It can module.The memory 12 can store the data such as audio data, video data.The processor 11 is for executing described deposit The executable module stored in reservoir 12, such as software function module and calculating included by audio-video matching editing device 100 Machine program etc..
Further, the memory 12 may be, but not limited to, random access memory (Random Access Memory, RAM), read-only memory (Read Only Memory, ROM), programmable read only memory (Programmable Read-Only Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM), electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc..Wherein, memory 12 is for storing program, and the processor 11 is held after receiving and executing instruction Row described program.The processor 11 and other possible components can be in the storage controls to the access of memory 12 Control is lower to be carried out.
The processor 11 can be general processor, including central processing unit (Central Processing Unit, CPU), network processing unit (Network Processor, NP) etc.;It can also be digital signal processor (DSP), dedicated integrated Circuit (ASIC), ready-made programmable gate array (FPGA) either other programmable logic device, discrete gate or transistor logic Device, discrete hardware components may be implemented or execute disclosed each method, step and logical box in the embodiment of the present invention Figure.General processor can be microprocessor or the processor 11 is also possible to any conventional processor etc..
In the present embodiment, the display unit 13 is used to play the video of 10 editing of terminal device, audio (ratio Such as, target music).The display unit 13 can be also used for the history usage record of display audio or video.In addition, described aobvious Show the editing toolbar that unit 13 can also show that user is accustomed to according to the editing of oneself and is arranged, use convenient for the user to operate. The display unit 13 may be, but not limited to, touching display screen, common liquid crystals display screen etc., be not especially limited here.
It is understood that structure shown in FIG. 1 is only a kind of structural schematic diagram of terminal device 10, the terminal device 10 may also include than shown in Fig. 1 more perhaps less component or with the configuration different from shown in Fig. 1.Shown in Fig. 1 Each component can using hardware, software, or its combination realize.
In the present embodiment, the terminal device 10 may be, but not limited to, smart phone, PC (Personal Computer, PC), tablet computer, personal digital assistant (Personal Digital Assistant, PDA) etc., it is preferable that The terminal device 10 is smart phone.
It referring to figure 2., is the flow diagram for the audio-video matching clipping method that present pre-ferred embodiments provide.At this In embodiment, the audio-video matching clipping method is applied to terminal device 10 shown in Fig. 1.The method will be by that will shear Video clip be filled in the target music for being marked with cut point, to form new video file, and then simplify editing sound The operating procedure of frequency and video.The detailed process to the matching clipping method of audio-video described in Fig. 2 and step carry out detailed below It is thin to illustrate.
In embodiments of the present invention, audio-video matching clipping method the following steps are included:
Step S210 obtains the target music for being labeled with multiple cut points in advance, and the target music is by the cut point Labeled as multiple snatch of music.
In the present embodiment, acquired target music is previously provided with multiple cut points, and cut point is used to be used as and fill out Enter the filling point of video clip, so that the video clip and corresponding snatch of music of filling match.In addition, user can be according to tool The quantity of cut point is arranged in body situation, is not especially limited here.
Before step S210, memory 12 can be previously stored with a first or more songs, and user can like according to itself A wherein first background music as video to be clipped, that is, the target music is selected well.It is of course also possible to randomly select Wherein a piece of music is as the target music.Then target music can be arranged according to the musical features of the target music of extraction Cut point.Understandably, target music is divided into multiple snatch of music by cut point label, the video clip of each cutting with it is right The snatch of music answered matches, that is, the video clip matched is just inserted in corresponding snatch of music.
In this embodiment, the musical features include tempo characteristic, and the tempo characteristic includes the sound of the target music Sound amplitude information.The step of cut point is arranged to target music in the musical features of the above-mentioned target music according to extraction is understood that Are as follows: the acoustic amplitudes information of frequency domain is preset from the target extraction of music;Choose that amplitude in the default frequency domain increases sharply when Between point be used as the cut point, making interval duration between adjacent cut point is more than preset duration.
It generally, include harmony component and rhythm component in music.Understandably, harmony component is the musical instrument for having tone The music played, for example, orchestra.The music that rhythm component is played for the musical instrument of not tone, for example, drum class is happy Device.The musical features can be the beat information in rhythm component, such as the nodal information that amplitude increases suddenly.Extracting mesh When the happy musical features of mark with phonetic symbols, can by target music harmony component and rhythm component separate, to obtain rhythm component. Then musical features of the acoustic amplitudes information as the target music are extracted from rhythm component.
Further, if the corresponding sound of rhythm component is separated into sonograph, the time point that the amplitude increases sharply can be with It is interpreted as in default frequency domain, amplitude is from the time inflection point for being reduced to increase.Optionally, the corresponding amplitude of the inflection point is not less than pre- If amplitude threshold.
Further, the video clip of the cut point cooperation cutting is used as being loaded into the incision of video clip Point.And the interval duration between adjacent cut point is more than preset duration, it is too short to avoid the interval duration between adjacent cut point, And make to insert video clip also section, and then influence the result of broadcast of the video after editing.
In the present embodiment, the amplitude threshold, preset duration, default frequency domain can be configured as the case may be, Here it is not especially limited.
At least one the target video cutting obtained is multiple according to the duration of the snatch of music by step S220 Video clip.
In the present embodiment, the method can choose view of the duration longest period as cutting in the snatch of music Then the one or more target video cuttings being obtained ahead of time are multiple piece of video of same fixed duration by the duration of frequency segment Section.Understandably, interval duration longest period when described fixed between a length of adjacent marker point, so as in target music Two neighboring mark point can fill up the video clip, when the video after avoiding editing occurs playing music, no video content exhibition Existing situation occurs.
Step S230 chooses the video clip of predetermined number as target video segment from multiple video clips.
In the present embodiment, the target video segments of selection can be equal with divided musical film number of segment, so that often Section target video segment and snatch of music correspond.
Step S240 calculates the target according to the weight of the target video segment and filling position using filling algorithm The filling of video clip is worth, and is worth according to the filling of each target video segment, and the target video segment is inserted institute It states between the corresponding cut point of target music, makes the target video segment of filling and the integral value of snatch of music match group most Big new video file.
In the present embodiment, the filling algorithm is editing filling algorithm.It is to be appreciated that the editing filling algorithm is Seek a kind of best fit strategy, target video segment is inserted between the mark point of target music, so that overall value is maximum. Wherein, the overall value can be regarded as the quality of the obtained video of editing, for example, between video clip and snatch of music Continuity etc. between matching degree, video clip.
Further, the overall value can be the sum of the filling value of each target video segment, the target view The filling value of frequency segment includes self-value and discrete value.The weight of the self-value and corresponding target video segment It is associated.Specifically, the mode for calculating the self-value may include analyzing the characteristic information of each target video segment, described Characteristic information includes face information, more people's scene informations, smile information and artificial mark information in face;According to the spy Reference breath, assigns self-value of the corresponding weight of each video clip as the target video segment.
In the present embodiment, before assigning weight, each characteristic information is previously provided with corresponding weight.Specifically, For example, the corresponding weight of pre-stored smile information is called if analysis video clip obtains the smile information in face, with Weight as the video clip.It certainly, in other embodiments, can also be taking human as the weight of setting video clip.Weight Size can be configured according to the actual situation, the size of weight is not especially limited here.
Further, the characteristic information can also include other information, to enrich the type of video clip.For example, institute Stating characteristic information can also include animal painting information, for example, the image information of the animals such as cat, dog, is no longer specifically described here.
In the present embodiment, the discrete value may include the value under two kinds of different situations.For example, if it exists at least When two target video segments are in same target video, the discrete value is corresponding with the target video segment to be regarded in target Video clip in frequency is associated at a distance from other target video segments are between the video clip in target video;If target When video clip is not in same target video, the discrete value is preset value.Its preset value can as the case may be into Row setting, is not especially limited here.
It is alternatively possible to be greater than the smallest discrete value that the target video segment not in same target video is arranged most Big self-value, to avoid two target video segments adjacent in snatch of music also phase in former target video is filled in It is adjacent.Namely based on above-mentioned design, the visual effect for the video file to be formed can be improved.
Further, step S240 can also include one or more sub-steps.For example, referring to figure 3. for shown in Fig. 2 The flow diagram of the sub-step of step S240.In the present embodiment, the step S240 may include sub-step S241 and son Step S242.
Sub-step S241 iterates to calculate each video clip as the target video segment using greedy approximate algorithm Self-value and discrete value, to obtain the overall value of multiple corresponding video files.
Sub-step S242 chooses video file corresponding to maximum overall value as new view from multiple overall values Frequency file.
In the present embodiment, it can be used and solve the dynamic programming algorithm of 0-1 knapsack problem and obtain that overall value is maximum to fill out Fill mode;Then greedy approximate algorithm can be used, iterate to calculate, and compared with last computation result, it is smaller to cast out overall value , and the result calculated using the biggish video file of overall value as this.By way of iterative calculation, it can select final Filling mode corresponding to maximum overall value, and obtain the maximum video file of overall value.
Further, the video clip between the corresponding cut point of the target music is inserted in the item for meeting discreteness Under part, filled according to the sequence that weight successively decreases.Understandably, video clip adjacent in target music is filled in original in satisfaction In target video segment it is non-conterminous under the conditions of, can successively decrease according to the weight of target video segment sequence filling target video.
Further, before inserting target video segment, the method can also include every section of target video piece of amendment The length of section, so that the length of target video segment is equal to the length of corresponding snatch of music in the target music.
Further, it can be corresponded to by being sheared to target video segment so that the length of target video segment is equal to Adjacent marker point between snatch of music length.Can also by target video segment carry out quickly or slow processes, So that the target video fragment length is equal to the length of the snatch of music.Based on above-mentioned design, can make to obtain after institute's editing Video file target music continuously simultaneously, moreover it is possible to making the broadcasting of video has continuity, promotes the use of viewing video file The experience sense at family also just improves the quality of the video file.
It referring to figure 4., is the block diagram for the audio-video matching editing device 100 that present pre-ferred embodiments provide. Present pre-ferred embodiments also provide a kind of audio-video matching editing device 100, the apparatus may include acquiring unit 110, Cut cells 120, selection unit 130 and Video Composition unit 140.
The acquiring unit 110, for obtaining the target music for being labeled with multiple cut points in advance, the target music quilt The cut point is labeled as multiple snatch of music.Specifically, the acquiring unit 110 can be used for executing step shown in Fig. 2 Rapid S210, specific operating method can refer to the detailed description to step S210.
The cut cells 120, for the duration according to the snatch of music, at least one target video that will be obtained Cutting is multiple video clips.Specifically, the cut cells 120 can be used for executing step S220 shown in Fig. 2, specifically Operating method can refer to the detailed description to step S220.
The selection unit 130 chooses the video clip of predetermined number as target video piece from multiple video clips Section.Specifically, the selection unit 130 can be used for executing step S230 shown in Fig. 2, and specific operating method can refer to pair The detailed description of step S230.
The Video Composition unit 140, for using filling algorithm, according to the weight and filling of the target video segment Position calculates the filling value of the target video segment, and is worth according to the filling of each target video segment, by the target Video clip is inserted between the corresponding cut point of the target music, matches the target video segment of filling with snatch of music Form the maximum new video file of overall value.Specifically, the Video Composition unit 140 can be used for executing shown in Fig. 2 Step S240, specific operating method can refer to the detailed description to step S240.
Further, the Video Composition unit 140 can be also used for executing sub-step S241 and sub-step shown in Fig. 3 Rapid S242, specific operating method can refer to the detailed description of sub-paragraphs S241 and sub-step S242, and which is not described herein again.
In conclusion the present invention provides a kind of audio-video matching clipping method and device.The method is by by cutting Multiple video clip filling are labeled in the target music of multiple cut points, obtain video file, and entirety is selected to fix the price value most It is big as new video file, simplify the editing operation of audio-video, while also improving the quality of the video of editing.It is described Method is worth according to the filling that the weight and filling position of target video segment calculate the video clip, and according to each piece of video The filling value of section chooses the maximum filling mode of overall value so that the target video segment is inserted the target music phase Between corresponding cut point, make the target video segment of filling and the new video of the integral Maximum Value of snatch of music match group File.The method and device can make video clip and the corresponding music rhythm phase of target music in the video file of editing Match, while improving the quality of editing video, additionally aids the experience sense for promoting user.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the invention may be variously modified and varied.All within the spirits and principles of the present invention, made any to repair Change, equivalent replacement, improvement etc., should all be included in the protection scope of the present invention.

Claims (8)

1. a kind of audio-video matches clipping method, which is characterized in that the described method includes:
The target music for being labeled with multiple cut points in advance is obtained, the target music is labeled as multiple music by the cut point Segment;
It is multiple video clips by least one the target video cutting obtained according to the duration of the snatch of music;
The video clip of predetermined number is chosen from multiple video clips as target video segment;
Using filling algorithm, the filling of the target video segment is calculated according to the weight of the target video segment and filling position Value, and be worth according to the filling of each target video segment, it is opposite that the target video segment is inserted the target music Between the cut point answered, make the target video segment of filling and the new video text of the integral Maximum Value of snatch of music match group Part;
Wherein, the overall value is the sum of filling value of each target video segment, the filling of the target video segment Value includes self-value and discrete value;
Calculate the mode of the self-value, comprising:
Analyze the characteristic information of each target video segment, the characteristic information includes face information, more people's scene informations, in face Smile information and artificial mark information;
According to the characteristic information, self-value of the corresponding weight of each video clip as the target video segment is assigned;
Calculate the mode of the discrete value, comprising:
When at least two target video segments are in same target video if it exists, the discrete value and the target video piece The corresponding video clip in target video of section and other target video segments between the video clip in target video away from From associated;
If target video segment is not in same target video, the discrete value is preset value.
2. the method according to claim 1, wherein the weight and filling according to the target video segment Position calculates the filling value of the target video segment, and is worth according to the filling of each target video segment, by the target Video clip inserts the step of between the corresponding cut point of the target music, comprising:
Using greedy approximate algorithm, each video clip is iterated to calculate as the self-value of the target video segment and discrete Value, to obtain the overall value of multiple corresponding video files;
Video file corresponding to maximum overall value is chosen from multiple overall values as new video file.
3. according to the method described in claim 2, it is characterized in that, inserting between the corresponding cut point of the target music Video clip is filled under conditions of meeting discreteness according to the sequence that weight successively decreases.
4. the method according to claim 1, wherein described obtain the target sound for being labeled with multiple cut points in advance Before happy step, the method also includes:
The acoustic amplitudes information of frequency domain is preset from the target extraction of music;
The time point of amplitude surge in the default frequency domain is chosen as the cut point, when making the interval between adjacent cut point Long is more than preset duration.
5. the method according to claim 1, wherein the duration according to the snatch of music, will obtain At least one target video cutting be multiple video clips the step of, comprising:
Choose duration of the duration longest period as the video clip of cutting in the snatch of music.
6. the method according to claim 1, wherein inserting the piece of video number of segment of the target music equal to described The musical film number of segment of target music.
7. the method according to claim 1, wherein described insert the target sound for the target video segment The step of between happy corresponding cut point, comprising:
The length of every section of target video segment is corrected, so that the length of target video segment is corresponding equal in the target music The length of snatch of music.
8. a kind of audio-video matches editing device characterized by comprising
Acquiring unit, for obtaining the target music for being labeled with multiple cut points in advance, the target music is by the cut point Labeled as multiple snatch of music;
At least one the target video cutting obtained is multiple for the duration according to the snatch of music by cut cells Video clip;
Selection unit, for choosing the video clip of predetermined number from multiple video clips as target video segment;
Video Composition unit, for using filling algorithm, being calculated according to the weight of the target video segment and filling position should The filling of target video segment is worth, and is worth according to the filling of each target video segment, and the target video segment is filled out Enter between the corresponding cut point of the target music, makes the target video segment and the integral valence of snatch of music match group of filling It is worth maximum new video file;
Wherein, the overall value is the sum of filling value of each target video segment, the filling of the target video segment Value includes self-value and discrete value;
The Video Composition unit calculates the mode of the self-value, comprising:
Analyze the characteristic information of each target video segment, the characteristic information includes face information, more people's scene informations, in face Smile information and artificial mark information;
According to the characteristic information, self-value of the corresponding weight of each video clip as the target video segment is assigned;
The Video Composition unit calculates the mode of the discrete value, comprising:
When at least two target video segments are in same target video if it exists, the discrete value and the target video piece The corresponding video clip in target video of section and other target video segments between the video clip in target video away from From associated;
If target video segment is not in same target video, the discrete value is preset value.
CN201710701832.XA 2017-08-16 2017-08-16 Audio-video matches clipping method and device Active CN107483843B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710701832.XA CN107483843B (en) 2017-08-16 2017-08-16 Audio-video matches clipping method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710701832.XA CN107483843B (en) 2017-08-16 2017-08-16 Audio-video matches clipping method and device

Publications (2)

Publication Number Publication Date
CN107483843A CN107483843A (en) 2017-12-15
CN107483843B true CN107483843B (en) 2019-11-15

Family

ID=60600510

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710701832.XA Active CN107483843B (en) 2017-08-16 2017-08-16 Audio-video matches clipping method and device

Country Status (1)

Country Link
CN (1) CN107483843B (en)

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108830208A (en) * 2018-06-08 2018-11-16 Oppo广东移动通信有限公司 Method for processing video frequency and device, electronic equipment, computer readable storage medium
CN108986056A (en) * 2018-08-24 2018-12-11 潘小亮 Content requirements judge system
CN109167934B (en) * 2018-09-03 2020-12-22 咪咕视讯科技有限公司 Video processing method and device and computer readable storage medium
CN109168084B (en) * 2018-10-24 2021-04-23 麒麟合盛网络技术股份有限公司 Video editing method and device
CN110233976B (en) * 2019-06-21 2022-09-09 广州酷狗计算机科技有限公司 Video synthesis method and device
CN112188307B (en) * 2019-07-03 2022-07-01 腾讯科技(深圳)有限公司 Video resource synthesis method and device, storage medium and electronic device
CN112235631B (en) * 2019-07-15 2022-05-03 北京字节跳动网络技术有限公司 Video processing method and device, electronic equipment and storage medium
CN110336960B (en) * 2019-07-17 2021-12-10 广州酷狗计算机科技有限公司 Video synthesis method, device, terminal and storage medium
CN112449231B (en) * 2019-08-30 2023-02-03 腾讯科技(深圳)有限公司 Multimedia file material processing method and device, electronic equipment and storage medium
CN110545476B (en) * 2019-09-23 2022-03-25 广州酷狗计算机科技有限公司 Video synthesis method and device, computer equipment and storage medium
CN110650368B (en) * 2019-09-25 2022-04-26 新东方教育科技集团有限公司 Video processing method and device and electronic equipment
CN110769309B (en) 2019-11-04 2023-03-31 北京字节跳动网络技术有限公司 Method, device, electronic equipment and medium for displaying music points
CN110955786B (en) * 2019-11-29 2023-10-27 网易(杭州)网络有限公司 Dance action data generation method and device
CN111064992A (en) * 2019-12-10 2020-04-24 懂频智能科技(上海)有限公司 Method for automatically switching video contents according to music beats
CN110992993B (en) * 2019-12-17 2022-12-09 Oppo广东移动通信有限公司 Video editing method, video editing device, terminal and readable storage medium
CN111008287B (en) * 2019-12-19 2023-08-04 Oppo(重庆)智能科技有限公司 Audio and video processing method and device, server and storage medium
CN111556254B (en) * 2020-04-10 2021-04-02 早安科技(广州)有限公司 Method, system, medium and intelligent device for video cutting by using video content
CN111683209B (en) * 2020-06-10 2023-04-18 北京奇艺世纪科技有限公司 Mixed-cut video generation method and device, electronic equipment and computer-readable storage medium
CN111541946A (en) * 2020-07-10 2020-08-14 成都品果科技有限公司 Automatic video generation method and system for resource matching based on materials
CN114390352A (en) * 2020-10-16 2022-04-22 上海哔哩哔哩科技有限公司 Audio and video processing method and device
CN114390367A (en) * 2020-10-16 2022-04-22 上海哔哩哔哩科技有限公司 Audio and video processing method and device
CN112911379B (en) * 2021-01-15 2023-06-27 北京字跳网络技术有限公司 Video generation method, device, electronic equipment and storage medium
CN113077470B (en) * 2021-03-26 2022-01-18 天翼爱音乐文化科技有限公司 Method, system, device and medium for cutting horizontal and vertical screen conversion picture

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101107667A (en) * 2004-12-17 2008-01-16 诺基亚公司 Method and apparatus for video editing on small screen with minimal input device
CN101640057A (en) * 2009-05-31 2010-02-03 北京中星微电子有限公司 Audio and video matching method and device therefor
EP2993668A1 (en) * 2014-09-08 2016-03-09 Thomson Licensing Method for editing an audiovisual segment and corresponding device and computer program product
CN105530440A (en) * 2014-09-29 2016-04-27 北京金山安全软件有限公司 Video production method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7027124B2 (en) * 2002-02-28 2006-04-11 Fuji Xerox Co., Ltd. Method for automatically producing music videos

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101107667A (en) * 2004-12-17 2008-01-16 诺基亚公司 Method and apparatus for video editing on small screen with minimal input device
CN101640057A (en) * 2009-05-31 2010-02-03 北京中星微电子有限公司 Audio and video matching method and device therefor
EP2993668A1 (en) * 2014-09-08 2016-03-09 Thomson Licensing Method for editing an audiovisual segment and corresponding device and computer program product
CN105530440A (en) * 2014-09-29 2016-04-27 北京金山安全软件有限公司 Video production method and device

Also Published As

Publication number Publication date
CN107483843A (en) 2017-12-15

Similar Documents

Publication Publication Date Title
CN107483843B (en) Audio-video matches clipping method and device
CN107393569B (en) Audio-video clipping method and device
CN110248258B (en) Recommendation method and device for video clips, storage medium and computer equipment
US20210005222A1 (en) Looping audio-visual file generation based on audio and video analysis
JP7368589B2 (en) Video processing methods, devices, electronic devices and storage media
US9355627B2 (en) System and method for combining a song and non-song musical content
CN110415723B (en) Method, device, server and computer readable storage medium for audio segmentation
US9613605B2 (en) Method, device and system for automatically adjusting a duration of a song
CN106652997A (en) Audio synthesis method and terminal
EP2940644A1 (en) Method, apparatus, device and system for inserting audio advertisement
US8207989B2 (en) Multi-video synthesis
KR20070121810A (en) Synthesis of composite news stories
CN107978310B (en) Audio processing method and device
CN104486649A (en) Video content rating method and device
US11511200B2 (en) Game playing method and system based on a multimedia file
CN107481739B (en) Audio cutting method and device
CN112995736A (en) Speech subtitle synthesis method, apparatus, computer device, and storage medium
US10395417B2 (en) Data plot processing
CN109859739B (en) Melody generation method and device based on voice synthesis and terminal equipment
CN114520931A (en) Video generation method and device, electronic equipment and readable storage medium
CN109410972A (en) Generate the method, apparatus and storage medium of sound effect parameters
CN113747233B (en) Music replacement method and device, electronic equipment and storage medium
US7612279B1 (en) Methods and apparatus for structuring audio data
CN110619673A (en) Method for generating and playing sound chart, method, system and equipment for processing data
CN114924673A (en) Media menu recommendation method and device based on bullet screen interaction

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant