WO2021098670A1 - 视频生成方法、装置、电子设备和计算机可读介质 - Google Patents

视频生成方法、装置、电子设备和计算机可读介质 Download PDF

Info

Publication number
WO2021098670A1
WO2021098670A1 PCT/CN2020/129284 CN2020129284W WO2021098670A1 WO 2021098670 A1 WO2021098670 A1 WO 2021098670A1 CN 2020129284 W CN2020129284 W CN 2020129284W WO 2021098670 A1 WO2021098670 A1 WO 2021098670A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
image
segment
sequence
music
Prior art date
Application number
PCT/CN2020/129284
Other languages
English (en)
French (fr)
Inventor
王亚
姜维
郑起凡
付平非
Original Assignee
北京字节跳动网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京字节跳动网络技术有限公司 filed Critical 北京字节跳动网络技术有限公司
Priority to KR1020227016781A priority Critical patent/KR20220103112A/ko
Priority to JP2022528542A priority patent/JP7457804B2/ja
Priority to BR112022009608A priority patent/BR112022009608A2/pt
Priority to EP20889011.1A priority patent/EP4047943A4/en
Publication of WO2021098670A1 publication Critical patent/WO2021098670A1/zh
Priority to US17/744,671 priority patent/US11636879B2/en

Links

Images

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • G11B27/036Insert-editing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7834Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using audio features
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • G10L21/055Time compression or expansion for synchronising with other signals, e.g. video signals
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • G11B27/034Electronic editing of digitised analogue information signals, e.g. audio or video signals on discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/41407Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44016Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8106Monomedia components thereof involving special audio data, e.g. different tracks for different languages
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/85406Content authoring involving a specific file format, e.g. MP4 format

Definitions

  • the embodiments of the present disclosure relate to the field of computer technology, and in particular to video generation methods, devices, electronic equipment, and computer-readable media.
  • video processing technology is also rapidly advancing, and more and more people use video to transmit information or share life fragments.
  • video processing software has been widely used in various scenarios as a commonly used software on terminals. In many cases, users often need to use video, music and other material editing to create a video.
  • Some embodiments of the present disclosure propose methods, devices, and computer-readable media of electronic devices for generating videos to solve the technical problems mentioned in the background art section above.
  • some embodiments of the present disclosure provide a method for generating a video, the method includes: acquiring a first image collection and audio materials, the first image collection includes a plurality of image materials; according to the first The number of image materials in the image collection determines the first music point of the audio material, where the first music point is used to divide the audio material into a plurality of first music fragments, and the number of the first music fragments is the same as the number of the first music fragments.
  • the number of image materials in an image collection is the same; according to the sequence of the image materials in the first image collection, a video clip is generated for each first music segment in the audio material to obtain the first video sequence , Wherein the corresponding first music segment and the video segment have the same duration; in response to detecting the editing operation performed on the video segment in the first video sequence, the video segment in the first video sequence is adjusted, Obtain a second video sequence; splice the video segments in the second video sequence together, and add the audio material as a video sound track to obtain a synthesized video.
  • some embodiments of the present disclosure provide a video generation device, the device includes: an acquisition unit configured to acquire a first image set and audio materials, the first image set includes a plurality of image materials; a determining unit , Configured to determine a first music point of the audio material according to the number of image materials in the first image collection, wherein the first music point is used to divide the audio material into a plurality of first music fragments, and the first music point is used to divide the audio material into a plurality of first music fragments.
  • the number of a music segment is the same as the number of image materials in the first image collection; the generating unit is configured to, according to the sequence of the image materials in the first image collection, be each first music segment in the audio material.
  • the material is used as a video audio track to obtain a composite video.
  • an embodiment of the present application provides an electronic device.
  • the network device includes: one or more processors; a storage device for storing one or more programs; when one or more programs are used by one or more The processor executes, so that one or more processors implement the method described in any implementation manner of the first aspect.
  • an embodiment of the present application provides a computer-readable medium on which a computer program is stored, and when the computer program is executed by a processor, the method as described in any of the implementation manners in the first aspect is implemented.
  • some embodiments of the present application provide a computer program, including program code.
  • the program code executes to implement the method in any one of the first and second aspects.
  • One of the above embodiments of the present disclosure has the following beneficial effects: by dividing the audio material into music points, the duration of each video segment in the composite video can be determined, so that the image material can be processed into the composite video One by one in the video clips, which reduces the time for users to process video and audio materials, making editing easier.
  • the adjustment difficulty of the user can be reduced by adjusting each video segment of the composite video.
  • FIGS. 1A-1B are schematic diagrams of an application scenario of a video generation method according to some embodiments of the present disclosure
  • Figure 2 is a flowchart of some embodiments of a video generation method according to the present disclosure
  • FIG. 3A is a schematic diagram of a rotation operation for a video clip according to some embodiments of the present disclosure
  • FIG. 3B is a schematic diagram of adjusting the arrangement order of video clips according to some embodiments of the present disclosure.
  • FIG. 4 is a schematic diagram of deleting and adjusting a video clip according to some embodiments of the present disclosure
  • FIG. 5 is a schematic diagram of adding and adjusting a video clip according to some embodiments of the present disclosure.
  • Fig. 6A is a schematic diagram of an automatic optimization and adjustment operation performed on a video clip according to some embodiments of the present disclosure
  • 6B is a schematic diagram of manual optimization and adjustment operations on video clips according to some embodiments of the present disclosure.
  • FIG. 7 is a schematic diagram of performing a rotation adjustment operation on a video clip according to some embodiments of the present disclosure.
  • Fig. 8 is a schematic structural diagram of some embodiments of a video generating device according to the present disclosure.
  • FIG. 9 is a schematic structural diagram of an electronic device suitable for implementing some embodiments of the present disclosure.
  • FIGS. 1A-1B are schematic diagrams of an application scenario of a video generation method according to some embodiments of the present application.
  • the user can select multiple image materials on the upload page 1017 of the terminal device 101. For example, upload the image material 1011-1014 shown on page 1017.
  • the user clicks the position shown in the selection box 1015 to select the image materials 1011-1013.
  • the audio material 106 is divided into music pieces A, B, and C according to the music point 107 and the music point 108.
  • the video materials 1011-1013 are respectively processed according to the duration of the obtained music fragments A-C.
  • a first video sequence composed of video segments 1031, 1041, and 1051 is obtained.
  • a first video sequence 109 composed of a video segment 1031, a video segment 1041, and a video segment 1051 is shown.
  • the aforementioned terminal device 101 splices the video clips 1031, 1041, and 1061 in the second video sequence 110 according to the time when the music clip AC appears in the audio material 106, and adds the audio material 106 as the audio track of the spliced video to obtain Composite video 111.
  • the video generation method may be executed by the terminal device 101, or may also be executed by a server, or may also be executed by various software programs.
  • the terminal device 101 may be, for example, various electronic devices with a display screen, including but not limited to smart phones, tablet computers, e-book readers, laptop computers, desktop computers, and so on.
  • the execution subject may also be embodied as a server, software, and so on.
  • the execution subject is software, it can be installed in the electronic devices listed above. It can be implemented, for example, as multiple software or software modules for providing distributed services, or as a single software or software module. There is no specific limitation here.
  • terminal devices and servers in FIG. 1 are merely illustrative. According to implementation needs, there can be any number of terminal devices and servers.
  • the video generation method includes the following steps:
  • Step 201 Obtain a first image collection and audio materials.
  • the execution subject of the video generation method may obtain the image material and the audio material through a wired connection or a wireless connection.
  • the above-mentioned first image set includes a plurality of image materials.
  • the aforementioned image material may be a video or picture stored locally by the user, or may be a video or picture downloaded by the user from the Internet.
  • the above audio material may be music stored locally by the user, or music on the Internet.
  • Step 202 Determine the first music point of the audio material according to the number of image materials in the first image collection.
  • the above-mentioned execution subject may first determine the first music point of the audio material.
  • the music point may be a point in the audio material that satisfies the set tempo change condition. Then, the above-mentioned execution subject can select a target number of music points from the candidate music points that have been obtained.
  • the above-mentioned target quantity is usually determined according to the quantity of the above-mentioned image materials acquired. As an example, when 10 image materials are acquired, 9 music points can be determined.
  • the music point is a position in the audio material that meets the set musicality change.
  • the position where the musicality changes may include the position where the tempo changes and the position where the melody changes.
  • the music point can be determined in the following way: the above-mentioned executive body can analyze the above-mentioned audio material to determine the beat point and the starting point of the note, where the beat point is the position where the beat changes, and the starting point of the note is the melody occurrence The position of the change.
  • a deep learning-based beat analysis algorithm can be used to analyze the audio material to obtain the beat point in the audio material and the time stamp at which the beat point is located; on the other hand, perform short-time spectrum analysis on the audio material to obtain the audio material
  • the starting point of the note can be obtained by an onset detector. Then, the beat points and the starting points of the notes obtained through the two methods are unified, and the beat points and the starting points of the notes are combined and de-duplicated to obtain candidate music points.
  • Step 203 According to the sequence of the image materials in the first image collection, a video segment is generated for each first music segment in the audio material by using one image material to obtain a first video sequence.
  • the execution subject may generate a video segment for the music segment with the same length as the music segment according to the sequence of the image materials in the first image collection. , Get the first video sequence.
  • the corresponding first music segment and video segment have the same duration.
  • the first video sequence usually refers to a sequence composed of generated video segments.
  • the sequence of the above sequence may be the sequence in which the user uploads the image materials, or it may be randomly sorted by the above-mentioned execution subject.
  • the duration of the 3 music fragments is 1 second, 2 seconds and 3 seconds respectively
  • the duration of the video fragments corresponding to the above music fragments can also be 1 second respectively. , 2 seconds and 3 seconds.
  • the first segment can correspond to the first image material in the first image collection
  • the second segment can correspond to the second image material in the first image collection.
  • Video footage The order of the image materials in the first image collection may be, for example, the order selected by the user.
  • the duration of the video material is greater than the duration of the first music segment
  • a video clip with the same duration as the first music segment is intercepted from the video material, and the duration of the video material is less than the duration of the first music segment.
  • the duration is long, the original video material is subjected to variable speed processing to increase the duration, and then the variable video material is used as a video clip, so that the duration of the video clip is equal to the duration of the music clip.
  • the picture material in the image material a variety of implementation methods can be used to generate the picture material into a video clip.
  • it can be a video clip generated after adding motion effects to the picture material.
  • the aforementioned motion effect may be a foreground motion effect randomly added to the picture material.
  • the foreground animation effect can be a dynamic animation effect added to the picture. For example, add an animation effect of raining to a picture. Among them, adding dynamic effects to the picture material can make the picture material more visually beautiful and improve the user's visual effect.
  • Step 204 In response to detecting an editing operation performed on the video segment in the first video sequence, adjust the video segment in the first video sequence to obtain a second video sequence.
  • the executing agent when the executing agent detects that the user edits the video segment in the first video sequence, it can adjust the corresponding video segment according to the detected editing operation to obtain the second video sequence.
  • the foregoing adjustment operation may be a video editing operation such as an adjustment operation of the playback speed of a video clip in the first video sequence.
  • the foregoing playback speed adjustment operation may be a click operation on a preset playback speed control or an adjustment operation on the playback speed of the video clip by triggering a preset gesture.
  • the aforementioned playback speed adjustment may be to play the video clip at a high-speed playback speed, or to play the video clip at a low-speed playback speed.
  • the executing agent when the executing agent detects that the user performs a rotation operation on the video segment 2 in the first video sequence, the executing agent may rotate the video segment 2 to obtain a rotated video segment.
  • the above-mentioned rotation operation may be an operation of clicking a preset video clip rotation control or dragging or rotating the video clip.
  • the above-mentioned rotation operation may be a drag rotation operation 3042 for the video clip 3041 in the interface 3040 displayed by the electronic device 303 as shown in FIG. 3A, and then obtain the information in the interface 3050 displayed in the electronic device 303.
  • the rotated video clip 3051 may be a drag rotation operation 3042 for the video clip 3041 in the interface 3040 displayed by the electronic device 303 as shown in FIG. 3A, and then obtain the information in the interface 3050 displayed in the electronic device 303.
  • the rotated video clip 3051 may be a drag rotation operation 3042 for the video clip 3041 in the interface 3040 displayed by the electronic device 303 as shown in FIG. 3A, and then obtain the information in the interface 3050 displayed in the electronic device 303.
  • the rotated video clip 3051 may be a drag rotation operation 3042 for the video clip 3041 in the interface 3040 displayed by the electronic device 303 as shown in FIG. 3A, and then obtain the information in the interface 3050 displayed in the electronic device 303.
  • the execution subject may replace the video segment 2 in the first video sequence with the rotated video segment to obtain the second video sequence.
  • the execution subject in response to detecting an adjustment operation for the arrangement order of the video clips in the first video sequence, may perform the adjustment to the first video clip according to the arrangement order of the adjusted video clips.
  • the arrangement order of the image materials in the image collection is adjusted to obtain the second image collection.
  • the execution subject may generate a video segment using one image material for each first music segment in the audio material according to the sequence of the image materials in the second image set to obtain the second video sequence, wherein, The corresponding first music segment and video segment have the same duration.
  • the execution subject cuts out the video clips abc of 1s, 2s and 3s according to the sequence of the video material ABC
  • the sequence adjustment operation of adjusting the sequence of the video clips to bca is detected
  • the corresponding video materials are arranged
  • the order is adjusted to the arrangement order of the image material BCA.
  • the video clips b ⁇ c ⁇ a ⁇ of 1s, 2s, and 3s are respectively cut out, and the second video clip composed of the video clips b ⁇ c ⁇ a ⁇ is obtained.
  • the execution subject may control an electronic device with a display function to display the first display interface of the above-mentioned first video sequence, in response to detecting that the above-mentioned first display interface is directed to the first display interface in the above-mentioned first display interface.
  • the adjustment operation of moving the identifiers of the video segments in the video sequence from the first sorting position to the second sorting position moves the above-mentioned video segments to the second sorting position, thereby determining the sequence of the adjusted video segments.
  • the arrangement order of the image materials in the first image set is adjusted.
  • the above-mentioned identifiers may be identifiers such as preview images, screenshots, text descriptions, etc. of the video clips.
  • the above-mentioned first display page may be the page 3010 displayed by the electronic device 301 as shown in FIG. 3B.
  • the above-mentioned execution subject may move the video clip 3023 from the first ranking position C to the second ranking position B. 2. Sort position B, and determine the sequence of the adjusted video clips. After that, the above-mentioned execution subject may adjust the arrangement order of the image materials 3031-3033 in the first image collection 303 according to the arrangement order of the adjusted video clips to obtain the second image collection 304.
  • the first image material in response to detecting a deletion operation for the first video segment in the first video sequence, is deleted from the first image set to obtain the third image Collection, wherein the first video segment is generated based on the first image material; according to the number of image materials in the third image collection, a second music point of the audio material is determined, wherein the second music point is used for Divide the audio material into a plurality of second music fragments, the number of the second music fragments is the same as the number of the image materials in the second image collection; according to the order of the image materials in the third image collection, the audio material Each of the second music fragments in, generates a video fragment by using an image material to obtain the above-mentioned second video sequence, wherein the corresponding second music fragment and the video fragment have the same duration.
  • the execution subject when the execution subject detects a deletion operation for the first video segment b in the first video sequence abc, it deletes the image material B used to generate the first video segment b in the first image material set ABC, and obtains The third image collection AC. Then, the execution subject determines the second music point of the audio material based on the number 2 of the image materials in the third image set AC, and then divides the audio material into two second music segments. Finally, according to the sequence of the video material A and the video material C in the third video set AC, a video segment d and a video segment e are generated for the two second music segments, and the second video sequence de is obtained.
  • the execution subject may control an electronic device with a display function to display the second display interface of the above-mentioned first video sequence.
  • a first intercepting operation interface for intercepting the video segment in the first video material is displayed.
  • the deletion operation in the first interception operation interface the first image material is deleted.
  • the above-mentioned selection operation may be an operation of clicking the first video segment, or an operation such as a long press.
  • the aforementioned interception operation interface may be an operation interface used by a user to intercept the aforementioned first image material.
  • the foregoing deletion operation may be clicking a preset control, or may be related operations such as dragging the foregoing first image material.
  • the above-mentioned second display page may be the page 4010 displayed by the electronic device 401 as shown in FIG. 4.
  • the first interception operation interface 4020 for intercepting the video segment in the first image material 4023 is displayed, and when it is detected
  • the deletion control 4022 is clicked on the first interception operation interface 4020, the first image material 4023 is deleted.
  • a second image material in response to detecting an adding operation for a video segment in the above-mentioned first video sequence, a second image material is obtained and added to the above-mentioned first image set to obtain a fourth image set .
  • the third music point of the audio material is determined.
  • the third music point is used to divide the audio material into a plurality of third music segments.
  • the number of the third music fragments is the same as the number of image materials in the fourth image collection.
  • a video segment is generated for each third music segment in the audio material by using one image material to obtain the second video sequence.
  • the corresponding third music segment and video segment have the same duration.
  • the second image material D is obtained and added to the first image set ABC to obtain the fourth image set ABCD.
  • the third music point of the audio material is determined according to the number 4 of the image materials in the fourth image set ABCD, and the audio material is further divided into four third music fragments.
  • each third music segment in the audio material is used to generate a video segment using one image material to obtain a second video sequence.
  • the above-mentioned execution subject may display the third display interface of the above-mentioned first video sequence.
  • the browsing interface of the image material is displayed.
  • the second image material is acquired and added to the first image set to obtain a fourth image set.
  • the foregoing third display page may be a page 5010 displayed by the electronic device 501 as shown in FIG. 5.
  • the browsing interface 5020 of the image material is displayed.
  • the selection operation for the second image material 5023 in the browsing interface 5020 is detected, the second image material is acquired and added to the first image set to obtain the second image set.
  • the video clips in the first video sequence are cropped from the image materials in the first image set at a preset starting point position.
  • a third image material is used for the music segment corresponding to the second video segment.
  • Multiple initial video clips are cropped from different starting points.
  • the music segment corresponding to the second video segment has the same duration as each of the initial video segments, and the second video segment is generated based on the third image material.
  • Frame extraction is performed on the multiple initial video segments, and the quality of the multiple initial video segments is analyzed according to the images obtained by the frame extraction.
  • the third video segment with the highest quality is selected from the above-mentioned multiple initial video segments. Replace the second video segment with the third video segment in the first video sequence to obtain the second video sequence.
  • the above-mentioned automatic optimization operation usually refers to the operation of the user clicking a preset automatic optimization control.
  • the above-mentioned quality usually refers to the comprehensive score obtained by scoring the image obtained by drawing frames in the video segment.
  • the above-mentioned comprehensive score may be the average score or the highest score of the image score obtained by drawing frames.
  • the aforementioned scoring method can be based on the motion information (such as jitter, etc.), aesthetics (composition, etc.) or attributes (light color, etc.) in the frame.
  • the above-mentioned execution subject may display a fourth display interface of the second video segment in the above-mentioned first video sequence.
  • the above-mentioned fourth display interface includes an automatic optimization control, and the above-mentioned automatic optimization control is used to trigger automatic optimization.
  • the third image material is used to cut out a plurality of initial video fragments with different starting point positions for the music fragment corresponding to the second video fragment.
  • the aforementioned automatic optimization control may be a button or a preset control for triggering gestures.
  • the fourth display page that displays the second video segment 6011 in the first video sequence may be the page 6010 displayed by the electronic device 601 as shown in FIG. 6A.
  • the music fragment 6015 corresponding to the second video fragment 6011 is used to crop multiple initial video fragments with different starting point positions using the third image material 6014 6016-6018.
  • the execution subject when the execution subject detects a manual optimization operation for the fourth video segment in the first video sequence, it can be determined that the manual optimization operation is in the fourth image material.
  • the selected cropping interval when the execution subject detects a manual optimization operation for the fourth video segment in the first video sequence, it can be determined that the manual optimization operation is in the fourth image material.
  • the selected cropping interval Afterwards, according to the above-mentioned cropping interval, a fifth video segment is cropped from the above-mentioned fourth image material. Finally, in the first video sequence, the fourth video segment is replaced with the fifth video segment to obtain the second video sequence.
  • the above-mentioned execution subject may display the fifth display interface of the above-mentioned first video sequence.
  • a second interception operation interface for intercepting the video segment in the fourth image material is displayed.
  • the clipping interval selected in the fourth image material is determined.
  • the fifth display page for displaying the first video sequence 6021 may be the page 6020 displayed by the electronic device 602 as shown in FIG. 6B.
  • the second interception operation interface 6030 for intercepting the video segment in the fourth video material 6033 is displayed, and when it is detected in the second During the selection operation of the interception interval 6035 to 6034 in the interception operation interface, the cropping interval 6034-6035 selected in the above-mentioned fourth image material is determined.
  • the execution subject may, in response to detecting a rotation operation for the sixth video segment in the first video sequence, rotate the sixth video segment to obtain the seventh video segment .
  • the sixth video segment is replaced with the seventh video segment to obtain the second video sequence.
  • the above-mentioned execution subject may display the sixth display interface of the above-mentioned first video sequence.
  • the rotation operation interface of the sixth video segment is displayed.
  • the sixth video segment is rotated to obtain the seventh video segment.
  • the sixth display interface may be a page 7010 displayed on the electronic device 701.
  • the rotation operation interface 7020 of the sixth video segment 7014 is displayed.
  • a click operation on the rotation control 7022 of the sixth video segment 7014 in the selection operation interface 7020 is detected, the sixth video segment 7014 is rotated to obtain a seventh video segment.
  • Step 205 Splice the video clips in the second video sequence together, and add audio material as the video sound track to obtain a composite video
  • the execution subject of the video generation method may sequentially splice the video fragments in the second video sequence corresponding to the music fragments together according to the order in which the music fragments in the audio material appear, and in the audio track of the spliced video Add the above audio material to get a synthesized video.
  • the above audio material can be divided into 3 segments in order according to music points.
  • segment A can be from 0 seconds to 2 seconds
  • segment B can be from 2 seconds to 5 seconds
  • segment C can be from 5 seconds to 5 seconds. 10 seconds.
  • the corresponding video segments in the second video sequence are segment a, segment b, and segment c, respectively.
  • the spliced video can be expressed as abc.
  • the above audio material is added to the audio track of the spliced video abc to obtain a synthesized video.
  • the above-mentioned mode generally refers to a mode in which a composite video is generated from a collection of audio materials and images, or a mode in which a composite video is obtained by splicing image materials in the image collection.
  • the duration of each video segment in the synthesized video can be determined, so that the video material can be processed into each video in the synthesized video Fragments, which reduces the time for users to process image and audio materials, making editing easier.
  • the adjustment difficulty of the user can be reduced by adjusting each video segment of the composite video.
  • the present disclosure provides some embodiments of a web page generating device. These device embodiments correspond to those method embodiments shown in FIG. It can be applied to various electronic devices.
  • the webpage generating apparatus 800 of some embodiments includes: an acquiring unit 801, a determining unit 802, a generating unit 803, an adjusting unit 804, and a splicing unit 805.
  • the acquiring unit 801 is configured to acquire a first image set and audio materials, and the first image set includes multiple image materials;
  • the determining unit 802 is configured to determine the audio according to the number of image materials in the first image set.
  • the first music point of the material wherein the first music point is used to divide the audio material into a plurality of first music fragments, and the number of the first music fragments is the same as the number of image materials in the first image collection;
  • the unit 803 is configured to use one image material to generate a video segment for each first music segment in the audio material according to the sequence of the image materials in the first image collection, to obtain a first video sequence, where the corresponding The first music segment and the video segment have the same duration;
  • the adjustment unit 804 is configured to adjust the video segment in the first video sequence in response to detecting an editing operation performed on the video segment in the first video sequence.
  • the splicing unit 805 is configured to splice the video segments in the second video sequence together, and add the audio material as a video sound track to obtain a synthesized video.
  • the adjustment unit 801 of the video generation device 800 is further configured to: in response to detecting an adjustment operation for the arrangement order of the video clips in the first video sequence, according to the adjusted video clips
  • the arrangement order of the image materials in the first image set is adjusted to obtain the second image set; according to the order of the image materials in the second image set, it is each first music fragment in the audio material
  • One video material is used to generate a video segment to obtain the above-mentioned second video sequence, wherein the corresponding first music segment and the video segment have the same duration.
  • the video generation device 800 may further include a first subunit configured to: display the first display interface of the first video sequence; in response to detecting that the first display interface is in the first display interface For the adjustment operation of moving the identifiers of the video clips in the first video sequence from the first sorting position to the second sorting position, move the video clips to the second sorting position, and determine the order of the adjusted video clips; according to the above adjustment The arrangement order of the video clips is adjusted to the arrangement order of the image materials in the above-mentioned first image collection.
  • the adjustment unit 801 of the video generating device 800 is further configured to: in response to detecting a deletion operation for the first video segment in the first video sequence, Delete the first image material from the collection to obtain a third image collection, where the first video segment is generated based on the first image material; according to the number of image materials in the third image collection, the second audio material is determined Music point, wherein the second music point is used to divide the audio material into a plurality of second music fragments, and the number of the second music fragments is the same as the number of image materials in the second image collection; according to the third image
  • the sequence of the video materials in the collection is to generate a video segment for each second music segment in the audio material using one video material to obtain the second video sequence, wherein the corresponding second music segment and video segment have The same duration.
  • the video generation device 800 may further include a second subunit configured to: display the second display interface of the first video sequence; In the second display interface, for the selection operation of the first video segment in the first video sequence, the first interception operation interface for intercepting the video segment in the first image material is displayed; in response to detecting that the first interception operation interface is The delete operation in the delete operation, delete the above-mentioned first image material.
  • the adjustment unit 801 of the video generating device 800 is further configured to: in response to detecting an increase operation for the video segment in the first video sequence, obtain the second image material and add it to In the first image set, a fourth image set is obtained; according to the number of image materials in the fourth image set, a third music point of the audio material is determined, wherein the third music point is used to divide the audio material into A plurality of third music fragments, the number of the third music fragments is the same as the number of the image materials in the fourth image set; according to the order of the image materials in the fourth image set, each third of the audio materials
  • the music fragments use one image material to generate a video fragment to obtain the above-mentioned second video sequence, wherein the corresponding third music fragment and the video fragment have the same duration.
  • the video generating device 800 may further include a third subunit configured to: display the third display interface of the first video sequence; in response to detecting that the third display interface is In response to the operation of adding video clips in the first video sequence, the browsing interface of the image material is displayed; in response to detecting the selection operation of the second image material in the browsing interface, the second image material is acquired and added to the first From one image collection, a fourth image collection is obtained.
  • a third subunit configured to: display the third display interface of the first video sequence; in response to detecting that the third display interface is In response to the operation of adding video clips in the first video sequence, the browsing interface of the image material is displayed; in response to detecting the selection operation of the second image material in the browsing interface, the second image material is acquired and added to the first From one image collection, a fourth image collection is obtained.
  • the video clips in the first video sequence are cropped from the image materials in the first image set at a preset starting point position.
  • the adjustment unit 801 of the video generation device 800 is further configured to: in response to detecting the automatic optimization operation for the second video segment in the first video sequence, the second video sequence
  • the music segment corresponding to the video segment uses the third image material to cut out multiple initial video segments with different starting point positions, where the music segment corresponding to the second video segment has the same duration as each of the initial video segments.
  • the second video segment is generated based on the above-mentioned third image material; the above-mentioned multiple initial video segments are extracted respectively, and the quality of the above-mentioned multiple initial video segments is analyzed according to the image obtained by the extracted frame; from the above-mentioned multiple initial video segments The third video segment with the highest quality is selected; the second video segment is replaced with the third video segment in the first video sequence to obtain the second video sequence.
  • the video generation device 800 may further include a fourth subunit configured to: display a fourth display interface of the second video segment in the first video sequence, wherein the fourth display The interface includes an automatic optimization control, and the above automatic optimization control is used to trigger automatic optimization; in response to detecting an automatic optimization operation for the above automatic optimization control in the above fourth display interface, use the third for the music fragment corresponding to the second video fragment.
  • the image material is cropped out of multiple initial video clips at different starting points.
  • the adjustment unit 801 of the video generation device 800 is further configured to: in response to detecting a manual optimization operation for the fourth video segment in the first video sequence, determine the manual optimization Operate the cropping interval selected in the fourth video material; according to the cropping interval, crop a fifth video segment from the fourth video material; replace the fourth video segment with the fifth video segment in the first video sequence Video segment to obtain the above-mentioned second video sequence.
  • the video generation device 800 may further include a fifth subunit configured to: display the fifth display interface of the first video sequence; in response to detecting that the fifth display interface is For the selection operation of the fourth video segment, the second interception operation interface for intercepting the video segment in the fourth image material is displayed; in response to detecting the selection operation of the interception interval in the second interception operation interface, it is determined to be in the first The selected cropping interval in the four image materials.
  • a fifth subunit configured to: display the fifth display interface of the first video sequence; in response to detecting that the fifth display interface is For the selection operation of the fourth video segment, the second interception operation interface for intercepting the video segment in the fourth image material is displayed; in response to detecting the selection operation of the interception interval in the second interception operation interface, it is determined to be in the first The selected cropping interval in the four image materials.
  • the adjustment unit 801 of the video generating device 800 is further configured to: in response to detecting a rotation operation for the sixth video segment in the first video sequence, the sixth video segment Rotate to obtain a seventh video segment; in the first video sequence, replace the sixth video segment with the seventh video segment to obtain the second video sequence.
  • the video generation device 800 may further include a sixth subunit configured to: display the sixth display interface of the first video sequence; in response to detecting that the sixth display interface is For the selection operation of the sixth video segment, the rotation operation interface of the sixth video segment is displayed; in response to detecting the rotation operation for the sixth video segment in the rotation operation interface, the sixth video segment is rotated to obtain Seventh video clip.
  • a sixth subunit configured to: display the sixth display interface of the first video sequence; in response to detecting that the sixth display interface is For the selection operation of the sixth video segment, the rotation operation interface of the sixth video segment is displayed; in response to detecting the rotation operation for the sixth video segment in the rotation operation interface, the sixth video segment is rotated to obtain Seventh video clip.
  • the video generation device disclosed in some embodiments of the present disclosure can determine the duration of each video segment in the synthesized video by dividing the audio material into music points, so that the video material can be processed into each video in the synthesized video Fragments, which reduces the time for users to process image and audio materials, making editing easier.
  • the adjustment difficulty of the user can be reduced by adjusting each video segment of the composite video. Use video materials to generate video clips, and then generate composite videos, which directly realizes the card point of each video clip and audio material for users, giving users diversified choices and improving user experience.
  • FIG. 9 shows a schematic structural diagram of an electronic device (for example, the terminal device in FIG. 1) 900 suitable for implementing some embodiments of the present disclosure.
  • the electronic device shown in FIG. 9 is only an example, and should not bring any limitation to the function and scope of use of the embodiments of the present disclosure.
  • the electronic device 900 may include a processing device (such as a central processing unit, a graphics processor, etc.) 901, which can be loaded into a random access device according to a program stored in a read-only memory (ROM) 902 or from a storage device 908.
  • the program in the memory (RAM) 903 executes various appropriate actions and processing.
  • various programs and data required for the operation of the electronic device 900 are also stored.
  • the processing device 901, the ROM 902, and the RAM 903 are connected to each other through a bus 904.
  • An input/output (I/O) interface 905 is also connected to the bus 904.
  • the following devices can be connected to the I/O interface 905: including input devices 906 such as touch screens, touch pads, keyboards, mice, cameras, microphones, accelerometers, gyroscopes, etc.; including, for example, liquid crystal displays (LCD), speakers, vibration An output device 907 such as a device; and a communication device 909.
  • the communication device 909 may allow the electronic device 900 to perform wireless or wired communication with other devices to exchange data.
  • FIG. 9 shows an electronic device 900 having various devices, it should be understood that it is not required to implement or have all of the illustrated devices. It may alternatively be implemented or provided with more or fewer devices. Each block shown in FIG. 9 may represent one device, or may represent multiple devices as needed.
  • the process described above with reference to the flowchart may be implemented as a computer software program.
  • some embodiments of the present disclosure include a computer program product, which includes a computer program carried on a computer-readable medium, and the computer program contains program code for executing the method shown in the flowchart.
  • the computer program may be downloaded and installed from the network through the communication device 909, or installed from the storage device 908, or installed from the ROM 902.
  • the processing device 901 the above-mentioned functions defined in the methods of some embodiments of the present disclosure are executed.
  • the aforementioned computer-readable medium in some embodiments of the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the two.
  • the computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or a combination of any of the above.
  • Computer-readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable removable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • the computer-readable storage medium may be any tangible medium that contains or stores a program, and the program may be used by or in combination with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal propagated in a baseband or as a part of a carrier wave, and a computer-readable program code is carried therein.
  • This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • the computer-readable signal medium may also be any computer-readable medium other than the computer-readable storage medium.
  • the computer-readable signal medium may send, propagate, or transmit the program for use by or in combination with the instruction execution system, apparatus, or device .
  • the program code contained on the computer-readable medium can be transmitted by any suitable medium, including but not limited to: wire, optical cable, RF (Radio Frequency), etc., or any suitable combination of the above.
  • the client and server can communicate with any currently known or future developed network protocol such as HTTP (HyperText Transfer Protocol), and can communicate with digital data in any form or medium.
  • Communication e.g., communication network
  • Examples of communication networks include local area networks (“LAN”), wide area networks (“WAN”), the Internet (for example, the Internet), and end-to-end networks (for example, ad hoc end-to-end networks), as well as any currently known or future research and development network of.
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or it may exist alone without being assembled into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs.
  • the electronic device acquires a first image collection and audio materials, and the above-mentioned first image collection includes multiple Image material; according to the number of image materials in the first image set, determine the first music point of the audio material, wherein the first music point is used to divide the audio material into a plurality of first music fragments, the first The number of music clips is the same as the number of image materials in the first image collection; according to the sequence of the image materials in the first image collection, a video is generated for each first music clip in the audio material.
  • Segment to obtain the first video sequence wherein the corresponding first music segment and the video segment have the same duration; in response to detecting the editing operation performed on the video segment in the first video sequence, the first video sequence Adjust the video segments in the video sequence to obtain a second video sequence; splice the video segments in the second video sequence together, and add the audio material as the video sound track to obtain a composite video
  • the computer program code used to perform the operations of some embodiments of the present disclosure can be written in one or more programming languages or a combination thereof.
  • the above-mentioned programming languages include object-oriented programming languages such as Java, Smalltalk, C++, Also includes conventional procedural programming languages-such as "C" language or similar programming languages.
  • the program code can be executed entirely on the user's computer, partly on the user's computer, executed as an independent software package, partly on the user's computer and partly executed on a remote computer, or entirely executed on the remote computer or server.
  • the remote computer can be connected to the user’s computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (for example, using an Internet service provider to Connect via the Internet).
  • LAN local area network
  • WAN wide area network
  • each block in the flowchart or block diagram may represent a module, program segment, or part of code, and the module, program segment, or part of code contains one or more for realizing the specified logical function Executable instructions.
  • the functions marked in the block may also occur in a different order from the order marked in the drawings. For example, two blocks shown in succession can actually be executed substantially in parallel, and they can sometimes be executed in the reverse order, depending on the functions involved.
  • each block in the block diagram and/or flowchart, and the combination of the blocks in the block diagram and/or flowchart can be implemented by a dedicated hardware-based system that performs the specified functions or operations Or it can be realized by a combination of dedicated hardware and computer instructions.
  • the units described in some embodiments of the present disclosure may be implemented in software or hardware.
  • the described unit may also be provided in the processor.
  • a processor includes an acquisition unit, a determination unit, a generation unit, an adjustment unit, and a splicing unit.
  • the names of these units do not constitute a limitation on the unit itself under certain circumstances.
  • the acquisition unit can also be described as "a unit for acquiring the first image collection and audio material".
  • exemplary types of hardware logic components include: Field Programmable Gate Array (FPGA), Application Specific Integrated Circuit (ASIC), Application Specific Standard Product (ASSP), System on Chip (SOC), Complex Programmable Logical device (CPLD) and so on.
  • FPGA Field Programmable Gate Array
  • ASIC Application Specific Integrated Circuit
  • ASSP Application Specific Standard Product
  • SOC System on Chip
  • CPLD Complex Programmable Logical device
  • a video generation method including: acquiring a first image set and audio materials, the first image set includes a plurality of image materials; according to the images in the first image set The number of materials determines the first music point of the audio material, wherein the first music point is used to divide the audio material into a plurality of first music fragments, and the number of the first music fragments is the same as that in the first image collection
  • the number of image materials is the same; according to the sequence of the image materials in the first image collection, a video clip is generated for each first music segment in the audio material by using one image material to obtain the first video sequence, wherein The corresponding first music segment and the video segment have the same duration; in response to detecting the editing operation performed on the video segment in the first video sequence, the video segment in the first video sequence is adjusted to obtain the second video Sequence; splice the video clips in the second video sequence together, and add the audio material as the video sound track to obtain a synthesized video.
  • the video segment in the first video sequence is adjusted to obtain the second video sequence, including :
  • the arrangement order of the image materials in the first image set is adjusted to obtain the second image set ;
  • each first music segment in the audio material uses one image material to generate a video segment to obtain the second video sequence, wherein the corresponding first music segment Music clips and video clips have the same duration.
  • the image material in the first image set is The arrangement order is adjusted to obtain the second image set, including: displaying the first display interface of the first video sequence; in response to detecting that the identifier of the video segment in the first video sequence in the first display interface is changed from the first
  • the adjustment operation of moving the sorting position to the second sorting position moves the video clips to the second sorting position to determine the sequence of the adjusted video clips; according to the sequence of the adjusted video clips, compare the images in the first image set
  • the arrangement order of the materials is adjusted.
  • the video segment in the first video sequence is adjusted to obtain the second video sequence, including : In response to detecting the deletion operation for the first video segment in the first video sequence, delete the first image material in the first image set to obtain a third image set, wherein the first video segment is based on the above The first image material is generated; according to the number of image materials in the third image collection, the second music point of the audio material is determined, wherein the second music point is used to divide the audio material into a plurality of second music fragments The number of the second music fragments is the same as the number of the image materials in the second image collection; according to the sequence of the image materials in the third image collection, one image is used for each second music fragment in the audio material A video segment is generated from the material to obtain the above-mentioned second video sequence, wherein the corresponding second music segment and the video segment have the same duration.
  • deleting the first image material in the first image set includes: displaying the first The second display interface of the video sequence; in response to detecting the selection operation for the first video segment in the first video sequence in the second display interface of the first video sequence, display the intercepted video in the first video material The first interception operation interface of the fragment; in response to detecting the deletion operation in the first interception operation interface, the first image material is deleted.
  • the video segment in the first video sequence is adjusted to obtain the second video sequence, including : In response to detecting an increase operation for the video segment in the first video sequence, obtain the second image material and add it to the first image set to obtain a fourth image set; according to the number of image materials in the fourth image set , Determine the third music point of the audio material, wherein the third music point is used to divide the audio material into a plurality of third music fragments, and the number of the third music fragments is equal to the number of the image materials in the fourth image set The number is the same; according to the sequence of the image materials in the fourth image set, a video clip is generated for each third music clip in the audio material using one image material to obtain the second video sequence, where the corresponding The third music segment and the video segment have the same duration.
  • acquiring and adding the second image material to the first image set to obtain the fourth image set includes: Display the third display interface of the above-mentioned first video sequence; in response to detecting the addition operation of the video clip in the above-mentioned first video sequence in the above-mentioned third display interface, display the browsing interface of the image material; For the selection operation of the second image material in the interface, the second image material is acquired and added to the first image set to obtain a fourth image set.
  • the video clips in the first video sequence are cropped from the image materials in the first image set with a preset starting point position.
  • the video segment in the first video sequence is adjusted to obtain the second video sequence, including :
  • use the third image material to cut out multiple initial video segments with different starting point positions for the music segment corresponding to the second video segment , Wherein the music segment corresponding to the second video segment has the same duration as each of the initial video segments, and the second video segment is generated based on the third image material; and the multiple initial video segments are respectively framed , And analyze the quality of the multiple initial video clips according to the images obtained by drawing frames; select the third video clip with the highest quality from the multiple initial video clips; replace the second video clip in the first video sequence with The above-mentioned third video segment obtains the above-mentioned second video sequence.
  • a third image material is used for the music segment corresponding to the above-mentioned second video segment with different A plurality of initial video clips are cropped from the starting point position, including: a fourth display interface displaying the second video clip in the first video sequence, wherein the fourth display interface includes an automatic optimization control, and the automatic optimization control is used to trigger automatic Optimization; in response to detecting the automatic optimization operation for the above-mentioned automatic optimization control in the above-mentioned fourth display interface, use the third image material to cut out multiple initial videos with different starting point positions for the music fragment corresponding to the above-mentioned second video segment Fragment.
  • the video segment in the first video sequence is adjusted to obtain the second video sequence, including : In response to detecting the manual optimization operation for the fourth video segment in the first video sequence, determine the cropping interval selected in the fourth image material by the manual optimization operation; according to the cropping interval, from the fourth image material The fifth video segment is cropped out in the first video sequence; the fourth video segment is replaced with the fifth video segment in the first video sequence to obtain the second video sequence.
  • determining the cropping interval selected in the fourth image material by the manual optimization operation includes : Display the fifth display interface of the above-mentioned first video sequence; in response to detecting the selection operation for the above-mentioned fourth video segment in the above-mentioned fifth display interface, display the second interception operation of intercepting the video segment in the above-mentioned fourth video material Interface; in response to detecting the selection operation of the interception interval in the second interception operation interface, determine the selected cropping interval in the fourth image material.
  • the video segment in the first video sequence is adjusted to obtain the second video sequence, including : In response to detecting the rotation operation for the sixth video segment in the first video sequence, rotate the sixth video segment to obtain the seventh video segment; replace the sixth video segment in the first video sequence with The above seventh video segment is used to obtain the above second video sequence.
  • rotating the sixth video segment to obtain the seventh video segment includes: displaying the first video segment The sixth display interface of a video sequence; in response to detecting the selection operation for the sixth video segment in the sixth display interface, display the rotation operation interface of the sixth video segment; in response to detecting that the rotation operation interface is For the rotation operation of the sixth video segment, the sixth video segment is rotated to obtain the seventh video segment.
  • a video generation device includes: an acquisition unit configured to acquire a first image set and audio materials, the first image set includes a plurality of image materials; the determining unit is configured to The number of image materials in the first image set determines the first music point of the audio material, where the first music point is used to divide the audio material into a plurality of first music fragments, and the number of the first music fragments The number of image materials in the first image collection is the same; the generating unit is configured to use one image material for each first music segment in the audio material according to the sequence of the image materials in the first image collection A video segment to obtain a first video sequence, wherein the corresponding first music segment and the video segment have the same duration; the adjustment unit is configured to respond to detection of editing for the video segment in the first video sequence Operation, adjust the video segments in the first video sequence to obtain a second video sequence; the splicing unit is configured to splice the video segments in the second video sequence together, and add the audio material as a video sound
  • an electronic device including: one or more processors; a storage device, on which one or more programs are stored, when one or more programs are stored by one or more Execution by two processors, so that one or more processors implement the method described in any of the foregoing embodiments.
  • a computer-readable medium on which a computer program is stored, where the program is executed by a processor to implement the method described in any of the above embodiments.
  • a computer program including program code, and when a computer runs the computer program, the program code executes the method described in any of the above embodiments.

Abstract

本公开的实施例公开了视频生成方法、装置、电子设备和计算机可读介质。该方法的一具体实施方式包括:获取第一影像集合和音频素材;按照第一影像集合中影像素材的数量,确定音频素材的第一音乐点;按照第一影像集合中影像素材的排列顺序,为音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列;响应于检测到针对第一视频序列中的视频片段进行的编辑操作,对第一视频序列中的视频片段进行调整,得到第二视频序列;将第二视频序列中的视频片段拼接在一起,并添加音频素材作为视频音轨,得到合成视频。该实施方式丰富了用户对视频的调整方式,降低调整难度。

Description

视频生成方法、装置、电子设备和计算机可读介质
本申请要求于2019年11月18日提交中国专利局、申请号为201911129727.9、申请名称为“视频生成方法、装置、电子设备和计算机可读介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本公开的实施例涉及计算机技术领域,具体涉及视频生成方法、装置、电子设备和计算机可读介质。
背景技术
随着多媒体技术的飞速发展,视频处理技术也在快速进步,越来越多的人通过视频来传输信息或是分享生活片段。同时,视频处理软件已经作为终端上的一种常用软件,广泛应用于各种场景。在许多情况下,用户往往需要使用视频、音乐等素材剪辑制作出一个视频。
但目前,用户在使用视频软件剪辑视频时往往需要花费大量的精力和时间来处理各种素材。可见,目前的视频剪辑方式对用户来说是不够简便的。
发明内容
本公开的内容部分用于以简要的形式介绍构思,这些构思将在后面的具体实施方式部分被详细描述。本公开的内容部分并不旨在标识要求保护的技术方案的关键特征或必要特征,也不旨在用于限制所要求的保护的技术方案的范围。
本公开的一些实施例提出了用于生成视频的方法、装置、电子设备计算机可读介质,来解决以上背景技术部分提到的技术问题。
第一方面,本公开的一些实施例提供了一种用于生成视频的方法,该方法包括:获取第一影像集合和音频素材,上述第一影像集合中包括多个影像素材;按照上述第一影像集合中影像素材的数量,确定上述音频素材的第一音乐点,其中,上述第一音乐点用于将上述音频素材划分成多个第一音乐片段,上述第一音乐片段的数量与上述第一影像集合中影像素材的数量相同;按照上述第一影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长;响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列;将上述第二视频序列中的视频片段拼接在一起,并添加上述音频素材作为视频音轨,得到合成视频。
第二方面,本公开的一些实施例提供了一种视频生成装置,装置包括:获取单元,被配置成获取第一影像集合和音频素材,上述第一影像集合中包括多个影像素材;确定单元,被配置成按照上述第一影像集合中影像素材的数量,确定上述音频素材的第一音乐点,其中,上述第一音乐点用于将上述音频素材划分成多个第一音乐片段,上 述第一音乐片段的数量与上述第一影像集合中影像素材的数量相同;生成单元,被配置成按照上述第一影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长;调整单元,被配置成响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列;拼接单元,被配置成将上述第二视频序列中的视频片段拼接在一起,并添加上述音频素材作为视频音轨,得到合成视频。
第三方面,本申请实施例提供了一种电子设备,该网络设备包括:一个或多个处理器;存储装置,用于存储一个或多个程序;当一个或多个程序被一个或多个处理器执行,使得一个或多个处理器实现如第一方面中任一实现方式描述的方法。
第四方面,本申请实施例提供了一种计算机可读介质,其上存储有计算机程序,该计算机程序被处理器执行时实现如第一方面中任一实现方式描述的方法。
第五方面,本申请的一些实施例提供了一种计算机程序,包括程序代码,当计算机运行所述计算机程序时,所述程序代码执行实现如第一、二方面中任一的方法。
本公开的上述各个实施例中的一个实施例具有如下有益效果:通过对音频素材进行音乐点的划分,能够确定合成视频中的一个个视频片段的时长,从而使得影像素材能够被处理成合成视频中的一个个视频片段,这样就减少了用户处理影像素材和音频素材的时间,使得剪辑更简便。而通过对合成视频的一个个视频片段进行调整可以降低用户的调整难度。利用影像素材生成一个个视频片段,进而生成合成视频,为用户直接实现一个个视频片段与音频素材的卡点,给用户多样化的选择,进而提升用户体验。
附图说明
结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。
图1A-1B是根据本公开的一些实施例的视频生成方法的一个应用场景的示意图;
图2是根据本公开的视频生成方法的一些实施例的流程图;
图3A是根据本公开的一些实施例的针对视频片段的旋转操作的示意图;
图3B是根据本公开的一些实施例的对视频片段的排列顺序进行调整操作的示意图;
图4是根据本公开的一些实施例的对视频片段进行删除调整操作的示意图;
图5是根据本公开的一些实施例的对视频片段进行增加调整操作的示意图;
图6A是根据本公开的一些实施例的对视频片段进行自动优化调整操作的示意图;
图6B是根据本公开的一些实施例的对视频片段进行手动优化调整操作的示意图;
图7是根据本公开的一些实施例的对视频片段进行旋转调整操作的示意图;
图8是根据本公开的视频生成装置的一些实施例的结构示意图;
图9是适于用来实现本公开的一些实施例的电子设备的结构示意图。
具体实施方式
下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例,然而应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例。相反,提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。
另外还需要说明的是,为了便于描述,附图中仅示出了与有关发明相关的部分。在不冲突的情况下,本公开中的实施例及实施例中的特征可以相互组合。
需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。
需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。
下面将参考附图并结合实施例来详细说明本申请。
图1A-1B是根据本申请一些实施例的视频生成方法的一个应用场景的示意图。
在如图1A的应用场景中所示,首先,用户可以在终端设备101的上传页面1017上选择多条影像素材。例如,上传页面1017中所示的影像素材1011-1014。用户单击选择框1015所示的位置,选中影像素材1011-1013。用户点击“下一步”按键1016,上述终端设备101根据选中的影像素材1011-1013的数量(图中影像素材数量102示出为3),确定获取到的音频素材106中的音乐点107和音乐点108。根据音乐点107和音乐点108将音频素材106划分成音乐片段A、B和C。根据得到的音乐片段A-C的时长分别对影像素材1011-1013进行处理。得到由视频片段1031、1041和1051组成的第一视频序列。
之后,跳转到如图1B所示的终端设备101的视频编辑页面1018中进行视频编辑。例如,在视频编辑页面1018中示出了由视频片段1031、视频片段1041和视频片段1051组成的第一视频序列109。用户对视频编辑页面1018中所示的第一视频序列109中视频片段1051的播放速度调整按键1019的点击操作。之后,用户点击“下一步”按键1020,上述终端设备101对第一视频序列109中的视频片段1051进行播放速度调整,得到视频片段1061,再将视频片段1031、1041和1061组成第二视频序列110。最后,上述终端设备101将第二视频序列110中的视频片段1031、1041和1061按照音乐片段A-C在音频素材106中出现的时间进行拼接,并添加音频素材106作为拼接后视频的音轨,得到合成视频111。
可以理解的是,视频生成方法可以是由终端设备101来执行,或者也可以是由服务器来执行,或者还可以是各种软件程序来执行。其中,终端设备101例如可以是具有显示屏的各种电子设备,包括但不限于智能手机、平板电脑、电子书阅读器、膝上型便携计算机和台式计算机等等。此外,执行主体也可以体现为服务器、软件等。当执行主体为软件时,可以安装在上述所列举的电子设备中。其可以实现成例如用来提 供分布式服务的多个软件或软件模块,也可以实现成单个软件或软件模块。在此不做具体限定。
应该理解,图1中的终端设备、服务器的数目仅仅是示意性的。根据实现需要,可以具有任意数目的终端设备和服务器。
继续参考图2,示出了根据本公开的视频生成方法的一些实施例的流程200。该视频生成方法,包括以下步骤:
步骤201,获取第一影像集合和音频素材。
在一些实施例中,视频生成方法的执行主体(例如,图1所示的终端设备101)可以通过有线连接方式或者无线连接方式,获取影像素材和音频素材。其中,上述第一影像集合包括多个影像素材。
作为示例,上述影像素材可以是用户存储在本地的视频或图片,还可以是用户从网上下载的视频或图片。上述音频素材可以是用户存储在本地的音乐,也可以是网络上的音乐。
步骤202,按照第一影像集合中影像素材的数量,确定音频素材的第一音乐点。
在一些实施例中,上述执行主体可以首先确定音频素材的第一音乐点。在这里,音乐点可以是音频素材中满足设定的节拍变换条件的点。然后,上述执行主体可以从已经得到的各个候选音乐点中选取出目标数量的音乐点。上述目标数量通常是根据获取的上述影像素材的数量来确定。作为示例,当获取到10个影像素材,可以确定9个音乐点。
作为又一示例,当音乐点为音频素材中满足设定的音乐性发生变换的位置。上述音乐性发生变换的位置可以包括节拍发生变换的位置和旋律发生变换的位置。基于此,音乐点可以通过如下方式来确定:上述执行主体可以对上述音频素材进行分析,确定其中的节拍点和音符起始点,其中,节拍点为节拍发生变换的位置,音符起始点为旋律发生变换的位置。具体地,一方面可以采用基于深度学习的节拍分析算法对音频素材进行分析,得到音频素材中的节拍点以及节拍点所在的时间戳,另一方面对音频素材进行短时频谱分析,得到音频素材中的音符起始点以及音符起始点所在的时间戳。在这里,音符起始点可以是通过起始点检测器(onset detector)得到。然后,统一通过两种方式得到的节拍点和音符起始点,对节拍点和音符起始点进行合并及去重,从而得到候选音乐点。
步骤203,按照第一影像集合中影像素材的排列顺序,为音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列。
在一些实施例中,针对音频素材中的每一个第一音乐片段,上述执行主体可以按照上述第一影像集合中影像素材的排列顺序,为该音乐片段生成一个与该音乐片段时长相同的视频片段,得到第一视频序列。其中,相对应的第一音乐片段和视频片段具有相同的时长。在这里,第一视频序列通常是指由生成的视频片段构成的序列。上述序列的排列顺序可以是用户上传影像素材的顺序,也可以是由上述执行主体随机排序。
作为示例,假设音乐素材被划分成3个音乐片段,3个音乐片段的时长分别是1秒、2秒和3秒时,那么与上述音乐片段相对应的视频片段的时长也可以分别是1秒、 2秒和3秒。
作为另一示例,如果音乐素材被划分为第一段和第二段,那么第一段可以对应第一影像集合中的第一个影像素材,第二段可以对应第一影像集合中的第二个影像素材。其中,第一影像集合中影像素材的顺序例如可以是用户选择的顺序。
作为又一示例,当影像素材的时长大于第一音乐片段的时长时,在该影像素材中截取与第一音乐片段的时长相等的视频片段,而在该影像素材的时长小于第一音乐片段的时长时,则对该原影像素材进行变速处理来加长时长,再将变速后的影像素材作为视频片段,使视频片段的时长与音乐片段的时长相等。
可以理解的是,对于影像素材中的图片素材,多种实现方式可以用于将图片素材生成视频片段。例如,可以是对图片素材添加动效后生成视频片段。上述动效可以是给图片素材随机添加的前景动效。前景动效可以是在图片上添加的动态的动画效果。例如,给图片添加下雨的动画效果。其中,给图片素材增加动效可以让图片素材在视觉上更加优美,提高用户的视觉效果。
步骤204,响应于检测到针对第一视频序列中的视频片段进行的编辑操作,对第一视频序列中的视频片段进行调整,得到第二视频序列。
在一些实施例中,当执行主体检测到用户对第一视频序列中的视频片段的编辑操作时,可以根据检测到的编辑操作对相应的视频片段进行调整并得到第二视频序列。
作为示例,上述调整操作可以是对第一视频序列中的视频片段的播放速度调整操作等视频编辑操作。在这里,上述播放速度调整操作可以是对预设的播放速度控件的点击操作或通过触发预设的手势对视频片段播放速度的调整操作等。上述播放速度调整可以是将视频片段以高倍速的播放速度进行播放也可以是将视频片段以低倍速的播放速度进行播放等方式进行播放速度调整。
作为另一示例,当执行主体检测到用户对第一视频序列中的视频片段2进行的旋转操作,上述执行主体可以将上述视频片段2进行旋转得到旋转后的视频片段。在这里,上述旋转操作可以是点击预设的视频片段旋转控件操作或对视频片段的拖动、旋转等操作。
作为又一示例,上述旋转操作可以是针对如图3A所示的电子设备303所显示的界面3040中视频片段3041的拖动旋转操作3042,之后,得到电子设备303中所显示的界面3050中的旋转后的视频片段3051。
而后,执行主体可以将第一视频序列中的视频片段2替换为旋转后的视频片段,得到第二视频序列。
在一些实施例的一些可选的实现方式中,响应于检测到针对上述第一视频序列中视频片段的排列顺序的调整操作,上述执行主体可以按照调整后视频片段的排列顺序,对上述第一影像集合中影像素材的排列顺序进行调整,得到第二影像集合。而后,上述执行主体可以按照上述第二影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长。
作为示例,上述执行主体按照影像素材ABC的排列顺序分别剪出1s、2s和3s的视频片段abc后,当检测到调整视频片段排列顺序为bca的排列顺序调整操作,将对 应的影像素材的排列顺序调整为影像素材BCA的排列顺序。之后,按照影像素材BCA的排列顺序分别剪出1s、2s和3s的视频片段b`c`a`,得到由视频片段b`c`a`组成的第二视频片段。
在一些实施例的一些可选的实现方式中,执行主体可以控制具有显示功能的电子设备显示上述第一视频序列的第一展示界面,响应于检测到在上述第一展示界面中针对上述第一视频序列中视频片段的标识从第一排序位置移动至第二排序位置的调整操作,将上述视频片段移动至第二排序位置,进而确定调整后视频片段的排列顺序。而后,按照上述调整后视频片段的排列顺序,对上述第一影像集合中影像素材的排列顺序进行调整。在这里,上述标识可以是视频片段的预览影像、截图、文本说明等标识。
作为示例,上述第一展示页面可以是如图3B所示的电子设备301显示的页面3010。当检测到用户针对第一视频序列302中视频片段3023从第一排序位置C移动至第二排序位置B的调整操作3011时,上述执行主体可以将视频片段3023从第一排序位置C移动到第二排序位置B,并确定调整后的视频片段的排列顺序。之后上述执行主体可以按照调整后的视频片段的排列顺序,对第一影像集合303中的影像素材3031-3033的排列顺序进行调整,得到第二影像集合304。
在一些实施例的一些可选的实现方式中,响应于检测到针对上述第一视频序列中的第一视频片段的删除操作,在上述第一影像集合中删除第一影像素材,得到第三影像集合,其中,上述第一视频片段是基于上述第一影像素材生成的;按照上述第三影像集合中影像素材的数量,确定上述音频素材的第二音乐点,其中,上述第二音乐点用于将上述音频素材划分成多个第二音乐片段,上述第二音乐片段的数量与上述第二影像集合中影像素材的数量相同;按照上述第三影像集合中影像素材的排列顺序,为上述音频素材中的每个第二音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列,其中,相对应的第二音乐片段和视频片段具有相同的时长。
作为示例,上述执行主体检测到针对上述第一视频序列abc中的第一视频片段b的删除操作时,在第一影像素材集合ABC中删除用于生成第一视频片段b的影像素材B,得到第三影像集合AC。而后,执行主体基于第三影像集合AC中影像素材的数量2确定上述音频素材的第二音乐点,进而将上述音频素材划分为2个第二音乐片段。最后按照第三影像集合AC中影像素材A和影像素材C的排列顺序为2个第二音乐片段生成视频片段d和视频片段e,得到第二视频序列de。
在一些实施例的一些可选的实现方式中,执行主体可以控制具有显示功能的电子设备显示上述第一视频序列的第二展示界面。响应于检测到在上述第一视频序列的第二展示界面中针对上述第一视频序列中的第一视频片段的选择操作,显示在上述第一影像素材中截取视频片段的第一截取操作界面。响应于检测到在上述第一截取操作界面中的删除操作,删除上述第一影像素材。在这里,上述选择操作可以是点击第一视频片段的操作,也可以是长按等操作。上述截取操作界面可以是用户用于截取上述第一影像素材的操作界面。上述删除操作可以是点击预先设置的控件,也可以是将上述第一影像素材拖动等相关操作。
作为示例,上述第二展示页面可以是如图4所示的电子设备401显示的页面4010。当检测到在上述第二展示页面4010针对上述第一视频序列4011中第一视频片段4014 的选择操作时,显示在上述第一影像素材4023中截取视频片段的第一截取操作界面4020,当检测到在上述第一截取操作界面4020中的对删除控件4022的点击操作时,删除上述第一影像素材4023。
在一些实施例的一些可选的实现方式中,响应于检测到针对上述第一视频序列中视频片段的增加操作,获取第二影像素材并添加到上述第一影像集合中,得到第四影像集合。按照上述第四影像集合中影像素材的数量,确定上述音频素材的第三音乐点。其中,上述第三音乐点用于将上述音频素材划分成多个第三音乐片段。上述第三音乐片段的数量与上述第四影像集合中影像素材的数量相同。按照上述第四影像集合中影像素材的排列顺序,为上述音频素材中的每个第三音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列。其中,相对应的第三音乐片段和视频片段具有相同的时长。
作为示例,上述执行主体检测到针对上述第一视频序列abc中视频片段的增加操作时,获取第二影像素材D并添加到第一影像集合ABC中,得到第四影像集合ABCD。然后,按照第四影像集合ABCD中影像素材的数量4确定上述音频素材的第三音乐点,进而将上述音频素材划分为四个第三音乐片段。之后,按照上述第四影像集合ABCD中影像素材A、B、C和D的排列顺序,为音频素材中的每个第三音乐片段分别利用一个影像素材生成一个视频片段,得到第二视频序列。
在一些实施例的一些可选的实现方式中,上述执行主体可以显示上述第一视频序列的第三展示界面。响应于检测到在上述第三展示界面中针对上述第一视频序列中视频片段的增加操作,显示影像素材的浏览界面。响应于检测到在上述浏览界面中针对上述第二影像素材的选择操作,获取上述第二影像素材并添加到上述第一影像集合中,得到第四影像集合。
作为示例,上述第三展示页面可以是如图5所示的电子设备501显示的页面5010。当检测到在上述第三展示页面5010中针对第一视频序列5011中视频片段的增加控件5015的点击操作时,显示影像素材的浏览界面5020。当检测到在上述浏览界面5020中针对上述第二影像素材5023的选择操作时,获取上述第二影像素材并添加到第一影像集合中,得到第二影像集合。
在一些实施例的一些可选的实现方式中,上述第一视频序列中的视频片段是从上述第一影像集合中的影像素材中以预设起始点位置裁剪出的。
在一些实施例的一些可选的实现方式中,响应于检测到针对上述第一视频序列中的第二视频片段的自动优化操作,为上述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段。其中,上述第二视频片段对应的音乐片段与每个上述初始视频片段具有相同的时长,上述第二视频片段是基于上述第三影像素材生成的。分别对上述多个初始视频片段进行抽帧,并按照抽帧得到的图像分析上述多个初始视频片段的质量。从上述多个初始视频片段中选取质量最高的第三视频片段。在上述第一视频序列中将上述第二视频片段替换为上述第三视频片段,得到上述第二视频序列。在这里,上述自动优化操作通常是指用户点击预设的自动优化控件的操作等。上述质量通常是指在视频片段中将抽帧得到的图像进行评分所得到的综合分数。上述综合分数可以是抽帧得到的图像评分的平均分或最高分等。上述评分 方式可以是根据帧中的运动信息(如抖动等)、美学(构图等)或属性(光色等)进行评分。
在一些实施例的一些可选的实现方式中,上述执行主体可以显示上述第一视频序列中第二视频片段的第四展示界面。其中,上述第四展示界面包括自动优化控件,上述自动优化控件用于触发自动优化。响应于检测到在上述第四展示界面中针对上述自动优化控件的自动优化操作,为上述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段。在这里,上述自动优化控件可以是一个按键或预先设定的用于触发手势的控件等。
作为示例,上述显示第一视频序列中第二视频片段6011的第四展示页面可以是如图6A所示的电子设备601显示的页面6010。当检测到在上述第四展示界面6010中针对自动优化控件6013的点击操作,为上述第二视频片段6011对应的音乐片段6015利用第三影像素材6014以不同的起始点位置裁剪多个初始视频片段6016-6018。
在一些实施例的一些可选的实现方式中,当执行主体检测到针对上述第一视频序列中的第四视频片段的手动优化操作时,可以确定出上述手动优化操作在上述第四影像素材中选中的裁剪区间。之后,按照上述裁剪区间,从上述第四影像素材中裁剪出第五视频片段。最后,在上述第一视频序列中将上述第四视频片段替换为上述第五视频片段,得到上述第二视频序列。
在一些实施例的一些可选的实现方式中,上述执行主体可以显示上述第一视频序列的第五展示界面。响应于检测到在上述第五展示界面中针对上述第四视频片段的选择操作,显示在上述第四影像素材中截取视频片段的第二截取操作界面。响应于检测到在上述第二截取操作界面中截取区间的选择操作,确定在上述第四影像素材中选中的裁剪区间。
作为示例,上述显示第一视频序列6021第五展示页面可以是如图6B所示的电子设备602显示的页面6020。当检测到在上述第五展示界面6020中针对上述第四视频片段6024的选择操作时,显示在上述第四影像素材6033中截取视频片段的第二截取操作界面6030,当检测到在上述第二截取操作界面中截取区间6035至6034的选择操作时,确定在上述第四影像素材中选中的裁剪区间6034-6035。
在一些实施例的一些可选的实现方式中,上述执行主体可以响应于检测到针对上述第一视频序列中第六视频片段的旋转操作,将上述第六视频片段进行旋转,得到第七视频片段。在上述第一视频序列中将上述第六视频片段替换为上述第七视频片段,得到上述第二视频序列。
在一些实施例的一些可选的实现方式中,上述执行主体可以显示上述第一视频序列的第六展示界面。响应于检测到在上述第六展示界面中针对第六视频片段的选择操作,显示上述第六视频片段的旋转操作界面。响应于检测到在上述旋转操作界面中针对上述第六视频片段的旋转操作,将上述第六视频片段进行旋转,得到第七视频片段。
作为示例,上述第六展示界面可以是在电子设备701显示的页面7010,当检测到针对第六视频片段7014的选择操作时,显示上述第六视频片段7014的旋转操作界面7020。当检测到在上述选择操作界面7020中针对上述第六视频片段7014的旋转控件7022的点击操作时,将上述第六视频片段7014进行旋转,得到第七视频片段。
步骤205,将第二视频序列中的视频片段拼接在一起,并添加音频素材作为视频音轨,得到合成视频
视频生成方法的执行主体可以根据上述音频素材中上述音乐片段出现的顺序将上述与上述音乐片段对应的第二视频序列中的视频片段依次拼接在一起,并且在拼接而成的视频的音轨中添加上述音频素材,得到合成视频。
作为示例,可以根据音乐点将上述音频素材按照顺序划分成3段,例如,A段可以是从0秒到2秒,B段可以是从2秒到5秒,C段可以是从5秒到10秒。对应的第二视频序列中的视频片段分别是a段,b段,c段。那么拼接而成的视频可以表示为abc。将上述音频素材添加到拼接而成的视频abc的音轨中,得到合成视频。
需要说明的是,上述针对上述第一视频序列中的视频片段进行的编辑,在转换模式时,上述针对上述第一视频序列中的视频片段进行的编辑会进行保留。
作为示例,若对视频序列中视频片段a、b和c进行了调整顺序操作,得到视频片段d、e和f,再由视频片段d、e和f生成合成视频A,切换模式后,生成的合成视频A还是由d、e、和f生成的。在这里,上述模式通常是指由音频素材和影像集合生成合成视频的模式或由影像集合中的影像素材进行拼接得到合成视频的模式。
本公开的一些实施例公开的视频生成方法,通过对音频素材进行音乐点的划分,能够确定合成视频中的一个个视频片段的时长,从而使得影像素材能够被处理成合成视频中的一个个视频片段,这样就减少了用户处理影像素材和音频素材的时间,使得剪辑更简便。而通过对合成视频的一个个视频片段进行调整可以降低用户的调整难度。利用影像素材生成视频片段,进而生成合成视频,为用户直接实现一个个视频片段与音频素材的卡点,给用户多样化的选择,进而提升用户体验。
进一步参考图8,作为对上述各图所示方法的实现,本公开提供了一种网页生成装置的一些实施例,这些装置实施例与图2所示的那些方法实施例相对应,该装置具体可以应用于各种电子设备中。
如图8所示,一些实施例的网页生成装置800包括:获取单元801、确定单元802、生成单元803、调整单元804和拼接单元805。其中,获取单元801配置用于获取第一影像集合和音频素材,上述第一影像集合中包括多个影像素材;确定单元802配置用于按照上述第一影像集合中影像素材的数量,确定上述音频素材的第一音乐点,其中,上述第一音乐点用于将上述音频素材划分成多个第一音乐片段,上述第一音乐片段的数量与上述第一影像集合中影像素材的数量相同;生成单元803配置用于按照上述第一影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长;调整单元804配置用于响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列;而拼接单元805配置用于将上述第二视频序列中的视频片段拼接在一起,并添加上述音频素材作为视频音轨,得到合成视频。
在一些实施例的可选实现方式中,视频生成装置800的调整单元801被进一步被配置成:响应于检测到针对上述第一视频序列中视频片段的排列顺序的调整操作,按 照调整后视频片段的排列顺序,对上述第一影像集合中影像素材的排列顺序进行调整,得到第二影像集合;按照上述第二影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长。
在一些实施例的可选实现方式中,视频生成装置800还可以包括第一子单元,被配置成:显示上述第一视频序列的第一展示界面;响应于检测到在上述第一展示界面中针对上述第一视频序列中视频片段的标识从第一排序位置移动至第二排序位置的调整操作,将上述视频片段移动至第二排序位置,确定调整后视频片段的排列顺序;按照上述调整后视频片段的排列顺序,对上述第一影像集合中影像素材的排列顺序进行调整。
在一些实施例的可选实现方式中,视频生成装置800的调整单元801被进一步被配置成:响应于检测到针对上述第一视频序列中的第一视频片段的删除操作,在上述第一影像集合中删除第一影像素材,得到第三影像集合,其中,上述第一视频片段是基于上述第一影像素材生成的;按照上述第三影像集合中影像素材的数量,确定上述音频素材的第二音乐点,其中,上述第二音乐点用于将上述音频素材划分成多个第二音乐片段,上述第二音乐片段的数量与上述第二影像集合中影像素材的数量相同;按照上述第三影像集合中影像素材的排列顺序,为上述音频素材中的每个第二音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列,其中,相对应的第二音乐片段和视频片段具有相同的时长。
在一些实施例的可选实现方式中,视频生成装置800还可以包括第二子单元,被配置成:显示上述第一视频序列的第二展示界面;响应于检测到在上述第一视频序列的第二展示界面中针对上述第一视频序列中的第一视频片段的选择操作,显示在上述第一影像素材中截取视频片段的第一截取操作界面;响应于检测到在上述第一截取操作界面中的删除操作,删除上述第一影像素材。
在一些实施例的可选实现方式中,视频生成装置800的调整单元801被进一步被配置成:响应于检测到针对上述第一视频序列中视频片段的增加操作,获取第二影像素材并添加到上述第一影像集合中,得到第四影像集合;按照上述第四影像集合中影像素材的数量,确定上述音频素材的第三音乐点,其中,上述第三音乐点用于将上述音频素材划分成多个第三音乐片段,上述第三音乐片段的数量与上述第四影像集合中影像素材的数量相同;按照上述第四影像集合中影像素材的排列顺序,为上述音频素材中的每个第三音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列,其中,相对应的第三音乐片段和视频片段具有相同的时长。
在一些实施例的可选实现方式中,视频生成装置800还可以包括第三子单元,被配置成:显示上述第一视频序列的第三展示界面;响应于检测到在上述第三展示界面中针对上述第一视频序列中视频片段的增加操作,显示影像素材的浏览界面;响应于检测到在上述浏览界面中针对上述第二影像素材的选择操作,获取上述第二影像素材并添加到上述第一影像集合中,得到第四影像集合。
在一些实施例的可选实现方式中,上述第一视频序列中的视频片段是从上述第一影像集合中的影像素材中以预设起始点位置裁剪出的。
在一些实施例的可选实现方式中,视频生成装置800的调整单元801被进一步被配置成:响应于检测到针对上述第一视频序列中的第二视频片段的自动优化操作,为上述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段,其中,上述第二视频片段对应的音乐片段与每个上述初始视频片段具有相同的时长,上述第二视频片段是基于上述第三影像素材生成的;分别对上述多个初始视频片段进行抽帧,并按照抽帧得到的图像分析上述多个初始视频片段的质量;从上述多个初始视频片段中选取质量最高的第三视频片段;在上述第一视频序列中将上述第二视频片段替换为上述第三视频片段,得到上述第二视频序列。
在一些实施例的可选实现方式中,视频生成装置800还可以包括第四子单元,被配置成:显示上述第一视频序列中第二视频片段的第四展示界面,其中,上述第四展示界面包括自动优化控件,上述自动优化控件用于触发自动优化;响应于检测到在上述第四展示界面中针对上述自动优化控件的自动优化操作,为上述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段。
在一些实施例的可选实现方式中,视频生成装置800的调整单元801被进一步被配置成:响应于检测到针对上述第一视频序列中的第四视频片段的手动优化操作,确定上述手动优化操作在上述第四影像素材中选中的裁剪区间;按照上述裁剪区间,从上述第四影像素材中裁剪出第五视频片段;在上述第一视频序列中将上述第四视频片段替换为上述第五视频片段,得到上述第二视频序列。
在一些实施例的可选实现方式中,视频生成装置800还可以包括第五子单元,被配置成:显示上述第一视频序列的第五展示界面;响应于检测到在上述第五展示界面中针对上述第四视频片段的选择操作,显示在上述第四影像素材中截取视频片段的第二截取操作界面;响应于检测到在上述第二截取操作界面中截取区间的选择操作,确定在上述第四影像素材中选中的裁剪区间。
在一些实施例的可选实现方式中,视频生成装置800的调整单元801被进一步被配置成:响应于检测到针对上述第一视频序列中第六视频片段的旋转操作,将上述第六视频片段进行旋转,得到第七视频片段;在上述第一视频序列中将上述第六视频片段替换为上述第七视频片段,得到上述第二视频序列。
在一些实施例的可选实现方式中,视频生成装置800还可以包括第六子单元,被配置成:显示上述第一视频序列的第六展示界面;响应于检测到在上述第六展示界面中针对第六视频片段的选择操作,显示上述第六视频片段的旋转操作界面;响应于检测到在上述旋转操作界面中针对上述第六视频片段的旋转操作,将上述第六视频片段进行旋转,得到第七视频片段。
本公开的一些实施例公开的视频生成装置,通过对音频素材进行音乐点的划分,能够确定合成视频中的一个个视频片段的时长,从而使得影像素材能够被处理成合成视频中的一个个视频片段,这样就减少了用户处理影像素材和音频素材的时间,使得剪辑更简便。而通过对合成视频的一个个视频片段进行调整可以降低用户的调整难度。利用影像素材生成视频片段,进而生成合成视频,为用户直接实现一个个视频片段与音频素材的卡点,给用户多样化的选择,进而提升用户体验。
下面参考图9,其示出了适于用来实现本公开的一些实施例的电子设备(例如图1中的终端设备)900的结构示意图。图9示出的电子设备仅仅是一个示例,不应对本公开的实施例的功能和使用范围带来任何限制。
如图9所示,电子设备900可以包括处理装置(例如中央处理器、图形处理器等)901,其可以根据存储在只读存储器(ROM)902中的程序或者从存储装置908加载到随机访问存储器(RAM)903中的程序而执行各种适当的动作和处理。在RAM 903中,还存储有电子设备900操作所需的各种程序和数据。处理装置901、ROM 902以及RAM 903通过总线904彼此相连。输入/输出(I/O)接口905也连接至总线904。
通常,以下装置可以连接至I/O接口905:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置906;包括例如液晶显示器(LCD)、扬声器、振动器等的输出装置907;以及通信装置909。通信装置909可以允许电子设备900与其他设备进行无线或有线通信以交换数据。虽然图9示出了具有各种装置的电子设备900,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。图9中示出的每个方框可以代表一个装置,也可以根据需要代表多个装置。
特别地,根据本公开的一些实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的一些实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的一些实施例中,该计算机程序可以通过通信装置909从网络上被下载和安装,或者从存储装置908被安装,或者从ROM902被安装。在该计算机程序被处理装置901执行时,执行本公开的一些实施例的方法中限定的上述功能。
需要说明的是,本公开的一些实施例上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开的一些实施例中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开的一些实施例中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。
在一些实施方式中,客户端、服务器可以利用诸如HTTP(HyperText Transfer Protocol,超文本传输协议)之类的任何当前已知或未来研发的网络协议进行通信, 并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(“LAN”),广域网(“WAN”),网际网(例如,互联网)以及端对端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:获取第一影像集合和音频素材,上述第一影像集合中包括多个影像素材;按照上述第一影像集合中影像素材的数量,确定上述音频素材的第一音乐点,其中,上述第一音乐点用于将上述音频素材划分成多个第一音乐片段,上述第一音乐片段的数量与上述第一影像集合中影像素材的数量相同;按照上述第一影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长;响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列;将上述第二视频序列中的视频片段拼接在一起,并添加上述音频素材作为视频音轨,得到合成视频
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的一些实施例的操作的计算机程序代码,上述程序设计语言包括面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)——连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。
描述于本公开的一些实施例中的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。所描述的单元也可以设置在处理器中,例如,可以描述为:一种处理器包括获取单元、确定单元、生成单元、调整单元和拼接单元。其中,这些单元的名称在某种情况下并不构成对该单元本身的限定,例如,获取单元还可以被描述为“获取第一影像集合和音频素材的单元”。
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、 专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。
根据本公开的一个或多个实施例,提供了一种视频生成方法,包括:获取第一影像集合和音频素材,上述第一影像集合中包括多个影像素材;按照上述第一影像集合中影像素材的数量,确定上述音频素材的第一音乐点,其中,上述第一音乐点用于将上述音频素材划分成多个第一音乐片段,上述第一音乐片段的数量与上述第一影像集合中影像素材的数量相同;按照上述第一影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长;响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列;将上述第二视频序列中的视频片段拼接在一起,并添加上述音频素材作为视频音轨,得到合成视频。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:响应于检测到针对上述第一视频序列中视频片段的排列顺序的调整操作,按照调整后视频片段的排列顺序,对上述第一影像集合中影像素材的排列顺序进行调整,得到第二影像集合;按照上述第二影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中视频片段的排列顺序的调整操作,按照调整后视频片段的排列顺序,对上述第一影像集合中影像素材的排列顺序进行调整,得到第二影像集合,包括:显示上述第一视频序列的第一展示界面;响应于检测到在上述第一展示界面中针对上述第一视频序列中视频片段的标识从第一排序位置移动至第二排序位置的调整操作,将上述视频片段移动至第二排序位置,确定调整后视频片段的排列顺序;按照上述调整后视频片段的排列顺序,对上述第一影像集合中影像素材的排列顺序进行调整。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:响应于检测到针对上述第一视频序列中的第一视频片段的删除操作,在上述第一影像集合中删除第一影像素材,得到第三影像集合,其中,上述第一视频片段是基于上述第一影像素材生成的;按照上述第三影像集合中影像素材的数量,确定上述音频素材的第二音乐点,其中,上述第二音乐点用于将上述音频素材划分成多个第二音乐片段,上述第二音乐片段的数量与上述第二影像集合中影像素材的数量相同;按照上述第三影像集合中影像素材的排列顺序,为上述音频素材中的每个第二音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列,其中,相对应的第二音乐片段和视频片段具有相同的时长。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的第一视频片段的删除操作,在上述第一影像集合中删除第一影像素材,包括:显示上述第一视频序列的第二展示界面;响应于检测到在上述第一视频序列的第二展示界面中针 对上述第一视频序列中的第一视频片段的选择操作,显示在上述第一影像素材中截取视频片段的第一截取操作界面;响应于检测到在上述第一截取操作界面中的删除操作,删除上述第一影像素材。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:响应于检测到针对上述第一视频序列中视频片段的增加操作,获取第二影像素材并添加到上述第一影像集合中,得到第四影像集合;按照上述第四影像集合中影像素材的数量,确定上述音频素材的第三音乐点,其中,上述第三音乐点用于将上述音频素材划分成多个第三音乐片段,上述第三音乐片段的数量与上述第四影像集合中影像素材的数量相同;按照上述第四影像集合中影像素材的排列顺序,为上述音频素材中的每个第三音乐片段分别利用一个影像素材生成一个视频片段,得到上述第二视频序列,其中,相对应的第三音乐片段和视频片段具有相同的时长。
根据本公开的一个或多个实施例,响应于检测到上述第一视频序列中视频片段的增加操作,获取第二影像素材并添加到上述第一影像集合中,得到第四影像集合,包括:显示上述第一视频序列的第三展示界面;响应于检测到在上述第三展示界面中针对上述第一视频序列中视频片段的增加操作,显示影像素材的浏览界面;响应于检测到在上述浏览界面中针对上述第二影像素材的选择操作,获取上述第二影像素材并添加到上述第一影像集合中,得到第四影像集合。
根据本公开的一个或多个实施例,第一视频序列中的视频片段是从上述第一影像集合中的影像素材中以预设起始点位置裁剪出的。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:响应于检测到针对上述第一视频序列中的第二视频片段的自动优化操作,为上述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段,其中,上述第二视频片段对应的音乐片段与每个上述初始视频片段具有相同的时长,上述第二视频片段是基于上述第三影像素材生成的;分别对上述多个初始视频片段进行抽帧,并按照抽帧得到的图像分析上述多个初始视频片段的质量;从上述多个初始视频片段中选取质量最高的第三视频片段;在上述第一视频序列中将上述第二视频片段替换为上述第三视频片段,得到上述第二视频序列。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的第二视频片段的自动优化操作,为上述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段,包括:显示上述第一视频序列中第二视频片段的第四展示界面,其中,上述第四展示界面包括自动优化控件,上述自动优化控件用于触发自动优化;响应于检测到在上述第四展示界面中针对上述自动优化控件的自动优化操作,为上述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:响应于检测到针对上述第一视频序列中的第四视频片段的手动优化操作, 确定上述手动优化操作在上述第四影像素材中选中的裁剪区间;按照上述裁剪区间,从上述第四影像素材中裁剪出第五视频片段;在上述第一视频序列中将上述第四视频片段替换为上述第五视频片段,得到上述第二视频序列。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的第四视频片段的手动优化操作,确定上述手动优化操作在上述第四影像素材中选中的裁剪区间,包括:显示上述第一视频序列的第五展示界面;响应于检测到在上述第五展示界面中针对上述第四视频片段的选择操作,显示在上述第四影像素材中截取视频片段的第二截取操作界面;响应于检测到在上述第二截取操作界面中截取区间的选择操作,确定在上述第四影像素材中选中的裁剪区间。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:响应于检测到针对上述第一视频序列中第六视频片段的旋转操作,将上述第六视频片段进行旋转,得到第七视频片段;在上述第一视频序列中将上述第六视频片段替换为上述第七视频片段,得到上述第二视频序列。
根据本公开的一个或多个实施例,响应于检测到针对上述第一视频序列中第六视频片段的旋转操作,将上述第六视频片段进行旋转,得到第七视频片段,包括:显示上述第一视频序列的第六展示界面;响应于检测到在上述第六展示界面中针对第六视频片段的选择操作,显示上述第六视频片段的旋转操作界面;响应于检测到在上述旋转操作界面中针对上述第六视频片段的旋转操作,将上述第六视频片段进行旋转,得到第七视频片段。
根据本公开的一个或多个实施例,视频生成装置包括:获取单元,被配置成获取第一影像集合和音频素材,上述第一影像集合中包括多个影像素材;确定单元,被配置成按照上述第一影像集合中影像素材的数量,确定上述音频素材的第一音乐点,其中,上述第一音乐点用于将上述音频素材划分成多个第一音乐片段,上述第一音乐片段的数量与上述第一影像集合中影像素材的数量相同;生成单元,被配置成按照上述第一影像集合中影像素材的排列顺序,为上述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长;调整单元,被配置成响应于检测到针对上述第一视频序列中的视频片段进行的编辑操作,对上述第一视频序列中的视频片段进行调整,得到第二视频序列;拼接单元,被配置成将上述第二视频序列中的视频片段拼接在一起,并添加上述音频素材作为视频音轨,得到合成视频。
根据本公开的一个或多个实施例,提供了一种电子设备,包括:一个或多个处理器;存储装置,其上存储有一个或多个程序,当一个或多个程序被一个或多个处理器执行,使得一个或多个处理器实现如上述任一实施例描述的方法。
根据本公开的一个或多个实施例,提供了一种计算机可读介质,其上存储有计算机程序,其中,程序被处理器执行时实现如上述任一实施例描述的方法。
根据本申请的一个或多个实施例,提供了一种计算机程序,包括程序代码,当计算机运行所述计算机程序时,所述程序代码执行上述任一实施例描述的方法。
以上描述仅为本公开的一些较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开的实施例中所涉及的发明范围,并不限于上述技术特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述发明构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开的实施例中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。

Claims (18)

  1. 一种视频生成方法,包括:
    获取第一影像集合和音频素材,所述第一影像集合中包括多个影像素材;
    按照所述第一影像集合中影像素材的数量,确定所述音频素材的第一音乐点,其中,所述第一音乐点用于将所述音频素材划分成多个第一音乐片段,所述第一音乐片段的数量与所述第一影像集合中影像素材的数量相同;
    按照所述第一影像集合中影像素材的排列顺序,为所述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长;
    响应于检测到针对所述第一视频序列中的视频片段进行的编辑操作,对所述第一视频序列中的视频片段进行调整,得到第二视频序列;
    将所述第二视频序列中的视频片段拼接在一起,并添加所述音频素材作为视频音轨,得到合成视频。
  2. 根据权利要求1所述的方法,其中,所述响应于检测到针对所述第一视频序列中的视频片段进行的编辑操作,对所述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:
    响应于检测到针对所述第一视频序列中视频片段的排列顺序的调整操作,按照调整后视频片段的排列顺序,对所述第一影像集合中影像素材的排列顺序进行调整,得到第二影像集合;
    按照所述第二影像集合中影像素材的排列顺序,为所述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到所述第二视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长。
  3. 根据权利要求2所述的方法,其中,所述响应于检测到针对所述第一视频序列中视频片段的排列顺序的调整操作,按照调整后视频片段的排列顺序,对所述第一影像集合中影像素材的排列顺序进行调整,得到第二影像集合,包括:
    显示所述第一视频序列的第一展示界面;
    响应于检测到在所述第一展示界面中针对所述第一视频序列中视频片段的标识从第一排序位置移动至第二排序位置的调整操作,将所述视频片段移动至第二排序位置,确定调整后视频片段的排列顺序;
    按照所述调整后视频片段的排列顺序,对所述第一影像集合中影像素材的排列顺序进行调整,得到第二影像集合。
  4. 根据权利要求1-3任一项所述的方法,其中,所述响应于检测到针对所述第一视频序列中的视频片段进行的编辑操作,对所述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:
    响应于检测到针对所述第一视频序列中的第一视频片段的删除操作,在所述第一影像集合中删除第一影像素材,得到第三影像集合,其中,所述第一视频片段是基于 所述第一影像素材生成的;
    按照所述第三影像集合中影像素材的数量,确定所述音频素材的第二音乐点,其中,所述第二音乐点用于将所述音频素材划分成多个第二音乐片段,所述第二音乐片段的数量与所述第二影像集合中影像素材的数量相同;
    按照所述第三影像集合中影像素材的排列顺序,为所述音频素材中的每个第二音乐片段分别利用一个影像素材生成一个视频片段,得到所述第二视频序列,其中,相对应的第二音乐片段和视频片段具有相同的时长。
  5. 根据权利要求4所述的方法,其中,所述响应于检测到针对所述第一视频序列中的第一视频片段的删除操作,在所述第一影像集合中删除第一影像素材,包括:
    显示所述第一视频序列的第二展示界面;
    响应于检测到在所述第一视频序列的第二展示界面中针对所述第一视频序列中的第一视频片段的选择操作,显示在所述第一影像素材中截取视频片段的第一截取操作界面;
    响应于检测到在所述第一截取操作界面中的删除操作,删除所述第一影像素材。
  6. 根据权利要求1-5任一项所述的方法,其中,所述响应于检测到针对所述第一视频序列中的视频片段进行的编辑操作,对所述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:
    响应于检测到针对所述第一视频序列中视频片段的增加操作,获取第二影像素材并添加到所述第一影像集合中,得到第四影像集合;
    按照所述第四影像集合中影像素材的数量,确定所述音频素材的第三音乐点,其中,所述第三音乐点用于将所述音频素材划分成多个第三音乐片段,所述第三音乐片段的数量与所述第四影像集合中影像素材的数量相同;
    按照所述第四影像集合中影像素材的排列顺序,为所述音频素材中的每个第三音乐片段分别利用一个影像素材生成一个视频片段,得到所述第二视频序列,其中,相对应的第三音乐片段和视频片段具有相同的时长。
  7. 根据权利要求6所述的方法,其中,所述响应于检测到所述第一视频序列中视频片段的增加操作,获取第二影像素材并添加到所述第一影像集合中,得到第四影像集合,包括:
    显示所述第一视频序列的第三展示界面;
    响应于检测到在所述第三展示界面中针对所述第一视频序列中视频片段的增加操作,显示影像素材的浏览界面;
    响应于检测到在所述浏览界面中针对所述第二影像素材的选择操作,获取所述第二影像素材并添加到所述第一影像集合中,得到第四影像集合。
  8. 根据权利要求1-7任一项所述的方法,其中,所述第一视频序列中的视频片段是从所述第一影像集合中的影像素材中以预设起始点位置裁剪出的。
  9. 根据权利要求8所述的方法,其中,所述响应于检测到针对所述第一视频序列中的视频片段进行的编辑操作,对所述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:
    响应于检测到针对所述第一视频序列中的第二视频片段的自动优化操作,为所述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段,其中,所述第二视频片段对应的音乐片段与每个所述初始视频片段具有相同的时长,所述第二视频片段是基于所述第三影像素材生成的;
    分别对所述多个初始视频片段进行抽帧,并按照抽帧得到的图像分析所述多个初始视频片段的质量;
    从所述多个初始视频片段中选取质量最高的第三视频片段;
    在所述第一视频序列中将所述第二视频片段替换为所述第三视频片段,得到所述第二视频序列。
  10. 根据权利要求9所述的方法,其中,所述响应于检测到针对所述第一视频序列中的第二视频片段的自动优化操作,为所述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段,包括:
    显示所述第一视频序列中第二视频片段的第四展示界面,其中,所述第四展示界面包括自动优化控件,所述自动优化控件用于触发自动优化;
    响应于检测到在所述第四展示界面中针对所述自动优化控件的自动优化操作,为所述第二视频片段对应的音乐片段利用第三影像素材以不同的起始点位置裁剪出多个初始视频片段。
  11. 根据权利要求8所述的方法,其中,所述响应于检测到针对所述第一视频序列中的视频片段进行的编辑操作,对所述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:
    响应于检测到针对所述第一视频序列中的第四视频片段的手动优化操作,确定所述手动优化操作在所述第四影像素材中选中的裁剪区间;
    按照所述裁剪区间,从所述第四影像素材中裁剪出第五视频片段;
    在所述第一视频序列中将所述第四视频片段替换为所述第五视频片段,得到所述第二视频序列。
  12. 根据权利要求11所述的方法,其中,所述响应于检测到针对所述第一视频序列中的第四视频片段的手动优化操作,确定所述手动优化操作在所述第四影像素材中选中的裁剪区间,包括:
    显示所述第一视频序列的第五展示界面;
    响应于检测到在所述第五展示界面中针对所述第四视频片段的选择操作,显示在所述第四影像素材中截取视频片段的第二截取操作界面;
    响应于检测到在所述第二截取操作界面中截取区间的选择操作,确定在所述第四 影像素材中选中的裁剪区间。
  13. 根据权利要求1-12任一项所述的方法,其中,所述响应于检测到针对所述第一视频序列中的视频片段进行的编辑操作,对所述第一视频序列中的视频片段进行调整,得到第二视频序列,包括:
    响应于检测到针对所述第一视频序列中第六视频片段的旋转操作,将所述第六视频片段进行旋转,得到第七视频片段;
    在所述第一视频序列中将所述第六视频片段替换为所述第七视频片段,得到所述第二视频序列。
  14. 根据权利要求13所述的方法,其中,所述响应于检测到针对所述第一视频序列中第六视频片段的旋转操作,将所述第六视频片段进行旋转,得到第七视频片段,包括:
    显示所述第一视频序列的第六展示界面;
    响应于检测到在所述第六展示界面中针对第六视频片段的选择操作,显示所述第六视频片段的旋转操作界面;
    响应于检测到在所述旋转操作界面中针对所述第六视频片段的旋转操作,将所述第六视频片段进行旋转,得到第七视频片段。
  15. 一种用于视频生成的装置,包括:
    获取单元,被配置成获取第一影像集合和音频素材,所述第一影像集合中包括多个影像素材;
    确定单元,被配置成按照所述第一影像集合中影像素材的数量,确定所述音频素材的第一音乐点,其中,所述第一音乐点用于将所述音频素材划分成多个第一音乐片段,所述第一音乐片段的数量与所述第一影像集合中影像素材的数量相同;
    生成单元,被配置成按照所述第一影像集合中影像素材的排列顺序,为所述音频素材中的每个第一音乐片段分别利用一个影像素材生成一个视频片段,得到第一视频序列,其中,相对应的第一音乐片段和视频片段具有相同的时长;
    调整单元,被配置成响应于检测到针对所述第一视频序列中的视频片段进行的编辑操作,对所述第一视频序列中的视频片段进行调整,得到第二视频序列;
    拼接单元,被配置成将所述第二视频序列中的视频片段拼接在一起,并添加所述音频素材作为视频音轨,得到合成视频。
  16. 一种电子设备,包括:
    一个或多个处理器;
    存储装置,其上存储有一个或多个程序,
    当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现如权利要求1-14中任一所述的方法。
  17. 一种计算机可读介质,其上存储有计算机程序,其中,所述程序被处理器执行时实现如权利要求1-14中任一所述的方法。
  18. 一种计算机程序,其特征在于,包括程序代码,当计算机运行所述计算机程序时,所述程序代码执行如权利要求1-14任一所述的方法。
PCT/CN2020/129284 2019-11-18 2020-11-17 视频生成方法、装置、电子设备和计算机可读介质 WO2021098670A1 (zh)

Priority Applications (5)

Application Number Priority Date Filing Date Title
KR1020227016781A KR20220103112A (ko) 2019-11-18 2020-11-17 비디오 생성 방법 및 장치, 전자 장치, 및 컴퓨터 판독가능 매체
JP2022528542A JP7457804B2 (ja) 2019-11-18 2020-11-17 ビデオ生成方法および装置、電子装置、およびコンピュータ読み取り可能媒体
BR112022009608A BR112022009608A2 (pt) 2019-11-18 2020-11-17 Método e aparelho de geração de vídeo, dispositivo eletrônico e meio legível por computador
EP20889011.1A EP4047943A4 (en) 2019-11-18 2020-11-17 VIDEO GENERATING METHOD AND APPARATUS, ELECTRONIC DEVICE AND COMPUTER READABLE MEDIA
US17/744,671 US11636879B2 (en) 2019-11-18 2022-05-14 Video generating method, apparatus, electronic device, and computer-readable medium

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201911129727.9A CN112822541B (zh) 2019-11-18 2019-11-18 视频生成方法、装置、电子设备和计算机可读介质
CN201911129727.9 2019-11-18

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US17/744,671 Continuation US11636879B2 (en) 2019-11-18 2022-05-14 Video generating method, apparatus, electronic device, and computer-readable medium

Publications (1)

Publication Number Publication Date
WO2021098670A1 true WO2021098670A1 (zh) 2021-05-27

Family

ID=75852677

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/129284 WO2021098670A1 (zh) 2019-11-18 2020-11-17 视频生成方法、装置、电子设备和计算机可读介质

Country Status (7)

Country Link
US (1) US11636879B2 (zh)
EP (1) EP4047943A4 (zh)
JP (1) JP7457804B2 (zh)
KR (1) KR20220103112A (zh)
CN (1) CN112822541B (zh)
BR (1) BR112022009608A2 (zh)
WO (1) WO2021098670A1 (zh)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113473204A (zh) * 2021-05-31 2021-10-01 北京达佳互联信息技术有限公司 一种信息展示方法、装置、电子设备及存储介质
CN113613067A (zh) * 2021-08-03 2021-11-05 北京字跳网络技术有限公司 视频处理方法、装置、设备及存储介质
CN113676671A (zh) * 2021-09-27 2021-11-19 北京达佳互联信息技术有限公司 视频剪辑方法、装置、电子设备及存储介质
CN113891113A (zh) * 2021-09-29 2022-01-04 阿里巴巴(中国)有限公司 视频剪辑合成方法及电子设备

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115065840A (zh) * 2022-06-07 2022-09-16 北京达佳互联信息技术有限公司 一种信息处理方法、装置、电子设备及存储介质
CN116506694B (zh) * 2023-06-26 2023-10-27 北京达佳互联信息技术有限公司 视频剪辑方法、装置、电子设备及存储介质

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105530440A (zh) * 2014-09-29 2016-04-27 北京金山安全软件有限公司 一种视频的制作方法及装置
US20160172000A1 (en) * 2013-07-24 2016-06-16 Prompt, Inc. An apparatus of providing a user interface for playing and editing moving pictures and the method thereof
CN108419035A (zh) * 2018-02-28 2018-08-17 北京小米移动软件有限公司 图片视频的合成方法及装置
CN109168084A (zh) * 2018-10-24 2019-01-08 麒麟合盛网络技术股份有限公司 一种视频剪辑的方法和装置
CN109379643A (zh) * 2018-11-21 2019-02-22 北京达佳互联信息技术有限公司 视频合成方法、装置、终端及存储介质
CN110265057A (zh) * 2019-07-10 2019-09-20 腾讯科技(深圳)有限公司 生成多媒体的方法及装置、电子设备、存储介质
CN110278388A (zh) * 2019-06-19 2019-09-24 北京字节跳动网络技术有限公司 展示视频的生成方法、装置、设备及存储介质

Family Cites Families (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3325809B2 (ja) * 1997-08-15 2002-09-17 日本電信電話株式会社 映像制作方法及び装置及びこの方法を記録した記録媒体
US7027124B2 (en) * 2002-02-28 2006-04-11 Fuji Xerox Co., Ltd. Method for automatically producing music videos
JP4285287B2 (ja) * 2004-03-17 2009-06-24 セイコーエプソン株式会社 画像処理装置、画像処理方法およびそのプログラム、記録媒体
US7512886B1 (en) * 2004-04-15 2009-03-31 Magix Ag System and method of automatically aligning video scenes with an audio track
JP2007066399A (ja) * 2005-08-30 2007-03-15 Ricoh Co Ltd 映像音声編集システム
JP4487958B2 (ja) * 2006-03-16 2010-06-23 ソニー株式会社 メタデータ付与方法及び装置
WO2008024486A2 (en) * 2006-08-24 2008-02-28 Fliptrack, Inc. Beat and text based editing and composing systems and methods
US7877690B2 (en) * 2006-09-20 2011-01-25 Adobe Systems Incorporated Media system with integrated clip views
US7569761B1 (en) * 2007-09-21 2009-08-04 Adobe Systems Inc. Video editing matched to musical beats
US8737815B2 (en) * 2009-01-23 2014-05-27 The Talk Market, Inc. Computer device, method, and graphical user interface for automating the digital transformation, enhancement, and editing of personal and professional videos
US20110142420A1 (en) * 2009-01-23 2011-06-16 Matthew Benjamin Singer Computer device, method, and graphical user interface for automating the digital tranformation, enhancement, and editing of personal and professional videos
US20120195573A1 (en) * 2011-01-28 2012-08-02 Apple Inc. Video Defect Replacement
US20130163963A1 (en) * 2011-12-21 2013-06-27 Cory Crosland System and method for generating music videos from synchronized user-video recorded content
US20150058709A1 (en) * 2012-01-26 2015-02-26 Michael Edward Zaletel Method of creating a media composition and apparatus therefore
US20140355960A1 (en) * 2013-05-31 2014-12-04 Microsoft Corporation Touch optimized design for video editing
CN105933773A (zh) 2016-05-12 2016-09-07 青岛海信传媒网络技术有限公司 视频编辑方法及系统
CN105959579B (zh) * 2016-07-18 2020-04-17 杭州当虹科技股份有限公司 一种360度全景视频局部内容更换装置
CN106992004B (zh) 2017-03-06 2020-06-26 华为技术有限公司 一种调整视频的方法及终端
US11915722B2 (en) * 2017-03-30 2024-02-27 Gracenote, Inc. Generating a video presentation to accompany audio
US20180295427A1 (en) * 2017-04-07 2018-10-11 David Leiberman Systems and methods for creating composite videos
CN107770626B (zh) * 2017-11-06 2020-03-17 腾讯科技(深圳)有限公司 视频素材的处理方法、视频合成方法、装置及存储介质
AU2018271424A1 (en) * 2017-12-13 2019-06-27 Playable Pty Ltd System and Method for Algorithmic Editing of Video Content
CN109257545B (zh) * 2018-08-27 2021-04-13 咪咕文化科技有限公司 一种多源视频剪辑方法、装置及存储介质
CN109275028B (zh) * 2018-09-30 2021-02-26 北京微播视界科技有限公司 视频获取方法、装置、终端和介质
CN109660867A (zh) 2019-01-09 2019-04-19 深圳慧聚智能科技有限公司 一种动态视频调整方法及其设备
CN110233976B (zh) * 2019-06-21 2022-09-09 广州酷狗计算机科技有限公司 视频合成的方法及装置
CN110336960B (zh) * 2019-07-17 2021-12-10 广州酷狗计算机科技有限公司 视频合成的方法、装置、终端及存储介质
CN110769309B (zh) * 2019-11-04 2023-03-31 北京字节跳动网络技术有限公司 用于展示音乐点的方法、装置、电子设备和介质

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160172000A1 (en) * 2013-07-24 2016-06-16 Prompt, Inc. An apparatus of providing a user interface for playing and editing moving pictures and the method thereof
CN105530440A (zh) * 2014-09-29 2016-04-27 北京金山安全软件有限公司 一种视频的制作方法及装置
CN108419035A (zh) * 2018-02-28 2018-08-17 北京小米移动软件有限公司 图片视频的合成方法及装置
CN109168084A (zh) * 2018-10-24 2019-01-08 麒麟合盛网络技术股份有限公司 一种视频剪辑的方法和装置
CN109379643A (zh) * 2018-11-21 2019-02-22 北京达佳互联信息技术有限公司 视频合成方法、装置、终端及存储介质
CN110278388A (zh) * 2019-06-19 2019-09-24 北京字节跳动网络技术有限公司 展示视频的生成方法、装置、设备及存储介质
CN110265057A (zh) * 2019-07-10 2019-09-20 腾讯科技(深圳)有限公司 生成多媒体的方法及装置、电子设备、存储介质

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP4047943A4

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113473204A (zh) * 2021-05-31 2021-10-01 北京达佳互联信息技术有限公司 一种信息展示方法、装置、电子设备及存储介质
CN113473204B (zh) * 2021-05-31 2023-10-13 北京达佳互联信息技术有限公司 一种信息展示方法、装置、电子设备及存储介质
CN113613067A (zh) * 2021-08-03 2021-11-05 北京字跳网络技术有限公司 视频处理方法、装置、设备及存储介质
CN113613067B (zh) * 2021-08-03 2023-08-22 北京字跳网络技术有限公司 视频处理方法、装置、设备及存储介质
CN113676671A (zh) * 2021-09-27 2021-11-19 北京达佳互联信息技术有限公司 视频剪辑方法、装置、电子设备及存储介质
CN113676671B (zh) * 2021-09-27 2023-06-23 北京达佳互联信息技术有限公司 视频剪辑方法、装置、电子设备及存储介质
CN113891113A (zh) * 2021-09-29 2022-01-04 阿里巴巴(中国)有限公司 视频剪辑合成方法及电子设备
CN113891113B (zh) * 2021-09-29 2024-03-12 阿里巴巴(中国)有限公司 视频剪辑合成方法及电子设备

Also Published As

Publication number Publication date
KR20220103112A (ko) 2022-07-21
JP7457804B2 (ja) 2024-03-28
CN112822541B (zh) 2022-05-20
EP4047943A1 (en) 2022-08-24
JP2023501813A (ja) 2023-01-19
EP4047943A4 (en) 2022-11-30
US20220277775A1 (en) 2022-09-01
BR112022009608A2 (pt) 2022-08-16
US11636879B2 (en) 2023-04-25
CN112822541A (zh) 2021-05-18

Similar Documents

Publication Publication Date Title
WO2021098670A1 (zh) 视频生成方法、装置、电子设备和计算机可读介质
US11887630B2 (en) Multimedia data processing method, multimedia data generation method, and related device
WO2021088830A1 (zh) 用于展示音乐点的方法、装置、电子设备和介质
WO2021093737A1 (zh) 生成视频的方法、装置、电子设备和计算机可读介质
JP7387891B2 (ja) 動画ファイルの生成方法、装置、端末及び記憶媒体
CN106303723B (zh) 视频处理方法和装置
US20240107127A1 (en) Video display method and apparatus, video processing method, apparatus, and system, device, and medium
US20190130185A1 (en) Visualization of Tagging Relevance to Video
WO2021057740A1 (zh) 视频生成方法、装置、电子设备和计算机可读介质
JP7267434B2 (ja) ドキュメント入力内容の処理方法、装置、電子機器及び記憶媒体
CN113365134B (zh) 音频分享方法、装置、设备及介质
KR20190132360A (ko) 멀티미디어 리소스를 처리하는 방법 및 디바이스
EP4333439A1 (en) Video sharing method and apparatus, device, and medium
WO2023005831A1 (zh) 一种资源播放方法、装置、电子设备和存储介质
CN110968314B (zh) 一种页面生成方法及装置
JP7240505B2 (ja) 音声パケット推薦方法、装置、電子機器およびプログラム
JP2023549903A (ja) マルチメディアのインタラクション方法、情報インタラクション方法、装置、機器及び媒体
WO2017008646A1 (zh) 一种在触控终端上选择多个目标的方法和设备
WO2023078395A1 (zh) 页面展示方法及装置、电子设备、存储介质和程序产品
CN114520928B (zh) 显示信息生成方法、信息显示方法、装置和电子设备
CN113553466A (zh) 页面展示方法、装置、介质和计算设备
CN112153439A (zh) 互动视频处理方法、装置、设备及可读存储介质
WO2021160141A1 (zh) 视频处理方法、装置、可读介质和电子设备
CN114816144B (zh) 信息显示方法、装置和电子设备
WO2022252916A1 (zh) 特效配置文件的生成方法、装置、设备及介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20889011

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2022528542

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2020889011

Country of ref document: EP

Effective date: 20220517

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112022009608

Country of ref document: BR

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 112022009608

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20220517