WO2016177296A1 - Video generation method and apparatus - Google Patents

Video generation method and apparatus Download PDF

Info

Publication number
WO2016177296A1
WO2016177296A1 PCT/CN2016/080666 CN2016080666W WO2016177296A1 WO 2016177296 A1 WO2016177296 A1 WO 2016177296A1 CN 2016080666 W CN2016080666 W CN 2016080666W WO 2016177296 A1 WO2016177296 A1 WO 2016177296A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
video
song
video image
processed
Prior art date
Application number
PCT/CN2016/080666
Other languages
French (fr)
Chinese (zh)
Inventor
王超
李纯
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Publication of WO2016177296A1 publication Critical patent/WO2016177296A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/272Means for inserting a foreground image in a background image, i.e. inlay, outlay
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/433Content storage operation, e.g. storage operation in response to a pause request, caching operations
    • H04N21/4334Recording operations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/91Television signal processing therefor

Definitions

  • the embodiments of the present invention relate to the field of computer technologies, and in particular, to a method and an apparatus for generating a video.
  • the singing application (or karaoke application) is a very popular entertainment application.
  • the video provided in the song is generally a simple presentation of the content of the camera, and the flexibility of recording the video of the song is poor.
  • an embodiment of the present invention provides a method and apparatus for generating a video.
  • the technical solution is as follows:
  • a method of generating a video comprising:
  • the video effect combination includes at least one filter and/or at least one Foreground video
  • the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  • an apparatus for generating a video comprising:
  • a playing module configured to play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording;
  • a display module configured to display an option of pre-stored at least one video special effect combination after the capturing of the video image and the recording of the audio, and display the composite button; wherein the video special effect combination includes at least one filter And/or at least one foreground video;
  • a preview module configured to perform combined special effect processing on the captured video image according to the first video special effect combination when receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, and processing The subsequent video image is displayed while playing the accompaniment audio and the recorded audio;
  • a synthesizing module configured to synthesize the accompaniment audio, the recorded audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  • a terminal where the terminal includes:
  • One or more processors are One or more processors.
  • the memory stores one or more programs, the one or more programs being configured to be executed by the one or more processors, the one or more programs including instructions for:
  • the video effect combination includes at least one filter and/or at least one Foreground video
  • the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  • the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed.
  • FIG. 1 is a flowchart of a method for generating a video according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of interface display according to an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of an interface display according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of interface display according to an embodiment of the present invention.
  • FIG. 5 is a schematic structural diagram of a video generating apparatus according to an embodiment of the present invention.
  • FIG. 6 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • An embodiment of the present invention provides a method for generating a video. As shown in FIG. 1 , the processing procedure of the method may include the following steps:
  • Step 101 Play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording.
  • Step 102 After the shooting of the video image and the recording of the audio are finished, displaying an option of the pre-stored combination of the at least one video special effect, and displaying the composite button; wherein the video special effect combination includes at least one filter and/or at least one foreground video. .
  • Step 103 When receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image. Display, playing accompaniment audio and recorded audio simultaneously.
  • Step 104 When receiving a click command for the composite button, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video.
  • the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed.
  • the embodiment of the invention provides a method for generating a video, and the execution body of the method is a terminal.
  • the terminal may be any terminal with a video capture function, such as a mobile phone with a camera, a tablet, and the like, and an application for song and video recording may be installed on the terminal.
  • the terminal may be provided with a processor and a memory, and the processor may be used for processing video images and audio, and the memory may be used for storing data required in the following processing and generated data. It can be equipped with input and output devices such as camera, microphone, screen, audio output device, camera can be used for video image shooting, microphone can be used for audio recording, screen can be used for video, lyrics subtitles, etc. Type of screen, audio output device can be used for audio playback, can be headphones or speakers.
  • the terminal is a mobile phone as an example, and a detailed description of the solution is performed. Other situations are similar, and the embodiment is not described in detail.
  • Step 101 Play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording.
  • the song to be recorded may be a song or a song fragment that the user wants to make a K song.
  • the user can install the above application on the terminal and operate the application, and then trigger the terminal to display the main interface of the application, in the main interface, a little song button can be displayed, and the user clicks the song button.
  • the terminal can be triggered to switch to the corresponding song selection interface of the application, and the song selection interface can display a list of songs in which the user can select the song he likes (ie, the target song mentioned later). Songs with accompaniment files stored locally can be displayed in the song list, and songs with accompaniment files stored on the network side can also be displayed.
  • the terminal can use the song as the song to be recorded, and can display the recording interface of the song.
  • the recording interface can display a recording button.
  • the terminal can trigger the terminal to retrieve the song.
  • the accompaniment file of the song, and the accompaniment file is run to play the accompaniment audio of the song, and the lyrics subtitle corresponding to the song is displayed on the recording interface, and the terminal can start capturing the video image through the front camera of the terminal and recording through the microphone. Audio.
  • the video image captured by the front camera of the terminal can be transmitted to the screen for display in real time, and the user can adjust the position of the terminal based on the image displayed on the screen.
  • the user can select a favorite clip as the song to be recorded in the target song.
  • the following processing may be performed: determining the song segment selected by the user in the target song.
  • the accompaniment audio of the song segment can be played.
  • the user can select a song segment to be recorded in the song, and further, the terminal can obtain the song segment selected by the user in the target song.
  • the user can select a song segment in the target song in a variety of ways, and several alternative treatments are given below:
  • the lyrics list of the target song is displayed; the starting point and the ending point of the song segment set by the user in the lyrics list are obtained; and the song segment selected by the user in the target song is determined according to the starting point and the ending point.
  • the user can drag the start line and the end line displayed in the lyrics, and the song piece corresponding to the lyric content between the start line and the end line is the song piece selected by the user.
  • the lyric list of the target song may be displayed in the recording interface, and the start line and the end line may be displayed in the lyric list, and the user may drag the above start line and end line up and down.
  • the lyrics below the starting line are the starting lyrics (starting sentence) of the song segment selected by the user
  • the lyrics above the ending line are the ending lyrics (terminating sentence) of the song segment selected by the user
  • the user can click the record button, and the terminal can be triggered to obtain the start time point of the start lyrics and the end time point of terminating the lyrics as the start point and the end point of the song segment, and can be determined in the target song according to the start point and the end point.
  • Selected song clips Further, the terminal can play the accompaniment audio of the song segment.
  • the playing time axis of the target song is displayed; the starting point and the ending point of the song segment set by the user in the playing time axis are acquired; and the song segment selected by the user in the target song is determined according to the starting point and the ending point.
  • the recording interface can display the playing time axis of the target song and the recording button, and the two lines located at different positions can also be displayed on the displayed playing time axis, and the user can select the favorite by dragging the two lines.
  • the user can click the recording button in the recording interface, and the terminal can be triggered to obtain the playing time point of the two lines, and the two playing time points are the starting point of the song segment selected by the user.
  • the terminal can acquire a song segment between the start point and the end point in the target song, and then play the accompaniment audio of the song segment.
  • a variety of filter options can also be displayed in the recording interface, as shown in Figure 3, where the user can select a filter for real-time processing of the captured video image. .
  • the user can click the record button to start recording, and the application can perform corresponding image processing on each image frame captured according to the filter selected by the user, and output the filtered video image to the screen. Display and encode it and save it to a file in real time.
  • Step 102 After the shooting of the video image and the recording of the audio are finished, displaying an option of the at least one video special effect combination stored in advance, and displaying the composite button; wherein the video special effect combination includes at least one filter and/or at least one foreground video.
  • the video special effect combination is a plurality of video special effects for combining video images
  • the video special effects may be a filter, a foreground video, etc.
  • the filter may be used to adjust pixel values of each pixel in the video image to achieve Tools for a specific visual effect, such as black and white filters, quaint filters, etc.
  • the scene video can be a video that is hovering over the top of the video image.
  • the preview interface may display an option of one or more combinations of video effects pre-stored locally, wherein the video effect combination may use different different time segments of the video image. Filters and/or foreground video can also use different filters and/or foreground video for the same time period.
  • the processing information corresponding to each video special effect combination may be recorded in the above application, and the processing information may include each filter in the video special effect combination, a start time point and an end time point of the foreground video.
  • a composite button for synthesizing the video image and the audio may be displayed in the preview interface described above.
  • Step 103 When receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image. Display, playing accompaniment audio and recorded audio simultaneously.
  • the user can select a video effect combination that he or she likes according to the option of the video special effect combination displayed on the preview interface.
  • the terminal will receive the video special effect combination (ie, The selection instruction of the first video special effect combination, at this time, the terminal can acquire the processing information of the video special effect combination stored therein, according to each filter in the processing information, the start time point and the end time point of the foreground video, for each The video frames involved in the filter and foreground video are processed, and the processed video images are output to the screen for display in real time.
  • the start time point of a black and white filter in the first video special effect combination is the 5th second
  • the end time point is the 13th second.
  • the accompaniment audio and the recorded audio can be acquired, and the audio and video are played synchronously according to the time of each video frame and the time of each audio frame in the accompaniment audio and the recorded audio.
  • the preview interface displays multiple video effects combination options, after the user selects one of the video effects combination previews, the user can also select other video effects combinations for preview, and finally select a favorite video effect combination.
  • Step 104 When receiving a click command for the composite button, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video.
  • the terminal will receive the click command for the above composite button, and the terminal can acquire the video image processed by the video special effect and perform ffmpeg (a set can be used for recording, converting digital audio, video, and can It is converted into a stream of open source computer programs) encoding, in addition, the terminal can also acquire the accompaniment audio and recorded audio, and audio-encode it to obtain the encoded audio, and then the encoded video image and the encoded audio. Ffmpeg synthesis, get a composite video.
  • ffmpeg a set can be used for recording, converting digital audio, video, and can It is converted into a stream of open source computer programs
  • the user may also perform special effects on the recorded audio when recording the song, and correspondingly, the following processing may be performed: displaying an option of at least one audio special effect, when receiving the first audio special effect in the at least one audio special effect
  • the processing of step 104 may be as follows: when receiving a click command for the composite button, synthesizing the processed audio, the accompaniment audio, and the processed video image to obtain a composite video.
  • the above preview interface can also display one or more audio effects options, such as doll sounds, phonographs and other audio effects, the user can select an audio effect that he likes in the options of the displayed audio effects (ie, the first An audio effect), at this time, the terminal will receive a selection instruction for the audio effect, perform special effects processing on the recorded audio according to the selected audio special effect, and play the processed audio on the preview interface for the user. Preview it.
  • the user can click the composite button displayed in the preview interface to trigger the terminal to obtain the encoded video image.
  • the terminal can also acquire the accompaniment audio and the processed audio and encode it to obtain the coded image.
  • the audio is then ffmpeg synthesized by the encoded video image and the encoded audio to obtain a composite video.
  • the recorded audio may also be edited, and correspondingly, the following processing may be performed: displaying at least one option of audio editing processing; when receiving the first in at least one audio editing process When an audio editing process selects an instruction, the first audio editing process is performed on the recorded audio, and the processed audio is played.
  • the processing of step 104 may be as follows: when receiving a click command for the composite button, synthesizing the processed audio, the accompaniment audio, and the processed video image to obtain a composite video.
  • the above preview interface may also display one or more options for audio editing processing, such as adjusting the volume, turning down the volume, moving the vocal forward, moving the vocal backwards and denoising, etc., the user may Select one of the audio editing processing options (ie, the first audio editing processing) in one or more of the displayed audio editing processing options.
  • the preview interface may display options such as volume adjustment, vocal movement, and denoising when the user needs When you adjust the volume of the recorded audio, you can click on the volume The whole option, at this time, will trigger the terminal to display the volume adjustment axis (the volume gradually increases from left to right, or the volume gradually increases from bottom to top), and the volume adjustment line can also be displayed on the displayed volume adjustment axis.
  • the user can adjust the volume by moving the position of the volume adjustment line.
  • the terminal When the user clicks the vocal movement option, the terminal will receive the user's selection instruction for the vocal movement option, and the time axis can be displayed, and the vocal moving line at the middle position of the time axis can also be displayed on the displayed time axis.
  • the user can adjust the size of the movement by moving the position of the vocal moving line, that is, when the vocal moving line moves forward, the terminal can move the recorded audio forward, and when the vocal moving line moves backward, the terminal can The recorded audio moves backwards, which avoids re-recording the audio when the recorded audio is misaligned.
  • the terminal When the user clicks the denoising option, the terminal will receive a selection instruction for the denoising option, and then the denoised processing of the recorded audio can be performed.
  • the preview button in the preview interface can be clicked, and the terminal can be triggered to perform the first audio editing process on the recorded audio, and play the processed audio.
  • the synthesized button displayed in the preview interface can be clicked to trigger the terminal to obtain the encoded video image.
  • the terminal can also acquire the accompaniment audio and the processed audio and encode the code to obtain the code.
  • the encoded video image is then ffmpeg synthesized with the encoded audio to obtain a composite video.
  • the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed.
  • At least one video effect combination option and displaying a composite button wherein the video effect combination includes at least one filter and/or at least one foreground video, when receiving a selection of a first video effect combination in the at least one video effect combination
  • the combined video effect is performed on the captured video image, and the processed video image is displayed, and the accompaniment audio and the recorded audio are played simultaneously, when receiving the click command on the composite button , the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  • the video image in the synthesized video thus obtained is not a simple presentation of the content captured by the camera, but is processed by the video special effect, thereby enhancing the flexibility of the video recording of the song.
  • an embodiment of the present invention further provides a device for generating a video. As shown in FIG. 5, the device includes:
  • the playing module 510 is configured to play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image capturing and audio recording;
  • a display module 520 configured to display after the shooting of the video image and the recording of the audio
  • An option of pre-stored at least one video effect combination is displayed, and a composite button is displayed; wherein the video effect combination includes at least one filter and/or at least one foreground video;
  • the preview module 530 is configured to: when receiving the selection instruction for the first video special effect combination in the at least one video special effect combination, perform combined special effect processing on the captured video image according to the first video special effect combination, and The processed video image is displayed while playing the accompaniment audio and the recorded audio;
  • the synthesizing module 540 is configured to synthesize the accompaniment audio, the recorded audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  • the method further includes a determining module, configured to determine a song segment selected by the user in the target song before playing the accompaniment audio of the song to be recorded;
  • the playing module 510 is configured to: play the accompaniment audio of the song segment.
  • the determining module is configured to:
  • the determining module is configured to:
  • the display module 520 is further configured to display an option of at least one audio effect
  • the preview module 530 is further configured to perform special effect processing on the recorded audio according to the first audio special effect when receiving a selection instruction for the first audio special effect in the at least one audio special effect, and The processed audio is played;
  • the synthesizing module 540 is configured to synthesize the processed audio, the accompaniment audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  • the display module 520 is further configured to display an option of at least one audio editing process
  • the preview module 530 is further configured to: when the selection instruction of the first audio editing process in the at least one audio editing process is received, perform the first audio editing on the recorded audio And play the processed audio;
  • the synthesizing module 540 is configured to synthesize the processed audio, the accompaniment audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  • the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed.
  • At least one video effect combination option and displaying a composite button wherein the video effect combination includes at least one filter and/or at least one foreground video, when receiving a selection of a first video effect combination in the at least one video effect combination
  • the combined video effect is performed on the captured video image, and the processed video image is displayed, and the accompaniment audio and the recorded audio are played simultaneously, when receiving the click command on the composite button , the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  • the video image in the synthesized video thus obtained is not a simple presentation of the content captured by the camera, but is processed by the video special effect, thereby enhancing the flexibility of the video recording of the song.
  • the device for generating a video provided by the foregoing embodiment is only illustrated by dividing the foregoing functional modules when generating a video. In an actual application, the function allocation may be completed by different functional modules as needed. The internal structure of the device is divided into different functional modules to complete all or part of the functions described above.
  • the device for generating a video provided by the foregoing embodiment is the same as the method for generating a video. The specific implementation process is described in detail in the method embodiment, and details are not described herein again.
  • FIG. 6 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • the terminal may be used to implement the method for generating video provided in the foregoing embodiment. Specifically:
  • the terminal 900 may include an RF (Radio Frequency) circuit 110, a memory 120 including one or more computer readable storage media, an input unit 130, a display unit 140, a sensor 150, an audio circuit 160, and a WiFi (wireless fidelity, wireless).
  • the fidelity module 170 includes a processor 180 having one or more processing cores, and a power supply 190 and the like. It will be understood by those skilled in the art that the terminal structure shown in FIG. 6 does not constitute a limitation to the terminal, and may include more or less components than those illustrated, or a combination of certain components, or different component arrangements. among them:
  • the RF circuit 110 can be used for transmitting and receiving information or during a call, receiving and transmitting signals, and in particular, receiving downlink information of the base station and then processing it by one or more processors 180; The data related to the uplink is sent to the base station.
  • the RF circuit 110 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a Subscriber Identity Module (SIM) card, a transceiver, a coupler, an LNA (Low Noise Amplifier). , duplexer, etc.
  • SIM Subscriber Identity Module
  • RF circuitry 110 can also communicate with the network and other devices via wireless communication.
  • the wireless communication may use any communication standard or protocol, including but not limited to GSM (Global System of Mobile communication), GPRS (General Packet Radio Service), CDMA (Code Division Multiple Access). , Code Division Multiple Access), WCDMA (Wideband Code Division Multiple Access), LTE (Long Term Evolution), e-mail, SMS (Short Messaging Service), and the like.
  • GSM Global System of Mobile communication
  • GPRS General Packet Radio Service
  • CDMA Code Division Multiple Access
  • WCDMA Wideband Code Division Multiple Access
  • LTE Long Term Evolution
  • e-mail Short Messaging Service
  • the memory 120 can be used to store software programs and modules, and the processor 180 executes various functional applications and data processing by running software programs and modules stored in the memory 120.
  • the memory 120 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may be stored according to The data created by the use of the terminal 900 (such as audio data, phone book, etc.) and the like.
  • memory 120 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device. Accordingly, memory 120 may also include a memory controller to provide access to memory 120 by processor 180 and input unit 130.
  • the input unit 130 can be configured to receive input numeric or character information and to generate keyboard, mouse, joystick, optical or trackball signal inputs related to user settings and function controls.
  • input unit 130 can include touch-sensitive surface 131 as well as other input devices 132.
  • Touch-sensitive surface 131 also referred to as a touch display or trackpad, can collect touch operations on or near the user (such as a user using a finger, stylus, etc., on any suitable object or accessory on touch-sensitive surface 131 or The operation near the touch-sensitive surface 131) and driving the corresponding connecting device according to a preset program.
  • the touch-sensitive surface 131 can include two portions of a touch detection device and a touch controller.
  • the touch detection device detects the touch orientation of the user, and detects a signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts the touch information into contact coordinates, and sends the touch information.
  • the processor 180 is provided and can receive commands from the processor 180 and execute them.
  • the touch-sensitive surface 131 can be implemented in various types such as resistive, capacitive, infrared, and surface acoustic waves.
  • the input unit 130 can also include other input devices 132.
  • other input devices 132 may include, but is not limited to, one or more of a physical keyboard, function keys (such as volume control buttons, switch buttons, etc.), trackballs, mice, joysticks, and the like.
  • the display unit 140 can be used to display information entered by the user or information provided to the user and various graphical user interfaces of the terminal 900, which can be composed of graphics, text, icons, video, and any combination thereof.
  • the display unit 140 may include a display panel 141.
  • the display panel 141 may be configured in the form of an LCD (Liquid Crystal Display), an OLED (Organic Light-Emitting Diode), or the like.
  • the touch-sensitive surface 131 may cover the display panel 141, and when the touch-sensitive surface 131 detects a touch operation thereon or nearby, it is transmitted to the processor 180 to determine the type of the touch event, and then the processor 180 according to the touch event The type provides a corresponding visual output on display panel 141.
  • touch-sensitive surface 131 and display panel 141 are implemented as two separate components to implement input and input functions, in some embodiments, touch-sensitive surface 131 can be integrated with display panel 141 for input. And output function.
  • Terminal 900 can also include at least one type of sensor 150, such as a light sensor, motion sensor, and other sensors.
  • the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display panel 141 according to the brightness of the ambient light, and the proximity sensor may close the display panel 141 when the terminal 900 moves to the ear. / or backlight.
  • the gravity acceleration sensor can detect the magnitude of acceleration in all directions (usually three axes). When it is stationary, it can detect the magnitude and direction of gravity.
  • the terminal 900 can also be configured with gyroscopes, barometers, hygrometers, thermometers, infrared sensors and other sensors, here Let me repeat.
  • the audio circuit 160, the speaker 161, and the microphone 162 can provide an audio interface between the user and the terminal 900.
  • the audio circuit 160 can transmit the converted electrical data of the received audio data to the speaker 161 for conversion to the sound signal output by the speaker 161; on the other hand, the microphone 162 converts the collected sound signal into an electrical signal by the audio circuit 160. After receiving, it is converted into audio data, and then processed by the audio data output processor 180, transmitted to the terminal, for example, via the RF circuit 110, or outputted to the memory 120 for further processing.
  • the audio circuit 160 may also include an earbud jack to provide communication of the peripheral earphones with the terminal 900.
  • WiFi is a short-range wireless transmission technology
  • the terminal 900 can help users to send and receive emails, browse web pages, and access streaming media through the WiFi module 170, which provides wireless broadband Internet access for users.
  • FIG. 6 shows the WiFi module 170, it can be understood that it does not belong to the end.
  • the necessary configuration of the end 900 can be omitted as needed within the scope of not changing the essence of the invention.
  • the processor 180 is a control center of the terminal 900 that connects various portions of the entire handset using various interfaces and lines, by running or executing software programs and/or modules stored in the memory 120, and recalling data stored in the memory 120, The various functions and processing data of the terminal 900 are performed to perform overall monitoring of the mobile phone.
  • the processor 180 may include one or more processing cores; preferably, the processor 180 may integrate an application processor and a modem processor, where the application processor mainly processes an operating system, a user interface, an application, and the like.
  • the modem processor primarily handles wireless communications. It can be understood that the above modem processor may not be integrated into the processor 180.
  • the terminal 900 also includes a power source 190 (such as a battery) that supplies power to the various components.
  • the power source can be logically coupled to the processor 180 through a power management system to manage functions such as charging, discharging, and power management through the power management system.
  • Power supply 190 may also include any one or more of a DC or AC power source, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator, and the like.
  • the terminal 900 may further include a camera, a Bluetooth module, and the like, and details are not described herein again.
  • the display unit of the terminal 900 is a touch screen display
  • the terminal 900 further includes a memory, and one or more programs, wherein one or more programs are stored in the memory and configured to be one or one
  • the above processor executes one or more programs to perform the method of generating a video as described in the various embodiments above.
  • non-transitory computer readable storage medium comprising instructions, such as a memory comprising instructions executable by a processor of a mobile terminal to perform the method of generating video as described above.
  • the non-transitory computer readable storage medium may be a ROM (Read-Only Memory), a RAM (Random-Access Memory), or a CD-ROM (Compact Disc Read-Only Memory, CD-ROM, tape, floppy disk and optical data storage devices.
  • a person skilled in the art may understand that all or part of the steps of implementing the above embodiments may be completed by hardware, or may be instructed by a program to execute related hardware, and the program may be stored in a computer readable storage medium.
  • the storage medium mentioned may be a read only memory, a magnetic disk or an optical disk or the like.

Abstract

Disclosed are a video generation method and apparatus, which belong to the technical field of computers. The method comprises: playing an accompanying audio of a song to be recorded, displaying a lyric subtitle corresponding to the accompanying audio, and performing video image shooting and audio recording; after video image shooting and audio recording are completed, displaying options of at least one pre-stored video special-effect combination, and displaying a synthetic key; when a selection instruction for a first video special-effect combination of at least one video special-effect combination is received, according to the first video special-effect combination, performing special-effect combination processing on a shot video image and displaying the processed video image, and at the same time, playing the accompanying audio and the recorded audio; and when a click instruction for the synthetic key is received, synthesizing the accompanying audio, the recorded audio and the processed video image to obtain a synthesized video. By means of the present invention, the flexibility of song video recording can be enhanced.

Description

一种生成视频的方法和装置Method and device for generating video
本申请要求于2015年5月4日提交中国专利局、申请号为201510221018.9、发明名称为“一种生成视频的方法和装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。The present application claims priority to Chinese Patent Application No. 201510221018.9, entitled "A Method and Apparatus for Generating Video", filed on May 4, 2015, the entire contents of in.
技术领域Technical field
本发明实施例涉及计算机技术领域,特别涉及一种生成视频的方法和装置。The embodiments of the present invention relate to the field of computer technologies, and in particular, to a method and an apparatus for generating a video.
背景技术Background technique
随着计算机技术的发展,手机、计算机等终端得到了广泛的应用,相应的终端上的应用程序的种类越来越多、功能越来越丰富。歌唱类应用程序(或称K歌类应用程序)是一种很常用的娱乐应用程序。With the development of computer technology, mobile phones, computers and other terminals have been widely used, and the types of applications on the corresponding terminals are more and more and more functions are becoming more and more abundant. The singing application (or karaoke application) is a very popular entertainment application.
用户可以通过歌唱类应用程序进行歌曲的录制,在录制歌曲的同时,还可以进行视频图像的拍摄,得到配有视频的歌曲。Users can record songs through the singing application. While recording songs, they can also take video images and get songs with videos.
在实现本发明实施例的过程中,发明人发现相关技术至少存在以下问题:In the process of implementing the embodiments of the present invention, the inventors have found that the related art has at least the following problems:
基于上述录制歌曲的过程,歌曲中配有的视频一般只是对摄像头拍摄内容的简单呈现,进行歌曲视频录制的灵活性较差。Based on the above process of recording a song, the video provided in the song is generally a simple presentation of the content of the camera, and the flexibility of recording the video of the song is poor.
发明内容Summary of the invention
为了解决相关技术的问题,本发明实施例提供了一种生成视频的方法和装置。所述技术方案如下:In order to solve the problems of the related art, an embodiment of the present invention provides a method and apparatus for generating a video. The technical solution is as follows:
第一方面,提供了一种生成视频的方法,所述方法包括:In a first aspect, a method of generating a video is provided, the method comprising:
播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;Playing the accompaniment audio of the song to be recorded, displaying the lyrics subtitle corresponding to the accompaniment audio, and performing video image shooting and audio recording;
在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;After the capturing of the video image and the recording of the audio are ended, displaying an option of the at least one video effect combination stored in advance, and displaying the composite button; wherein the video effect combination includes at least one filter and/or at least one Foreground video
当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指 令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;Selecting a selection finger for the first video effect combination in the at least one video effect combination Timing, according to the first video special effect combination, performing combined special effect processing on the captured video image, and displaying the processed video image while playing the accompaniment audio and the recorded audio;
当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。When the click command for the composite button is received, the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
第二方面,提供了一种生成视频的装置,所述装置包括:In a second aspect, an apparatus for generating a video is provided, the apparatus comprising:
播放模块,用于播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;a playing module, configured to play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording;
显示模块,用于在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;a display module, configured to display an option of pre-stored at least one video special effect combination after the capturing of the video image and the recording of the audio, and display the composite button; wherein the video special effect combination includes at least one filter And/or at least one foreground video;
预览模块,用于当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;a preview module, configured to perform combined special effect processing on the captured video image according to the first video special effect combination when receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, and processing The subsequent video image is displayed while playing the accompaniment audio and the recorded audio;
合成模块,用于当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。And a synthesizing module, configured to synthesize the accompaniment audio, the recorded audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
第三方面,提供了一种终端,所述终端包括:In a third aspect, a terminal is provided, where the terminal includes:
一个或多个处理器;和One or more processors; and
存储器;Memory
所述存储器存储有一个或多个程序,所述一个或多个程序被配置成由所述一个或多个处理器执行,所述一个或多个程序包含用于进行以下操作的指令:The memory stores one or more programs, the one or more programs being configured to be executed by the one or more processors, the one or more programs including instructions for:
播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;Playing the accompaniment audio of the song to be recorded, displaying the lyrics subtitle corresponding to the accompaniment audio, and performing video image shooting and audio recording;
在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;After the capturing of the video image and the recording of the audio are ended, displaying an option of the at least one video effect combination stored in advance, and displaying the composite button; wherein the video effect combination includes at least one filter and/or at least one Foreground video
当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;And when receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image Displaying, playing the accompaniment audio and recorded audio simultaneously;
当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。 When the click command for the composite button is received, the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
本发明实施例提供的技术方案带来的有益效果是:The beneficial effects brought by the technical solutions provided by the embodiments of the present invention are:
本发明实施例中,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键,其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频。当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。这样得到的合成视频中的视频图像不是对摄像头拍摄内容的简单呈现,而是经过了视频特效组合处理,从而可以增强歌曲视频录制的灵活性。In the embodiment of the present invention, the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed. An option of at least one video effect combination and displaying a composite button, wherein the video effect combination includes at least one filter and/or at least one foreground video. When receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, combining the special effect processing on the captured video image according to the first video special effect combination, and displaying the processed video image while simultaneously displaying The accompaniment audio and the recorded audio are played, and when the click command for the composite button is received, the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video. The video image in the synthesized video thus obtained is not a simple presentation of the content captured by the camera, but is subjected to a combination of video effects, thereby enhancing the flexibility of video recording of the song.
附图说明DRAWINGS
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly described below. It is obvious that the drawings in the following description are only some embodiments of the present invention. Other drawings may also be obtained from those of ordinary skill in the art in light of the inventive work.
图1是本发明实施例提供的一种生成视频的方法流程图;FIG. 1 is a flowchart of a method for generating a video according to an embodiment of the present invention;
图2是本发明实施例提供的界面显示示意图;2 is a schematic diagram of interface display according to an embodiment of the present invention;
图3是本发明实施例提供的界面显示示意图;3 is a schematic diagram of an interface display according to an embodiment of the present invention;
图4是本发明实施例提供的界面显示示意图;4 is a schematic diagram of interface display according to an embodiment of the present invention;
图5是本发明实施例提供的一种生成视频装置的结构示意图;FIG. 5 is a schematic structural diagram of a video generating apparatus according to an embodiment of the present invention;
图6是本发明实施例提供的一种终端的结构示意图。FIG. 6 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
具体实施方式detailed description
为使本发明的目的、技术方案和优点更加清楚,下面将结合附图对本发明实施方式作进一步地详细描述。The embodiments of the present invention will be further described in detail below with reference to the accompanying drawings.
本发明实施例提供了一种生成视频的方法,如图1所示,该方法的处理流程可以包括如下的步骤:An embodiment of the present invention provides a method for generating a video. As shown in FIG. 1 , the processing procedure of the method may include the following steps:
步骤101,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制。 Step 101: Play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording.
步骤102,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频。Step 102: After the shooting of the video image and the recording of the audio are finished, displaying an option of the pre-stored combination of the at least one video special effect, and displaying the composite button; wherein the video special effect combination includes at least one filter and/or at least one foreground video. .
步骤103,当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频。Step 103: When receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image. Display, playing accompaniment audio and recorded audio simultaneously.
步骤104,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。Step 104: When receiving a click command for the composite button, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video.
本发明实施例中,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键,其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频。当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。这样得到的合成视频中的视频图像不是对摄像头拍摄内容的简单呈现,而是经过了视频特效组合处理,从而可以增强歌曲视频录制的灵活性。In the embodiment of the present invention, the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed. An option of at least one video effect combination and displaying a composite button, wherein the video effect combination includes at least one filter and/or at least one foreground video. When receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, combining the special effect processing on the captured video image according to the first video special effect combination, and displaying the processed video image while simultaneously displaying The accompaniment audio and the recorded audio are played, and when the click command for the composite button is received, the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video. The video image in the synthesized video thus obtained is not a simple presentation of the content captured by the camera, but is subjected to a combination of video effects, thereby enhancing the flexibility of video recording of the song.
本发明实施例提供了一种生成视频的方法,该方法的执行主体为终端。其中,终端可以是具有视频拍摄功能的任意终端,比如带有摄像头的手机、平板电脑等移动终端,终端上可以安装有用于歌曲和视频录制的应用程序。该终端中可以设置有处理器、存储器,处理器可以用于对视频图像和音频进行处理,存储器可以用于存储下述处理过程中需要的数据以及产生的数据。可以设置有摄像头、麦克风、屏幕、音频输出设备等输入输出设备,摄像头可以用于视频图像的拍摄,麦克风可以用于音频的录制,屏幕可以用于视频、歌词字幕等的显示,可以是触控式的屏幕,音频输出设备可以用于音频的播放,可以是耳机或喇叭等。本实施例中,以终端为手机为例,进行方案的详细说明,其它情况与之类似,本实施例不再累述。The embodiment of the invention provides a method for generating a video, and the execution body of the method is a terminal. The terminal may be any terminal with a video capture function, such as a mobile phone with a camera, a tablet, and the like, and an application for song and video recording may be installed on the terminal. The terminal may be provided with a processor and a memory, and the processor may be used for processing video images and audio, and the memory may be used for storing data required in the following processing and generated data. It can be equipped with input and output devices such as camera, microphone, screen, audio output device, camera can be used for video image shooting, microphone can be used for audio recording, screen can be used for video, lyrics subtitles, etc. Type of screen, audio output device can be used for audio playback, can be headphones or speakers. In this embodiment, the terminal is a mobile phone as an example, and a detailed description of the solution is performed. Other situations are similar, and the embodiment is not described in detail.
下面将结合具体实施方式,对图1所示的处理流程进行详细的说明,内容可以如下: The processing flow shown in FIG. 1 will be described in detail below with reference to specific implementations, and the content can be as follows:
步骤101,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制。Step 101: Play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording.
其中,待录制歌曲可以是用户想要进行K歌的歌曲或歌曲片段。The song to be recorded may be a song or a song fragment that the user wants to make a K song.
在实施中,用户可以在其终端上安装上述应用程序,并操作启动该应用程序,此时可以触发终端显示应用程序的主界面,在主界面中可以显示有点歌按键,用户点击点歌按键后,可以触发终端切换至应用程序相应的歌曲选择界面,在歌曲选择界面中可以显示歌曲列表,用户可以在其中选择自己喜欢的歌曲(即后面提到的目标歌曲)。歌曲列表中可以显示本地存储有伴奏文件的歌曲,还可以显示网络侧存储有伴奏文件的歌曲。用户在歌曲列表中选择某歌曲之后,终端可以将该歌曲作为待录制歌曲,并可以显示该歌曲的录制界面,录制界面中可以显示有录制按键,用户点击录制按键,则可以触发终端调取该歌曲的伴奏文件,并运行该伴奏文件,以播放该歌曲的伴奏音频,并在录制界面显示该歌曲对应的歌词字幕,同时,终端可以开始通过终端的前置摄像头拍摄视频图像,并通过麦克风录制音频。此外,通过终端的前置摄像头拍摄的视频图像可以实时传输至屏幕进行显示,用户可以基于屏幕显示的图像对终端的位置进行调节。In the implementation, the user can install the above application on the terminal and operate the application, and then trigger the terminal to display the main interface of the application, in the main interface, a little song button can be displayed, and the user clicks the song button. After that, the terminal can be triggered to switch to the corresponding song selection interface of the application, and the song selection interface can display a list of songs in which the user can select the song he likes (ie, the target song mentioned later). Songs with accompaniment files stored locally can be displayed in the song list, and songs with accompaniment files stored on the network side can also be displayed. After the user selects a song in the song list, the terminal can use the song as the song to be recorded, and can display the recording interface of the song. The recording interface can display a recording button. When the user clicks the recording button, the terminal can trigger the terminal to retrieve the song. The accompaniment file of the song, and the accompaniment file is run to play the accompaniment audio of the song, and the lyrics subtitle corresponding to the song is displayed on the recording interface, and the terminal can start capturing the video image through the front camera of the terminal and recording through the microphone. Audio. In addition, the video image captured by the front camera of the terminal can be transmitted to the screen for display in real time, and the user can adjust the position of the terminal based on the image displayed on the screen.
可选的,用户可以在目标歌曲中选择自己喜欢的片段作为待录制歌曲,相应的,在步骤101之前可以进行如下处理:确定用户在目标歌曲中选取的歌曲片段。相应的在步骤101中,可以播放该歌曲片段的伴奏音频。Optionally, the user can select a favorite clip as the song to be recorded in the target song. Correspondingly, before step 101, the following processing may be performed: determining the song segment selected by the user in the target song. Correspondingly in step 101, the accompaniment audio of the song segment can be played.
在实施中,用户在上述歌曲列表中选择某歌曲(即目标歌曲)之后,用户可以在该歌曲中选取想要录制的歌曲片段,进而,终端可以获得用户在目标歌曲中选取的歌曲片段。In the implementation, after the user selects a certain song (ie, the target song) in the song list, the user can select a song segment to be recorded in the song, and further, the terminal can obtain the song segment selected by the user in the target song.
可选的,用户在目标歌曲中选取歌曲片段的处理方式可以多种多样,以下给出了几种可选的处理方式:Optionally, the user can select a song segment in the target song in a variety of ways, and several alternative treatments are given below:
方式一,显示目标歌曲的歌词列表;获取用户在歌词列表中设置的歌曲片段的起始点和终止点;根据起始点和终止点,确定用户在目标歌曲中选取的歌曲片段。In the first mode, the lyrics list of the target song is displayed; the starting point and the ending point of the song segment set by the user in the lyrics list are obtained; and the song segment selected by the user in the target song is determined according to the starting point and the ending point.
在实施中,用户可以拖动歌词中显示的起始线和终止线,起始线和终止线之间的歌词内容对应的歌曲片段即为用户选取的歌曲片段。具体的,如图2所示,录制界面中可以显示目标歌曲的歌词列表,还可以在歌词列表中显示有起始线和终止线,用户可以通过上下拖动上述的起始线和终止线在歌词列表中选 取自己喜欢的歌曲片段,起始线下方的歌词为用户所选取歌曲片段的起始歌词(起始句),终止线上方的歌词为用户所选取歌曲片段的终止歌词(终止句),然后,用户可以点击录制按键,此时可以触发终端获取上述的起始歌词的开始时间点和终止歌词的结束时间点,作为歌曲片段起始点和终止点,根据起始点和终止点可以确定在目标歌曲中选取的歌曲片段。进而,终端可以播放该歌曲片段的伴奏音频。In the implementation, the user can drag the start line and the end line displayed in the lyrics, and the song piece corresponding to the lyric content between the start line and the end line is the song piece selected by the user. Specifically, as shown in FIG. 2, the lyric list of the target song may be displayed in the recording interface, and the start line and the end line may be displayed in the lyric list, and the user may drag the above start line and end line up and down. Selected from the list of lyrics Take the song fragment you like, the lyrics below the starting line are the starting lyrics (starting sentence) of the song segment selected by the user, and the lyrics above the ending line are the ending lyrics (terminating sentence) of the song segment selected by the user, and then The user can click the record button, and the terminal can be triggered to obtain the start time point of the start lyrics and the end time point of terminating the lyrics as the start point and the end point of the song segment, and can be determined in the target song according to the start point and the end point. Selected song clips. Further, the terminal can play the accompaniment audio of the song segment.
另外,在应用程序中,还可以预先设置歌曲片段的时长上限,如30秒。如果上述起始点与终止点的时间差大于30秒,则可以设置录制界面中的录制按键进入无法点击状态。此外,在应用程序中还可以预先设置歌曲片段的时长下限,如10秒。如果上述起始点与终止点的时间差小于10秒,则可以设置录制界面中的录制按键进入无法点击状态。In addition, in the application, you can also set the upper limit of the duration of the song clip, such as 30 seconds. If the time difference between the above starting point and the ending point is greater than 30 seconds, the recording button in the recording interface can be set to enter the unclickable state. In addition, the minimum length of the song segment, such as 10 seconds, can be set in advance in the application. If the time difference between the above starting point and the ending point is less than 10 seconds, the recording button in the recording interface can be set to enter the unclickable state.
方式二,显示目标歌曲的播放时间轴;获取用户在播放时间轴中设置的歌曲片段的起始点和终止点;根据起始点和终止点,确定用户在目标歌曲中选取的歌曲片段。In the second mode, the playing time axis of the target song is displayed; the starting point and the ending point of the song segment set by the user in the playing time axis are acquired; and the song segment selected by the user in the target song is determined according to the starting point and the ending point.
在实施中,录制界面中可以显示目标歌曲的播放时间轴和录制按键,在显示的播放时间轴上还可以显示位于不同位置的两条线,用户可以通过拖动这两条线来选取自己喜欢的歌曲片段,选择后,用户可以点击录制界面中的录制按键,此时可以触发终端获取上述两条线所在的播放时间点,这两个播放时间点即为用户选取的歌曲片段的起始点和终止点,终端可以在目标歌曲中获取起始点和终止点之间的歌曲片段,进而,播放该歌曲片段的伴奏音频。In the implementation, the recording interface can display the playing time axis of the target song and the recording button, and the two lines located at different positions can also be displayed on the displayed playing time axis, and the user can select the favorite by dragging the two lines. After the selection, the user can click the recording button in the recording interface, and the terminal can be triggered to obtain the playing time point of the two lines, and the two playing time points are the starting point of the song segment selected by the user. At the termination point, the terminal can acquire a song segment between the start point and the end point in the target song, and then play the accompaniment audio of the song segment.
可选的,在开始录制之前,在录制界面中还可以显示有多种滤镜的选项,如图3所示,用户可以在其中选择一种滤镜,用于对拍摄的视频图像进行实时处理。用户在选择滤镜后,可以点击录制按键开始录制,应用程序则可以根据用户选取的滤镜对拍摄到的每一个图像帧进行相应的图像处理,将滤镜处理后的视频图像输出到屏幕进行显示,并对其进行编码,实时保存到文件中。Optionally, before starting recording, a variety of filter options can also be displayed in the recording interface, as shown in Figure 3, where the user can select a filter for real-time processing of the captured video image. . After selecting the filter, the user can click the record button to start recording, and the application can perform corresponding image processing on each image frame captured according to the filter selected by the user, and output the filtered video image to the screen. Display and encode it and save it to a file in real time.
步骤102,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,视频特效组合包括至少一个滤镜和/或至少一个前景视频。Step 102: After the shooting of the video image and the recording of the audio are finished, displaying an option of the at least one video special effect combination stored in advance, and displaying the composite button; wherein the video special effect combination includes at least one filter and/or at least one foreground video.
其中,视频特效组合是用于对视频图像进行组合处理的多种视频特效,视频特效可以是滤镜、前景视频等,滤镜可以是用于对视频图像中各像素的像素值进行调整以达到某种特定视觉效果的工具,如黑白滤镜、古朴色滤镜等,前 景视频可以是悬浮显示在视频图像上层的视频。The video special effect combination is a plurality of video special effects for combining video images, the video special effects may be a filter, a foreground video, etc., and the filter may be used to adjust pixel values of each pixel in the video image to achieve Tools for a specific visual effect, such as black and white filters, quaint filters, etc. The scene video can be a video that is hovering over the top of the video image.
在实施中,当待录制歌曲的伴奏音频的最后一个音频帧播放完毕时,或者用户在伴奏音频播放过程中点击录制界面中的结束按键时,视频图像的拍摄和音频的录制结束,终端可以相应的切换至应用程序的预览界面,如图4所示,预览界面中可以显示本地预先存储的一个或多个视频特效组合的选项,其中的视频特效组合可以在视频图像的不同时间段使用不同的滤镜和/或前景视频,也可以在相同的时间段使用不同的滤镜和/或前景视频。上述应用程序中可以记录有每种视频特效组合对应的处理信息,处理信息中可以包括视频特效组合中每个滤镜、前景视频的开始时间点和结束时间点。另外,在上述的预览界面中可以显示有合成按键,用于对视频图像、音频进行合成。In the implementation, when the last audio frame of the accompaniment audio of the song to be recorded is finished playing, or when the user clicks the end button in the recording interface during the accompaniment audio playing, the shooting of the video image and the recording of the audio are ended, and the terminal can correspondingly Switching to the preview interface of the application, as shown in FIG. 4, the preview interface may display an option of one or more combinations of video effects pre-stored locally, wherein the video effect combination may use different different time segments of the video image. Filters and/or foreground video can also use different filters and/or foreground video for the same time period. The processing information corresponding to each video special effect combination may be recorded in the above application, and the processing information may include each filter in the video special effect combination, a start time point and an end time point of the foreground video. In addition, a composite button for synthesizing the video image and the audio may be displayed in the preview interface described above.
步骤103,当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频。Step 103: When receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image. Display, playing accompaniment audio and recorded audio simultaneously.
在实施中,用户根据预览界面显示的视频特效组合的选项,可以选择自己喜欢的一个视频特效组合,用户点击其中的一个视频特效组合的选项时,终端将会接收到对该视频特效组合(即第一视频特效组合)的选择指令,此时,终端可以获取其存储的该视频特效组合的处理信息,根据处理信息中的每个滤镜、前景视频的开始时间点和结束时间点,对每个滤镜、前景视频所涉及的视频帧进行处理,并实时对处理后的视频图像输出到屏幕上进行显示。例如,第一视频特效组合中一个黑白滤镜的开始时间点是第5秒,结束时间点是第13秒,当终端根据第一视频特效组合对拍摄的视频图像进行组合特效处理时,可以将拍摄的图像的第5秒到第13秒的视频帧进行黑白滤镜处理。In the implementation, the user can select a video effect combination that he or she likes according to the option of the video special effect combination displayed on the preview interface. When the user clicks on one of the video effect combination options, the terminal will receive the video special effect combination (ie, The selection instruction of the first video special effect combination, at this time, the terminal can acquire the processing information of the video special effect combination stored therein, according to each filter in the processing information, the start time point and the end time point of the foreground video, for each The video frames involved in the filter and foreground video are processed, and the processed video images are output to the screen for display in real time. For example, the start time point of a black and white filter in the first video special effect combination is the 5th second, and the end time point is the 13th second. When the terminal performs combined special effects processing on the captured video image according to the first video special effect combination, The video frames from the 5th to 13th second of the captured image are subjected to black and white filter processing.
在对视频图像进行处理的同时,可以获取伴奏音频和录制的音频,根据每个视频帧的时间以及伴奏音频和录制的音频中每个音频帧的时间,对音视频进行同步播放。While the video image is being processed, the accompaniment audio and the recorded audio can be acquired, and the audio and video are played synchronously according to the time of each video frame and the time of each audio frame in the accompaniment audio and the recorded audio.
若预览界面中显示了多个视频特效组合的选项,用户选择了其中的一个视频特效组合预览后,用户还可以选择其他的视频特效组合进行预览,最终选择一个自己最喜欢的视频特效组合。If the preview interface displays multiple video effects combination options, after the user selects one of the video effects combination previews, the user can also select other video effects combinations for preview, and finally select a favorite video effect combination.
步骤104,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。Step 104: When receiving a click command for the composite button, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video.
在实施中,用户选择视频特效组合并预览后可以点击上述预览界面中的合 成按键,终端将会接收到对上述合成按键的点击指令,终端可以获取经过视频特效组合处理的视频图像,将其进行ffmpeg(是一套可以用来记录、转换数字音频、视频,并能将其转化为流的开源计算机程序)编码,此外,终端还可以获取伴奏音频和录制的音频,并对其进行音频编码,得到编码后的音频,之后将编码后的视频图像与编码后的音频进行ffmpeg合成,得到合成视频。In the implementation, after the user selects the video effect combination and previews, the user can click on the preview interface. When the button is pressed, the terminal will receive the click command for the above composite button, and the terminal can acquire the video image processed by the video special effect and perform ffmpeg (a set can be used for recording, converting digital audio, video, and can It is converted into a stream of open source computer programs) encoding, in addition, the terminal can also acquire the accompaniment audio and recorded audio, and audio-encode it to obtain the encoded audio, and then the encoded video image and the encoded audio. Ffmpeg synthesis, get a composite video.
可选的,用户在录制歌曲时还可以对录制的音频进行特效处理,相应的,还可以进行如下处理:显示至少一个音频特效的选项,当接收到对至少一个音频特效中的第一音频特效的选择指令时,根据第一音频特效,对录制的音频进行特效处理,并对处理后的音频进行播放。相应的,步骤104的处理过程可以如下:当接收到对合成按键的点击指令时,将处理后的音频、伴奏音频和处理后的视频图像进行合成,得到合成视频。Optionally, the user may also perform special effects on the recorded audio when recording the song, and correspondingly, the following processing may be performed: displaying an option of at least one audio special effect, when receiving the first audio special effect in the at least one audio special effect When the selection instruction is executed, the recorded audio is subjected to special effects according to the first audio effect, and the processed audio is played. Correspondingly, the processing of step 104 may be as follows: when receiving a click command for the composite button, synthesizing the processed audio, the accompaniment audio, and the processed video image to obtain a composite video.
在实施中,在上述的预览界面还可以显示一个或多个音频特效的选项,如娃娃音、留声机等音频特效,用户可以在显示的音频特效的选项中选择一个自己喜欢的音频特效(即第一音频特效),此时,终端将会接收到对该音频特效的选择指令,根据所选择的音频特效,对所录制的音频进行特效处理,并在预览界面对处理后的音频进行播放供用户进行预览。用户选择音频特效并预览后可以点击上述预览界面中显示的合成按键,可以触发终端获取编码后的视频图像,此外,终端还可以获取伴奏音频和处理后的音频并将其进行编码,得到编码后的音频,之后将编码后的视频图像与编码后的音频进行ffmpeg合成,得到合成视频。In the implementation, the above preview interface can also display one or more audio effects options, such as doll sounds, phonographs and other audio effects, the user can select an audio effect that he likes in the options of the displayed audio effects (ie, the first An audio effect), at this time, the terminal will receive a selection instruction for the audio effect, perform special effects processing on the recorded audio according to the selected audio special effect, and play the processed audio on the preview interface for the user. Preview it. After selecting the audio effect and previewing, the user can click the composite button displayed in the preview interface to trigger the terminal to obtain the encoded video image. In addition, the terminal can also acquire the accompaniment audio and the processed audio and encode it to obtain the coded image. The audio is then ffmpeg synthesized by the encoded video image and the encoded audio to obtain a composite video.
可选的,用户在录制歌曲时,还可以对录制的音频进行编辑处理,相应的,还可以进行如下处理:显示至少一个音频编辑处理的选项;当接收到对至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对录制的音频执行第一音频编辑处理,并对处理后的音频进行播放。相应的,步骤104的处理过程可以如下:当接收到对合成按键的点击指令时,将处理后的音频、伴奏音频和处理后的视频图像进行合成,得到合成视频。Optionally, when the user records the song, the recorded audio may also be edited, and correspondingly, the following processing may be performed: displaying at least one option of audio editing processing; when receiving the first in at least one audio editing process When an audio editing process selects an instruction, the first audio editing process is performed on the recorded audio, and the processed audio is played. Correspondingly, the processing of step 104 may be as follows: when receiving a click command for the composite button, synthesizing the processed audio, the accompaniment audio, and the processed video image to obtain a composite video.
在实施中,在上述的预览界面还可以显示一个或多个音频编辑处理的选项,比如调大音量、调小音量、人声向前移动、人声向后移动和去噪等选项,用户可以在显示的一个或多个音频编辑处理选项中选择其中一个音频编辑处理(即第一音频编辑处理),具体的,预览界面可以显示有音量调整、人声移动和去噪等选项,当用户需要对录制的音频进行音量调整时,可以点击音量调 整选项,此时,将会触发终端显示音量调整轴(音量从左到由逐渐增大,或,音量从下到上逐渐增大),在显示的音量调整轴上还可以显示有音量调整线,用户可以通过移动该音量调整线的位置调整音量的大小。当用户点击人声移动选项时,终端将会接收到用户对人声移动选项的选取指令,可以显示时间轴,在显示的时间轴上还可以显示有位于时间轴中间位置的人声移动线,用户可以通过移动该人声移动线的位置调整移动的大小,即当人声移动线向前移动时,终端可以将录制的音频向前移动,当人声移动线向后移动时,终端可以将录制的音频向后移动,这样,可以避免在录制的音频错位时重新录制音频。当用户点击去噪选项时,终端将会接收到对去噪选项的选取指令,进而,可以对录制的音频进行去噪处理。用户选择第一音频编辑处理选项后,可以点击预览界面中的预览按键,此时可以触发终端对录制的音频执行第一音频编辑处理,并对处理后的音频进行播放。用户选择音频编辑处理并预览后可以点击上述预览界面中显示的合成按键,可以触发终端获取编码后的视频图像,此外,终端还可以获取伴奏音频和处理后的音频并将其进行编码,得到编码后的音频,之后将编码后的视频图像与编码后的音频进行ffmpeg合成,得到合成视频。In the implementation, the above preview interface may also display one or more options for audio editing processing, such as adjusting the volume, turning down the volume, moving the vocal forward, moving the vocal backwards and denoising, etc., the user may Select one of the audio editing processing options (ie, the first audio editing processing) in one or more of the displayed audio editing processing options. Specifically, the preview interface may display options such as volume adjustment, vocal movement, and denoising when the user needs When you adjust the volume of the recorded audio, you can click on the volume The whole option, at this time, will trigger the terminal to display the volume adjustment axis (the volume gradually increases from left to right, or the volume gradually increases from bottom to top), and the volume adjustment line can also be displayed on the displayed volume adjustment axis. The user can adjust the volume by moving the position of the volume adjustment line. When the user clicks the vocal movement option, the terminal will receive the user's selection instruction for the vocal movement option, and the time axis can be displayed, and the vocal moving line at the middle position of the time axis can also be displayed on the displayed time axis. The user can adjust the size of the movement by moving the position of the vocal moving line, that is, when the vocal moving line moves forward, the terminal can move the recorded audio forward, and when the vocal moving line moves backward, the terminal can The recorded audio moves backwards, which avoids re-recording the audio when the recorded audio is misaligned. When the user clicks the denoising option, the terminal will receive a selection instruction for the denoising option, and then the denoised processing of the recorded audio can be performed. After the user selects the first audio editing processing option, the preview button in the preview interface can be clicked, and the terminal can be triggered to perform the first audio editing process on the recorded audio, and play the processed audio. After the user selects the audio editing process and previews, the synthesized button displayed in the preview interface can be clicked to trigger the terminal to obtain the encoded video image. In addition, the terminal can also acquire the accompaniment audio and the processed audio and encode the code to obtain the code. After the audio, the encoded video image is then ffmpeg synthesized with the encoded audio to obtain a composite video.
本发明实施例中,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键,其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频,当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。这样得到的合成视频中的视频图像不是对摄像头拍摄内容的简单呈现,而是经过了视频特效处理,从而可以增强歌曲视频录制的灵活性。In the embodiment of the present invention, the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed. At least one video effect combination option and displaying a composite button, wherein the video effect combination includes at least one filter and/or at least one foreground video, when receiving a selection of a first video effect combination in the at least one video effect combination When instructing, according to the first video special effect combination, the combined video effect is performed on the captured video image, and the processed video image is displayed, and the accompaniment audio and the recorded audio are played simultaneously, when receiving the click command on the composite button , the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video. The video image in the synthesized video thus obtained is not a simple presentation of the content captured by the camera, but is processed by the video special effect, thereby enhancing the flexibility of the video recording of the song.
基于相同的技术构思,本发明实施例还提供了一种生成视频的装置,如图5所示,该装置包括:Based on the same technical concept, an embodiment of the present invention further provides a device for generating a video. As shown in FIG. 5, the device includes:
播放模块510,用于播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;The playing module 510 is configured to play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image capturing and audio recording;
显示模块520,用于在所述视频图像的拍摄和所述音频的录制结束后,显 示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;a display module 520, configured to display after the shooting of the video image and the recording of the audio An option of pre-stored at least one video effect combination is displayed, and a composite button is displayed; wherein the video effect combination includes at least one filter and/or at least one foreground video;
预览模块530,用于当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;The preview module 530 is configured to: when receiving the selection instruction for the first video special effect combination in the at least one video special effect combination, perform combined special effect processing on the captured video image according to the first video special effect combination, and The processed video image is displayed while playing the accompaniment audio and the recorded audio;
合成模块540,用于当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。The synthesizing module 540 is configured to synthesize the accompaniment audio, the recorded audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
可选的,所述,还包括确定模块,用于在播放待录制歌曲的伴奏音频之前,确定用户在目标歌曲中选取的歌曲片段;Optionally, the method further includes a determining module, configured to determine a song segment selected by the user in the target song before playing the accompaniment audio of the song to be recorded;
所述播放模块510,用于:播放所述歌曲片段的伴奏音频。The playing module 510 is configured to: play the accompaniment audio of the song segment.
可选的,所述确定模块,用于:Optionally, the determining module is configured to:
显示目标歌曲的歌词列表;Display a list of lyrics of the target song;
获取用户在所述歌词列表中设置的歌曲片段的起始点和终止点;Obtaining a starting point and a ending point of a song segment set by the user in the lyrics list;
根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。And determining, according to the starting point and the ending point, a piece of song selected by the user in the target song.
可选的,所述确定模块,用于:Optionally, the determining module is configured to:
显示目标歌曲的播放时间轴;Display the playback timeline of the target song;
获取用户在所述播放时间轴中设置的歌曲片段的起始点和终止点;Obtaining a starting point and a ending point of a song segment set by the user in the playing time axis;
根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。And determining, according to the starting point and the ending point, a piece of song selected by the user in the target song.
可选的,所述显示模块520,还用于显示至少一个音频特效的选项;Optionally, the display module 520 is further configured to display an option of at least one audio effect;
所述预览模块530,还用于当接收到对所述至少一个音频特效中的第一音频特效的选择指令时,根据所述第一音频特效,对所述录制的音频进行特效处理,并对处理后的音频进行播放;The preview module 530 is further configured to perform special effect processing on the recorded audio according to the first audio special effect when receiving a selection instruction for the first audio special effect in the at least one audio special effect, and The processed audio is played;
所述合成模块540,用于当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。The synthesizing module 540 is configured to synthesize the processed audio, the accompaniment audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
可选的,所述显示模块520,还用于显示至少一个音频编辑处理的选项;Optionally, the display module 520 is further configured to display an option of at least one audio editing process;
所述预览模块530,还用于当接收到对所述至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对所述录制的音频执行所述第一音频编辑处 理,并对处理后的音频进行播放;The preview module 530 is further configured to: when the selection instruction of the first audio editing process in the at least one audio editing process is received, perform the first audio editing on the recorded audio And play the processed audio;
所述合成模块540,用于当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。The synthesizing module 540 is configured to synthesize the processed audio, the accompaniment audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
本发明实施例中,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键,其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频,当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。这样得到的合成视频中的视频图像不是对摄像头拍摄内容的简单呈现,而是经过了视频特效处理,从而可以增强歌曲视频录制的灵活性。In the embodiment of the present invention, the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed. At least one video effect combination option and displaying a composite button, wherein the video effect combination includes at least one filter and/or at least one foreground video, when receiving a selection of a first video effect combination in the at least one video effect combination When instructing, according to the first video special effect combination, the combined video effect is performed on the captured video image, and the processed video image is displayed, and the accompaniment audio and the recorded audio are played simultaneously, when receiving the click command on the composite button , the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video. The video image in the synthesized video thus obtained is not a simple presentation of the content captured by the camera, but is processed by the video special effect, thereby enhancing the flexibility of the video recording of the song.
需要说明的是:上述实施例提供的生成视频的装置在生成视频时,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将设备的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。另外,上述实施例提供的生成视频的装置与生成视频的方法实施例属于同一构思,其具体实现过程详见方法实施例,这里不再赘述。It should be noted that the device for generating a video provided by the foregoing embodiment is only illustrated by dividing the foregoing functional modules when generating a video. In an actual application, the function allocation may be completed by different functional modules as needed. The internal structure of the device is divided into different functional modules to complete all or part of the functions described above. In addition, the device for generating a video provided by the foregoing embodiment is the same as the method for generating a video. The specific implementation process is described in detail in the method embodiment, and details are not described herein again.
请参考图6,其示出了本发明实施例所涉及的终端的结构示意图,该终端可以用于实施上述实施例中提供的生成视频的方法。具体来讲:Please refer to FIG. 6 , which is a schematic structural diagram of a terminal according to an embodiment of the present invention. The terminal may be used to implement the method for generating video provided in the foregoing embodiment. Specifically:
终端900可以包括RF(Radio Frequency,射频)电路110、包括有一个或一个以上计算机可读存储介质的存储器120、输入单元130、显示单元140、传感器150、音频电路160、WiFi(wireless fidelity,无线保真)模块170、包括有一个或者一个以上处理核心的处理器180、以及电源190等部件。本领域技术人员可以理解,图6中示出的终端结构并不构成对终端的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。其中:The terminal 900 may include an RF (Radio Frequency) circuit 110, a memory 120 including one or more computer readable storage media, an input unit 130, a display unit 140, a sensor 150, an audio circuit 160, and a WiFi (wireless fidelity, wireless). The fidelity module 170 includes a processor 180 having one or more processing cores, and a power supply 190 and the like. It will be understood by those skilled in the art that the terminal structure shown in FIG. 6 does not constitute a limitation to the terminal, and may include more or less components than those illustrated, or a combination of certain components, or different component arrangements. among them:
RF电路110可用于收发信息或通话过程中,信号的接收和发送,特别地,将基站的下行信息接收后,交由一个或者一个以上处理器180处理;另外,将 涉及上行的数据发送给基站。通常,RF电路110包括但不限于天线、至少一个放大器、调谐器、一个或多个振荡器、用户身份模块(SIM)卡、收发信机、耦合器、LNA(Low Noise Amplifier,低噪声放大器)、双工器等。此外,RF电路110还可以通过无线通信与网络和其他设备通信。所述无线通信可以使用任一通信标准或协议,包括但不限于GSM(Global System of Mobile communication,全球移动通讯系统)、GPRS(General Packet Radio Service,通用分组无线服务)、CDMA(Code Division Multiple Access,码分多址)、WCDMA(Wideband Code Division Multiple Access,宽带码分多址)、LTE(Long Term Evolution,长期演进)、电子邮件、SMS(Short Messaging Service,短消息服务)等。The RF circuit 110 can be used for transmitting and receiving information or during a call, receiving and transmitting signals, and in particular, receiving downlink information of the base station and then processing it by one or more processors 180; The data related to the uplink is sent to the base station. Generally, the RF circuit 110 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a Subscriber Identity Module (SIM) card, a transceiver, a coupler, an LNA (Low Noise Amplifier). , duplexer, etc. In addition, RF circuitry 110 can also communicate with the network and other devices via wireless communication. The wireless communication may use any communication standard or protocol, including but not limited to GSM (Global System of Mobile communication), GPRS (General Packet Radio Service), CDMA (Code Division Multiple Access). , Code Division Multiple Access), WCDMA (Wideband Code Division Multiple Access), LTE (Long Term Evolution), e-mail, SMS (Short Messaging Service), and the like.
存储器120可用于存储软件程序以及模块,处理器180通过运行存储在存储器120的软件程序以及模块,从而执行各种功能应用以及数据处理。存储器120可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据终端900的使用所创建的数据(比如音频数据、电话本等)等。此外,存储器120可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。相应地,存储器120还可以包括存储器控制器,以提供处理器180和输入单元130对存储器120的访问。The memory 120 can be used to store software programs and modules, and the processor 180 executes various functional applications and data processing by running software programs and modules stored in the memory 120. The memory 120 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may be stored according to The data created by the use of the terminal 900 (such as audio data, phone book, etc.) and the like. Moreover, memory 120 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device. Accordingly, memory 120 may also include a memory controller to provide access to memory 120 by processor 180 and input unit 130.
输入单元130可用于接收输入的数字或字符信息,以及产生与用户设置以及功能控制有关的键盘、鼠标、操作杆、光学或者轨迹球信号输入。具体地,输入单元130可包括触敏表面131以及其他输入设备132。触敏表面131,也称为触摸显示屏或者触控板,可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触敏表面131上或在触敏表面131附近的操作),并根据预先设定的程式驱动相应的连接装置。可选的,触敏表面131可包括触摸检测装置和触摸控制器两个部分。其中,触摸检测装置检测用户的触摸方位,并检测触摸操作带来的信号,将信号传送给触摸控制器;触摸控制器从触摸检测装置上接收触摸信息,并将它转换成触点坐标,再送给处理器180,并能接收处理器180发来的命令并加以执行。此外,可以采用电阻式、电容式、红外线以及表面声波等多种类型实现触敏表面131。除了触敏表面131,输入单元130还可以包括其他输入设备132。具体地,其他输入设备 132可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆等中的一种或多种。The input unit 130 can be configured to receive input numeric or character information and to generate keyboard, mouse, joystick, optical or trackball signal inputs related to user settings and function controls. In particular, input unit 130 can include touch-sensitive surface 131 as well as other input devices 132. Touch-sensitive surface 131, also referred to as a touch display or trackpad, can collect touch operations on or near the user (such as a user using a finger, stylus, etc., on any suitable object or accessory on touch-sensitive surface 131 or The operation near the touch-sensitive surface 131) and driving the corresponding connecting device according to a preset program. Alternatively, the touch-sensitive surface 131 can include two portions of a touch detection device and a touch controller. Wherein, the touch detection device detects the touch orientation of the user, and detects a signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts the touch information into contact coordinates, and sends the touch information. The processor 180 is provided and can receive commands from the processor 180 and execute them. In addition, the touch-sensitive surface 131 can be implemented in various types such as resistive, capacitive, infrared, and surface acoustic waves. In addition to the touch-sensitive surface 131, the input unit 130 can also include other input devices 132. Specifically, other input devices 132 may include, but is not limited to, one or more of a physical keyboard, function keys (such as volume control buttons, switch buttons, etc.), trackballs, mice, joysticks, and the like.
显示单元140可用于显示由用户输入的信息或提供给用户的信息以及终端900的各种图形用户接口,这些图形用户接口可以由图形、文本、图标、视频和其任意组合来构成。显示单元140可包括显示面板141,可选的,可以采用LCD(Liquid Crystal Display,液晶显示器)、OLED(Organic Light-Emitting Diode,有机发光二极管)等形式来配置显示面板141。进一步的,触敏表面131可覆盖显示面板141,当触敏表面131检测到在其上或附近的触摸操作后,传送给处理器180以确定触摸事件的类型,随后处理器180根据触摸事件的类型在显示面板141上提供相应的视觉输出。虽然在图6中,触敏表面131与显示面板141是作为两个独立的部件来实现输入和输入功能,但是在某些实施例中,可以将触敏表面131与显示面板141集成而实现输入和输出功能。The display unit 140 can be used to display information entered by the user or information provided to the user and various graphical user interfaces of the terminal 900, which can be composed of graphics, text, icons, video, and any combination thereof. The display unit 140 may include a display panel 141. Alternatively, the display panel 141 may be configured in the form of an LCD (Liquid Crystal Display), an OLED (Organic Light-Emitting Diode), or the like. Further, the touch-sensitive surface 131 may cover the display panel 141, and when the touch-sensitive surface 131 detects a touch operation thereon or nearby, it is transmitted to the processor 180 to determine the type of the touch event, and then the processor 180 according to the touch event The type provides a corresponding visual output on display panel 141. Although in FIG. 6, touch-sensitive surface 131 and display panel 141 are implemented as two separate components to implement input and input functions, in some embodiments, touch-sensitive surface 131 can be integrated with display panel 141 for input. And output function.
终端900还可包括至少一种传感器150,比如光传感器、运动传感器以及其他传感器。具体地,光传感器可包括环境光传感器及接近传感器,其中,环境光传感器可根据环境光线的明暗来调节显示面板141的亮度,接近传感器可在终端900移动到耳边时,关闭显示面板141和/或背光。作为运动传感器的一种,重力加速度传感器可检测各个方向上(一般为三轴)加速度的大小,静止时可检测出重力的大小及方向,可用于识别手机姿态的应用(比如横竖屏切换、相关游戏、磁力计姿态校准)、振动识别相关功能(比如计步器、敲击)等;至于终端900还可配置的陀螺仪、气压计、湿度计、温度计、红外线传感器等其他传感器,在此不再赘述。Terminal 900 can also include at least one type of sensor 150, such as a light sensor, motion sensor, and other sensors. Specifically, the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display panel 141 according to the brightness of the ambient light, and the proximity sensor may close the display panel 141 when the terminal 900 moves to the ear. / or backlight. As a kind of motion sensor, the gravity acceleration sensor can detect the magnitude of acceleration in all directions (usually three axes). When it is stationary, it can detect the magnitude and direction of gravity. It can be used to identify the gesture of the mobile phone (such as horizontal and vertical screen switching, related Game, magnetometer attitude calibration), vibration recognition related functions (such as pedometer, tapping), etc.; as for the terminal 900 can also be configured with gyroscopes, barometers, hygrometers, thermometers, infrared sensors and other sensors, here Let me repeat.
音频电路160、扬声器161,传声器162可提供用户与终端900之间的音频接口。音频电路160可将接收到的音频数据转换后的电信号,传输到扬声器161,由扬声器161转换为声音信号输出;另一方面,传声器162将收集的声音信号转换为电信号,由音频电路160接收后转换为音频数据,再将音频数据输出处理器180处理后,经RF电路110以发送给比如另一终端,或者将音频数据输出至存储器120以便进一步处理。音频电路160还可能包括耳塞插孔,以提供外设耳机与终端900的通信。The audio circuit 160, the speaker 161, and the microphone 162 can provide an audio interface between the user and the terminal 900. The audio circuit 160 can transmit the converted electrical data of the received audio data to the speaker 161 for conversion to the sound signal output by the speaker 161; on the other hand, the microphone 162 converts the collected sound signal into an electrical signal by the audio circuit 160. After receiving, it is converted into audio data, and then processed by the audio data output processor 180, transmitted to the terminal, for example, via the RF circuit 110, or outputted to the memory 120 for further processing. The audio circuit 160 may also include an earbud jack to provide communication of the peripheral earphones with the terminal 900.
WiFi属于短距离无线传输技术,终端900通过WiFi模块170可以帮助用户收发电子邮件、浏览网页和访问流式媒体等,它为用户提供了无线的宽带互联网访问。虽然图6示出了WiFi模块170,但是可以理解的是,其并不属于终 端900的必须构成,完全可以根据需要在不改变发明的本质的范围内而省略。WiFi is a short-range wireless transmission technology, and the terminal 900 can help users to send and receive emails, browse web pages, and access streaming media through the WiFi module 170, which provides wireless broadband Internet access for users. Although FIG. 6 shows the WiFi module 170, it can be understood that it does not belong to the end. The necessary configuration of the end 900 can be omitted as needed within the scope of not changing the essence of the invention.
处理器180是终端900的控制中心,利用各种接口和线路连接整个手机的各个部分,通过运行或执行存储在存储器120内的软件程序和/或模块,以及调用存储在存储器120内的数据,执行终端900的各种功能和处理数据,从而对手机进行整体监控。可选的,处理器180可包括一个或多个处理核心;优选的,处理器180可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器180中。The processor 180 is a control center of the terminal 900 that connects various portions of the entire handset using various interfaces and lines, by running or executing software programs and/or modules stored in the memory 120, and recalling data stored in the memory 120, The various functions and processing data of the terminal 900 are performed to perform overall monitoring of the mobile phone. Optionally, the processor 180 may include one or more processing cores; preferably, the processor 180 may integrate an application processor and a modem processor, where the application processor mainly processes an operating system, a user interface, an application, and the like. The modem processor primarily handles wireless communications. It can be understood that the above modem processor may not be integrated into the processor 180.
终端900还包括给各个部件供电的电源190(比如电池),优选的,电源可以通过电源管理系统与处理器180逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。电源190还可以包括一个或一个以上的直流或交流电源、再充电系统、电源故障检测电路、电源转换器或者逆变器、电源状态指示器等任意组件。The terminal 900 also includes a power source 190 (such as a battery) that supplies power to the various components. Preferably, the power source can be logically coupled to the processor 180 through a power management system to manage functions such as charging, discharging, and power management through the power management system. Power supply 190 may also include any one or more of a DC or AC power source, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator, and the like.
尽管未示出,终端900还可以包括摄像头、蓝牙模块等,在此不再赘述。具体在本实施例中,终端900的显示单元是触摸屏显示器,终端900还包括有存储器,以及一个或者一个以上的程序,其中一个或者一个以上程序存储于存储器中,且经配置以由一个或者一个以上处理器执行述一个或者一个以上程序来执行上述各个实施例所述的生成视频的方法。Although not shown, the terminal 900 may further include a camera, a Bluetooth module, and the like, and details are not described herein again. Specifically, in this embodiment, the display unit of the terminal 900 is a touch screen display, the terminal 900 further includes a memory, and one or more programs, wherein one or more programs are stored in the memory and configured to be one or one The above processor executes one or more programs to perform the method of generating a video as described in the various embodiments above.
在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器,上述指令可由移动终端的处理器执行以完成上述生成视频的方法。例如,所述非临时性计算机可读存储介质可以是ROM(Read-Only Memory,只读存储器)、RAM(Random-Access Memory,随机存取存储器)、CD-ROM(Compact Disc Read-Only Memory,光盘只读存储器)、磁带、软盘和光数据存储设备等。In an exemplary embodiment, there is also provided a non-transitory computer readable storage medium comprising instructions, such as a memory comprising instructions executable by a processor of a mobile terminal to perform the method of generating video as described above. For example, the non-transitory computer readable storage medium may be a ROM (Read-Only Memory), a RAM (Random-Access Memory), or a CD-ROM (Compact Disc Read-Only Memory, CD-ROM, tape, floppy disk and optical data storage devices.
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。 A person skilled in the art may understand that all or part of the steps of implementing the above embodiments may be completed by hardware, or may be instructed by a program to execute related hardware, and the program may be stored in a computer readable storage medium. The storage medium mentioned may be a read only memory, a magnetic disk or an optical disk or the like.
以上所述仅为本发明的较佳实施例,并不用以限制本发明,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。 The above are only the preferred embodiments of the present invention, and are not intended to limit the present invention. Any modifications, equivalents, improvements, etc., which are within the spirit and scope of the present invention, should be included in the protection of the present invention. Within the scope.

Claims (18)

  1. 一种生成视频的方法,其特征在于,所述方法包括:A method of generating a video, the method comprising:
    播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;Playing the accompaniment audio of the song to be recorded, displaying the lyrics subtitle corresponding to the accompaniment audio, and performing video image shooting and audio recording;
    在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;After the capturing of the video image and the recording of the audio are ended, displaying an option of the at least one video effect combination stored in advance, and displaying the composite button; wherein the video effect combination includes at least one filter and/or at least one Foreground video
    当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;And when receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image Displaying, playing the accompaniment audio and recorded audio simultaneously;
    当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。When the click command for the composite button is received, the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  2. 根据权利要求1所述的方法,其特征在于,所述播放待录制歌曲的伴奏音频之前,还包括:确定用户在目标歌曲中选取的歌曲片段;The method according to claim 1, wherein before the playing the accompaniment audio of the song to be recorded, the method further comprises: determining a song segment selected by the user in the target song;
    所述播放待录制歌曲的伴奏音频,包括:播放所述歌曲片段的伴奏音频。The playing the accompaniment audio of the song to be recorded includes: playing the accompaniment audio of the song segment.
  3. 根据权利要求2所述的方法,其特征在于,所述确定用户在目标歌曲中选取的歌曲片段,包括:The method of claim 2, wherein the determining a piece of song selected by the user in the target song comprises:
    显示目标歌曲的歌词列表;Display a list of lyrics of the target song;
    获取用户在所述歌词列表中设置的歌曲片段的起始点和终止点;Obtaining a starting point and a ending point of a song segment set by the user in the lyrics list;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。And determining, according to the starting point and the ending point, a piece of song selected by the user in the target song.
  4. 根据权利要求2所述的方法,其特征在于,所述确定用户在目标歌曲中选取的歌曲片段,包括:The method of claim 2, wherein the determining a piece of song selected by the user in the target song comprises:
    显示目标歌曲的播放时间轴;Display the playback timeline of the target song;
    获取用户在所述播放时间轴中设置的歌曲片段的起始点和终止点;Obtaining a starting point and a ending point of a song segment set by the user in the playing time axis;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。And determining, according to the starting point and the ending point, a piece of song selected by the user in the target song.
  5. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:
    显示至少一个音频特效的选项;An option to display at least one audio effect;
    当接收到对所述至少一个音频特效中的第一音频特效的选择指令时,根据 所述第一音频特效,对所述录制的音频进行特效处理,并对处理后的音频进行播放;When receiving a selection instruction for the first audio effect in the at least one audio effect, according to The first audio special effect, performing special effects processing on the recorded audio, and playing the processed audio;
    所述当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频,包括:When the click command of the composite button is received, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video, including:
    当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。When the click command for the composite button is received, the processed audio, the accompaniment audio, and the processed video image are combined to obtain a synthesized video.
  6. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 further comprising:
    显示至少一个音频编辑处理的选项;Display at least one option for audio editing processing;
    当接收到对所述至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对所述录制的音频执行所述第一音频编辑处理,并对处理后的音频进行播放;When receiving a selection instruction for the first audio editing process in the at least one audio editing process, performing the first audio editing process on the recorded audio, and playing the processed audio;
    所述当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频,包括:When the click command of the composite button is received, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video, including:
    当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。When the click command for the composite button is received, the processed audio, the accompaniment audio, and the processed video image are combined to obtain a synthesized video.
  7. 一种生成视频的装置,其特征在于,所述装置包括:A device for generating a video, characterized in that the device comprises:
    播放模块,用于播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;a playing module, configured to play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording;
    显示模块,用于在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;a display module, configured to display an option of pre-stored at least one video special effect combination after the capturing of the video image and the recording of the audio, and display the composite button; wherein the video special effect combination includes at least one filter And/or at least one foreground video;
    预览模块,用于当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;a preview module, configured to perform combined special effect processing on the captured video image according to the first video special effect combination when receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, and processing The subsequent video image is displayed while playing the accompaniment audio and the recorded audio;
    合成模块,用于当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。And a synthesizing module, configured to synthesize the accompaniment audio, the recorded audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  8. 根据权利要求7所述的装置,其特征在于,所述装置还包括确定模块,用于:在所述播放待录制歌曲的伴奏音频之前,确定用户在目标歌曲中选取的歌曲片段; The device according to claim 7, wherein the device further comprises: a determining module, configured to: before the playing the accompaniment audio of the song to be recorded, determining a song segment selected by the user in the target song;
    所述播放模块,用于:播放所述歌曲片段的伴奏音频。The playing module is configured to: play the accompaniment audio of the song segment.
  9. 根据权利要求8所述的装置,其特征在于,所述确定模块,用于:The device according to claim 8, wherein the determining module is configured to:
    显示目标歌曲的歌词列表;Display a list of lyrics of the target song;
    获取用户在所述歌词列表中设置的歌曲片段的起始点和终止点;Obtaining a starting point and a ending point of a song segment set by the user in the lyrics list;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。And determining, according to the starting point and the ending point, a piece of song selected by the user in the target song.
  10. 根据权利要求8所述的装置,其特征在于,所述确定模块,用于:The device according to claim 8, wherein the determining module is configured to:
    显示目标歌曲的播放时间轴;Display the playback timeline of the target song;
    获取用户在所述播放时间轴中设置的歌曲片段的起始点和终止点;Obtaining a starting point and a ending point of a song segment set by the user in the playing time axis;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。And determining, according to the starting point and the ending point, a piece of song selected by the user in the target song.
  11. 根据权利要求7所述的装置,其特征在于,所述显示模块,还用于显示至少一个音频特效的选项;The device according to claim 7, wherein the display module is further configured to display an option of at least one audio effect;
    所述预览模块,还用于当接收到对所述至少一个音频特效中的第一音频特效的选择指令时,根据所述第一音频特效,对所述录制的音频进行特效处理,并对处理后的音频进行播放;The preview module is further configured to perform special effect processing on the recorded audio according to the first audio special effect when receiving a selection instruction for the first audio special effect in the at least one audio special effect, and process the After the audio is played;
    所述合成模块,用于当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。The synthesizing module is configured to synthesize the processed audio, the accompaniment audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  12. 根据权利要求7所述的方法,其特征在于,所述显示模块,还用于显示至少一个音频编辑处理的选项;The method according to claim 7, wherein the display module is further configured to display an option of at least one audio editing process;
    所述预览模块,还用于当接收到对所述至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对所述录制的音频执行所述第一音频编辑处理,并对处理后的音频进行播放;The preview module is further configured to perform the first audio editing process on the recorded audio when receiving a selection instruction for the first audio editing process in the at least one audio editing process, and after processing Audio is played;
    所述合成模块,用于当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。The synthesizing module is configured to synthesize the processed audio, the accompaniment audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  13. 一种终端,其特征在于,所述终端包括:A terminal, wherein the terminal comprises:
    一个或多个处理器;和One or more processors; and
    存储器;Memory
    所述存储器存储有一个或多个程序,所述一个或多个程序被配置成由所述一个或多个处理器执行,所述一个或多个程序包含用于进行以下操作的指令: The memory stores one or more programs, the one or more programs being configured to be executed by the one or more processors, the one or more programs including instructions for:
    播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;Playing the accompaniment audio of the song to be recorded, displaying the lyrics subtitle corresponding to the accompaniment audio, and performing video image shooting and audio recording;
    在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;After the capturing of the video image and the recording of the audio are ended, displaying an option of the at least one video effect combination stored in advance, and displaying the composite button; wherein the video effect combination includes at least one filter and/or at least one Foreground video
    当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;And when receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image Displaying, playing the accompaniment audio and recorded audio simultaneously;
    当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。When the click command for the composite button is received, the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  14. 根据权利要求13所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:The terminal of claim 13 wherein said one or more programs further comprise instructions for:
    确定用户在目标歌曲中选取的歌曲片段;Determining a song segment selected by the user in the target song;
    播放所述歌曲片段的伴奏音频。The accompaniment audio of the song segment is played.
  15. 根据权利要求14所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:The terminal of claim 14, wherein the one or more programs further comprise instructions for:
    显示目标歌曲的歌词列表;Display a list of lyrics of the target song;
    获取用户在所述歌词列表中设置的歌曲片段的起始点和终止点;Obtaining a starting point and a ending point of a song segment set by the user in the lyrics list;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。And determining, according to the starting point and the ending point, a piece of song selected by the user in the target song.
  16. 根据权利要求14所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:The terminal of claim 14, wherein the one or more programs further comprise instructions for:
    显示目标歌曲的播放时间轴;Display the playback timeline of the target song;
    获取用户在所述播放时间轴中设置的歌曲片段的起始点和终止点;Obtaining a starting point and a ending point of a song segment set by the user in the playing time axis;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。And determining, according to the starting point and the ending point, a piece of song selected by the user in the target song.
  17. 根据权利要求13所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:The terminal of claim 13 wherein said one or more programs further comprise instructions for:
    显示至少一个音频特效的选项;An option to display at least one audio effect;
    当接收到对所述至少一个音频特效中的第一音频特效的选择指令时,根据所述第一音频特效,对所述录制的音频进行特效处理,并对处理后的音频进行 播放;When receiving a selection instruction for the first audio effect in the at least one audio effect, performing special effects processing on the recorded audio according to the first audio effect, and performing the processed audio Play
    所述当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频,包括:When the click command of the composite button is received, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video, including:
    当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。When the click command for the composite button is received, the processed audio, the accompaniment audio, and the processed video image are combined to obtain a synthesized video.
  18. 根据权利要求13所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:The terminal of claim 13 wherein said one or more programs further comprise instructions for:
    显示至少一个音频编辑处理的选项;Display at least one option for audio editing processing;
    当接收到对所述至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对所述录制的音频执行所述第一音频编辑处理,并对处理后的音频进行播放;When receiving a selection instruction for the first audio editing process in the at least one audio editing process, performing the first audio editing process on the recorded audio, and playing the processed audio;
    所述当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频,包括:When the click command of the composite button is received, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video, including:
    当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。 When the click command for the composite button is received, the processed audio, the accompaniment audio, and the processed video image are combined to obtain a synthesized video.
PCT/CN2016/080666 2015-05-04 2016-04-29 Video generation method and apparatus WO2016177296A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510221018.9A CN104967900B (en) 2015-05-04 2015-05-04 A kind of method and apparatus generating video
CN201510221018.9 2015-05-04

Publications (1)

Publication Number Publication Date
WO2016177296A1 true WO2016177296A1 (en) 2016-11-10

Family

ID=54221823

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/080666 WO2016177296A1 (en) 2015-05-04 2016-04-29 Video generation method and apparatus

Country Status (2)

Country Link
CN (1) CN104967900B (en)
WO (1) WO2016177296A1 (en)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109285532A (en) * 2018-10-31 2019-01-29 深圳市酷达通讯有限公司 A kind of order programme
CN109325219A (en) * 2018-08-24 2019-02-12 维沃移动通信有限公司 A kind of method, apparatus and system generating recording documents
CN109473117A (en) * 2018-12-18 2019-03-15 广州市百果园信息技术有限公司 Audio special efficacy stacking method, device and its terminal
CN109683993A (en) * 2018-12-12 2019-04-26 努比亚技术有限公司 A kind of applied program processing method, equipment and computer readable storage medium
CN109840879A (en) * 2017-11-28 2019-06-04 腾讯科技(深圳)有限公司 Image rendering method, device, computer storage medium and terminal
WO2019105438A1 (en) * 2017-11-30 2019-06-06 广州市百果园信息技术有限公司 Video special effect adding method and apparatus, and smart mobile terminal
CN110446097A (en) * 2019-08-26 2019-11-12 维沃移动通信有限公司 Record screen method and mobile terminal
CN111327855A (en) * 2020-03-10 2020-06-23 网易(杭州)网络有限公司 Video recording method and device and video positioning method and device
CN111491205A (en) * 2020-04-17 2020-08-04 维沃移动通信有限公司 Video processing method and device and electronic equipment
CN112653920A (en) * 2020-12-18 2021-04-13 北京字跳网络技术有限公司 Video processing method, device, equipment, storage medium and computer program product
CN112995536A (en) * 2021-02-04 2021-06-18 上海哔哩哔哩科技有限公司 Video synthesis method and system
CN113079332A (en) * 2021-03-16 2021-07-06 青岛海信移动通信技术股份有限公司 Mobile terminal and screen recording method thereof
CN113489899A (en) * 2021-06-29 2021-10-08 中国平安人寿保险股份有限公司 Special effect video recording method and device, computer equipment and storage medium
CN114390205A (en) * 2022-01-29 2022-04-22 西安维沃软件技术有限公司 Shooting method and device and electronic equipment
CN114390299A (en) * 2020-10-16 2022-04-22 腾讯科技(深圳)有限公司 Song on-demand method, device, equipment and computer readable storage medium
CN114915830A (en) * 2022-06-06 2022-08-16 武汉市芯中芯科技有限公司 Method for realizing audio and video synthesis of wifi visual equipment by using mobile phone microphone
CN115474088A (en) * 2022-09-07 2022-12-13 腾讯音乐娱乐科技(深圳)有限公司 Video processing method, computer equipment and storage medium
CN116668763A (en) * 2022-11-10 2023-08-29 荣耀终端有限公司 Screen recording method and device
CN116708899A (en) * 2022-06-30 2023-09-05 北京生数科技有限公司 Video processing method, device and storage medium applied to virtual image synthesis
US11948385B2 (en) * 2020-12-23 2024-04-02 Abbyy Development Inc. Zero-footprint image capture by mobile device
CN116668763B (en) * 2022-11-10 2024-04-19 荣耀终端有限公司 Screen recording method and device

Families Citing this family (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104967900B (en) * 2015-05-04 2018-08-07 腾讯科技(深圳)有限公司 A kind of method and apparatus generating video
TWI592021B (en) 2015-02-04 2017-07-11 騰訊科技(深圳)有限公司 Method, device, and terminal for generating video
CN106055671B (en) * 2016-06-03 2022-06-14 腾讯科技(深圳)有限公司 Multimedia data processing method and equipment thereof
CN106131472A (en) * 2016-07-26 2016-11-16 维沃移动通信有限公司 A kind of kinescope method and mobile terminal
CN106604113A (en) * 2016-12-15 2017-04-26 天脉聚源(北京)传媒科技有限公司 Method and apparatus for synthesizing videos intelligently
CN106847246A (en) * 2017-03-01 2017-06-13 华东交通大学 A kind of song chorus method, apparatus and system
CN107786876A (en) * 2017-09-21 2018-03-09 北京达佳互联信息技术有限公司 The synchronous method of music and video, device and mobile terminal
CN107707828B (en) * 2017-09-26 2019-07-26 维沃移动通信有限公司 A kind of method for processing video frequency and mobile terminal
CN108055490B (en) * 2017-10-25 2021-04-13 北京密境和风科技有限公司 Video processing method and device, mobile terminal and storage medium
CN108401124B (en) * 2018-03-16 2020-08-25 广州酷狗计算机科技有限公司 Video recording method and device
CN108810436A (en) * 2018-05-24 2018-11-13 广州音乐猫乐器科技有限公司 A kind of video recording method and system based on the He Zou of full-automatic musical instrument
CN108965757B (en) * 2018-08-02 2021-04-06 广州酷狗计算机科技有限公司 Video recording method, device, terminal and storage medium
CN109166596A (en) * 2018-08-10 2019-01-08 北京微播视界科技有限公司 Music editor's method, apparatus, terminal device and computer readable storage medium
CN109151356A (en) * 2018-09-05 2019-01-04 传线网络科技(上海)有限公司 video recording method and device
CN109522443A (en) * 2018-09-29 2019-03-26 上海与德通讯技术有限公司 Collection method, electronic equipment and the computer readable storage medium of melody part
CN109508393A (en) * 2018-09-29 2019-03-22 上海与德通讯技术有限公司 Collection method, electronic equipment and the computer readable storage medium of melody part
CN108962286B (en) * 2018-10-15 2020-12-01 腾讯音乐娱乐科技(深圳)有限公司 Audio recognition method, device and storage medium
CN109005359B (en) * 2018-10-31 2020-11-03 广州酷狗计算机科技有限公司 Video recording method, apparatus and storage medium
CN109348281B (en) * 2018-11-08 2020-02-21 北京微播视界科技有限公司 Video processing method, video processing device, computer equipment and storage medium
CN109587549B (en) * 2018-12-05 2021-08-13 广州酷狗计算机科技有限公司 Video recording method, device, terminal and storage medium
CN109413342B (en) * 2018-12-21 2021-01-08 广州酷狗计算机科技有限公司 Audio and video processing method and device, terminal and storage medium
CN110324718B (en) * 2019-08-05 2021-09-07 北京字节跳动网络技术有限公司 Audio and video generation method and device, electronic equipment and readable medium
CN111061405B (en) * 2019-12-13 2021-08-27 广州酷狗计算机科技有限公司 Method, device and equipment for recording song audio and storage medium
CN110996167A (en) * 2019-12-20 2020-04-10 广州酷狗计算机科技有限公司 Method and device for adding subtitles in video
CN111970571B (en) * 2020-08-24 2022-07-26 北京字节跳动网络技术有限公司 Video production method, device, equipment and storage medium
CN112312053B (en) * 2020-10-29 2023-05-23 维沃移动通信有限公司 Video recording method and device
CN112422831A (en) * 2020-11-20 2021-02-26 广州太平洋电脑信息咨询有限公司 Video generation method and device, computer equipment and storage medium
CN114245036B (en) * 2021-12-21 2024-03-12 北京达佳互联信息技术有限公司 Video production method and device
CN115767141A (en) * 2022-08-26 2023-03-07 维沃移动通信有限公司 Video playing method and device and electronic equipment

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1156377A (en) * 1995-12-27 1997-08-06 娱乐消遣技术株式会社 Karaoke system
CN1383543A (en) * 2000-06-20 2002-12-04 皇家菲利浦电子有限公司 Karaoka system
CN2617100Y (en) * 2003-01-29 2004-05-19 项青松 Self-made audio video editting-making device
WO2007071954A1 (en) * 2005-12-19 2007-06-28 Landesberg, Andrew Live performance entertainment apparatus and method
CN201477863U (en) * 2009-05-21 2010-05-19 董海涛 MTV synthesizer
CN102568527A (en) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 Method and system for easily cutting audio files and applied mobile handheld device
CN104079966A (en) * 2014-05-15 2014-10-01 惠州市水木网络科技有限公司 Karaoke sharing system based on set top box
CN104702856A (en) * 2013-12-10 2015-06-10 音圆国际股份有限公司 Real-time selfie special-effect MV (music video) compositing system device and real-time selfie special-effect MV compositing method applied to karaoke machines
CN104883516A (en) * 2015-06-05 2015-09-02 福建星网视易信息系统有限公司 Method and system for producing real-time singing video
CN104967900A (en) * 2015-05-04 2015-10-07 腾讯科技(深圳)有限公司 Video generating method and video generating device
CN104967801A (en) * 2015-02-04 2015-10-07 腾讯科技(深圳)有限公司 Video data processing method and apparatus

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101141598A (en) * 2006-09-08 2008-03-12 郭鹏飞 Karaoke CD for providing simultaneously song subtitling and foreground
CN201035651Y (en) * 2007-02-01 2008-03-12 李智 Self-aided K song recorder

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1156377A (en) * 1995-12-27 1997-08-06 娱乐消遣技术株式会社 Karaoke system
CN1383543A (en) * 2000-06-20 2002-12-04 皇家菲利浦电子有限公司 Karaoka system
CN2617100Y (en) * 2003-01-29 2004-05-19 项青松 Self-made audio video editting-making device
WO2007071954A1 (en) * 2005-12-19 2007-06-28 Landesberg, Andrew Live performance entertainment apparatus and method
CN201477863U (en) * 2009-05-21 2010-05-19 董海涛 MTV synthesizer
CN102568527A (en) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 Method and system for easily cutting audio files and applied mobile handheld device
CN104702856A (en) * 2013-12-10 2015-06-10 音圆国际股份有限公司 Real-time selfie special-effect MV (music video) compositing system device and real-time selfie special-effect MV compositing method applied to karaoke machines
CN104079966A (en) * 2014-05-15 2014-10-01 惠州市水木网络科技有限公司 Karaoke sharing system based on set top box
CN104967801A (en) * 2015-02-04 2015-10-07 腾讯科技(深圳)有限公司 Video data processing method and apparatus
CN104967900A (en) * 2015-05-04 2015-10-07 腾讯科技(深圳)有限公司 Video generating method and video generating device
CN104883516A (en) * 2015-06-05 2015-09-02 福建星网视易信息系统有限公司 Method and system for producing real-time singing video

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109840879A (en) * 2017-11-28 2019-06-04 腾讯科技(深圳)有限公司 Image rendering method, device, computer storage medium and terminal
WO2019105438A1 (en) * 2017-11-30 2019-06-06 广州市百果园信息技术有限公司 Video special effect adding method and apparatus, and smart mobile terminal
CN109325219A (en) * 2018-08-24 2019-02-12 维沃移动通信有限公司 A kind of method, apparatus and system generating recording documents
CN109285532A (en) * 2018-10-31 2019-01-29 深圳市酷达通讯有限公司 A kind of order programme
CN109683993B (en) * 2018-12-12 2023-12-29 努比亚技术有限公司 Application processing method, device and computer readable storage medium
CN109683993A (en) * 2018-12-12 2019-04-26 努比亚技术有限公司 A kind of applied program processing method, equipment and computer readable storage medium
CN109473117A (en) * 2018-12-18 2019-03-15 广州市百果园信息技术有限公司 Audio special efficacy stacking method, device and its terminal
CN110446097A (en) * 2019-08-26 2019-11-12 维沃移动通信有限公司 Record screen method and mobile terminal
CN110446097B (en) * 2019-08-26 2022-04-15 维沃移动通信有限公司 Screen recording method and mobile terminal
CN111327855A (en) * 2020-03-10 2020-06-23 网易(杭州)网络有限公司 Video recording method and device and video positioning method and device
CN111327855B (en) * 2020-03-10 2022-08-05 网易(杭州)网络有限公司 Video recording method and device and video positioning method and device
CN111491205A (en) * 2020-04-17 2020-08-04 维沃移动通信有限公司 Video processing method and device and electronic equipment
CN114390299A (en) * 2020-10-16 2022-04-22 腾讯科技(深圳)有限公司 Song on-demand method, device, equipment and computer readable storage medium
CN114390299B (en) * 2020-10-16 2024-02-02 腾讯科技(深圳)有限公司 Song requesting method, apparatus, device and computer readable storage medium
CN112653920A (en) * 2020-12-18 2021-04-13 北京字跳网络技术有限公司 Video processing method, device, equipment, storage medium and computer program product
US11948385B2 (en) * 2020-12-23 2024-04-02 Abbyy Development Inc. Zero-footprint image capture by mobile device
CN112995536A (en) * 2021-02-04 2021-06-18 上海哔哩哔哩科技有限公司 Video synthesis method and system
CN113079332A (en) * 2021-03-16 2021-07-06 青岛海信移动通信技术股份有限公司 Mobile terminal and screen recording method thereof
CN113489899A (en) * 2021-06-29 2021-10-08 中国平安人寿保险股份有限公司 Special effect video recording method and device, computer equipment and storage medium
CN114390205B (en) * 2022-01-29 2023-09-15 西安维沃软件技术有限公司 Shooting method and device and electronic equipment
CN114390205A (en) * 2022-01-29 2022-04-22 西安维沃软件技术有限公司 Shooting method and device and electronic equipment
CN114915830A (en) * 2022-06-06 2022-08-16 武汉市芯中芯科技有限公司 Method for realizing audio and video synthesis of wifi visual equipment by using mobile phone microphone
CN116708899A (en) * 2022-06-30 2023-09-05 北京生数科技有限公司 Video processing method, device and storage medium applied to virtual image synthesis
CN116708899B (en) * 2022-06-30 2024-01-23 北京生数科技有限公司 Video processing method, device and storage medium applied to virtual image synthesis
CN115474088A (en) * 2022-09-07 2022-12-13 腾讯音乐娱乐科技(深圳)有限公司 Video processing method, computer equipment and storage medium
CN116668763A (en) * 2022-11-10 2023-08-29 荣耀终端有限公司 Screen recording method and device
CN116668763B (en) * 2022-11-10 2024-04-19 荣耀终端有限公司 Screen recording method and device

Also Published As

Publication number Publication date
CN104967900B (en) 2018-08-07
CN104967900A (en) 2015-10-07

Similar Documents

Publication Publication Date Title
WO2016177296A1 (en) Video generation method and apparatus
US10841661B2 (en) Interactive method, apparatus, and system in live room
TWI592021B (en) Method, device, and terminal for generating video
WO2018184488A1 (en) Video dubbing method and device
US10255929B2 (en) Media presentation playback annotation
CN109302538B (en) Music playing method, device, terminal and storage medium
WO2019105438A1 (en) Video special effect adding method and apparatus, and smart mobile terminal
WO2020015333A1 (en) Video shooting method and apparatus, terminal device, and storage medium
US9924205B2 (en) Video remote-commentary synchronization method and system, and terminal device
CN108924464B (en) Video file generation method and device and storage medium
WO2017076143A1 (en) Method, apparatus, and system for switching video live stream to video-on-demand data
US11670339B2 (en) Video acquisition method and device, terminal and medium
CN111050203B (en) Video processing method and device, video processing equipment and storage medium
WO2019062541A1 (en) Real-time digital audio signal mixing method and device
WO2018157812A1 (en) Method and apparatus for implementing video branch selection and playback
US20210349678A1 (en) Methods and electronic devices for dynamic control of playlists
CN104636110B (en) Control the method and device of volume
CN111147779B (en) Video production method, electronic device, and medium
CN107948562B (en) Video recording method and video recording terminal
WO2017215661A1 (en) Scenario-based sound effect control method and electronic device
CN108476339B (en) Remote control method and terminal
US20210266633A1 (en) Real-time voice information interactive method and apparatus, electronic device and storage medium
CN104639977A (en) Program playing method and device
KR102186815B1 (en) Method, apparatus and recovering medium for clipping of contents
AU2014200042B2 (en) Method and apparatus for controlling contents in electronic device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16789290

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 11.04.2018)

122 Ep: pct application non-entry in european phase

Ref document number: 16789290

Country of ref document: EP

Kind code of ref document: A1