WO2016177296A1 - 一种生成视频的方法和装置 - Google Patents

一种生成视频的方法和装置 Download PDF

Info

Publication number
WO2016177296A1
WO2016177296A1 PCT/CN2016/080666 CN2016080666W WO2016177296A1 WO 2016177296 A1 WO2016177296 A1 WO 2016177296A1 CN 2016080666 W CN2016080666 W CN 2016080666W WO 2016177296 A1 WO2016177296 A1 WO 2016177296A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
video
song
video image
processed
Prior art date
Application number
PCT/CN2016/080666
Other languages
English (en)
French (fr)
Inventor
王超
李纯
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Publication of WO2016177296A1 publication Critical patent/WO2016177296A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/272Means for inserting a foreground image in a background image, i.e. inlay, outlay
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/433Content storage operation, e.g. storage operation in response to a pause request, caching operations
    • H04N21/4334Recording operations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/91Television signal processing therefor

Definitions

  • the embodiments of the present invention relate to the field of computer technologies, and in particular, to a method and an apparatus for generating a video.
  • the singing application (or karaoke application) is a very popular entertainment application.
  • the video provided in the song is generally a simple presentation of the content of the camera, and the flexibility of recording the video of the song is poor.
  • an embodiment of the present invention provides a method and apparatus for generating a video.
  • the technical solution is as follows:
  • a method of generating a video comprising:
  • the video effect combination includes at least one filter and/or at least one Foreground video
  • the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  • an apparatus for generating a video comprising:
  • a playing module configured to play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording;
  • a display module configured to display an option of pre-stored at least one video special effect combination after the capturing of the video image and the recording of the audio, and display the composite button; wherein the video special effect combination includes at least one filter And/or at least one foreground video;
  • a preview module configured to perform combined special effect processing on the captured video image according to the first video special effect combination when receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, and processing The subsequent video image is displayed while playing the accompaniment audio and the recorded audio;
  • a synthesizing module configured to synthesize the accompaniment audio, the recorded audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  • a terminal where the terminal includes:
  • One or more processors are One or more processors.
  • the memory stores one or more programs, the one or more programs being configured to be executed by the one or more processors, the one or more programs including instructions for:
  • the video effect combination includes at least one filter and/or at least one Foreground video
  • the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  • the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed.
  • FIG. 1 is a flowchart of a method for generating a video according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of interface display according to an embodiment of the present invention.
  • FIG. 3 is a schematic diagram of an interface display according to an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of interface display according to an embodiment of the present invention.
  • FIG. 5 is a schematic structural diagram of a video generating apparatus according to an embodiment of the present invention.
  • FIG. 6 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • An embodiment of the present invention provides a method for generating a video. As shown in FIG. 1 , the processing procedure of the method may include the following steps:
  • Step 101 Play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording.
  • Step 102 After the shooting of the video image and the recording of the audio are finished, displaying an option of the pre-stored combination of the at least one video special effect, and displaying the composite button; wherein the video special effect combination includes at least one filter and/or at least one foreground video. .
  • Step 103 When receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image. Display, playing accompaniment audio and recorded audio simultaneously.
  • Step 104 When receiving a click command for the composite button, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video.
  • the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed.
  • the embodiment of the invention provides a method for generating a video, and the execution body of the method is a terminal.
  • the terminal may be any terminal with a video capture function, such as a mobile phone with a camera, a tablet, and the like, and an application for song and video recording may be installed on the terminal.
  • the terminal may be provided with a processor and a memory, and the processor may be used for processing video images and audio, and the memory may be used for storing data required in the following processing and generated data. It can be equipped with input and output devices such as camera, microphone, screen, audio output device, camera can be used for video image shooting, microphone can be used for audio recording, screen can be used for video, lyrics subtitles, etc. Type of screen, audio output device can be used for audio playback, can be headphones or speakers.
  • the terminal is a mobile phone as an example, and a detailed description of the solution is performed. Other situations are similar, and the embodiment is not described in detail.
  • Step 101 Play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image shooting and audio recording.
  • the song to be recorded may be a song or a song fragment that the user wants to make a K song.
  • the user can install the above application on the terminal and operate the application, and then trigger the terminal to display the main interface of the application, in the main interface, a little song button can be displayed, and the user clicks the song button.
  • the terminal can be triggered to switch to the corresponding song selection interface of the application, and the song selection interface can display a list of songs in which the user can select the song he likes (ie, the target song mentioned later). Songs with accompaniment files stored locally can be displayed in the song list, and songs with accompaniment files stored on the network side can also be displayed.
  • the terminal can use the song as the song to be recorded, and can display the recording interface of the song.
  • the recording interface can display a recording button.
  • the terminal can trigger the terminal to retrieve the song.
  • the accompaniment file of the song, and the accompaniment file is run to play the accompaniment audio of the song, and the lyrics subtitle corresponding to the song is displayed on the recording interface, and the terminal can start capturing the video image through the front camera of the terminal and recording through the microphone. Audio.
  • the video image captured by the front camera of the terminal can be transmitted to the screen for display in real time, and the user can adjust the position of the terminal based on the image displayed on the screen.
  • the user can select a favorite clip as the song to be recorded in the target song.
  • the following processing may be performed: determining the song segment selected by the user in the target song.
  • the accompaniment audio of the song segment can be played.
  • the user can select a song segment to be recorded in the song, and further, the terminal can obtain the song segment selected by the user in the target song.
  • the user can select a song segment in the target song in a variety of ways, and several alternative treatments are given below:
  • the lyrics list of the target song is displayed; the starting point and the ending point of the song segment set by the user in the lyrics list are obtained; and the song segment selected by the user in the target song is determined according to the starting point and the ending point.
  • the user can drag the start line and the end line displayed in the lyrics, and the song piece corresponding to the lyric content between the start line and the end line is the song piece selected by the user.
  • the lyric list of the target song may be displayed in the recording interface, and the start line and the end line may be displayed in the lyric list, and the user may drag the above start line and end line up and down.
  • the lyrics below the starting line are the starting lyrics (starting sentence) of the song segment selected by the user
  • the lyrics above the ending line are the ending lyrics (terminating sentence) of the song segment selected by the user
  • the user can click the record button, and the terminal can be triggered to obtain the start time point of the start lyrics and the end time point of terminating the lyrics as the start point and the end point of the song segment, and can be determined in the target song according to the start point and the end point.
  • Selected song clips Further, the terminal can play the accompaniment audio of the song segment.
  • the playing time axis of the target song is displayed; the starting point and the ending point of the song segment set by the user in the playing time axis are acquired; and the song segment selected by the user in the target song is determined according to the starting point and the ending point.
  • the recording interface can display the playing time axis of the target song and the recording button, and the two lines located at different positions can also be displayed on the displayed playing time axis, and the user can select the favorite by dragging the two lines.
  • the user can click the recording button in the recording interface, and the terminal can be triggered to obtain the playing time point of the two lines, and the two playing time points are the starting point of the song segment selected by the user.
  • the terminal can acquire a song segment between the start point and the end point in the target song, and then play the accompaniment audio of the song segment.
  • a variety of filter options can also be displayed in the recording interface, as shown in Figure 3, where the user can select a filter for real-time processing of the captured video image. .
  • the user can click the record button to start recording, and the application can perform corresponding image processing on each image frame captured according to the filter selected by the user, and output the filtered video image to the screen. Display and encode it and save it to a file in real time.
  • Step 102 After the shooting of the video image and the recording of the audio are finished, displaying an option of the at least one video special effect combination stored in advance, and displaying the composite button; wherein the video special effect combination includes at least one filter and/or at least one foreground video.
  • the video special effect combination is a plurality of video special effects for combining video images
  • the video special effects may be a filter, a foreground video, etc.
  • the filter may be used to adjust pixel values of each pixel in the video image to achieve Tools for a specific visual effect, such as black and white filters, quaint filters, etc.
  • the scene video can be a video that is hovering over the top of the video image.
  • the preview interface may display an option of one or more combinations of video effects pre-stored locally, wherein the video effect combination may use different different time segments of the video image. Filters and/or foreground video can also use different filters and/or foreground video for the same time period.
  • the processing information corresponding to each video special effect combination may be recorded in the above application, and the processing information may include each filter in the video special effect combination, a start time point and an end time point of the foreground video.
  • a composite button for synthesizing the video image and the audio may be displayed in the preview interface described above.
  • Step 103 When receiving a selection instruction for the first video special effect combination in the at least one video special effect combination, performing combined special effect processing on the captured video image according to the first video special effect combination, and performing the processed video image on the processed video image. Display, playing accompaniment audio and recorded audio simultaneously.
  • the user can select a video effect combination that he or she likes according to the option of the video special effect combination displayed on the preview interface.
  • the terminal will receive the video special effect combination (ie, The selection instruction of the first video special effect combination, at this time, the terminal can acquire the processing information of the video special effect combination stored therein, according to each filter in the processing information, the start time point and the end time point of the foreground video, for each The video frames involved in the filter and foreground video are processed, and the processed video images are output to the screen for display in real time.
  • the start time point of a black and white filter in the first video special effect combination is the 5th second
  • the end time point is the 13th second.
  • the accompaniment audio and the recorded audio can be acquired, and the audio and video are played synchronously according to the time of each video frame and the time of each audio frame in the accompaniment audio and the recorded audio.
  • the preview interface displays multiple video effects combination options, after the user selects one of the video effects combination previews, the user can also select other video effects combinations for preview, and finally select a favorite video effect combination.
  • Step 104 When receiving a click command for the composite button, synthesizing the accompaniment audio, the recorded audio, and the processed video image to obtain a composite video.
  • the terminal will receive the click command for the above composite button, and the terminal can acquire the video image processed by the video special effect and perform ffmpeg (a set can be used for recording, converting digital audio, video, and can It is converted into a stream of open source computer programs) encoding, in addition, the terminal can also acquire the accompaniment audio and recorded audio, and audio-encode it to obtain the encoded audio, and then the encoded video image and the encoded audio. Ffmpeg synthesis, get a composite video.
  • ffmpeg a set can be used for recording, converting digital audio, video, and can It is converted into a stream of open source computer programs
  • the user may also perform special effects on the recorded audio when recording the song, and correspondingly, the following processing may be performed: displaying an option of at least one audio special effect, when receiving the first audio special effect in the at least one audio special effect
  • the processing of step 104 may be as follows: when receiving a click command for the composite button, synthesizing the processed audio, the accompaniment audio, and the processed video image to obtain a composite video.
  • the above preview interface can also display one or more audio effects options, such as doll sounds, phonographs and other audio effects, the user can select an audio effect that he likes in the options of the displayed audio effects (ie, the first An audio effect), at this time, the terminal will receive a selection instruction for the audio effect, perform special effects processing on the recorded audio according to the selected audio special effect, and play the processed audio on the preview interface for the user. Preview it.
  • the user can click the composite button displayed in the preview interface to trigger the terminal to obtain the encoded video image.
  • the terminal can also acquire the accompaniment audio and the processed audio and encode it to obtain the coded image.
  • the audio is then ffmpeg synthesized by the encoded video image and the encoded audio to obtain a composite video.
  • the recorded audio may also be edited, and correspondingly, the following processing may be performed: displaying at least one option of audio editing processing; when receiving the first in at least one audio editing process When an audio editing process selects an instruction, the first audio editing process is performed on the recorded audio, and the processed audio is played.
  • the processing of step 104 may be as follows: when receiving a click command for the composite button, synthesizing the processed audio, the accompaniment audio, and the processed video image to obtain a composite video.
  • the above preview interface may also display one or more options for audio editing processing, such as adjusting the volume, turning down the volume, moving the vocal forward, moving the vocal backwards and denoising, etc., the user may Select one of the audio editing processing options (ie, the first audio editing processing) in one or more of the displayed audio editing processing options.
  • the preview interface may display options such as volume adjustment, vocal movement, and denoising when the user needs When you adjust the volume of the recorded audio, you can click on the volume The whole option, at this time, will trigger the terminal to display the volume adjustment axis (the volume gradually increases from left to right, or the volume gradually increases from bottom to top), and the volume adjustment line can also be displayed on the displayed volume adjustment axis.
  • the user can adjust the volume by moving the position of the volume adjustment line.
  • the terminal When the user clicks the vocal movement option, the terminal will receive the user's selection instruction for the vocal movement option, and the time axis can be displayed, and the vocal moving line at the middle position of the time axis can also be displayed on the displayed time axis.
  • the user can adjust the size of the movement by moving the position of the vocal moving line, that is, when the vocal moving line moves forward, the terminal can move the recorded audio forward, and when the vocal moving line moves backward, the terminal can The recorded audio moves backwards, which avoids re-recording the audio when the recorded audio is misaligned.
  • the terminal When the user clicks the denoising option, the terminal will receive a selection instruction for the denoising option, and then the denoised processing of the recorded audio can be performed.
  • the preview button in the preview interface can be clicked, and the terminal can be triggered to perform the first audio editing process on the recorded audio, and play the processed audio.
  • the synthesized button displayed in the preview interface can be clicked to trigger the terminal to obtain the encoded video image.
  • the terminal can also acquire the accompaniment audio and the processed audio and encode the code to obtain the code.
  • the encoded video image is then ffmpeg synthesized with the encoded audio to obtain a composite video.
  • the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed.
  • At least one video effect combination option and displaying a composite button wherein the video effect combination includes at least one filter and/or at least one foreground video, when receiving a selection of a first video effect combination in the at least one video effect combination
  • the combined video effect is performed on the captured video image, and the processed video image is displayed, and the accompaniment audio and the recorded audio are played simultaneously, when receiving the click command on the composite button , the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  • the video image in the synthesized video thus obtained is not a simple presentation of the content captured by the camera, but is processed by the video special effect, thereby enhancing the flexibility of the video recording of the song.
  • an embodiment of the present invention further provides a device for generating a video. As shown in FIG. 5, the device includes:
  • the playing module 510 is configured to play the accompaniment audio of the song to be recorded, display the lyrics subtitle corresponding to the accompaniment audio, and perform video image capturing and audio recording;
  • a display module 520 configured to display after the shooting of the video image and the recording of the audio
  • An option of pre-stored at least one video effect combination is displayed, and a composite button is displayed; wherein the video effect combination includes at least one filter and/or at least one foreground video;
  • the preview module 530 is configured to: when receiving the selection instruction for the first video special effect combination in the at least one video special effect combination, perform combined special effect processing on the captured video image according to the first video special effect combination, and The processed video image is displayed while playing the accompaniment audio and the recorded audio;
  • the synthesizing module 540 is configured to synthesize the accompaniment audio, the recorded audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  • the method further includes a determining module, configured to determine a song segment selected by the user in the target song before playing the accompaniment audio of the song to be recorded;
  • the playing module 510 is configured to: play the accompaniment audio of the song segment.
  • the determining module is configured to:
  • the determining module is configured to:
  • the display module 520 is further configured to display an option of at least one audio effect
  • the preview module 530 is further configured to perform special effect processing on the recorded audio according to the first audio special effect when receiving a selection instruction for the first audio special effect in the at least one audio special effect, and The processed audio is played;
  • the synthesizing module 540 is configured to synthesize the processed audio, the accompaniment audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  • the display module 520 is further configured to display an option of at least one audio editing process
  • the preview module 530 is further configured to: when the selection instruction of the first audio editing process in the at least one audio editing process is received, perform the first audio editing on the recorded audio And play the processed audio;
  • the synthesizing module 540 is configured to synthesize the processed audio, the accompaniment audio, and the processed video image to obtain a synthesized video when receiving a click instruction to the composite button.
  • the accompaniment audio of the song to be recorded is played, the lyrics subtitle corresponding to the accompaniment audio is displayed, and the video image is captured and the audio is recorded. After the video image is captured and the audio is recorded, the pre-stored display is displayed.
  • At least one video effect combination option and displaying a composite button wherein the video effect combination includes at least one filter and/or at least one foreground video, when receiving a selection of a first video effect combination in the at least one video effect combination
  • the combined video effect is performed on the captured video image, and the processed video image is displayed, and the accompaniment audio and the recorded audio are played simultaneously, when receiving the click command on the composite button , the accompaniment audio, the recorded audio, and the processed video image are combined to obtain a composite video.
  • the video image in the synthesized video thus obtained is not a simple presentation of the content captured by the camera, but is processed by the video special effect, thereby enhancing the flexibility of the video recording of the song.
  • the device for generating a video provided by the foregoing embodiment is only illustrated by dividing the foregoing functional modules when generating a video. In an actual application, the function allocation may be completed by different functional modules as needed. The internal structure of the device is divided into different functional modules to complete all or part of the functions described above.
  • the device for generating a video provided by the foregoing embodiment is the same as the method for generating a video. The specific implementation process is described in detail in the method embodiment, and details are not described herein again.
  • FIG. 6 is a schematic structural diagram of a terminal according to an embodiment of the present invention.
  • the terminal may be used to implement the method for generating video provided in the foregoing embodiment. Specifically:
  • the terminal 900 may include an RF (Radio Frequency) circuit 110, a memory 120 including one or more computer readable storage media, an input unit 130, a display unit 140, a sensor 150, an audio circuit 160, and a WiFi (wireless fidelity, wireless).
  • the fidelity module 170 includes a processor 180 having one or more processing cores, and a power supply 190 and the like. It will be understood by those skilled in the art that the terminal structure shown in FIG. 6 does not constitute a limitation to the terminal, and may include more or less components than those illustrated, or a combination of certain components, or different component arrangements. among them:
  • the RF circuit 110 can be used for transmitting and receiving information or during a call, receiving and transmitting signals, and in particular, receiving downlink information of the base station and then processing it by one or more processors 180; The data related to the uplink is sent to the base station.
  • the RF circuit 110 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a Subscriber Identity Module (SIM) card, a transceiver, a coupler, an LNA (Low Noise Amplifier). , duplexer, etc.
  • SIM Subscriber Identity Module
  • RF circuitry 110 can also communicate with the network and other devices via wireless communication.
  • the wireless communication may use any communication standard or protocol, including but not limited to GSM (Global System of Mobile communication), GPRS (General Packet Radio Service), CDMA (Code Division Multiple Access). , Code Division Multiple Access), WCDMA (Wideband Code Division Multiple Access), LTE (Long Term Evolution), e-mail, SMS (Short Messaging Service), and the like.
  • GSM Global System of Mobile communication
  • GPRS General Packet Radio Service
  • CDMA Code Division Multiple Access
  • WCDMA Wideband Code Division Multiple Access
  • LTE Long Term Evolution
  • e-mail Short Messaging Service
  • the memory 120 can be used to store software programs and modules, and the processor 180 executes various functional applications and data processing by running software programs and modules stored in the memory 120.
  • the memory 120 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application required for at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may be stored according to The data created by the use of the terminal 900 (such as audio data, phone book, etc.) and the like.
  • memory 120 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device. Accordingly, memory 120 may also include a memory controller to provide access to memory 120 by processor 180 and input unit 130.
  • the input unit 130 can be configured to receive input numeric or character information and to generate keyboard, mouse, joystick, optical or trackball signal inputs related to user settings and function controls.
  • input unit 130 can include touch-sensitive surface 131 as well as other input devices 132.
  • Touch-sensitive surface 131 also referred to as a touch display or trackpad, can collect touch operations on or near the user (such as a user using a finger, stylus, etc., on any suitable object or accessory on touch-sensitive surface 131 or The operation near the touch-sensitive surface 131) and driving the corresponding connecting device according to a preset program.
  • the touch-sensitive surface 131 can include two portions of a touch detection device and a touch controller.
  • the touch detection device detects the touch orientation of the user, and detects a signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts the touch information into contact coordinates, and sends the touch information.
  • the processor 180 is provided and can receive commands from the processor 180 and execute them.
  • the touch-sensitive surface 131 can be implemented in various types such as resistive, capacitive, infrared, and surface acoustic waves.
  • the input unit 130 can also include other input devices 132.
  • other input devices 132 may include, but is not limited to, one or more of a physical keyboard, function keys (such as volume control buttons, switch buttons, etc.), trackballs, mice, joysticks, and the like.
  • the display unit 140 can be used to display information entered by the user or information provided to the user and various graphical user interfaces of the terminal 900, which can be composed of graphics, text, icons, video, and any combination thereof.
  • the display unit 140 may include a display panel 141.
  • the display panel 141 may be configured in the form of an LCD (Liquid Crystal Display), an OLED (Organic Light-Emitting Diode), or the like.
  • the touch-sensitive surface 131 may cover the display panel 141, and when the touch-sensitive surface 131 detects a touch operation thereon or nearby, it is transmitted to the processor 180 to determine the type of the touch event, and then the processor 180 according to the touch event The type provides a corresponding visual output on display panel 141.
  • touch-sensitive surface 131 and display panel 141 are implemented as two separate components to implement input and input functions, in some embodiments, touch-sensitive surface 131 can be integrated with display panel 141 for input. And output function.
  • Terminal 900 can also include at least one type of sensor 150, such as a light sensor, motion sensor, and other sensors.
  • the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may adjust the brightness of the display panel 141 according to the brightness of the ambient light, and the proximity sensor may close the display panel 141 when the terminal 900 moves to the ear. / or backlight.
  • the gravity acceleration sensor can detect the magnitude of acceleration in all directions (usually three axes). When it is stationary, it can detect the magnitude and direction of gravity.
  • the terminal 900 can also be configured with gyroscopes, barometers, hygrometers, thermometers, infrared sensors and other sensors, here Let me repeat.
  • the audio circuit 160, the speaker 161, and the microphone 162 can provide an audio interface between the user and the terminal 900.
  • the audio circuit 160 can transmit the converted electrical data of the received audio data to the speaker 161 for conversion to the sound signal output by the speaker 161; on the other hand, the microphone 162 converts the collected sound signal into an electrical signal by the audio circuit 160. After receiving, it is converted into audio data, and then processed by the audio data output processor 180, transmitted to the terminal, for example, via the RF circuit 110, or outputted to the memory 120 for further processing.
  • the audio circuit 160 may also include an earbud jack to provide communication of the peripheral earphones with the terminal 900.
  • WiFi is a short-range wireless transmission technology
  • the terminal 900 can help users to send and receive emails, browse web pages, and access streaming media through the WiFi module 170, which provides wireless broadband Internet access for users.
  • FIG. 6 shows the WiFi module 170, it can be understood that it does not belong to the end.
  • the necessary configuration of the end 900 can be omitted as needed within the scope of not changing the essence of the invention.
  • the processor 180 is a control center of the terminal 900 that connects various portions of the entire handset using various interfaces and lines, by running or executing software programs and/or modules stored in the memory 120, and recalling data stored in the memory 120, The various functions and processing data of the terminal 900 are performed to perform overall monitoring of the mobile phone.
  • the processor 180 may include one or more processing cores; preferably, the processor 180 may integrate an application processor and a modem processor, where the application processor mainly processes an operating system, a user interface, an application, and the like.
  • the modem processor primarily handles wireless communications. It can be understood that the above modem processor may not be integrated into the processor 180.
  • the terminal 900 also includes a power source 190 (such as a battery) that supplies power to the various components.
  • the power source can be logically coupled to the processor 180 through a power management system to manage functions such as charging, discharging, and power management through the power management system.
  • Power supply 190 may also include any one or more of a DC or AC power source, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator, and the like.
  • the terminal 900 may further include a camera, a Bluetooth module, and the like, and details are not described herein again.
  • the display unit of the terminal 900 is a touch screen display
  • the terminal 900 further includes a memory, and one or more programs, wherein one or more programs are stored in the memory and configured to be one or one
  • the above processor executes one or more programs to perform the method of generating a video as described in the various embodiments above.
  • non-transitory computer readable storage medium comprising instructions, such as a memory comprising instructions executable by a processor of a mobile terminal to perform the method of generating video as described above.
  • the non-transitory computer readable storage medium may be a ROM (Read-Only Memory), a RAM (Random-Access Memory), or a CD-ROM (Compact Disc Read-Only Memory, CD-ROM, tape, floppy disk and optical data storage devices.
  • a person skilled in the art may understand that all or part of the steps of implementing the above embodiments may be completed by hardware, or may be instructed by a program to execute related hardware, and the program may be stored in a computer readable storage medium.
  • the storage medium mentioned may be a read only memory, a magnetic disk or an optical disk or the like.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Studio Devices (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)
  • Studio Circuits (AREA)

Abstract

本发明公开了一种生成视频的方法和装置,属于计算机技术领域。所述方法包括:播放待录制歌曲的伴奏音频,显示伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频;当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。采用本发明,可以增强歌曲视频录制的灵活性。

Description

一种生成视频的方法和装置
本申请要求于2015年5月4日提交中国专利局、申请号为201510221018.9、发明名称为“一种生成视频的方法和装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本发明实施例涉及计算机技术领域,特别涉及一种生成视频的方法和装置。
背景技术
随着计算机技术的发展,手机、计算机等终端得到了广泛的应用,相应的终端上的应用程序的种类越来越多、功能越来越丰富。歌唱类应用程序(或称K歌类应用程序)是一种很常用的娱乐应用程序。
用户可以通过歌唱类应用程序进行歌曲的录制,在录制歌曲的同时,还可以进行视频图像的拍摄,得到配有视频的歌曲。
在实现本发明实施例的过程中,发明人发现相关技术至少存在以下问题:
基于上述录制歌曲的过程,歌曲中配有的视频一般只是对摄像头拍摄内容的简单呈现,进行歌曲视频录制的灵活性较差。
发明内容
为了解决相关技术的问题,本发明实施例提供了一种生成视频的方法和装置。所述技术方案如下:
第一方面,提供了一种生成视频的方法,所述方法包括:
播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;
在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;
当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指 令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;
当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。
第二方面,提供了一种生成视频的装置,所述装置包括:
播放模块,用于播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;
显示模块,用于在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;
预览模块,用于当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;
合成模块,用于当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。
第三方面,提供了一种终端,所述终端包括:
一个或多个处理器;和
存储器;
所述存储器存储有一个或多个程序,所述一个或多个程序被配置成由所述一个或多个处理器执行,所述一个或多个程序包含用于进行以下操作的指令:
播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;
在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;
当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;
当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。
本发明实施例提供的技术方案带来的有益效果是:
本发明实施例中,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键,其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频。当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。这样得到的合成视频中的视频图像不是对摄像头拍摄内容的简单呈现,而是经过了视频特效组合处理,从而可以增强歌曲视频录制的灵活性。
附图说明
为了更清楚地说明本发明实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。
图1是本发明实施例提供的一种生成视频的方法流程图;
图2是本发明实施例提供的界面显示示意图;
图3是本发明实施例提供的界面显示示意图;
图4是本发明实施例提供的界面显示示意图;
图5是本发明实施例提供的一种生成视频装置的结构示意图;
图6是本发明实施例提供的一种终端的结构示意图。
具体实施方式
为使本发明的目的、技术方案和优点更加清楚,下面将结合附图对本发明实施方式作进一步地详细描述。
本发明实施例提供了一种生成视频的方法,如图1所示,该方法的处理流程可以包括如下的步骤:
步骤101,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制。
步骤102,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频。
步骤103,当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频。
步骤104,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。
本发明实施例中,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键,其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频。当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。这样得到的合成视频中的视频图像不是对摄像头拍摄内容的简单呈现,而是经过了视频特效组合处理,从而可以增强歌曲视频录制的灵活性。
本发明实施例提供了一种生成视频的方法,该方法的执行主体为终端。其中,终端可以是具有视频拍摄功能的任意终端,比如带有摄像头的手机、平板电脑等移动终端,终端上可以安装有用于歌曲和视频录制的应用程序。该终端中可以设置有处理器、存储器,处理器可以用于对视频图像和音频进行处理,存储器可以用于存储下述处理过程中需要的数据以及产生的数据。可以设置有摄像头、麦克风、屏幕、音频输出设备等输入输出设备,摄像头可以用于视频图像的拍摄,麦克风可以用于音频的录制,屏幕可以用于视频、歌词字幕等的显示,可以是触控式的屏幕,音频输出设备可以用于音频的播放,可以是耳机或喇叭等。本实施例中,以终端为手机为例,进行方案的详细说明,其它情况与之类似,本实施例不再累述。
下面将结合具体实施方式,对图1所示的处理流程进行详细的说明,内容可以如下:
步骤101,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制。
其中,待录制歌曲可以是用户想要进行K歌的歌曲或歌曲片段。
在实施中,用户可以在其终端上安装上述应用程序,并操作启动该应用程序,此时可以触发终端显示应用程序的主界面,在主界面中可以显示有点歌按键,用户点击点歌按键后,可以触发终端切换至应用程序相应的歌曲选择界面,在歌曲选择界面中可以显示歌曲列表,用户可以在其中选择自己喜欢的歌曲(即后面提到的目标歌曲)。歌曲列表中可以显示本地存储有伴奏文件的歌曲,还可以显示网络侧存储有伴奏文件的歌曲。用户在歌曲列表中选择某歌曲之后,终端可以将该歌曲作为待录制歌曲,并可以显示该歌曲的录制界面,录制界面中可以显示有录制按键,用户点击录制按键,则可以触发终端调取该歌曲的伴奏文件,并运行该伴奏文件,以播放该歌曲的伴奏音频,并在录制界面显示该歌曲对应的歌词字幕,同时,终端可以开始通过终端的前置摄像头拍摄视频图像,并通过麦克风录制音频。此外,通过终端的前置摄像头拍摄的视频图像可以实时传输至屏幕进行显示,用户可以基于屏幕显示的图像对终端的位置进行调节。
可选的,用户可以在目标歌曲中选择自己喜欢的片段作为待录制歌曲,相应的,在步骤101之前可以进行如下处理:确定用户在目标歌曲中选取的歌曲片段。相应的在步骤101中,可以播放该歌曲片段的伴奏音频。
在实施中,用户在上述歌曲列表中选择某歌曲(即目标歌曲)之后,用户可以在该歌曲中选取想要录制的歌曲片段,进而,终端可以获得用户在目标歌曲中选取的歌曲片段。
可选的,用户在目标歌曲中选取歌曲片段的处理方式可以多种多样,以下给出了几种可选的处理方式:
方式一,显示目标歌曲的歌词列表;获取用户在歌词列表中设置的歌曲片段的起始点和终止点;根据起始点和终止点,确定用户在目标歌曲中选取的歌曲片段。
在实施中,用户可以拖动歌词中显示的起始线和终止线,起始线和终止线之间的歌词内容对应的歌曲片段即为用户选取的歌曲片段。具体的,如图2所示,录制界面中可以显示目标歌曲的歌词列表,还可以在歌词列表中显示有起始线和终止线,用户可以通过上下拖动上述的起始线和终止线在歌词列表中选 取自己喜欢的歌曲片段,起始线下方的歌词为用户所选取歌曲片段的起始歌词(起始句),终止线上方的歌词为用户所选取歌曲片段的终止歌词(终止句),然后,用户可以点击录制按键,此时可以触发终端获取上述的起始歌词的开始时间点和终止歌词的结束时间点,作为歌曲片段起始点和终止点,根据起始点和终止点可以确定在目标歌曲中选取的歌曲片段。进而,终端可以播放该歌曲片段的伴奏音频。
另外,在应用程序中,还可以预先设置歌曲片段的时长上限,如30秒。如果上述起始点与终止点的时间差大于30秒,则可以设置录制界面中的录制按键进入无法点击状态。此外,在应用程序中还可以预先设置歌曲片段的时长下限,如10秒。如果上述起始点与终止点的时间差小于10秒,则可以设置录制界面中的录制按键进入无法点击状态。
方式二,显示目标歌曲的播放时间轴;获取用户在播放时间轴中设置的歌曲片段的起始点和终止点;根据起始点和终止点,确定用户在目标歌曲中选取的歌曲片段。
在实施中,录制界面中可以显示目标歌曲的播放时间轴和录制按键,在显示的播放时间轴上还可以显示位于不同位置的两条线,用户可以通过拖动这两条线来选取自己喜欢的歌曲片段,选择后,用户可以点击录制界面中的录制按键,此时可以触发终端获取上述两条线所在的播放时间点,这两个播放时间点即为用户选取的歌曲片段的起始点和终止点,终端可以在目标歌曲中获取起始点和终止点之间的歌曲片段,进而,播放该歌曲片段的伴奏音频。
可选的,在开始录制之前,在录制界面中还可以显示有多种滤镜的选项,如图3所示,用户可以在其中选择一种滤镜,用于对拍摄的视频图像进行实时处理。用户在选择滤镜后,可以点击录制按键开始录制,应用程序则可以根据用户选取的滤镜对拍摄到的每一个图像帧进行相应的图像处理,将滤镜处理后的视频图像输出到屏幕进行显示,并对其进行编码,实时保存到文件中。
步骤102,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,视频特效组合包括至少一个滤镜和/或至少一个前景视频。
其中,视频特效组合是用于对视频图像进行组合处理的多种视频特效,视频特效可以是滤镜、前景视频等,滤镜可以是用于对视频图像中各像素的像素值进行调整以达到某种特定视觉效果的工具,如黑白滤镜、古朴色滤镜等,前 景视频可以是悬浮显示在视频图像上层的视频。
在实施中,当待录制歌曲的伴奏音频的最后一个音频帧播放完毕时,或者用户在伴奏音频播放过程中点击录制界面中的结束按键时,视频图像的拍摄和音频的录制结束,终端可以相应的切换至应用程序的预览界面,如图4所示,预览界面中可以显示本地预先存储的一个或多个视频特效组合的选项,其中的视频特效组合可以在视频图像的不同时间段使用不同的滤镜和/或前景视频,也可以在相同的时间段使用不同的滤镜和/或前景视频。上述应用程序中可以记录有每种视频特效组合对应的处理信息,处理信息中可以包括视频特效组合中每个滤镜、前景视频的开始时间点和结束时间点。另外,在上述的预览界面中可以显示有合成按键,用于对视频图像、音频进行合成。
步骤103,当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频。
在实施中,用户根据预览界面显示的视频特效组合的选项,可以选择自己喜欢的一个视频特效组合,用户点击其中的一个视频特效组合的选项时,终端将会接收到对该视频特效组合(即第一视频特效组合)的选择指令,此时,终端可以获取其存储的该视频特效组合的处理信息,根据处理信息中的每个滤镜、前景视频的开始时间点和结束时间点,对每个滤镜、前景视频所涉及的视频帧进行处理,并实时对处理后的视频图像输出到屏幕上进行显示。例如,第一视频特效组合中一个黑白滤镜的开始时间点是第5秒,结束时间点是第13秒,当终端根据第一视频特效组合对拍摄的视频图像进行组合特效处理时,可以将拍摄的图像的第5秒到第13秒的视频帧进行黑白滤镜处理。
在对视频图像进行处理的同时,可以获取伴奏音频和录制的音频,根据每个视频帧的时间以及伴奏音频和录制的音频中每个音频帧的时间,对音视频进行同步播放。
若预览界面中显示了多个视频特效组合的选项,用户选择了其中的一个视频特效组合预览后,用户还可以选择其他的视频特效组合进行预览,最终选择一个自己最喜欢的视频特效组合。
步骤104,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。
在实施中,用户选择视频特效组合并预览后可以点击上述预览界面中的合 成按键,终端将会接收到对上述合成按键的点击指令,终端可以获取经过视频特效组合处理的视频图像,将其进行ffmpeg(是一套可以用来记录、转换数字音频、视频,并能将其转化为流的开源计算机程序)编码,此外,终端还可以获取伴奏音频和录制的音频,并对其进行音频编码,得到编码后的音频,之后将编码后的视频图像与编码后的音频进行ffmpeg合成,得到合成视频。
可选的,用户在录制歌曲时还可以对录制的音频进行特效处理,相应的,还可以进行如下处理:显示至少一个音频特效的选项,当接收到对至少一个音频特效中的第一音频特效的选择指令时,根据第一音频特效,对录制的音频进行特效处理,并对处理后的音频进行播放。相应的,步骤104的处理过程可以如下:当接收到对合成按键的点击指令时,将处理后的音频、伴奏音频和处理后的视频图像进行合成,得到合成视频。
在实施中,在上述的预览界面还可以显示一个或多个音频特效的选项,如娃娃音、留声机等音频特效,用户可以在显示的音频特效的选项中选择一个自己喜欢的音频特效(即第一音频特效),此时,终端将会接收到对该音频特效的选择指令,根据所选择的音频特效,对所录制的音频进行特效处理,并在预览界面对处理后的音频进行播放供用户进行预览。用户选择音频特效并预览后可以点击上述预览界面中显示的合成按键,可以触发终端获取编码后的视频图像,此外,终端还可以获取伴奏音频和处理后的音频并将其进行编码,得到编码后的音频,之后将编码后的视频图像与编码后的音频进行ffmpeg合成,得到合成视频。
可选的,用户在录制歌曲时,还可以对录制的音频进行编辑处理,相应的,还可以进行如下处理:显示至少一个音频编辑处理的选项;当接收到对至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对录制的音频执行第一音频编辑处理,并对处理后的音频进行播放。相应的,步骤104的处理过程可以如下:当接收到对合成按键的点击指令时,将处理后的音频、伴奏音频和处理后的视频图像进行合成,得到合成视频。
在实施中,在上述的预览界面还可以显示一个或多个音频编辑处理的选项,比如调大音量、调小音量、人声向前移动、人声向后移动和去噪等选项,用户可以在显示的一个或多个音频编辑处理选项中选择其中一个音频编辑处理(即第一音频编辑处理),具体的,预览界面可以显示有音量调整、人声移动和去噪等选项,当用户需要对录制的音频进行音量调整时,可以点击音量调 整选项,此时,将会触发终端显示音量调整轴(音量从左到由逐渐增大,或,音量从下到上逐渐增大),在显示的音量调整轴上还可以显示有音量调整线,用户可以通过移动该音量调整线的位置调整音量的大小。当用户点击人声移动选项时,终端将会接收到用户对人声移动选项的选取指令,可以显示时间轴,在显示的时间轴上还可以显示有位于时间轴中间位置的人声移动线,用户可以通过移动该人声移动线的位置调整移动的大小,即当人声移动线向前移动时,终端可以将录制的音频向前移动,当人声移动线向后移动时,终端可以将录制的音频向后移动,这样,可以避免在录制的音频错位时重新录制音频。当用户点击去噪选项时,终端将会接收到对去噪选项的选取指令,进而,可以对录制的音频进行去噪处理。用户选择第一音频编辑处理选项后,可以点击预览界面中的预览按键,此时可以触发终端对录制的音频执行第一音频编辑处理,并对处理后的音频进行播放。用户选择音频编辑处理并预览后可以点击上述预览界面中显示的合成按键,可以触发终端获取编码后的视频图像,此外,终端还可以获取伴奏音频和处理后的音频并将其进行编码,得到编码后的音频,之后将编码后的视频图像与编码后的音频进行ffmpeg合成,得到合成视频。
本发明实施例中,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键,其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频,当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。这样得到的合成视频中的视频图像不是对摄像头拍摄内容的简单呈现,而是经过了视频特效处理,从而可以增强歌曲视频录制的灵活性。
基于相同的技术构思,本发明实施例还提供了一种生成视频的装置,如图5所示,该装置包括:
播放模块510,用于播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;
显示模块520,用于在所述视频图像的拍摄和所述音频的录制结束后,显 示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;
预览模块530,用于当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;
合成模块540,用于当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。
可选的,所述,还包括确定模块,用于在播放待录制歌曲的伴奏音频之前,确定用户在目标歌曲中选取的歌曲片段;
所述播放模块510,用于:播放所述歌曲片段的伴奏音频。
可选的,所述确定模块,用于:
显示目标歌曲的歌词列表;
获取用户在所述歌词列表中设置的歌曲片段的起始点和终止点;
根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。
可选的,所述确定模块,用于:
显示目标歌曲的播放时间轴;
获取用户在所述播放时间轴中设置的歌曲片段的起始点和终止点;
根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。
可选的,所述显示模块520,还用于显示至少一个音频特效的选项;
所述预览模块530,还用于当接收到对所述至少一个音频特效中的第一音频特效的选择指令时,根据所述第一音频特效,对所述录制的音频进行特效处理,并对处理后的音频进行播放;
所述合成模块540,用于当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。
可选的,所述显示模块520,还用于显示至少一个音频编辑处理的选项;
所述预览模块530,还用于当接收到对所述至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对所述录制的音频执行所述第一音频编辑处 理,并对处理后的音频进行播放;
所述合成模块540,用于当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。
本发明实施例中,播放待录制歌曲的伴奏音频,显示该伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制,在视频图像的拍摄和音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键,其中,该视频特效组合包括至少一个滤镜和/或至少一个前景视频,当接收到对至少一个视频特效组合中的第一视频特效组合的选择指令时,根据该第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放伴奏音频和录制的音频,当接收到对合成按键的点击指令时,将伴奏音频、录制的音频和处理后的视频图像进行合成,得到合成视频。这样得到的合成视频中的视频图像不是对摄像头拍摄内容的简单呈现,而是经过了视频特效处理,从而可以增强歌曲视频录制的灵活性。
需要说明的是:上述实施例提供的生成视频的装置在生成视频时,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将设备的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。另外,上述实施例提供的生成视频的装置与生成视频的方法实施例属于同一构思,其具体实现过程详见方法实施例,这里不再赘述。
请参考图6,其示出了本发明实施例所涉及的终端的结构示意图,该终端可以用于实施上述实施例中提供的生成视频的方法。具体来讲:
终端900可以包括RF(Radio Frequency,射频)电路110、包括有一个或一个以上计算机可读存储介质的存储器120、输入单元130、显示单元140、传感器150、音频电路160、WiFi(wireless fidelity,无线保真)模块170、包括有一个或者一个以上处理核心的处理器180、以及电源190等部件。本领域技术人员可以理解,图6中示出的终端结构并不构成对终端的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。其中:
RF电路110可用于收发信息或通话过程中,信号的接收和发送,特别地,将基站的下行信息接收后,交由一个或者一个以上处理器180处理;另外,将 涉及上行的数据发送给基站。通常,RF电路110包括但不限于天线、至少一个放大器、调谐器、一个或多个振荡器、用户身份模块(SIM)卡、收发信机、耦合器、LNA(Low Noise Amplifier,低噪声放大器)、双工器等。此外,RF电路110还可以通过无线通信与网络和其他设备通信。所述无线通信可以使用任一通信标准或协议,包括但不限于GSM(Global System of Mobile communication,全球移动通讯系统)、GPRS(General Packet Radio Service,通用分组无线服务)、CDMA(Code Division Multiple Access,码分多址)、WCDMA(Wideband Code Division Multiple Access,宽带码分多址)、LTE(Long Term Evolution,长期演进)、电子邮件、SMS(Short Messaging Service,短消息服务)等。
存储器120可用于存储软件程序以及模块,处理器180通过运行存储在存储器120的软件程序以及模块,从而执行各种功能应用以及数据处理。存储器120可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的应用程序(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据终端900的使用所创建的数据(比如音频数据、电话本等)等。此外,存储器120可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。相应地,存储器120还可以包括存储器控制器,以提供处理器180和输入单元130对存储器120的访问。
输入单元130可用于接收输入的数字或字符信息,以及产生与用户设置以及功能控制有关的键盘、鼠标、操作杆、光学或者轨迹球信号输入。具体地,输入单元130可包括触敏表面131以及其他输入设备132。触敏表面131,也称为触摸显示屏或者触控板,可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触敏表面131上或在触敏表面131附近的操作),并根据预先设定的程式驱动相应的连接装置。可选的,触敏表面131可包括触摸检测装置和触摸控制器两个部分。其中,触摸检测装置检测用户的触摸方位,并检测触摸操作带来的信号,将信号传送给触摸控制器;触摸控制器从触摸检测装置上接收触摸信息,并将它转换成触点坐标,再送给处理器180,并能接收处理器180发来的命令并加以执行。此外,可以采用电阻式、电容式、红外线以及表面声波等多种类型实现触敏表面131。除了触敏表面131,输入单元130还可以包括其他输入设备132。具体地,其他输入设备 132可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆等中的一种或多种。
显示单元140可用于显示由用户输入的信息或提供给用户的信息以及终端900的各种图形用户接口,这些图形用户接口可以由图形、文本、图标、视频和其任意组合来构成。显示单元140可包括显示面板141,可选的,可以采用LCD(Liquid Crystal Display,液晶显示器)、OLED(Organic Light-Emitting Diode,有机发光二极管)等形式来配置显示面板141。进一步的,触敏表面131可覆盖显示面板141,当触敏表面131检测到在其上或附近的触摸操作后,传送给处理器180以确定触摸事件的类型,随后处理器180根据触摸事件的类型在显示面板141上提供相应的视觉输出。虽然在图6中,触敏表面131与显示面板141是作为两个独立的部件来实现输入和输入功能,但是在某些实施例中,可以将触敏表面131与显示面板141集成而实现输入和输出功能。
终端900还可包括至少一种传感器150,比如光传感器、运动传感器以及其他传感器。具体地,光传感器可包括环境光传感器及接近传感器,其中,环境光传感器可根据环境光线的明暗来调节显示面板141的亮度,接近传感器可在终端900移动到耳边时,关闭显示面板141和/或背光。作为运动传感器的一种,重力加速度传感器可检测各个方向上(一般为三轴)加速度的大小,静止时可检测出重力的大小及方向,可用于识别手机姿态的应用(比如横竖屏切换、相关游戏、磁力计姿态校准)、振动识别相关功能(比如计步器、敲击)等;至于终端900还可配置的陀螺仪、气压计、湿度计、温度计、红外线传感器等其他传感器,在此不再赘述。
音频电路160、扬声器161,传声器162可提供用户与终端900之间的音频接口。音频电路160可将接收到的音频数据转换后的电信号,传输到扬声器161,由扬声器161转换为声音信号输出;另一方面,传声器162将收集的声音信号转换为电信号,由音频电路160接收后转换为音频数据,再将音频数据输出处理器180处理后,经RF电路110以发送给比如另一终端,或者将音频数据输出至存储器120以便进一步处理。音频电路160还可能包括耳塞插孔,以提供外设耳机与终端900的通信。
WiFi属于短距离无线传输技术,终端900通过WiFi模块170可以帮助用户收发电子邮件、浏览网页和访问流式媒体等,它为用户提供了无线的宽带互联网访问。虽然图6示出了WiFi模块170,但是可以理解的是,其并不属于终 端900的必须构成,完全可以根据需要在不改变发明的本质的范围内而省略。
处理器180是终端900的控制中心,利用各种接口和线路连接整个手机的各个部分,通过运行或执行存储在存储器120内的软件程序和/或模块,以及调用存储在存储器120内的数据,执行终端900的各种功能和处理数据,从而对手机进行整体监控。可选的,处理器180可包括一个或多个处理核心;优选的,处理器180可集成应用处理器和调制解调处理器,其中,应用处理器主要处理操作系统、用户界面和应用程序等,调制解调处理器主要处理无线通信。可以理解的是,上述调制解调处理器也可以不集成到处理器180中。
终端900还包括给各个部件供电的电源190(比如电池),优选的,电源可以通过电源管理系统与处理器180逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。电源190还可以包括一个或一个以上的直流或交流电源、再充电系统、电源故障检测电路、电源转换器或者逆变器、电源状态指示器等任意组件。
尽管未示出,终端900还可以包括摄像头、蓝牙模块等,在此不再赘述。具体在本实施例中,终端900的显示单元是触摸屏显示器,终端900还包括有存储器,以及一个或者一个以上的程序,其中一个或者一个以上程序存储于存储器中,且经配置以由一个或者一个以上处理器执行述一个或者一个以上程序来执行上述各个实施例所述的生成视频的方法。
在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器,上述指令可由移动终端的处理器执行以完成上述生成视频的方法。例如,所述非临时性计算机可读存储介质可以是ROM(Read-Only Memory,只读存储器)、RAM(Random-Access Memory,随机存取存储器)、CD-ROM(Compact Disc Read-Only Memory,光盘只读存储器)、磁带、软盘和光数据存储设备等。
本领域普通技术人员可以理解实现上述实施例的全部或部分步骤可以通过硬件来完成,也可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,上述提到的存储介质可以是只读存储器,磁盘或光盘等。
以上所述仅为本发明的较佳实施例,并不用以限制本发明,凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。

Claims (18)

  1. 一种生成视频的方法,其特征在于,所述方法包括:
    播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;
    在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;
    当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;
    当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。
  2. 根据权利要求1所述的方法,其特征在于,所述播放待录制歌曲的伴奏音频之前,还包括:确定用户在目标歌曲中选取的歌曲片段;
    所述播放待录制歌曲的伴奏音频,包括:播放所述歌曲片段的伴奏音频。
  3. 根据权利要求2所述的方法,其特征在于,所述确定用户在目标歌曲中选取的歌曲片段,包括:
    显示目标歌曲的歌词列表;
    获取用户在所述歌词列表中设置的歌曲片段的起始点和终止点;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。
  4. 根据权利要求2所述的方法,其特征在于,所述确定用户在目标歌曲中选取的歌曲片段,包括:
    显示目标歌曲的播放时间轴;
    获取用户在所述播放时间轴中设置的歌曲片段的起始点和终止点;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。
  5. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    显示至少一个音频特效的选项;
    当接收到对所述至少一个音频特效中的第一音频特效的选择指令时,根据 所述第一音频特效,对所述录制的音频进行特效处理,并对处理后的音频进行播放;
    所述当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频,包括:
    当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。
  6. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    显示至少一个音频编辑处理的选项;
    当接收到对所述至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对所述录制的音频执行所述第一音频编辑处理,并对处理后的音频进行播放;
    所述当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频,包括:
    当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。
  7. 一种生成视频的装置,其特征在于,所述装置包括:
    播放模块,用于播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;
    显示模块,用于在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;
    预览模块,用于当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;
    合成模块,用于当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。
  8. 根据权利要求7所述的装置,其特征在于,所述装置还包括确定模块,用于:在所述播放待录制歌曲的伴奏音频之前,确定用户在目标歌曲中选取的歌曲片段;
    所述播放模块,用于:播放所述歌曲片段的伴奏音频。
  9. 根据权利要求8所述的装置,其特征在于,所述确定模块,用于:
    显示目标歌曲的歌词列表;
    获取用户在所述歌词列表中设置的歌曲片段的起始点和终止点;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。
  10. 根据权利要求8所述的装置,其特征在于,所述确定模块,用于:
    显示目标歌曲的播放时间轴;
    获取用户在所述播放时间轴中设置的歌曲片段的起始点和终止点;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。
  11. 根据权利要求7所述的装置,其特征在于,所述显示模块,还用于显示至少一个音频特效的选项;
    所述预览模块,还用于当接收到对所述至少一个音频特效中的第一音频特效的选择指令时,根据所述第一音频特效,对所述录制的音频进行特效处理,并对处理后的音频进行播放;
    所述合成模块,用于当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。
  12. 根据权利要求7所述的方法,其特征在于,所述显示模块,还用于显示至少一个音频编辑处理的选项;
    所述预览模块,还用于当接收到对所述至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对所述录制的音频执行所述第一音频编辑处理,并对处理后的音频进行播放;
    所述合成模块,用于当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。
  13. 一种终端,其特征在于,所述终端包括:
    一个或多个处理器;和
    存储器;
    所述存储器存储有一个或多个程序,所述一个或多个程序被配置成由所述一个或多个处理器执行,所述一个或多个程序包含用于进行以下操作的指令:
    播放待录制歌曲的伴奏音频,显示所述伴奏音频对应的歌词字幕,并进行视频图像的拍摄和音频的录制;
    在所述视频图像的拍摄和所述音频的录制结束后,显示预先存储的至少一个视频特效组合的选项,并显示合成按键;其中,所述视频特效组合包括至少一个滤镜和/或至少一个前景视频;
    当接收到对所述至少一个视频特效组合中的第一视频特效组合的选择指令时,根据所述第一视频特效组合,对拍摄的视频图像进行组合特效处理,并对处理后的视频图像进行显示,同时播放所述伴奏音频和录制的音频;
    当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频。
  14. 根据权利要求13所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:
    确定用户在目标歌曲中选取的歌曲片段;
    播放所述歌曲片段的伴奏音频。
  15. 根据权利要求14所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:
    显示目标歌曲的歌词列表;
    获取用户在所述歌词列表中设置的歌曲片段的起始点和终止点;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。
  16. 根据权利要求14所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:
    显示目标歌曲的播放时间轴;
    获取用户在所述播放时间轴中设置的歌曲片段的起始点和终止点;
    根据所述起始点和终止点,确定所述用户在所述目标歌曲中选取的歌曲片段。
  17. 根据权利要求13所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:
    显示至少一个音频特效的选项;
    当接收到对所述至少一个音频特效中的第一音频特效的选择指令时,根据所述第一音频特效,对所述录制的音频进行特效处理,并对处理后的音频进行 播放;
    所述当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频,包括:
    当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。
  18. 根据权利要求13所述的终端,其特征在于,所述一个或多个程序还包含用于进行以下操作的指令:
    显示至少一个音频编辑处理的选项;
    当接收到对所述至少一个音频编辑处理中的第一音频编辑处理的选择指令时,对所述录制的音频执行所述第一音频编辑处理,并对处理后的音频进行播放;
    所述当接收到对所述合成按键的点击指令时,将所述伴奏音频、所述录制的音频和所述处理后的视频图像进行合成,得到合成视频,包括:
    当接收到对所述合成按键的点击指令时,将所述处理后的音频、所述伴奏音频和所述处理后的视频图像进行合成,得到合成视频。
PCT/CN2016/080666 2015-05-04 2016-04-29 一种生成视频的方法和装置 WO2016177296A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510221018.9 2015-05-04
CN201510221018.9A CN104967900B (zh) 2015-05-04 2015-05-04 一种生成视频的方法和装置

Publications (1)

Publication Number Publication Date
WO2016177296A1 true WO2016177296A1 (zh) 2016-11-10

Family

ID=54221823

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/080666 WO2016177296A1 (zh) 2015-05-04 2016-04-29 一种生成视频的方法和装置

Country Status (2)

Country Link
CN (1) CN104967900B (zh)
WO (1) WO2016177296A1 (zh)

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109285532A (zh) * 2018-10-31 2019-01-29 深圳市酷达通讯有限公司 一种点歌系统
CN109325219A (zh) * 2018-08-24 2019-02-12 维沃移动通信有限公司 一种生成记录文档的方法、装置及系统
CN109473117A (zh) * 2018-12-18 2019-03-15 广州市百果园信息技术有限公司 音频特效叠加方法、装置及其终端
CN109683993A (zh) * 2018-12-12 2019-04-26 努比亚技术有限公司 一种应用程序处理方法、设备及计算机可读存储介质
CN109840879A (zh) * 2017-11-28 2019-06-04 腾讯科技(深圳)有限公司 图像渲染方法、装置、计算机存储介质及终端
WO2019105438A1 (zh) * 2017-11-30 2019-06-06 广州市百果园信息技术有限公司 视频特效添加方法、装置及智能移动终端
CN110446097A (zh) * 2019-08-26 2019-11-12 维沃移动通信有限公司 录屏方法及移动终端
CN111327855A (zh) * 2020-03-10 2020-06-23 网易(杭州)网络有限公司 一种视频录制方法、装置以及视频定位方法、装置
CN111491205A (zh) * 2020-04-17 2020-08-04 维沃移动通信有限公司 视频处理方法、装置及电子设备
CN112653920A (zh) * 2020-12-18 2021-04-13 北京字跳网络技术有限公司 视频处理方法、装置、设备、存储介质及计算机程序产品
CN112995536A (zh) * 2021-02-04 2021-06-18 上海哔哩哔哩科技有限公司 视频合成方法及系统
CN113079332A (zh) * 2021-03-16 2021-07-06 青岛海信移动通信技术股份有限公司 移动终端及其录屏方法
CN113489899A (zh) * 2021-06-29 2021-10-08 中国平安人寿保险股份有限公司 特效视频录制方法、装置、计算机设备及存储介质
CN114390205A (zh) * 2022-01-29 2022-04-22 西安维沃软件技术有限公司 拍摄方法、装置和电子设备
CN114390299A (zh) * 2020-10-16 2022-04-22 腾讯科技(深圳)有限公司 歌曲点播方法、装置、设备及计算机可读存储介质
CN114915830A (zh) * 2022-06-06 2022-08-16 武汉市芯中芯科技有限公司 一种利用手机麦克风实现wifi可视设备音视频合成的方法
CN115474088A (zh) * 2022-09-07 2022-12-13 腾讯音乐娱乐科技(深圳)有限公司 一种视频处理方法、计算机设备及存储介质
CN116668763A (zh) * 2022-11-10 2023-08-29 荣耀终端有限公司 录屏方法及装置
CN116708899A (zh) * 2022-06-30 2023-09-05 北京生数科技有限公司 应用于合成虚拟形象的视频处理方法、装置及存储介质
US11948385B2 (en) * 2020-12-23 2024-04-02 Abbyy Development Inc. Zero-footprint image capture by mobile device
CN115474088B (zh) * 2022-09-07 2024-05-28 腾讯音乐娱乐科技(深圳)有限公司 一种视频处理方法、计算机设备及存储介质

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI592021B (zh) 2015-02-04 2017-07-11 騰訊科技(深圳)有限公司 生成視頻的方法、裝置及終端
CN104967900B (zh) * 2015-05-04 2018-08-07 腾讯科技(深圳)有限公司 一种生成视频的方法和装置
CN106055671B (zh) * 2016-06-03 2022-06-14 腾讯科技(深圳)有限公司 一种多媒体数据处理方法及其设备
CN106131472A (zh) * 2016-07-26 2016-11-16 维沃移动通信有限公司 一种录像方法及移动终端
CN106604113A (zh) * 2016-12-15 2017-04-26 天脉聚源(北京)传媒科技有限公司 一种智能合成视频的方法及装置
CN106847246A (zh) * 2017-03-01 2017-06-13 华东交通大学 一种歌曲合唱方法、装置及系统
CN107786876A (zh) * 2017-09-21 2018-03-09 北京达佳互联信息技术有限公司 音乐和视频的同步方法、装置及移动终端
CN107707828B (zh) * 2017-09-26 2019-07-26 维沃移动通信有限公司 一种视频处理方法及移动终端
CN108055490B (zh) * 2017-10-25 2021-04-13 北京密境和风科技有限公司 一种视频处理方法、装置、移动终端及存储介质
CN108401124B (zh) * 2018-03-16 2020-08-25 广州酷狗计算机科技有限公司 视频录制的方法和装置
CN108810436A (zh) * 2018-05-24 2018-11-13 广州音乐猫乐器科技有限公司 一种基于全自动乐器合奏的视频录制方法和系统
CN108965757B (zh) * 2018-08-02 2021-04-06 广州酷狗计算机科技有限公司 视频录制方法、装置、终端及存储介质
CN109166596A (zh) * 2018-08-10 2019-01-08 北京微播视界科技有限公司 音乐编辑方法、装置、终端设备及计算机可读存储介质
CN109151356A (zh) * 2018-09-05 2019-01-04 传线网络科技(上海)有限公司 视频录制方法及装置
CN109522443A (zh) * 2018-09-29 2019-03-26 上海与德通讯技术有限公司 乐曲部分的收集方法、电子设备及计算机可读存储介质
CN109508393A (zh) * 2018-09-29 2019-03-22 上海与德通讯技术有限公司 乐曲部分的收集方法、电子设备及计算机可读存储介质
CN108962286B (zh) * 2018-10-15 2020-12-01 腾讯音乐娱乐科技(深圳)有限公司 音频识别方法、装置及存储介质
CN109005359B (zh) * 2018-10-31 2020-11-03 广州酷狗计算机科技有限公司 视频录制方法、装置存储介质
CN109348281B (zh) * 2018-11-08 2020-02-21 北京微播视界科技有限公司 视频处理方法、装置、计算机设备和存储介质
CN109587549B (zh) * 2018-12-05 2021-08-13 广州酷狗计算机科技有限公司 视频录制方法、装置、终端及存储介质
CN109413342B (zh) 2018-12-21 2021-01-08 广州酷狗计算机科技有限公司 音视频处理方法、装置、终端及存储介质
CN110324718B (zh) * 2019-08-05 2021-09-07 北京字节跳动网络技术有限公司 音视频生成方法、装置、电子设备及可读介质
CN111061405B (zh) * 2019-12-13 2021-08-27 广州酷狗计算机科技有限公司 录制歌曲音频的方法、装置、设备及存储介质
CN110996167A (zh) * 2019-12-20 2020-04-10 广州酷狗计算机科技有限公司 在视频中添加字幕的方法及装置
CN111970571B (zh) * 2020-08-24 2022-07-26 北京字节跳动网络技术有限公司 视频制作方法、装置、设备及存储介质
CN112312053B (zh) * 2020-10-29 2023-05-23 维沃移动通信有限公司 视频录制方法及装置
CN112422831A (zh) * 2020-11-20 2021-02-26 广州太平洋电脑信息咨询有限公司 视频生成方法、装置、计算机设备和存储介质
CN113658343A (zh) * 2021-07-27 2021-11-16 珠海市大悦科技有限公司 多媒体互动方法、装置及可读介质
CN114245036B (zh) * 2021-12-21 2024-03-12 北京达佳互联信息技术有限公司 视频制作方法及装置
CN115767141A (zh) * 2022-08-26 2023-03-07 维沃移动通信有限公司 视频播放方法、装置和电子设备

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1156377A (zh) * 1995-12-27 1997-08-06 娱乐消遣技术株式会社 卡拉ok系统
CN1383543A (zh) * 2000-06-20 2002-12-04 皇家菲利浦电子有限公司 一种卡拉ok系统
CN2617100Y (zh) * 2003-01-29 2004-05-19 项青松 自制音像资料的编辑制作装置
WO2007071954A1 (en) * 2005-12-19 2007-06-28 Landesberg, Andrew Live performance entertainment apparatus and method
CN201477863U (zh) * 2009-05-21 2010-05-19 董海涛 一种音乐电视合成机
CN102568527A (zh) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 一种轻松剪辑音频文件的方法、系统及其应用的移动手持装置
CN104079966A (zh) * 2014-05-15 2014-10-01 惠州市水木网络科技有限公司 基于机顶盒的卡拉ok共享系统
CN104702856A (zh) * 2013-12-10 2015-06-10 音圆国际股份有限公司 应用于伴唱机的实时自拍特效合成mv的系统装置及方法
CN104883516A (zh) * 2015-06-05 2015-09-02 福建星网视易信息系统有限公司 一种制作实时演唱视频的方法及系统
CN104967801A (zh) * 2015-02-04 2015-10-07 腾讯科技(深圳)有限公司 一种视频数据处理方法和装置
CN104967900A (zh) * 2015-05-04 2015-10-07 腾讯科技(深圳)有限公司 一种生成视频的方法和装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101141598A (zh) * 2006-09-08 2008-03-12 郭鹏飞 一种用于同时提供歌曲字幕和前景的卡拉ok光盘
CN201035651Y (zh) * 2007-02-01 2008-03-12 李智 自助式k歌录制机

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1156377A (zh) * 1995-12-27 1997-08-06 娱乐消遣技术株式会社 卡拉ok系统
CN1383543A (zh) * 2000-06-20 2002-12-04 皇家菲利浦电子有限公司 一种卡拉ok系统
CN2617100Y (zh) * 2003-01-29 2004-05-19 项青松 自制音像资料的编辑制作装置
WO2007071954A1 (en) * 2005-12-19 2007-06-28 Landesberg, Andrew Live performance entertainment apparatus and method
CN201477863U (zh) * 2009-05-21 2010-05-19 董海涛 一种音乐电视合成机
CN102568527A (zh) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 一种轻松剪辑音频文件的方法、系统及其应用的移动手持装置
CN104702856A (zh) * 2013-12-10 2015-06-10 音圆国际股份有限公司 应用于伴唱机的实时自拍特效合成mv的系统装置及方法
CN104079966A (zh) * 2014-05-15 2014-10-01 惠州市水木网络科技有限公司 基于机顶盒的卡拉ok共享系统
CN104967801A (zh) * 2015-02-04 2015-10-07 腾讯科技(深圳)有限公司 一种视频数据处理方法和装置
CN104967900A (zh) * 2015-05-04 2015-10-07 腾讯科技(深圳)有限公司 一种生成视频的方法和装置
CN104883516A (zh) * 2015-06-05 2015-09-02 福建星网视易信息系统有限公司 一种制作实时演唱视频的方法及系统

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109840879A (zh) * 2017-11-28 2019-06-04 腾讯科技(深圳)有限公司 图像渲染方法、装置、计算机存储介质及终端
WO2019105438A1 (zh) * 2017-11-30 2019-06-06 广州市百果园信息技术有限公司 视频特效添加方法、装置及智能移动终端
CN109325219A (zh) * 2018-08-24 2019-02-12 维沃移动通信有限公司 一种生成记录文档的方法、装置及系统
CN109285532A (zh) * 2018-10-31 2019-01-29 深圳市酷达通讯有限公司 一种点歌系统
CN109683993A (zh) * 2018-12-12 2019-04-26 努比亚技术有限公司 一种应用程序处理方法、设备及计算机可读存储介质
CN109683993B (zh) * 2018-12-12 2023-12-29 努比亚技术有限公司 一种应用程序处理方法、设备及计算机可读存储介质
CN109473117A (zh) * 2018-12-18 2019-03-15 广州市百果园信息技术有限公司 音频特效叠加方法、装置及其终端
CN110446097B (zh) * 2019-08-26 2022-04-15 维沃移动通信有限公司 录屏方法及移动终端
CN110446097A (zh) * 2019-08-26 2019-11-12 维沃移动通信有限公司 录屏方法及移动终端
CN111327855A (zh) * 2020-03-10 2020-06-23 网易(杭州)网络有限公司 一种视频录制方法、装置以及视频定位方法、装置
CN111327855B (zh) * 2020-03-10 2022-08-05 网易(杭州)网络有限公司 一种视频录制方法、装置以及视频定位方法、装置
CN111491205A (zh) * 2020-04-17 2020-08-04 维沃移动通信有限公司 视频处理方法、装置及电子设备
CN114390299A (zh) * 2020-10-16 2022-04-22 腾讯科技(深圳)有限公司 歌曲点播方法、装置、设备及计算机可读存储介质
CN114390299B (zh) * 2020-10-16 2024-02-02 腾讯科技(深圳)有限公司 歌曲点播方法、装置、设备及计算机可读存储介质
CN112653920A (zh) * 2020-12-18 2021-04-13 北京字跳网络技术有限公司 视频处理方法、装置、设备、存储介质及计算机程序产品
US11948385B2 (en) * 2020-12-23 2024-04-02 Abbyy Development Inc. Zero-footprint image capture by mobile device
CN112995536A (zh) * 2021-02-04 2021-06-18 上海哔哩哔哩科技有限公司 视频合成方法及系统
CN113079332A (zh) * 2021-03-16 2021-07-06 青岛海信移动通信技术股份有限公司 移动终端及其录屏方法
CN113489899A (zh) * 2021-06-29 2021-10-08 中国平安人寿保险股份有限公司 特效视频录制方法、装置、计算机设备及存储介质
CN114390205A (zh) * 2022-01-29 2022-04-22 西安维沃软件技术有限公司 拍摄方法、装置和电子设备
CN114390205B (zh) * 2022-01-29 2023-09-15 西安维沃软件技术有限公司 拍摄方法、装置和电子设备
CN114915830A (zh) * 2022-06-06 2022-08-16 武汉市芯中芯科技有限公司 一种利用手机麦克风实现wifi可视设备音视频合成的方法
CN116708899A (zh) * 2022-06-30 2023-09-05 北京生数科技有限公司 应用于合成虚拟形象的视频处理方法、装置及存储介质
CN116708899B (zh) * 2022-06-30 2024-01-23 北京生数科技有限公司 应用于合成虚拟形象的视频处理方法、装置及存储介质
CN115474088A (zh) * 2022-09-07 2022-12-13 腾讯音乐娱乐科技(深圳)有限公司 一种视频处理方法、计算机设备及存储介质
CN115474088B (zh) * 2022-09-07 2024-05-28 腾讯音乐娱乐科技(深圳)有限公司 一种视频处理方法、计算机设备及存储介质
CN116668763A (zh) * 2022-11-10 2023-08-29 荣耀终端有限公司 录屏方法及装置
CN116668763B (zh) * 2022-11-10 2024-04-19 荣耀终端有限公司 录屏方法及装置

Also Published As

Publication number Publication date
CN104967900B (zh) 2018-08-07
CN104967900A (zh) 2015-10-07

Similar Documents

Publication Publication Date Title
WO2016177296A1 (zh) 一种生成视频的方法和装置
US10841661B2 (en) Interactive method, apparatus, and system in live room
TWI592021B (zh) 生成視頻的方法、裝置及終端
WO2018184488A1 (zh) 视频配音方法及装置
US10255929B2 (en) Media presentation playback annotation
WO2020015333A1 (zh) 视频拍摄方法、装置、终端设备及存储介质
WO2019105438A1 (zh) 视频特效添加方法、装置及智能移动终端
CN109302538B (zh) 音乐播放方法、装置、终端及存储介质
US9924205B2 (en) Video remote-commentary synchronization method and system, and terminal device
CN108924464B (zh) 视频文件的生成方法、装置及存储介质
WO2017076143A1 (zh) 视频的直播流转点播数据的方法、装置及系统
US11670339B2 (en) Video acquisition method and device, terminal and medium
CN111050203B (zh) 一种视频处理方法、装置、视频处理设备及存储介质
WO2019062541A1 (zh) 一种实时数字音频信号混音的方法及装置
WO2018157812A1 (zh) 一种实现视频分支选择播放的方法及装置
WO2016184295A1 (zh) 即时通讯方法、用户设备及系统
WO2017088527A1 (zh) 音频文件的重录方法、装置及存储介质
US20210349678A1 (en) Methods and electronic devices for dynamic control of playlists
CN107948562B (zh) 视频录制方法和视频录制终端
CN104636110B (zh) 控制音量的方法及装置
CN111147779B (zh) 视频制作方法、电子设备及介质
WO2017215661A1 (zh) 一种场景音效的控制方法、及电子设备
CN104639977A (zh) 节目播放的方法及装置
KR102186815B1 (ko) 컨텐츠 스크랩 방법, 장치 및 기록매체
AU2014200042B2 (en) Method and apparatus for controlling contents in electronic device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16789290

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 11.04.2018)

122 Ep: pct application non-entry in european phase

Ref document number: 16789290

Country of ref document: EP

Kind code of ref document: A1