WO2012016406A1 - Method and mobile terminal for adding background sound in recording procedure - Google Patents

Method and mobile terminal for adding background sound in recording procedure Download PDF

Info

Publication number
WO2012016406A1
WO2012016406A1 PCT/CN2010/079242 CN2010079242W WO2012016406A1 WO 2012016406 A1 WO2012016406 A1 WO 2012016406A1 CN 2010079242 W CN2010079242 W CN 2010079242W WO 2012016406 A1 WO2012016406 A1 WO 2012016406A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
file
video
frame
mobile terminal
Prior art date
Application number
PCT/CN2010/079242
Other languages
French (fr)
Chinese (zh)
Inventor
刘洪霞
李�昊
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012016406A1 publication Critical patent/WO2012016406A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/765Interface circuits between an apparatus for recording and another apparatus
    • H04N5/77Interface circuits between an apparatus for recording and another apparatus between a recording apparatus and a television camera
    • H04N5/772Interface circuits between an apparatus for recording and another apparatus between a recording apparatus and a television camera the recording apparatus and the television camera being placed in the same enclosure
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/41407Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/4223Cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/433Content storage operation, e.g. storage operation in response to a pause request, caching operations
    • H04N21/4334Recording operations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • H04M1/65Recording arrangements for recording a message from the calling party
    • H04M1/656Recording arrangements for recording a message from the calling party for recording conversations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2250/00Details of telephonic subscriber devices
    • H04M2250/52Details of telephonic subscriber devices including functional features of a camera
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2101/00Still video cameras

Definitions

  • the present invention relates to the field of mobile communications, and in particular, to a method for loading a background sound during a video recording process and a mobile terminal.
  • the mobile phone camera function is such a key mobile phone add-on function.
  • the video file format commonly used by mobile phones is 3gp.
  • a file in 3gp format consists of several boxes and can be nested, that is, a box can store multiple other boxes.
  • the 4 bytes at the beginning of each box are an integer, which stores the length of the box; the next 4 bytes are a string, which stores the type of the box, such as "moov", "mvhd”;
  • the remaining bytes store the content information of this box.
  • the box is 24 bytes long and has the type "ftyp". The remaining bytes are used to record the version number and compatible version number of the 3gp file.
  • a 3gp file there is only one ftyp type box, and it appears in the file header.
  • the related decoder can judge the file by reading whether the file header has the "ftyp" keyword and whether it meets the ftyp box standard. Whether it is a 3gp file.
  • the length of the box is variable, the type is "moov", and the remaining bytes are used to record the basic information of the 3Gp file video and audio, including: Which encoding method is used, the total duration, the starting position of the audio and video frames in the file, Information such as the position of the key frame, the time stamp of the audio and video frame playback, and so on.
  • a 3gp file there is only one media packet, which usually appears in the header of the 3gp file (that is, after the ftyp type box) or at the end of the 3gp file. It is the most important part of the 3gp file format.
  • Read 3gp When the file is played, the media information package should be analyzed first, and the 3gp file should be created. (such as recording) Generally, the information collected before is written at the end of the file.
  • the length of the box is variable, the type is "mdat", and the remaining bytes are generally used by users to store video frames and audio frame data, while audio frames and video frames are generally interleaved and stored in chronological order.
  • the number of media packets is variable and the location in the file is not fixed. It is used to store actual audio and video data, as well as other related data.
  • the mobile phone camera effect is slightly rough, but with the powerful processing function of the mobile phone, if the camera function can be enriched and the user experience is better, the mobile phone camera function will be more powerful and more attractive to consumers. .
  • the user has the willingness to re-edit the video and replace the background sound in the video.
  • the user needs to first use a digital camera or a mobile phone to complete the camera operation, and then import the captured file into the PC, and then use the professional video editing software to process the recorded file.
  • This method is very professional and time consuming, making the average user unwilling to involve.
  • the technical problem to be solved by the present invention is to provide a method for loading background sounds in a video recording process and a mobile terminal, so as to solve the problem in the prior art that a user needs to synchronize a video file to a PC and edit with a professional video processing software to replace the video file. Defects in the background sound.
  • the present invention provides a method for loading a background sound during a recording process, including:
  • the mobile terminal parses the background sound file selected by the user and encodes the video frame acquired during the recording, and then parses the audio frame that is parsed. And the encoded video frames are alternately written into the video file, and the basic information required to play the audio frame and the video frame is written into the video file after the recording is completed.
  • the above method may also have the following features:
  • the basic information includes: an encoding mode, a total duration, a starting position of the audio and video and the video frame in the recording file, a position of the key frame, and a time stamp of the audio and video frame playing.
  • the above method may further include:
  • the mobile terminal adds a first identification information to each audio frame that is parsed
  • the mobile terminal When parsing the background sound file selected by the user, the mobile terminal encodes the audio data acquired by the microphone into an audio frame, and adds a second identification information to the audio frame; and the mobile terminal encodes the audio file.
  • the audio frame is alternately written into the video file along with the parsed audio frame and the encoded video frame.
  • the above method may also have the following features:
  • the background sound file selected by the user refers to: an audio file selected by the user as an background sound from an interface provided by the mobile terminal.
  • the invention also provides a mobile terminal for loading a background sound during a recording process, comprising: a storage module, a human-computer interaction module, an audio file parsing module, a video frame encoding module and a video file producing module;
  • the storage module is configured to: save an audio file
  • the human-computer interaction module is configured to: provide a human-computer interaction interface to display an audio file saved in the storage module to a user, and send information of the selected audio file to the user when the user selects one of them as a background sound Audio file parsing module;
  • the audio file parsing module is configured to: obtain a corresponding audio file from the storage module according to the received information of the audio file, and then parse the audio file into an audio frame and send the audio file to the video file creation module;
  • the video frame coding module is configured to: encode the video frame obtained during the recording process, and send the encoded video frame to the video file creation module;
  • the video file creation module is configured to: write the received audio frame and video frame into the video file alternately, and write the basic information required to play the audio frame and the video frame to the office after the recording is completed. In the video file.
  • the above method may also have the following features:
  • the basic information includes: an encoding method, a total duration, audio and video, and a video frame in the video The starting position in the piece, the position of the key frame, and the timestamp of the audio and video frame playback.
  • the mobile terminal may further include: an audio data encoding module;
  • the audio file parsing module is further configured to: add a first identification information for each audio frame that is parsed;
  • the audio data encoding module is configured to: encode the audio data acquired by the microphone on the mobile terminal into an audio frame, and add a second identification information to the audio frame;
  • the recording file creation module is configured to: the encoded audio frame is alternately written into the video file together with the parsed audio frame and the encoded video frame.
  • the invention is simple and easy to operate, and the user is very convenient to use.
  • the real background sound can be recorded without being recorded, the overhead of audio acquisition and encoding of the mobile phone is saved, which can make the camera more power-saving and the recording time longer.
  • Loading background sound recording adds an additional camera mode, which makes the user's special needs satisfied.
  • users can shoot their favorite scenes at the same time and choose their favorite music for synthesis.
  • the recorded multimedia file can be sent to your friends for sharing via MMS (Multimedia Messaging Service). This new feature enriches the user experience and makes the camera's camera function even more powerful.
  • MMS Multimedia Messaging Service
  • FIG. 1 is a schematic diagram of a format of a 3GP file in the prior art
  • FIG. 2 is a flowchart of a method for loading a background sound during a recording process according to an embodiment of the present invention
  • FIG. 3 is a flowchart of a method for simultaneously recording a real-time background sound and a background sound file selected by a user according to an embodiment of the present invention
  • FIG. 4 is a structural diagram of a mobile terminal according to an embodiment of the present invention. Preferred embodiment of the invention
  • the basic idea of the method of the present invention is: after the user chooses to load the background sound and uses the mobile terminal When the terminal starts recording, the mobile terminal parses the background sound file selected by the user and encodes the video frame obtained during the recording process, and then alternately parses the parsed audio frame and the encoded video frame into the video file, After the recording is completed, the basic information required to play the above audio frame and video frame is written into the video file, wherein the basic information includes: the encoding mode, the total duration, the starting position of the audio and video frame in the file, and the position of the key frame. And timestamps such as audio and video frame playback.
  • the mobile terminal needs to provide a corresponding menu selection interface for the user to select whether to load the background sound for the video file; after the user selects to load the background sound, the user also needs to provide a music selection interface for the user to pass through the interface.
  • the mobile terminal confirms selection of an audio file as a background sound.
  • the mobile terminal performs the following process:
  • the camera on the mobile terminal transmits the obtained video original frame data (such as YUV data) to the screen on the mobile terminal for display, and the mobile terminal encodes the video original frame data according to a preset video coding manner;
  • steps 1) and 2) can be performed simultaneously, regardless of time;
  • video frames and audio frames are continuously written into the video file in chronological order.
  • index information such as the starting position, size, and time stamp of each audio frame and video frame in the recording file is written into the corresponding field at the end of the file.
  • the audio data collected by the microphone on the mobile terminal is not processed by the above recording method. If the user intends to retain the voice data of the microphone, that is, the data collected by the microphone is retained in the video file, and the data of the background sound added by the user is also selected, as shown in FIG. 3, the following process is performed:
  • the camera on the mobile terminal transmits the acquired video original frame data (such as YUV data) to the screen on the mobile terminal for display, and the mobile terminal encodes the video original frame data according to a preset video coding manner; 2)
  • the mobile terminal parses the background sound file selected by the user into an audio frame, and adds a first identification information for each audio frame, where the first identification information is used to indicate that the audio frame is a background sound file selected by the user. Audio frame
  • the mobile terminal encodes the audio data (such as PCM (Pulse Code Modulation) data) acquired by the microphone into an audio frame according to a preset audio coding mode, and adds a second to each of the above audio frames. Identification information, the second identification information is used to indicate that the audio frame is an audio frame corresponding to the voice data acquired by the microphone;
  • PCM Pulse Code Modulation
  • steps 1) to 3) can be performed simultaneously, regardless of time;
  • video frames and two audio frames are continuously written into the video file in chronological order.
  • index information such as the starting position, size, and time stamp of each audio frame and video frame in the recording file is written into the corresponding field at the end of the file.
  • the decoder may correspondingly decode the audio frame corresponding to the background sound file and the real background sound collected during the recording according to the first identification information and the second identification information.
  • the mobile terminal that loads the background sound in the recording process according to the present invention includes: a storage module, a human-computer interaction module, an audio file parsing module, a video frame encoding module, and a video file creating module;
  • the storage module is used to save the audio file
  • the human-computer interaction module is configured to provide a human-computer interaction interface to display the audio file saved in the storage module to the user, and send the information of the selected audio file to the audio file parsing module when the user selects one as the background sound;
  • the audio file parsing module is configured to obtain a corresponding audio file from the storage module according to the received information of the audio file, and then parse the audio file into an audio frame and send the audio file to the video file making module;
  • the video frame coding module is configured to encode the video frame obtained during the recording process, and send the encoded video frame to the video file creation module;
  • the video file creation module is used to alternately receive the received audio frame and video frame into the video file. Medium, and write the basic information required to play the audio frame and video frame to the video file after the recording is completed.
  • the foregoing basic information may include: an encoding mode, a total duration, a starting position of the audio and video and the video frame in the video file, a position of the key frame, and a time stamp of the audio and video frame playing.
  • the foregoing mobile terminal may further include: an audio data encoding module;
  • the audio file parsing module may be further configured to add a first identification information to each of the parsed audio frames
  • the audio data encoding module may be configured to encode the audio data acquired by the microphone on the mobile terminal into an audio frame, and add a second identification information to the audio frame;
  • the recorded file creation module is configured to alternately write the encoded audio frame with the parsed audio frame and the encoded video frame.
  • the invention is simple and easy to operate, and the user is very convenient to use.
  • the real background sound can be recorded without being recorded, the overhead of audio acquisition and encoding of the mobile phone is saved, which can make the camera more power-saving and the recording time longer.
  • Loading background sound recording adds an additional camera mode, which makes the user's special needs satisfied.
  • users can shoot their favorite scenes at the same time and choose their favorite music for synthesis.
  • after shooting is complete Immediately send the recorded multimedia files to your friends for sharing via MMS SMS. This new feature enriches the user experience and makes the camera's camera function even more powerful.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

The present invention discloses a method and mobile terminal for adding a background sound in a recording procedure. Said mobile terminal includes: a storage module, a human-computer interaction module, an audio file parsing module, a video frame encoding module and a recording file production module. Said method includes the following steps: when a user uses a mobile terminal to start recording after choosing to add the background sound, said mobile terminal parses the background sound file selected by the user and encodes the video frames obtained in the recording procedure, then the parsed audio frames and the encoded video frames are written into the recording file alternatively, and the basic information needed for playing said audio frames and video frames are written into said recording file after the recording is completed. Herein, the user experience is enriched, and a stronger camera function of a mobile phone is enabled.

Description

一种在录像过程中加载背景音的方法及移动终端  Method for loading background sound in video recording process and mobile terminal
技术领域 Technical field
本发明涉及移动通讯领域, 尤其涉及一种在录像过程中加载背景音的方 法及移动终端。  The present invention relates to the field of mobile communications, and in particular, to a method for loading a background sound during a video recording process and a mobile terminal.
背景技术 Background technique
随着技术的发展, 手机被赋予越来越多的附加功能。 是否拥有这些附加 功能, 以及这些附加功能是否提供了更好的用户体验, 已成为消费者选购手 机时所考虑的关键因素之一。 手机摄像功能就是这样一种关键的手机附加功 能。  With the development of technology, mobile phones are being given more and more additional functions. Whether or not these additional features, and whether these additional features provide a better user experience, has become one of the key factors for consumers to consider when purchasing a mobile phone. The mobile phone camera function is such a key mobile phone add-on function.
目前手机常用的录像文件格式为 3gp。 如图 1所示, 3gp格式的文件由若 干个 box组成, 并且可以嵌套使用, 即一个 box还可以存储多个其他的 box。 其中, 每一个 box开头的 4个字节都是一个整数, 存放了本 box的长度; 接 下来的 4个字节是个字符串, 存放了本 box的类型, 如" moov" , "mvhd"; 后 续剩余字节存放了本 box的内容信息。 下面介绍几种基本的 box:  At present, the video file format commonly used by mobile phones is 3gp. As shown in Figure 1, a file in 3gp format consists of several boxes and can be nested, that is, a box can store multiple other boxes. Among them, the 4 bytes at the beginning of each box are an integer, which stores the length of the box; the next 4 bytes are a string, which stores the type of the box, such as "moov", "mvhd"; The remaining bytes store the content information of this box. Here are a few basic boxes:
1 )文件类型包:  1) File type package:
该 box的长度为 24 字节, 类型为 "ftyp" , 剩余字节用于记录该 3gp文 件的版本号及兼容版本号等。 在一个 3gp文件中, ftyp类型的 box有且只有 1 个, 且出现在文件头部, 相关解码器可以通过读取文件头部是否有 "ftyp" 关 键字且是否符合 ftyp box标准来判断该文件是否为 3gp文件。  The box is 24 bytes long and has the type "ftyp". The remaining bytes are used to record the version number and compatible version number of the 3gp file. In a 3gp file, there is only one ftyp type box, and it appears in the file header. The related decoder can judge the file by reading whether the file header has the "ftyp" keyword and whether it meets the ftyp box standard. Whether it is a 3gp file.
2 )媒体信息包:  2) Media information package:
该 box的长度不定, 类型为 "moov" , 剩余字节用于记录该 3gp文件视 频音频的基本信息, 包括: 釆用哪种编码方式、 总时长、 音视频帧在文件中 的起始位置、 关键帧的位置、 音视频帧播放的时间戳等信息。  The length of the box is variable, the type is "moov", and the remaining bytes are used to record the basic information of the 3Gp file video and audio, including: Which encoding method is used, the total duration, the starting position of the audio and video frames in the file, Information such as the position of the key frame, the time stamp of the audio and video frame playback, and so on.
在一个 3gp文件中, 媒体信息包有且只有 1个, 一般会出现在 3gp文件 头部 (即在 ftyp类型的 box之后)或 3gp文件末尾, 是 3gp文件格式中最重 要的部分, 读取 3gp文件进行播放时要先分析媒体信息包, 而制作 3gp文件 (如录制) 时一般要将之前收集的信息写在文件末尾。 In a 3gp file, there is only one media packet, which usually appears in the header of the 3gp file (that is, after the ftyp type box) or at the end of the 3gp file. It is the most important part of the 3gp file format. Read 3gp When the file is played, the media information package should be analyzed first, and the 3gp file should be created. (such as recording) Generally, the information collected before is written at the end of the file.
3 )媒体数据包  3) Media packets
该 box的长度不定, 类型为 "mdat" , 剩余字节一般用户存储视频帧和 音频帧数据, 而音频帧和视频帧一般是按照时间先后顺序交错存储的。  The length of the box is variable, the type is "mdat", and the remaining bytes are generally used by users to store video frames and audio frame data, while audio frames and video frames are generally interleaved and stored in chronological order.
媒体数据包数量不定, 在文件中的位置也不定, 它用于存储实际的音频 视频数据, 也可以存储其他的相关数据。  The number of media packets is variable and the location in the file is not fixed. It is used to store actual audio and video data, as well as other related data.
与传统数码摄像机相比, 手机摄像效果略显粗糙, 但借助手机强大的处 理功能, 如果能丰富手机摄像功能, 给用户更好的体验, 将会使得手机摄像 功能更加强大, 更容易吸引消费者。  Compared with the traditional digital camera, the mobile phone camera effect is slightly rough, but with the powerful processing function of the mobile phone, if the camera function can be enriched and the user experience is better, the mobile phone camera function will be more powerful and more attractive to consumers. .
在很多场景下, 用户有意愿重新编辑录像, 替换视频中的背景声音。 在 这种情况下 ,用户需要首先使用数码摄像机或手机等手持设备完成摄像操作 , 然后再将拍摄好的文件导入 PC后, 使用专业的视频编辑软件对录制好的文 件进行处理。 这种方式专业性很强并且费时费力, 使得一般用户不愿涉及。 发明内容  In many scenarios, the user has the willingness to re-edit the video and replace the background sound in the video. In this case, the user needs to first use a digital camera or a mobile phone to complete the camera operation, and then import the captured file into the PC, and then use the professional video editing software to process the recorded file. This method is very professional and time consuming, making the average user unwilling to involve. Summary of the invention
本发明要解决的技术问题是提供一种在录像过程中加载背景音的方法及 移动终端, 以解决现有技术中用户需要将录像文件同步到 PC 并使用专业视 频处理软件进行编辑才能替换录像文件中的背景音的缺陷。  The technical problem to be solved by the present invention is to provide a method for loading background sounds in a video recording process and a mobile terminal, so as to solve the problem in the prior art that a user needs to synchronize a video file to a PC and edit with a professional video processing software to replace the video file. Defects in the background sound.
为解决上述问题, 本发明提供了一种在录像过程中加载背景音的方法, 包括:  In order to solve the above problems, the present invention provides a method for loading a background sound during a recording process, including:
在用户选择加载背景音后且使用移动终端开始录像时, 所述移动终端对 用户选择的背景音文件进行解析且对录制过程中获取到的视频帧进行编码, 然后将解析出的所述音频帧及编码后的所述视频帧交替写入录像文件中, 并 在录制完成后将播放所述音频帧和视频帧所需的基本信息写入所述录像文件 中。  After the user selects to load the background sound and starts recording by using the mobile terminal, the mobile terminal parses the background sound file selected by the user and encodes the video frame acquired during the recording, and then parses the audio frame that is parsed. And the encoded video frames are alternately written into the video file, and the basic information required to play the audio frame and the video frame is written into the video file after the recording is completed.
上述方法还可具有以下特征:  The above method may also have the following features:
所述基本信息包括: 编码方式、 总时长、 音视频和视频帧在所述录像文 件中的起始位置、 关键帧的位置及音视频帧播放的时间戳。 上述方法还可包括: The basic information includes: an encoding mode, a total duration, a starting position of the audio and video and the video frame in the recording file, a position of the key frame, and a time stamp of the audio and video frame playing. The above method may further include:
所述移动终端为解析出的每一音频帧添加一第一标识信息;  The mobile terminal adds a first identification information to each audio frame that is parsed;
所述移动终端在对用户选择的背景音文件进行解析时, 将其上麦克风获 取到的音频数据编码为音频帧, 并为该音频帧添加一第二标识信息; 以及 所述移动终端将编码后的音频帧与所述解析出的音频帧及编码后的视频 帧一起交替写入所述录像文件。  When parsing the background sound file selected by the user, the mobile terminal encodes the audio data acquired by the microphone into an audio frame, and adds a second identification information to the audio frame; and the mobile terminal encodes the audio file. The audio frame is alternately written into the video file along with the parsed audio frame and the encoded video frame.
上述方法还可具有以下特征:  The above method may also have the following features:
所述用户选择的背景音文件是指: 用户从所述移动终端提供的界面中选 择的作为背景音的音频文件。  The background sound file selected by the user refers to: an audio file selected by the user as an background sound from an interface provided by the mobile terminal.
本发明还提供了一种在录像过程中加载背景音的移动终端, 包括: 存储 模块、 人机交互模块、 音频文件解析模块、 视频帧编码模块及录像文件制作 模块; The invention also provides a mobile terminal for loading a background sound during a recording process, comprising: a storage module, a human-computer interaction module, an audio file parsing module, a video frame encoding module and a video file producing module;
所述存储模块设置为: 保存音频文件;  The storage module is configured to: save an audio file;
所述人机交互模块设置为: 提供人机交互界面向用户展示所述存储模块 中保存的音频文件, 并在用户从中选择一个作为背景音时将该被选择的音频 文件的信息发送到所述音频文件解析模块;  The human-computer interaction module is configured to: provide a human-computer interaction interface to display an audio file saved in the storage module to a user, and send information of the selected audio file to the user when the user selects one of them as a background sound Audio file parsing module;
所述音频文件解析模块设置为: 根据接收到的所述音频文件的信息从所 述存储模块中获取对应的音频文件, 然后将该音频文件解析为音频帧并发送 到所述录像文件制作模块;  The audio file parsing module is configured to: obtain a corresponding audio file from the storage module according to the received information of the audio file, and then parse the audio file into an audio frame and send the audio file to the video file creation module;
视频帧编码模块设置为: 对录制过程中获取到的视频帧进行编码, 并将 编码后的视频帧发送到所述录像文件制作模块;  The video frame coding module is configured to: encode the video frame obtained during the recording process, and send the encoded video frame to the video file creation module;
所述录像文件制作模块设置为: 将接收到的所述音频帧及视频帧交替写 入录像文件中, 并在录制完成后将播放所述音频帧和视频帧所需的基本信息 写入到所述录像文件中。  The video file creation module is configured to: write the received audio frame and video frame into the video file alternately, and write the basic information required to play the audio frame and the video frame to the office after the recording is completed. In the video file.
上述方法还可具有以下特征:  The above method may also have the following features:
所述基本信息包括: 编码方式、 总时长、 音视频和视频帧在所述录像文 件中的起始位置、 关键帧的位置及音视频帧播放的时间戳。 The basic information includes: an encoding method, a total duration, audio and video, and a video frame in the video The starting position in the piece, the position of the key frame, and the timestamp of the audio and video frame playback.
上述所述移动终端中还可包括: 音频数据编码模块;  The mobile terminal may further include: an audio data encoding module;
所述音频文件解析模块还设置为: 为解析出的每一个音频帧添加一第一 标识信息;  The audio file parsing module is further configured to: add a first identification information for each audio frame that is parsed;
所述音频数据编码模块设置为: 将所述移动终端上的麦克风获取到的音 频数据编码为音频帧, 并为该音频帧添加一第二标识信息;  The audio data encoding module is configured to: encode the audio data acquired by the microphone on the mobile terminal into an audio frame, and add a second identification information to the audio frame;
所述录制文件制作模块是设置为: 编码后的音频帧与所述解析出的音频 帧及编码后的视频帧一起交替写入所述录像文件。  The recording file creation module is configured to: the encoded audio frame is alternately written into the video file together with the parsed audio frame and the encoded video frame.
本发明简便易行, 用户使用非常方便。 同时, 由于可以不录制真正的背 景声音, 节省了手机进行音频获取和编码的开销, 可以使得摄像时更省电, 摄像时间更长久。 加载背景音录制增添了一种额外的摄像模式, 使得用户的 某些特殊需求得到了满足, 尤其对于追求时尚个性化的年轻用户, 用户可以 随时拍摄自己喜欢的场景同时选择自己喜爱的音乐进行合成, 拍摄完成后可 立刻通过 MMS ( Multimedia Messaging Service, 多媒体消息)短信发送录制 好的多媒体文件给自己的朋友进行分享。 这种新功能丰富了用户体验, 使得 手机摄像功能更加强大。 附图概述 The invention is simple and easy to operate, and the user is very convenient to use. At the same time, since the real background sound can be recorded without being recorded, the overhead of audio acquisition and encoding of the mobile phone is saved, which can make the camera more power-saving and the recording time longer. Loading background sound recording adds an additional camera mode, which makes the user's special needs satisfied. Especially for young users who are pursuing fashion and individuality, users can shoot their favorite scenes at the same time and choose their favorite music for synthesis. After the shooting is completed, the recorded multimedia file can be sent to your friends for sharing via MMS (Multimedia Messaging Service). This new feature enriches the user experience and makes the camera's camera function even more powerful. BRIEF abstract
图 1为现有技术中 3GP文件的格式示意图;  1 is a schematic diagram of a format of a 3GP file in the prior art;
图 2 为本发明实施例中在录像过程中加载背景音的方法流程图; 图 3为本发明实施例中实时背景音与用户选择的背景音文件同时录制方 法的流程图;  2 is a flowchart of a method for loading a background sound during a recording process according to an embodiment of the present invention; FIG. 3 is a flowchart of a method for simultaneously recording a real-time background sound and a background sound file selected by a user according to an embodiment of the present invention;
图 4为本发明实施例中移动终端的结构图。 本发明的较佳实施方式  FIG. 4 is a structural diagram of a mobile terminal according to an embodiment of the present invention. Preferred embodiment of the invention
本发明所述方法的基本构思是: 在用户选择加载背景音后且使用移动终 端开始录像时, 移动终端对用户选择的背景音文件进行解析且对录制过程中 获取到的视频帧进行编码, 然后将解析出的音频帧及编码后的视频帧交替写 入录像文件中, 在录制完成后将播放上述音频帧和视频帧所需的基本信息写 入该录像文件中, 其中, 基本信息包括: 编码方式、 总时长、 音视频帧在文 件中的起始位置、 关键帧的位置及音视频帧播放的时间戳等信息。 The basic idea of the method of the present invention is: after the user chooses to load the background sound and uses the mobile terminal When the terminal starts recording, the mobile terminal parses the background sound file selected by the user and encodes the video frame obtained during the recording process, and then alternately parses the parsed audio frame and the encoded video frame into the video file, After the recording is completed, the basic information required to play the above audio frame and video frame is written into the video file, wherein the basic information includes: the encoding mode, the total duration, the starting position of the audio and video frame in the file, and the position of the key frame. And timestamps such as audio and video frame playback.
在具体实现时, 移动终端需提供相应的菜单选择界面供用户选择是否需 要为录像文件加载背景音; 在用户选择了加载背景音后, 还要为用户提供音 乐选择界面, 供用户通过该界面在该移动终端上确认选择作为背景音的音频 文件。  In specific implementation, the mobile terminal needs to provide a corresponding menu selection interface for the user to select whether to load the background sound for the video file; after the user selects to load the background sound, the user also needs to provide a music selection interface for the user to pass through the interface. The mobile terminal confirms selection of an audio file as a background sound.
如图 2所示, 在用户选定了背景音且在选择开始录像后, 移动终端执行 下述流程:  As shown in FIG. 2, after the user selects the background sound and selects to start recording, the mobile terminal performs the following process:
1 )移动终端上的摄像头将获取到的视频原始帧数据(如 YUV数据)传 送到移动终端上的屏幕进行显示, 同时移动终端根据预设的视频编码方式对 上述视频原始帧数据进行编码;  1) The camera on the mobile terminal transmits the obtained video original frame data (such as YUV data) to the screen on the mobile terminal for display, and the mobile terminal encodes the video original frame data according to a preset video coding manner;
2 )移动终端将用户选定的背景音文件解析为音频帧。 需要说明的是, 步 骤 1 )和 2 )可同时进行, 在时间上不分先后;  2) The mobile terminal parses the background sound file selected by the user into an audio frame. It should be noted that steps 1) and 2) can be performed simultaneously, regardless of time;
3 )将解析出的音频帧和经过编码的视频帧交替写入录像文件。  3) The parsed audio frame and the encoded video frame are alternately written into the video file.
根据文件编码规范, 视频帧与音频帧要不断按照时间先后顺序交替的写 入录像文件中。 录制完成后, 再将每个音频帧及视频帧在该录像文件中的起 始位置、 大小以及时间戳等索引信息写入文件末尾的相应字段中。  According to the file coding specification, video frames and audio frames are continuously written into the video file in chronological order. After the recording is completed, index information such as the starting position, size, and time stamp of each audio frame and video frame in the recording file is written into the corresponding field at the end of the file.
根据上述说明可知, 釆用上述录像方法对移动终端上麦克风釆集到的音 频数据并没有进行处理。 如果用户有意愿保留麦克风的语音数据, 即在录像 文件中既保留麦克风釆集到的数据, 也有用户选择添加的背景音的数据, 则 如图 3所示, 执行下述流程: As can be seen from the above description, the audio data collected by the microphone on the mobile terminal is not processed by the above recording method. If the user intends to retain the voice data of the microphone, that is, the data collected by the microphone is retained in the video file, and the data of the background sound added by the user is also selected, as shown in FIG. 3, the following process is performed:
1 )移动终端上的摄像头将获取到的视频原始帧数据(如 YUV数据)传 送到移动终端上的屏幕进行显示, 同时移动终端根据预设的视频编码方式对 上述视频原始帧数据进行编码; 2 )移动终端将用户选定的背景音文件解析为音频帧, 并为上述每一音频 帧添加一第一标识信息, 该第一标识信息用于表示该音频帧为用户选定的背 景音文件的音频帧; 1) The camera on the mobile terminal transmits the acquired video original frame data (such as YUV data) to the screen on the mobile terminal for display, and the mobile terminal encodes the video original frame data according to a preset video coding manner; 2) The mobile terminal parses the background sound file selected by the user into an audio frame, and adds a first identification information for each audio frame, where the first identification information is used to indicate that the audio frame is a background sound file selected by the user. Audio frame
3 )移动终端根据预设的音频编码方式将其上麦克风获取到的音频数据 (如 PCM ( Pulse Code Modulation, 脉码调制)数据)编码为音频帧, 并为 上述每一音频帧添加一第二标识信息, 该第二标识信息用于表示该音频帧为 麦克风获取到的语音数据对应的音频帧;  3) The mobile terminal encodes the audio data (such as PCM (Pulse Code Modulation) data) acquired by the microphone into an audio frame according to a preset audio coding mode, and adds a second to each of the above audio frames. Identification information, the second identification information is used to indicate that the audio frame is an audio frame corresponding to the voice data acquired by the microphone;
需要说明的是, 步骤 1 ) ~3 )可同时进行, 在时间上不分先后;  It should be noted that steps 1) to 3) can be performed simultaneously, regardless of time;
4 )将经过上述步骤得到的音频帧和视频帧交替写入录像文件。  4) The audio frame and the video frame obtained through the above steps are alternately written into the video file.
根据文件编码规范, 视频帧与两种音频帧要不断按照时间先后顺序交替 的写入录像文件中。 录制完成后, 再将每个音频帧及视频帧在该录像文件中 的起始位置、 大小以及时间戳等索引信息写入文件末尾的相应字段中。  According to the file coding specification, video frames and two audio frames are continuously written into the video file in chronological order. After the recording is completed, index information such as the starting position, size, and time stamp of each audio frame and video frame in the recording file is written into the corresponding field at the end of the file.
在播放时, 解码器可以根据第一标识信息和第二标识信息对应地解码出 背景音文件对应的音频帧和录像过程中釆集到的真实背景音。  During playback, the decoder may correspondingly decode the audio frame corresponding to the background sound file and the real background sound collected during the recording according to the first identification information and the second identification information.
本发明所述在录像过程中加载背景音的移动终端, 如图 4所示, 包括: 存储模块、 人机交互模块、 音频文件解析模块、 视频帧编码模块及录像文件 制作模块; The mobile terminal that loads the background sound in the recording process according to the present invention, as shown in FIG. 4, includes: a storage module, a human-computer interaction module, an audio file parsing module, a video frame encoding module, and a video file creating module;
存储模块用于保存音频文件;  The storage module is used to save the audio file;
人机交互模块用于提供人机交互界面向用户展示存储模块中保存的音频 文件, 并在用户从中选择一个作为背景音时将该被选择的音频文件的信息发 送到音频文件解析模块;  The human-computer interaction module is configured to provide a human-computer interaction interface to display the audio file saved in the storage module to the user, and send the information of the selected audio file to the audio file parsing module when the user selects one as the background sound;
音频文件解析模块用于根据接收到的音频文件的信息从存储模块中获取 对应的音频文件, 然后将该音频文件解析为音频帧并发送到录像文件制作模 块;  The audio file parsing module is configured to obtain a corresponding audio file from the storage module according to the received information of the audio file, and then parse the audio file into an audio frame and send the audio file to the video file making module;
视频帧编码模块用于对录制过程中获取到的视频帧进行编码, 并将编码 后的视频帧发送到录像文件制作模块;  The video frame coding module is configured to encode the video frame obtained during the recording process, and send the encoded video frame to the video file creation module;
录像文件制作模块用于将接收到的音频帧及视频帧交替写入录像文件 中, 并在录制完成后将播放所述音频帧和视频帧所需的基本信息写入到录像 文件中。 The video file creation module is used to alternately receive the received audio frame and video frame into the video file. Medium, and write the basic information required to play the audio frame and video frame to the video file after the recording is completed.
其中, 上述基本信息可包括: 编码方式、 总时长、 音视频和视频帧在所 述录像文件中的起始位置、 关键帧的位置及音视频帧播放的时间戳。  The foregoing basic information may include: an encoding mode, a total duration, a starting position of the audio and video and the video frame in the video file, a position of the key frame, and a time stamp of the audio and video frame playing.
此外, 上述移动终端中还可包括: 音频数据编码模块;  In addition, the foregoing mobile terminal may further include: an audio data encoding module;
音频文件解析模块还可用于为解析出的每一个音频帧添加一第一标识信 息;  The audio file parsing module may be further configured to add a first identification information to each of the parsed audio frames;
音频数据编码模块可用于将移动终端上的麦克风获取到的音频数据编码 为音频帧, 并为该音频帧添加一第二标识信息;  The audio data encoding module may be configured to encode the audio data acquired by the microphone on the mobile terminal into an audio frame, and add a second identification information to the audio frame;
录制文件制作模块用于编码后的音频帧与所述解析出的音频帧及编码后 的视频帧一起交替写入所述录像文件。  The recorded file creation module is configured to alternately write the encoded audio frame with the parsed audio frame and the encoded video frame.
尽管为示例目的, 已经公开了本发明的优选实施例, 本领域的技术人员 将意识到各种改进、 增加和取代也是可能的, 因此, 本发明的范围应当不限 于上述实施例。 While the preferred embodiments of the present invention have been disclosed for purposes of illustration, those skilled in the art will recognize that various modifications, additions and substitutions are possible, and the scope of the invention should not be limited to the embodiments described above.
本领域普通技术人员可以理解上述方法中的全部或部分步骤可通过程序 来指令相关硬件完成, 所述程序可以存储于计算机可读存储介质中, 如只读 存储器、 磁盘或光盘等。 可选地, 上述实施例的全部或部分步骤也可以使用 一个或多个集成电路来实现。 相应地, 上述实施例中的各模块 /单元可以釆用 硬件的形式实现, 也可以釆用软件功能模块的形式实现。 本发明不限制于任 何特定形式的硬件和软件的结合。  One of ordinary skill in the art will appreciate that all or a portion of the above steps may be accomplished by a program instructing the associated hardware, such as a read-only memory, a magnetic disk, or an optical disk. Alternatively, all or part of the steps of the above embodiments may also be implemented using one or more integrated circuits. Correspondingly, each module/unit in the above embodiment may be implemented in the form of hardware or in the form of a software function module. The invention is not limited to any specific form of combination of hardware and software.
工业实用性 Industrial applicability
本发明简便易行, 用户使用非常方便。 同时, 由于可以不录制真正的背 景声音, 节省了手机进行音频获取和编码的开销, 可以使得摄像时更省电, 摄像时间更长久。 加载背景音录制增添了一种额外的摄像模式, 使得用户的 某些特殊需求得到了满足, 尤其对于追求时尚个性化的年轻用户, 用户可以 随时拍摄自己喜欢的场景同时选择自己喜爱的音乐进行合成, 拍摄完成后可 立刻通过 MMS短信发送录制好的多媒体文件给自己的朋友进行分享。 这种 新功能丰富了用户体验, 使得手机摄像功能更加强大。 The invention is simple and easy to operate, and the user is very convenient to use. At the same time, since the real background sound can be recorded without being recorded, the overhead of audio acquisition and encoding of the mobile phone is saved, which can make the camera more power-saving and the recording time longer. Loading background sound recording adds an additional camera mode, which makes the user's special needs satisfied. Especially for young users who are pursuing fashion and individuality, users can shoot their favorite scenes at the same time and choose their favorite music for synthesis. , after shooting is complete Immediately send the recorded multimedia files to your friends for sharing via MMS SMS. This new feature enriches the user experience and makes the camera's camera function even more powerful.

Claims

权 利 要 求 书 Claim
1、 一种在录像过程中加载背景音的方法, 该方法包括:  1. A method of loading a background sound during a recording process, the method comprising:
在用户选择加载背景音后且使用移动终端开始录像时, 所述移动终端对 用户选择的背景音文件进行解析且对录制过程中获取到的视频帧进行编码, 然后将解析出的所述音频帧及编码后的所述视频帧交替写入录像文件中, 并 在录制完成后将播放所述音频帧和视频帧所需的基本信息写入所述录像文件 中。  After the user selects to load the background sound and starts recording by using the mobile terminal, the mobile terminal parses the background sound file selected by the user and encodes the video frame acquired during the recording, and then parses the audio frame that is parsed. And the encoded video frames are alternately written into the video file, and the basic information required to play the audio frame and the video frame is written into the video file after the recording is completed.
2、 如权利要求 1所述的方法, 其中:  2. The method of claim 1 wherein:
所述基本信息包括: 编码方式、 总时长、 音视频和视频帧在所述录像文 件中的起始位置、 关键帧的位置及音视频帧播放的时间戳。  The basic information includes: an encoding mode, a total duration, a starting position of the audio and video and the video frame in the video file, a position of the key frame, and a time stamp of the audio and video frame playing.
3、 如权利要求 1或 2所述的方法, 所述方法还包括:  3. The method of claim 1 or 2, the method further comprising:
所述移动终端为解析出的每一音频帧添加一第一标识信息;  The mobile terminal adds a first identification information to each audio frame that is parsed;
所述移动终端在对用户选择的背景音文件进行解析时, 将其上麦克风获 取到的音频数据编码为音频帧, 并为该音频帧添加一第二标识信息; 以及 所述移动终端将编码后的音频帧与所述解析出的音频帧及编码后的视频 帧一起交替写入所述录像文件。  When parsing the background sound file selected by the user, the mobile terminal encodes the audio data acquired by the microphone into an audio frame, and adds a second identification information to the audio frame; and the mobile terminal encodes the audio file. The audio frame is alternately written into the video file along with the parsed audio frame and the encoded video frame.
4、 如权利要求 1所述的方法, 其中,  4. The method of claim 1, wherein
所述用户选择的背景音文件是指: 用户从所述移动终端提供的界面中选 择的作为背景音的音频文件。  The background sound file selected by the user refers to: an audio file selected by the user as an background sound from an interface provided by the mobile terminal.
5、 一种在录像过程中加载背景音的移动终端, 所述移动终端包括: 存储 模块、 人机交互模块、 音频文件解析模块、 视频帧编码模块及录像文件制作 模块;  5. A mobile terminal that loads a background sound during a recording process, the mobile terminal comprising: a storage module, a human-machine interaction module, an audio file parsing module, a video frame encoding module, and a video file creating module;
所述存储模块设置为: 保存音频文件;  The storage module is configured to: save an audio file;
所述人机交互模块设置为: 提供人机交互界面向用户展示所述存储模块 中保存的音频文件, 并在用户从中选择一个作为背景音时将该被选择的音频 文件的信息发送到所述音频文件解析模块;  The human-computer interaction module is configured to: provide a human-computer interaction interface to display an audio file saved in the storage module to a user, and send information of the selected audio file to the user when the user selects one of them as a background sound Audio file parsing module;
所述音频文件解析模块设置为: 根据接收到的所述音频文件的信息从所 述存储模块中获取对应的音频文件, 然后将该音频文件解析为音频帧并发送 到所述录像文件制作模块; The audio file parsing module is configured to: according to the received information of the audio file Obtaining a corresponding audio file in the storage module, and then parsing the audio file into an audio frame and sending the audio file to the video file creation module;
视频帧编码模块设置为: 对录制过程中获取到的视频帧进行编码, 并将 编码后的视频帧发送到所述录像文件制作模块;  The video frame coding module is configured to: encode the video frame obtained during the recording process, and send the encoded video frame to the video file creation module;
所述录像文件制作模块设置为: 将接收到的所述音频帧及视频帧交替写 入录像文件中, 并在录制完成后将播放所述音频帧和视频帧所需的基本信息 写入到所述录像文件中。  The video file creation module is configured to: write the received audio frame and video frame into the video file alternately, and write the basic information required to play the audio frame and the video frame to the office after the recording is completed. In the video file.
6、 如权利要求 5所述的移动终端, 其中,  6. The mobile terminal of claim 5, wherein
所述基本信息包括: 编码方式、 总时长、 音视频和视频帧在所述录像文 件中的起始位置、 关键帧的位置及音视频帧播放的时间戳。  The basic information includes: an encoding mode, a total duration, a starting position of the audio and video and the video frame in the video file, a position of the key frame, and a time stamp of the audio and video frame playing.
7、 如权利要求 5或 6所述的移动终端, 所述移动终端还包括: 音频数据 编码模块;  The mobile terminal according to claim 5 or 6, wherein the mobile terminal further comprises: an audio data encoding module;
所述音频文件解析模块还设置为: 为解析出的每一个音频帧添加一第一 标识信息;  The audio file parsing module is further configured to: add a first identification information for each audio frame that is parsed;
所述音频数据编码模块设置为: 将所述移动终端上的麦克风获取到的音 频数据编码为音频帧, 并为该音频帧添加一第二标识信息;  The audio data encoding module is configured to: encode the audio data acquired by the microphone on the mobile terminal into an audio frame, and add a second identification information to the audio frame;
所述录制文件制作模块是设置为: 将编码后的音频帧与所述解析出的音 频帧及编码后的视频帧一起交替写入所述录像文件。  The recording file creation module is configured to: write the encoded audio frame to the video file alternately with the parsed audio frame and the encoded video frame.
PCT/CN2010/079242 2010-08-03 2010-11-29 Method and mobile terminal for adding background sound in recording procedure WO2012016406A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN2010102456720A CN102348086A (en) 2010-08-03 2010-08-03 Method and mobile terminal for loading background sounds in video recording process
CN201010245672.0 2010-08-03

Publications (1)

Publication Number Publication Date
WO2012016406A1 true WO2012016406A1 (en) 2012-02-09

Family

ID=45546323

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/079242 WO2012016406A1 (en) 2010-08-03 2010-11-29 Method and mobile terminal for adding background sound in recording procedure

Country Status (2)

Country Link
CN (1) CN102348086A (en)
WO (1) WO2012016406A1 (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103973955B (en) * 2013-01-28 2017-08-25 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN105681713A (en) * 2016-01-04 2016-06-15 努比亚技术有限公司 Video recording method, video recording device and mobile terminal
CN106804005B (en) * 2017-03-27 2019-05-17 维沃移动通信有限公司 A kind of production method and mobile terminal of video
CN107592486A (en) * 2017-09-14 2018-01-16 光锐恒宇(北京)科技有限公司 A kind of video generation method and device
CN107566769B (en) * 2017-09-27 2019-12-03 维沃移动通信有限公司 A kind of video recording method and mobile terminal
CN108600825B (en) 2018-07-12 2019-10-25 北京微播视界科技有限公司 Select method, apparatus, terminal device and the medium of background music shooting video
CN109600650B (en) * 2018-08-01 2020-06-19 北京微播视界科技有限公司 Method and apparatus for processing data
JP7008870B2 (en) 2018-08-01 2022-01-25 北京微播視界科技有限公司 How and equipment to record video
CN109600660B (en) * 2018-08-01 2020-07-24 北京微播视界科技有限公司 Method and apparatus for recording video
CN109672837A (en) * 2019-01-24 2019-04-23 深圳慧源创新科技有限公司 Equipment of taking photo by plane real-time video method for recording, mobile terminal and computer storage medium
CN110312137A (en) * 2019-04-01 2019-10-08 浙江工业大学 A kind of audio plays the video file generation method of driving video recording

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101056322A (en) * 2006-04-13 2007-10-17 中兴通讯股份有限公司 A device and method for overlapping the background sound at the mobile communication terminal
CN101098523A (en) * 2006-06-29 2008-01-02 海尔集团公司 Method for realizing karaoke by mobile phone and mobile phone with karaoke function
CN101106770A (en) * 2006-07-13 2008-01-16 中兴通讯股份有限公司 A method for making shot animation with background music in mobile phone

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101056322A (en) * 2006-04-13 2007-10-17 中兴通讯股份有限公司 A device and method for overlapping the background sound at the mobile communication terminal
CN101098523A (en) * 2006-06-29 2008-01-02 海尔集团公司 Method for realizing karaoke by mobile phone and mobile phone with karaoke function
CN101106770A (en) * 2006-07-13 2008-01-16 中兴通讯股份有限公司 A method for making shot animation with background music in mobile phone

Also Published As

Publication number Publication date
CN102348086A (en) 2012-02-08

Similar Documents

Publication Publication Date Title
WO2012016406A1 (en) Method and mobile terminal for adding background sound in recording procedure
KR101550462B1 (en) A method and an apparatus for embedding data in a media stream
JP2011087103A (en) Provision of content reproduction system, content reproduction device, program, content reproduction method, and content server
JP2007012112A (en) Data recording device and method thereof, program, and recording medium
JP2017505012A (en) Video processing method, apparatus, and playback apparatus
KR20080007148A (en) Playback apparatus, playback method, and program
CN101312460A (en) Method for converting media file of multiple formats into target device supported media file
CN104916298A (en) Coding and decoding methods, coding and decoding devices, electronic equipment and audio picture generating method
KR20040102078A (en) Information recording medium and manufacturing method thereof
WO2009010009A1 (en) Prompting message forming method and device for mobile terminal
JP5737357B2 (en) Music playback apparatus and music playback program
JP4404091B2 (en) Content distribution server and terminal for distributing content frames for playing music
US20090037006A1 (en) Device, medium, data signal, and method for obtaining audio attribute data
EP2179365A1 (en) Method and apparatus for generating and reproducing media object-based metadata
CN204928959U (en) Mobile terminal's music broadcast system
CN201392655Y (en) Digital photo frame with voice explanation function
JP4254297B2 (en) Image processing apparatus and method, and image processing system and program using the same
CN102169708A (en) Audio and video play system, method, mobile terminal and player
JP4489013B2 (en) Information recording apparatus and recorded information management method
JP3743321B2 (en) Data editing method, information processing apparatus, server, data editing program, and recording medium
JP2014131307A (en) Information processing apparatus, information processing method, and program
KR20190027645A (en) Method for producing multimedia book
JP5161323B2 (en) Reproduction method and apparatus
KR100687268B1 (en) Mobile communication terminal having a function of playing external MP3 data and the method thereof
JP2006127443A (en) E-mail transmitting terminal and e-mail system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10855544

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10855544

Country of ref document: EP

Kind code of ref document: A1