WO2011023017A1 - 一种转码的方法和装置 - Google Patents

一种转码的方法和装置 Download PDF

Info

Publication number
WO2011023017A1
WO2011023017A1 PCT/CN2010/073723 CN2010073723W WO2011023017A1 WO 2011023017 A1 WO2011023017 A1 WO 2011023017A1 CN 2010073723 W CN2010073723 W CN 2010073723W WO 2011023017 A1 WO2011023017 A1 WO 2011023017A1
Authority
WO
WIPO (PCT)
Prior art keywords
transcoding
audio
source files
multimedia
multimedia source
Prior art date
Application number
PCT/CN2010/073723
Other languages
English (en)
French (fr)
Inventor
陈敬昌
Original Assignee
腾讯科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 腾讯科技(深圳)有限公司 filed Critical 腾讯科技(深圳)有限公司
Priority to SG2011092145A priority Critical patent/SG176822A1/en
Publication of WO2011023017A1 publication Critical patent/WO2011023017A1/zh
Priority to US13/336,331 priority patent/US8583828B2/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440218Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by transcoding between formats or standards, e.g. from MPEG-2 to MPEG-4
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/162User input
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/40Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2365Multiplexing of several video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4344Remultiplexing of multiplex streams, e.g. by modifying time stamps or remapping the packet identifiers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4347Demultiplexing of several video streams

Definitions

  • the present invention relates to the field of computer multimedia processing, and in particular, to a method and apparatus for transcoding. Background technique
  • H.263 proposed for video telephony and video conferencing services in multimedia technology
  • DVB Digital Video Broadcasting
  • HDTV High Definition Television
  • DVD for multimedia technology
  • Digital Versati le Disc Digital Multipurpose Disc
  • MPEG2 Moving Picture Expert Group
  • MPEG4 developed for network streaming service in multimedia technology
  • transcoders software or hardware tools that provide inter-standard conversion are called transcoders.
  • existing transcoders only provide one-to-one transcoding functions, that is, one source file is transcoded into one. An object file.
  • the inventors have found that the above prior art has at least the following shortcomings and deficiencies: Since the existing transcoder only provides one-to-one transcoding, that is, one source file is transcoded into one object file.
  • the transcoding method is single, and the transcoding efficiency is not high.
  • the user needs to start the transcoder to perform the transcoding operation multiple times, which affects the user experience, especially when the source file is short.
  • the multimedia files transcoded by the transcoder can only be played one by one, which cannot meet the user's desire to continuously play multiple multimedia files. Summary of the invention
  • the embodiment of the present invention provides a Transcoding method And equipment.
  • the technical solution is as follows:
  • a method of transcoding comprising:
  • the method further includes:
  • the separating the audio and video streams from the multiple multimedia source files includes:
  • transcoding each of the separated audio streams and video streams includes:
  • each of the separated audio streams and video streams is transcoded.
  • the separating the audio and video streams by the multiple multimedia source files in sequence includes:
  • the transcoding each of the separated audio streams and video streams in sequence includes:
  • Each of the separated audio streams and video streams is transcoded in turn according to the order in which the files are arranged after the merge transcoding is completed by the user.
  • the transcoding target parameter includes at least:
  • the combining the audio stream and the video stream that are transcoded by the plurality of multimedia source files into a multimedia object file includes:
  • the audio stream transcoded by each multimedia source file is combined with the video stream to obtain a multimedia object file corresponding to each of the multimedia source files.
  • the method further includes: combining the obtained multimedia object files corresponding to each of the multimedia source files to obtain a multimedia object file.
  • the combining the audio stream and the video stream that are transcoded by the plurality of multimedia source files into a multimedia object file includes:
  • a device for transcoding comprising:
  • a receiving unit configured to receive a plurality of multimedia source files selected by the user and the transcoding target parameters input by the user; and a separating unit, configured to perform audio and video stream separation on the plurality of multimedia source files;
  • transcoding unit configured to transcode each audio stream and video stream separated by the separation unit according to the transcoding target parameter received by the receiving unit
  • a merging unit configured to merge the audio stream and the video stream transcoded by the plurality of multimedia source files obtained by the transcoding unit into a multimedia object file.
  • the device further includes: a determining unit, configured to perform legality judgment on the transcoding target parameter input by the user received by the receiving unit, and if legal, provide the transcoding target parameter to the Transcoding unit.
  • a determining unit configured to perform legality judgment on the transcoding target parameter input by the user received by the receiving unit, and if legal, provide the transcoding target parameter to the Transcoding unit.
  • the separation unit includes:
  • a first separating subunit configured to sequentially separate the audio and video streams from the plurality of multimedia source files
  • a second separating subunit configured to simultaneously separate the audio and video streams from the plurality of multimedia source files
  • the transcoding unit includes:
  • a first transcoding unit configured to sequentially transcode each of the separated audio streams and video streams
  • the second transcoding unit is configured to simultaneously transcode each of the separated audio streams and video streams.
  • the first separating sub-unit is specifically configured to sequentially, after receiving the transcoding confirmation signal input by the user, the plurality of multimedia sources according to a sequence of creation times of the plurality of multimedia source files
  • the file is separated from the audio and video streams; or, according to the arrangement of the file names of the plurality of multimedia source files, the plurality of multimedia media are sequentially
  • the body source file is separated from the audio and video streams; or, according to the sequence of the files arranged by the user after the combined transcoding is completed, the plurality of multimedia source files are sequentially separated into audio and video streams;
  • the first transcoding unit is specifically configured to: perform transcoding each of the separated audio streams and video streams in sequence according to a sequence of creation times of the plurality of multimedia source files; or, according to the The order of the file names of the plurality of multimedia source files is sequentially and sequentially, and each of the separated audio streams and video streams is sequentially transcoded; or, according to the sequence of the files arranged by the user after the combined transcoding is completed, Each of the separated audio streams and video streams is transcoded.
  • the merging unit is specifically configured to combine the audio stream transcoded by each multimedia source file with the video stream to obtain a multimedia object file corresponding to each of the multimedia source files.
  • the merging unit is further configured to combine the obtained multimedia target files corresponding to each of the multimedia source files to obtain a multimedia object file.
  • the merging unit is specifically configured to combine the audio streams after transcoding all the multimedia source files to obtain a combined audio stream, and combine the video streams transcoded by the all multimedia source files to obtain a merge.
  • Video stream combining the combined audio stream and the merged video stream to obtain a multimedia object file.
  • multiple multimedia can be selected in the user.
  • the operation of transcoding multiple source files is performed at one time, which simplifies the steps of the user to start the transcoding operation multiple times, reduces the number of transcodings, thereby improving the user experience; and transcoding the audio stream and When the video streams are merged, they can be combined into one multimedia object file, which can meet the user's desire to continuously play multiple multimedia files, further improving the user experience.
  • by simultaneously transmitting a plurality of source files to be transcoded Video stream separation and transcoding can further improve transcoding efficiency.
  • Embodiment 1 is a flowchart of a method for transcoding provided by Embodiment 1 of the present invention
  • FIG. 2 is a schematic diagram of a transcoder structure provided by Embodiment 2 of the present invention.
  • FIG. 3 is a flowchart of a method for implementing transcoding based on the architecture diagram shown in FIG. 2 according to Embodiment 2 of the present invention
  • 4 is a schematic structural diagram of another transcoder provided by Embodiment 2 of the present invention
  • FIG. 5 is a schematic diagram of a device for transcoding provided in Embodiment 3 of the present invention.
  • FIG. 6 is a schematic diagram of another apparatus for transcoding provided by Embodiment 3 of the present invention.
  • FIG. 7 is a schematic diagram of still another apparatus for transcoding provided by Embodiment 3 of the present invention. detailed description
  • the embodiment of the present invention provides a method for transcoding. Referring to FIG. 1, the method is as follows:
  • the method provided by the embodiment of the present invention separates audio and video streams by using a plurality of source files to be transcoded selected by a user, and transcodes each of the separated audio streams and video streams according to the transcoding target parameters input by the user.
  • the operation of transcoding multiple source files can be performed at one time, which simplifies the steps of the user to start the transcoding operation multiple times, and reduces the number of transcodings, thereby improving the user experience.
  • Embodiment 2 In order to describe the method provided by the foregoing embodiments of the present invention in detail, refer to the following embodiments: Embodiment 2
  • the embodiment of the present invention provides a method for transcoding, and the method provided by the embodiment of the present invention is shown in FIG.
  • a schematic diagram of a transcoder architecture for implementing the method provided by the embodiment of the present invention the transcoder includes: a source file list management module, an audio and video stream separation module, an audio transcoding module, a video transcoding module, an audio and video stream synthesis module, and a merge Transcoding master control module and user parameter setting module.
  • a plurality of source files that the user desires to perform combined transcoding are used, and the file names are respectively B, C as an example. See Figure 3 for details.
  • the method is as follows: 201: The source file list management module of the transcoder receives the source files A and B selected by the user to be combined and transcoded.
  • the embodiment of the present invention does not limit the number and type of source files.
  • Table 1 is a schematic diagram of source file information provided by an embodiment of the present invention.
  • the source located in the source file list may be executed according to the user's needs.
  • the file is deleted, and the source files in the source file list are reordered.
  • the user parameter setting module of the transcoder receives the transcoding target parameter input by the user.
  • the above transcoding target parameters include, but are not limited to, a file format of a multimedia object file, a file size of a multimedia object file, a code stream of a multimedia object file, and the like.
  • the transcoding target parameter input by the user includes the file format of the target file and the file size of the target file as an example, for example, the file format of the target file in the transcoding target parameter input by the user is AVI. (Audio Video Interleaved, audio video interleaved format), the file size of the target file is 15M.
  • the user parameter setting module may further determine whether the transcoding target parameter is legal, and if it is legal, pass the parameter set by the user to the audio transcoding.
  • the method for determining whether the transcoding target parameter is legal is not limited in this embodiment.
  • the format parameter may be preset to determine whether the received transcoding target parameter is a preset format parameter, and if so, The transcoding target parameter is legal; otherwise, the transcoding target parameter is illegal, and the preset format parameter is not limited in this embodiment.
  • the combined transcoding master control module of the transcoder receives the transcoding confirmation signal input by the user.
  • the user when the user inputs the selected source file to be transcoded into the transcoder and inputs the transcoding target parameter into the transcoder, the user can trigger the transcoding confirmation signal to start transcoding.
  • the user implements a transcoding confirmation signal that triggers the start of transcoding by pressing a previously provided confirmation button to perform a combined transcoding function.
  • the combined transcoding master control module sequentially sends the source files A, B, and C to the audio and video stream separation module according to the order of creation of the source files, or the combined transcoding master control module according to the file name of the source file. Arrange the sequence before and after, or send the source files A, B, and C to the audio and video stream separation module in sequence according to the order of the files after the transcoding is completed.
  • the audio and video stream separation module can perform video stream separation for each source file in sequence according to the manner in which the source files in the source file list management module are sequentially sent to the audio and video stream separation module. After all the source files are separated, the video streams are separated for all the source files at the same time. This embodiment does not limit the specific separation mode.
  • the audio and video stream separation module of the transcoder After receiving the current source file, the audio and video stream separation module of the transcoder performs audio and video stream separation on the current source file, and sends the separated audio stream and video stream to the corresponding audio transcoding module and Video transcoding module.
  • the audio/video stream separation module receives the current source file A
  • the audio file is parsed for the source file A, and the audio stream of the current source file A and the current source file A are obtained, and the current source file A is obtained.
  • the audio stream is sent to the audio transcoding module, and the video stream is sent to the video transcoding module to separately transcode the audio stream and the video stream.
  • the audio transcoding module and the video transcoding module can also be Each audio stream or video stream is transcoded in sequence, and may be simultaneously transcoded. This embodiment does not limit the specific transcoding mode.
  • the audio transcoding module receives the audio stream of the current source file, receives the transcoding target parameter input by the user according to the user parameter setting module, performs audio transcoding, and obtains a target audio stream corresponding to the current source file.
  • the transcoding target parameter input by the user includes the file format AVI of the target file as an example.
  • the audio transcoding module receives the audio stream of the source file A
  • the file of the source file A is used.
  • the format is MPEG4, and the source file A audio stream is transcoded according to the file format AVI of the target file, thereby realizing transcoding the audio stream into the target audio stream of the audio format desired by the user.
  • the video transcoding module receives the video stream of the current source file, receives the transcoding target parameter input by the user according to the user parameter setting module, performs video transcoding, and obtains the target video stream corresponding to the current source file.
  • the video transcoding module is configured to implement transcoding a video stream into a target video stream of a user requested video format.
  • the transcoding target parameter input by the user including the file format AVI of the target file, is taken as an example.
  • the file file format AVI transcodes the source file A video stream, thereby transcoding the video stream to the desired view of the user.
  • Target video stream in frequency format.
  • the embodiment of the present invention does not limit the sequence of steps 206 and 207.
  • the target audio stream and the target video stream are both sent to the audio and video stream synthesis module.
  • the audio and video stream synthesis module combines the received audio stream and the video stream of the plurality of source files into a target file. For this step, it can be divided into the following two cases:
  • the multimedia object file corresponding to each multimedia source file can be obtained by combining the audio stream transcoded by each multimedia source file with the video stream.
  • the merged audio stream and the merged video stream are combined to obtain a multimedia object file.
  • the plurality of object files are combined into one object file, that is, each multimedia is obtained in the first case described above.
  • the obtained multimedia object files corresponding to each multimedia source file are combined to obtain a multimedia object file.
  • Table 2 is a schematic diagram of the separation and conversion of audio and video of the source file provided by the embodiment of the present invention.
  • the target file After the audio stream Axx of the target file corresponding to the source file A is merged with the video stream Ayy, the target file is obtained; after the audio stream Bxx of the target file corresponding to the source file B is merged with the video stream Byy, the target file B is obtained, and the source file is obtained. After the audio stream Cxx of the target file corresponding to C is merged with the video stream Cyy, the target file (T.
  • the object files in the above example may be the same or different, and may be determined according to the actual situation, and the method is not limited in this embodiment, so that audio and video can be well matched.
  • the manner in which the audio stream of each multimedia source file is merged with the video stream can be preferentially selected.
  • the separation of the audio and video streams and the merging of the audio and video streams belong to the prior art, and will not be further described in this embodiment.
  • the user adds the file to the source file list in a customized order, and transcodes the file in the source file list by using the method provided by the embodiment of the present invention.
  • the transcoder includes: a source file list management module, multiple audio and video stream separation modules, and multiple An audio transcoding module, a plurality of video transcoding modules, an audio and video stream synthesis module, a combined transcoding master control module, and a user parameter setting module.
  • the difference from the above steps 201 to 209 is that, in step 204, under the control of the combined transcoding master control module, multiple source files in the source file list may be separately sent to the respective sounds.
  • the audio and video separation modules simultaneously perform audio and video separation on each source file, and then respectively send them into corresponding audio transcoding modules and video transcoding modules.
  • the method is similar, and will not be described again. Simultaneously performing audio and video stream separation, audio transcoding and video transcoding for multiple source files, thereby further saving transcoding time and improving transcoding efficiency.
  • the method provided by the embodiment of the present invention does not require a file format of the source file, and may be the same or Different, if the source file format is different, the function performed is the combined transcoding function provided by the embodiment of the present invention; if the source file format is the same, and the user sets the target media format to the source file format, the embodiment of the present invention
  • the method provided can implement the media file merging function, that is, the transcoder provided by the embodiment of the present invention can also function as a media file combiner.
  • the method of the embodiment of the present invention can be used to implement the superimposition function of the video and the audio file, so that the existing image can be obtained. Sound multimedia files greatly enhance the user experience.
  • the method for transcoding separates the audio and video streams by using a plurality of source files to be transcoded by the user, and separates each of the obtained transcoded target parameters according to the user input.
  • the audio stream and the video stream are transcoded, and after the user selects multiple multimedia source files, the operation of transcoding multiple source files is performed at one time, which simplifies the steps of the user to start the transcoding operation multiple times, and reduces the transcoding.
  • the number of times can improve the user experience; and when the transcoded audio stream and the video stream are combined, they can be combined into one multimedia object file, thereby satisfying the user's desire to continuously play a plurality of multimedia files, thereby further improving the user experience.
  • the transcoding efficiency can be further improved by simultaneously separating and transcoding the audio and video streams of the source files to be transcoded.
  • the embodiment of the present invention provides a device for transcoding.
  • the device includes:
  • the receiving unit 501 is configured to receive a plurality of multimedia source files selected by the user and the transcoding target parameters input by the user, and the separating unit 502 is configured to perform audio and video stream separation on the plurality of multimedia source files.
  • the transcoding unit 503 is configured to perform transcoding on the audio stream and the video stream separated by the separating unit 502 according to the transcoding target parameter received by the receiving unit 501;
  • the merging unit 504 is configured to merge the audio stream and the video stream transcoded by the plurality of multimedia source files obtained by the transcoding unit 503 into a multimedia object file.
  • the determining unit 505 is configured to perform legality judgment on the transcoding target parameter input by the user received by the receiving unit 501, and if it is legal, provide the transcoding target parameter to the transcoding unit 503.
  • the method may be: determining whether the transcoded target parameter input by the user meets the preset format parameter, and if yes, transcoding the target parameter Legal, if no, the transcoding target parameter is invalid. This embodiment does not limit the preset format parameters.
  • the separation unit 502 includes:
  • a first separating subunit configured to sequentially separate a plurality of multimedia source files into audio and video streams
  • the second separating subunit is configured to simultaneously separate the audio and video streams of the plurality of multimedia source files.
  • the first separating sub-unit is specifically configured to: after receiving the transcoding confirmation signal input by the user, sequentially separating the plurality of multimedia source files into audio and video streams according to a sequence of creation times of the plurality of multimedia source files; Or, according to the arrangement of the file names of the plurality of multimedia source files, sequentially separating the plurality of multimedia source files into audio and video streams; or, according to the sequence of the files arranged after the combined transcoding is completed by the user, Multiple multimedia source files are separated for audio and video streams.
  • the transcoding unit 503 includes:
  • a first transcoding unit configured to sequentially transcode each of the separated audio streams and video streams
  • the second transcoding unit is configured to simultaneously transcode each of the separated audio streams and video streams.
  • the first transcoding unit is specifically configured to transcode each of the separated audio streams and video streams according to a sequence of creation times of the plurality of multimedia source files; or, according to the files of the plurality of multimedia source files
  • the order of the names is arranged in sequence, and each of the separated audio streams and video streams is transcoded in turn; or, according to the sequence of the files arranged after the combined transcoding is completed by the user, each of the separated audio streams is sequentially separated. Transcode with the video stream.
  • Transcoding target parameters include but are not limited to:
  • the file format of the multimedia object file or the file size of the multimedia object file.
  • the number of multimedia object files is one, and the file formats of multiple multimedia source files are the same or different.
  • the merging unit 504 is specifically configured to combine the audio stream transcoded by each multimedia source file with the video stream to obtain a multimedia object file corresponding to each multimedia source file.
  • the merging unit 504 is further configured to merge the obtained multimedia object files corresponding to each multimedia source file to obtain a multimedia object file.
  • the merging unit 504 is specifically configured to combine the audio streams transcoded by all the multimedia source files to obtain a combined audio stream, and combine the video streams transcoded by all the multimedia source files to obtain a combined video stream. ; Merging the combined audio stream and the merged video stream to obtain a multimedia object file.
  • the apparatus for transcoding provided by the embodiment of the present invention may further include:
  • the playing unit 506 is configured to play the multimedia object file obtained by the merging unit 504.
  • the units in the apparatus provided by the embodiments of the present invention may be combined into one module, and may be further split.
  • This embodiment does not specifically limit this.
  • the receiving unit 501 of the apparatus provided by the embodiment of the present invention may adopt the source file list management module and the user parameter setting mode shown in the method embodiment.
  • the separation unit 502 of the device provided by the embodiment of the present invention is implemented by using the audio and video stream separation module shown in the method embodiment.
  • the transcoding unit 503 of the device provided by the embodiment of the present invention may be used.
  • the audio transcoding module and the video transcoding module are implemented in the method embodiment.
  • the merging unit 504 of the device provided by the embodiment of the present invention may be implemented by using the audio and video stream synthesizing module shown in the method embodiment, where The audio and video stream module, the video transcoding module, and the video transcoding module can each be one or more.
  • the apparatus for transcoding separates audio and video streams by using a plurality of source files to be transcoded selected by a user, and separates each obtained according to a transcoding target parameter input by a user.
  • the audio stream and the video stream are transcoded, and after the user selects multiple multimedia source files, the operation of transcoding multiple source files is performed at one time, which simplifies the steps of the user to start the transcoding operation multiple times, and reduces the transcoding.
  • the number of times can improve the user experience; and when the transcoded audio stream and the video stream are combined, they can be combined into one multimedia object file, thereby satisfying the user's desire to continuously play a plurality of multimedia files, thereby further improving the user experience.
  • the transcoding efficiency can be further improved by simultaneously separating and transcoding the audio and video streams of the source files to be transcoded.
  • receiving in the embodiment of the present invention may be understood as being actively acquired from other modules, or may be receiving information transmitted by other modules.
  • modules in the apparatus in the embodiments may be distributed in the apparatus of the embodiment according to the embodiment, or may be correspondingly changed in one or more apparatuses different from the embodiment.
  • the modules of the above embodiments may be combined into one module, or may be further split into a plurality of sub-modules.
  • Some of the steps in the embodiment of the present invention may be implemented by using software, and the corresponding software program may be stored in a readable storage medium, such as an optical disk or a hard disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本发明公开了一种转码的方法和装置,属于计算机媒体处理领域。所述方法包括:接收用户选择的多个多媒体源文件以及所述用户输入的转码目标参数;对多个多媒体源文件进行音视频流分离;根据所述转码目标参数,对分离得到的每个音频流和视频流进行转码;将所述多个多媒体源文件转码后的音频流和视频流合并成多媒体目标文件。所述装置包括:接收单元、分离单元、转码单元和合并单元。本发明扩展了转码方式、提高了用户的体验,具有很强的实用性。

Description

说 明 书
一种转码的方法和装置 技术领域
本发明涉及计算机多媒体处理领域, 特别涉及一种转码的方法和装置。 背景技术
在信息高速发展的今天, 多媒体技术已经成为人们工作、 学习中必不可少的重要组成部 分。 为了促进多媒体技术的进一步发展, 满足人们在日常工作、 生活中对多媒体业务服务的 不同需求, 一些标准化组织针对于不同业务需求提出了多种音视频编码标准。 例如, 针对多 媒体技术中的视频电话和视频会议业务提出的 H. 263标准;针对多媒体技术中的 DVB(Digital Video Broadcasting, 数字视频广播), HDTV (High Definition Television, 高清数字电视 技术) 和 DVD (Digital Versati le Disc, 数字多用途光盘)提出的 MPEG2 (Moving Picture Expert Group , 运动图像专家组) 标准; 针对多媒体技术中的网络流媒体业务服务开发的 MPEG4标准; 以及能够提供高视频压缩性能、 网络友好的 H. 264标准等等。 由于多种编码标 准的同时存在, 以及在多媒体应用中对压缩效率、 解压速度、 码流使用的目标设备等要求的 不同, 存在着多种多样的多媒体文件, 它们彼此之间所使用的文件格式、 视频格式、 压缩格 式各不相同。 为了有效地利用现有的多媒体资源, 实现不同标准之间的多媒体文件的转换, 转码技术受到了人们的广泛关注, 不断得到进一步的研究与开发。
现有技术中将提供标准间转换的软件或硬件工具称为转码器, 对于多媒体文件而言, 现 有的转码器仅提供了一对一的转码功能, 即一个源文件转码成一个目标文件。
发明人在实现本发明的过程中, 发现上述现有技术至少存在以下缺点和不足: 由于现有的转码器仅提供了一对一的转码方式, 即一个源文件转码成一个目标文件, 转 码方式单一, 转码效率不高; 当存在多个待转码的源文件时, 需要用户多次启动转码器进行 转码操作, 影响用户体验, 特别是当源文件较短时, 经过转码器转码后的多媒体文件只能一 个一个的播放, 无法满足用户希望将多个多媒体文件连续播放的需求。 发明内容
当存在多个待转码的源文件时, 为了降低转码的次数, 提高用户体验, 并提高转码效率, 满足用户希望将多个多媒体文件进行连续播放的需求, 本发明实施例提供了一种转码的方法 和装置。 所述技术方案如下:
一方面, 提供了一种转码的方法, 所述方法包括:
接收用户选择的多个多媒体源文件以及所述用户输入的转码目标参数;
对所述多个多媒体源文件进行音视频流分离;
根据所述转码目标参数, 对分离得到的每个音频流和视频流进行转码;
将所述多个多媒体源文件转码后的音频流和视频流合并成多媒体目标文件。
优选地, 所述接收所述用户输入的转码目标参数之后, 还包括:
判断所述接收的用户输入的转码目标参数是否合法, 如果是, 则执行后续步骤。
其中, 所述对所述多个多媒体源文件进行音视频流分离, 包括:
依次将所述多个多媒体源文件进行音视频流分离; 或,
同时将所述多个多媒体源文件进行音视频流分离;
相应地, 所述对分离得到的每个音频流和视频流进行转码, 包括:
依次对分离得到的每个音频流和视频流进行转码; 或,
同时对分离得到的每个音频流和视频流进行转码。
进一步地, 所述依次将所述多个多媒体源文件进行音视频流分离, 包括:
根据所述多个多媒体源文件的创建时间的先后顺序, 依次将所述多个多媒体源文件进行 音视频流分离; 或,
根据所述多个多媒体源文件的文件名称的排列前后顺序, 依次将所述多个多媒体源文件 进行音视频流分离; 或,
根据用户要求的合并转码完成后的各文件排列的先后顺序, 依次将所述多个多媒体源文 件进行音视频流分离;
相应地, 所述依次对分离得到的每个音频流和视频流进行转码, 包括:
根据所述多个多媒体源文件的创建时间的先后顺序, 依次对分离得到的每个音频流和视 频流进行转码; 或,
根据所述多个多媒体源文件的文件名称的排列前后顺序, 依次对分离得到的每个音频流 和视频流进行转码; 或,
根据用户要求的合并转码完成后的各文件排列的先后顺序, 依次对分离得到的每个音频 流和视频流进行转码。
其中, 所述转码目标参数至少包括:
多媒体目标文件的文件格式, 或, 多媒体目标文件的文件大小。 具体地,所述将所述多个多媒体源文件转码后的音频流和视频流合并成多媒体目标文件, 具体包括:
将每个多媒体源文件转码后的音频流与视频流进行合并, 得到所述每个多媒体源文件对 应的多媒体目标文件。
进一步地, 所述得到所述每个多媒体源文件对应的多媒体目标文件之后, 还包括: 将得到的所述每个多媒体源文件对应的多媒体目标文件进行合并, 得到一个多媒体目标 文件。
可选地,所述将所述多个多媒体源文件转码后的音频流和视频流合并成多媒体目标文件, 具体包括:
将所有多媒体源文件转码后的音频流进行合并, 得到合并的音频流;
将所述所有多媒体源文件转码后的视频流进行合并, 得到合并的视频流;
将所述合并的音频流及合并的视频流进行合并, 得到一个多媒体目标文件。
再一方面, 提供了一种转码的装置, 所述装置包括:
接收单元, 用于接收用户选择的多个多媒体源文件以及所述用户输入的转码目标参数; 分离单元, 用于对所述多个多媒体源文件进行音视频流分离;
转码单元, 用于根据所述接收单元接收的转码目标参数, 对所述分离单元分离得到的每 个音频流和视频流进行转码;
合并单元, 用于将所述转码单元得到的所述多个多媒体源文件转码后的音频流和视频流 合并成多媒体目标文件。
优选地, 所述装置还包括: 判断单元, 用于对所述接收单元接收的所述用户输入的转码 目标参数进行合法性判断, 如果合法, 则将所述转码目标参数提供给所述转码单元。
其中, 所述分离单元包括:
第一分离子单元, 用于依次将所述多个多媒体源文件进行音视频流分离; 或, 第二分离子单元, 用于同时将所述多个多媒体源文件进行音视频流分离;
相应地, 所述转码单元, 包括:
第一转码单元, 用于依次对分离得到的每个音频流和视频流进行转码; 或,
第二转码单元, 用于同时对分离得到的每个音频流和视频流进行转码。
进一步地, 所述第一分离子单元具体用于当接收到所述用户输入的转码确认信号后, 根 据所述多个多媒体源文件的创建时间的先后顺序, 依次将所述多个多媒体源文件进行音视频 流分离; 或, 根据所述多个多媒体源文件的文件名称的排列前后顺序, 依次将所述多个多媒 体源文件进行音视频流分离; 或, 根据用户要求的合并转码完成后的各文件排列的先后顺序, 依次将所述多个多媒体源文件进行音视频流分离;
相应地, 所述第一转码单元, 具体用于根据所述多个多媒体源文件的创建时间的先后顺 序, 依次对分离得到的每个音频流和视频流进行转码; 或, 根据所述多个多媒体源文件的文 件名称的排列前后顺序, 依次对分离得到的每个音频流和视频流进行转码; 或, 根据用户要 求的合并转码完成后的各文件排列的先后顺序, 依次对分离得到的每个音频流和视频流进行 转码。
具体地, 所述合并单元, 具体用于将每个多媒体源文件转码后的音频流与视频流进行合 并, 得到所述每个多媒体源文件对应的多媒体目标文件。
进一步地, 所述合并单元, 还用于将得到的所述每个多媒体源文件对应的多媒体目标文 件进行合并, 得到一个多媒体目标文件。
可选地, 所述合并单元, 具体用于将所有多媒体源文件转码后的音频流进行合并, 得到 合并的音频流; 将所述所有多媒体源文件转码后的视频流进行合并, 得到合并的视频流; 将 所述合并的音频流及合并的视频流进行合并, 得到一个多媒体目标文件。
本发明实施例提供的技术方案的有益效果是:
通过将用户选择的多个待转码的源文件进行音视频流分离, 并根据用户输入的转码目标 参数对分离得到的每个音频流和视频流进行转码, 可以在用户选择多个多媒体源文件后, 一 次性执行对多个源文件进行转码的操作, 简化了用户多次启动转码操作的步骤, 降低了转码 次数, 从而可以提升用户体验; 且将转码的音频流及视频流进行合并时, 可以合并成一个多 媒体目标文件, 从而可以满足用户希望将多个多媒体文件进行连续播放的需求, 进一步提高 用户体验; 另外, 通过将多个待转码的源文件同时进行音视频流分离及转码, 还可以进一步 提高转码效率。 附图说明
为了更清楚地说明本发明实施例中的技术方案, 下面将对实施例描述中所需要使用的附 图作简单地介绍, 显而易见地, 下面描述中的附图仅仅是本发明的一些实施例, 对于本领域 普通技术人员来讲, 在不付出创造性劳动的前提下, 还可以根据这些附图获得其他的附图。
图 1是本发明实施例 1提供的转码的方法流程图;
图 2是本发明实施例 2提供的一种转码器架构示意图;
图 3是本发明实施例 2提供的基于图 2所示的架构图实现转码的方法流程图; 图 4是本发明实施例 2提供的另一转码器架构示意图;
图 5是本发明实施例 3提供的转码的装置示意图;
图 6是本发明实施例 3提供的另一种转码的装置示意图;
图 7是本发明实施例 3提供的又一种转码的装置示意图。 具体实施方式
为使本发明的目的、 技术方案和优点更加清楚, 下面将结合附图对本发明实施方式作进 一步地详细描述。 实施例 1
当存在多个待转码的源文件时, 为了降低转码次数, 并提高用户体验, 本发明实施例提 供了一种转码的方法, 参见图 1, 该方法内容如下:
101 :接收用户选择的多个多媒体源文件以及用户输入的转码目标参数;
102:对多个多媒体源文件进行音视频流分离;
103 :根据转码目标参数, 对分离得到的每个音频流和视频流进行转码;
104 :将多个多媒体源文件转码后的音频流和视频流合并成多媒体目标文件。
本发明实施例提供的方法, 通过将用户选择的多个待转码的源文件进行音视频流分离, 并根据用户输入的转码目标参数对分离得到的每个音频流和视频流进行转码, 可以在用户选 择多个多媒体源文件后, 一次性执行对多个源文件进行转码的操作, 简化了用户多次启动转 码操作的步骤, 降低了转码次数, 从而可以提升用户体验。
为了对上述本发明实施例提供的方法进行详细说明, 请参见如下实施例: 实施例 2
当存在多个待转码的源文件时, 为了减少转码次数, 提高用户的体验, 本发明实施例提 供了一种转码的方法, 基于本发明实施例提供的方法, 参见图 2, 为实现本发明实施例提供 的方法的转码器架构示意图, 该转码器包括: 源文件列表管理模块、 音视频流分离模块、 音 频转码模块、视频转码模块、 音视频流合成模块、合并转码总控模块以及用户参数设置模块。 基于该转码器架构, 为了对本发明实施例提供的方法进行示意说明, 本实施例以用户希望进 行合并转码的源文件是多个, 文件名称分别是 、 B、 C为例进行说明。 详见图 3, 该方法内 容如下: 201: 转码器的源文件列表管理模块接收用户所选择出的待进行合并转码的源文件 A、 B、
C。
其中, 本发明实施例对源文件的数量以及类型不做限制。 参见表 1, 为本发明实施例提 供的源文件信息示意表。
表 1
Figure imgf000007_0001
进一步地, 当用户将所选择出的待进行合并转码的源文件 (1 个或多个) 添加进源文件 列表管理模块中后, 可以根据用户的需要, 执行对位于源文件列表里的源文件进行删除, 对 源文件列表里的源文件进行重排序等操作。
202:转码器的用户参数设置模块接收用户所输入的转码目标参数。
其中, 上述转码目标参数包括但不限于多媒体目标文件的文件格式、 多媒体目标文件的 文件大小、 多媒体目标文件的码流等。 为了便于说明, 本实施例以用户输入的转码目标参数 包括目标文件的文件格式和目标文件的文件大小为例进行说明, 如: 用户输入的转码目标参 数中的目标文件的文件格式为 AVI (Audio Video Interleaved, 音频视频交错格式)、 目标文 件的文件大小为 15M。
优选地,本发明实施例提供的该用户参数设置模块接收到用户所输入的转码目标参数后, 还可以判断该转码目标参数是否合法, 假如合法则将用户设置的参数传递到音频转码模块、 视频转码模块以及音视频流合成模块; 否则, 提示用户重新进行转码目标参数的输入。 关于 判断该转码目标参数是否合法的方式, 本实施例对此不做限定, 例如, 可预设格式参数, 判 断接收的输入的转码目标参数是否为预设格式参数, 如果是, 则认为该转码目标参数合法; 否则该转码目标参数非法, 本实施例不对预设格式参数进行限定。
203: 转码器的合并转码总控模块接收用户输入的转码确认信号。
其中, 当用户向转码器中输入了选择好的待进行转码的源文件, 且向转码器中输入了转 码目标参数后, 用户就可以触发开始进行转码的转码确认信号, 例如, 用户通过按下预先提 供的确认按钮, 实现触发开始进行转码的转码确认信号, 以便执行合并转码功能。
204: 在转码器的合并转码总控模块的控制下, 依次将源文件列表管理模块中的多个源文 件发送至音视频流分离模块。
关于如何依次将源文件列表管理模块中的源文件发送至音视频流分离模块, 本实施例对 此不做限制。 例如, 合并转码总控模块按照源文件的创建时间的先后顺序, 依次将源文件 A、 B、 C送至音视频流分离模块, 或者, 合并转码总控模块按照源文件的文件名称的排列前后顺 序, 或者, 按照转码完成后的各文件排列的先后顺序, 依次将源文件 A、 B、 C送至音视频流 分离模块。
通过上述几种依次将源文件列表管理模块中的源文件发送至音视频流分离模块的方式, 音视频流分离模块既可以按照接收的顺序依次对各个源文件进行视频流分离, 也可以在收到 所有源文件之后, 对所有源文件同时进行视频流分离, 本实施例不对具体分离方式进行限定。
205: 转码器的音视频流分离模块接收到当前源文件后, 对该当前源文件执行音视频流分 离, 并将分离后得到的音频流和视频流分别发送至相应的音频转码模块和视频转码模块。
例如, 音视频流分离模块接收到当前源文件 A, 则对该源文件 A执行音视频文件的解析, 得到当前源文件 A的音频流和当前源文件 A的视频流, 并将当前源文件 A的音频流发送给音 频转码模块, 视频流发送给视频转码模块, 以便对音频流与视频流分别进行转码。
当音视频流分离模块将分离后得到的音频流和视频流分别发送至相应的音频转码模块和 视频转码模块之后, 同音视频分离模块一样, 音频转码模块和视频转码模块同样可以对各个 音频流或视频流依次进行转码, 也可以对其进行同时转码, 本实施例不对具体转码方式进行 限定。
206: 音频转码模块接收到当前源文件的音频流, 根据用户参数设置模块接收用户所输入 的转码目标参数, 执行音频转码, 得到当前源文件对应的目标音频流。
如前所述, 本实施例以用户所输入的转码目标参数包括目标文件的文件格式 AVI为例进 行说明, 则音频转码模块接收到源文件 A的音频流后, 由于源文件 A的文件格式为 MPEG4, 则按照目标文件的文件格式 AVI, 对源文件 A音频流进行转码, 从而实现将音频流转码为用 户所希望的音频格式的目标音频流。
207: 视频转码模块接收到当前源文件的视频流, 根据用户参数设置模块接收用户所输入 的转码目标参数, 执行视频转码, 得到当前源文件对应的目标视频流。
与步骤 206类似, 视频转码模块用于实现将视频流转码为用户要求视频格式的目标视频 流。 仍以用户所输入的转码目标参数包括目标文件的文件格式 AVI为例进行说明, 视频转码 模块接收到当前源文件 A的视频流后, 由于源文件 A的文件格式为 MPEG4, 需要按照目标文 件的文件格式 AVI, 对源文件 A视频流进行转码, 从而实现将视频流转码为用户所希望的视 频格式的目标视频流。 本发明实施例不限制步骤 206和步骤 207执行的先后顺序。
208: 当音频转码模块得到当前源文件对应的目标音频流、视频模块得到当前源文件对应 的目标视频流后, 将目标音频流及目标视频流均发送至音视频流合成模块。
209: 音视频流合成模块将接收的多个源文件转码后的音频流和视频流合并成目标文件。 针对该步骤, 可以分为以下两种情况:
将多个源文件转码后的音频流和视频流合并成多个多媒体目标文件时:
可通过将每个多媒体源文件转码后的音频流与视频流进行合并, 得到每个多媒体源文件 对应的多媒体目标文件。
将多个多媒体源文件转码后的音频流和视频流合并成一个多媒体目标文件时, 包括但不 限于以下两种实现方式:
方式一:
将所有多媒体源文件转码后的音频流进行合并, 得到合并的音频流;
将所有多媒体源文件转码后的视频流进行合并, 得到合并的视频流;
将合并的音频流及合并的视频流进行合并, 得到一个多媒体目标文件。
方式二:
在将多个源文件转码后的音频流和视频流合并成多个多媒体目标文件的基础上, 再将多 个目标文件合并成一个目标文件, 即在上述第一种情况下得到每个多媒体源文件对应的多媒 体目标文件后, 将得到的每个多媒体源文件对应的多媒体目标文件进行合并, 得到一个多媒 体目标文件。
例如, 参见表 2, 为本发明实施例提供的源文件音视频分离、 转换后的示意表。
表 2
Figure imgf000009_0001
如表 2所示, 音视频流合成模块将得到的目标文件的音频流、 目标文件的视频流, 进行 合并时, 可采取以下几种方式:
、 采取将多个多媒体源文件转码后的音频流和视频流合并成多个多媒体目标文件的方 式:
即将源文件 A对应的目标文件的音频流 Axx与视频流 Ayy合并之后, 得到目标文件 ; 将源文件 B对应的目标文件的音频流 Bxx与视频流 Byy合并之后, 得到目标文件 B、 将源文 件 C对应的目标文件的音频流 Cxx与视频流 Cyy合并之后, 得到目标文件 (T。
二、 采取将多个多媒体源文件转码后的音频流和视频流合并成一个多媒体目标文件的方 式:
1 )、 在方式一的基础上, 将目标文件 A 和 (T进行合并, 得到一个目标文件(设为目 标文件 )。
2 )、 将源文件4、 源文件 B和源文件 C所对应的目标文件的音频流进行合并, 得到合并 的音频流; 将源文件 A、 源文件 B和源文件 C所对应的目标文件的视频流进行合并, 得到合 并的视频流; 再将合并的视频流及音频流进行合并,得到一个目标文件(设为 N )。即将 Axx、 Bxx和 Cxx合并, 得到 Nxx, 将 Ayy、 Byy和 Cyy合并, 得到 Nyy, 再将 Nxx和 Nyy合并, 得 到 N 。
其中, 上述举例中的目标文件 和 可以相同, 也可以不同, 需根据实际情况而定, 且 对于选取何种方式进行合并, 本实施例不做具体限定, 为了能够使音频与视频实现良好对应, 可优先选取将每个多媒体源文件的音频流与视频流进行合并的方式。 另外, 对音视频流分离 以及对音视频流合并属于现有技术, 本实施例对此不再赘述。
至此, 通过上述步骤 201-209, 用户选择出待进行转码的源文件后, 按照自定义的顺序 添加到源文件列表中, 利用本发明实施例提供的方法将源文件列表里的文件转码合并成一个 目标文件时, 特别是针对源文件较短时的情况, 通过合并转码, 可以得到一个播放时间较长 的目标文件, 提高了用户的使用体验, 丰富了转码方式。
优选地, 为了进一步提高转码效率, 参见图 4, 为本发明实施例提供的另一转码器架构 示意图, 该转码器包括: 源文件列表管理模块、 多个音视频流分离模块、 多个音频转码模块、 多个视频转码模块、 音视频流合成模块、 合并转码总控模块以及用户参数设置模块。 基于该 转码器架构示意图, 与上述步骤 201至 209的不同在于, 在步骤 204中, 在合并转码总控模 块控制下, 还可以将源文件列表中的多个源文件分别送入各个音视频分离模块中, 由各个音 视频分离模块同时对各源文件执行音视频的分离, 然后再分别送入各自对应的音频转码模块 和视频转码模块中, 方法类似, 不再赘述, 由于可以对多个源文件同步执行音视频流分离、 音频转码和视频转码, 从而进一步地节约了转码时间、 提高了转码效率。
进一步地, 本发明实施例提供的方法, 对源文件的文件格式没有要求, 可以相同也可以 不同, 若各源文件格式不同, 则所执行的功能为上述本发明实施例提供的合并转码功能; 若 各源文件格式相同, 且用户设置目标媒体格式为源文件格式, 这样本发明实施例提供的方法 又能实现媒体文件合并功能, 即本发明实施例提供的转码器又可以充当媒体文件合并器。
除此之外, 本发明实施例提供的方法还可以应用于以下场合:
当用户对某个场景分别拍摄了该场景的视频文件, 录制了该场景的音频文件, 则可以利 用本发明实施例提供的方法实现视频和音频文件的叠加功能, 从而可以得到既有图像又有声 音的多媒体文件, 大大提高了用户的使用体验。
综上所述, 本发明实施例提供的转码的方法, 通过将用户选择的多个待转码的源文件进 行音视频流分离, 并根据用户输入的转码目标参数对分离得到的每个音频流和视频流进行转 码, 可以在用户选择多个多媒体源文件后, 一次性执行对多个源文件进行转码的操作, 简化 了用户多次启动转码操作的步骤, 降低了转码次数, 从而可以提升用户体验; 且将转码的音 频流及视频流进行合并时, 可以合并成一个多媒体目标文件, 从而可以满足用户希望将多个 多媒体文件进行连续播放的需求, 进一步提高用户体验; 另外, 通过将多个待转码的源文件 同时进行音视频流分离及转码, 还可以进一步提高转码效率。 实施例 3
当存在多个待转码的源文件时, 为了降低转码次数, 并提高用户的体验, 本发明实施例 提供了一种转码的装置, 参见图 5, 该装置包括:
接收单元 501, 用于接收用户选择的多个多媒体源文件以及用户输入的转码目标参数; 分离单元 502, 用于对多个多媒体源文件进行音视频流分离;
转码单元 503, 用于根据接收单元 501接收的转码目标参数, 对分离单元 502分离得到 的音频流和视频流进行转码;
合并单元 504, 用于将转码单元 503得到的多个多媒体源文件转码后的音频流和视频流 合并成多媒体目标文件。
其中, 参见图 6, 本发明实施例提供的转码的装置还包括:
判断单元 505, 用于对接收单元 501接收的用户输入的转码目标参数进行合法性判断, 如果合法, 则将转码目标参数提供给转码单元 503。
在该判断单元 505对接收的用户输入的转码目标参数是否合法进行判断时,具体可以为: 判断接收的用户输入的转码目标参数是否符合预设格式参数, 如果是, 则转码目标参数合法, 如果否, 则转码目标参数不合法。 本实施例不对预设格式参数进行限定。 其中, 分离单元 502包括:
第一分离子单元, 用于依次将多个多媒体源文件进行音视频流分离; 或,
第二分离子单元, 用于同时将多个多媒体源文件进行音视频流分离。
其中, 该第一分离子单元, 具体用于当接收到用户输入的转码确认信号后, 根据多个多 媒体源文件的创建时间的先后顺序, 依次将多个多媒体源文件进行音视频流分离; 或, 根据 多个多媒体源文件的文件名称的排列前后顺序,依次将多个多媒体源文件进行音视频流分离; 或, 根据用户要求的合并转码完成后的各文件排列的先后顺序, 依次将多个多媒体源文件进 行音视频流分离。
转码单元 503包括:
第一转码单元, 用于依次对分离得到的每个音频流和视频流进行转码; 或,
第二转码单元, 用于同时对分离得到的每个音频流和视频流进行转码。
其中, 第一转码单元, 具体用于根据多个多媒体源文件的创建时间的先后顺序, 依次对 分离得到的每个音频流和视频流进行转码; 或, 根据多个多媒体源文件的文件名称的排列前 后顺序, 依次对分离得到的每个音频流和视频流进行转码; 或, 根据用户要求的合并转码完 成后的各文件排列的先后顺序, 依次对分离得到的每个音频流和视频流进行转码。
转码目标参数包括但不限于:
多媒体目标文件的文件格式, 或, 多媒体目标文件的文件大小。
多媒体目标文件的个数为一个, 多个多媒体源文件的文件格式相同或不同。
具体地, 合并单元 504, 具体用于将每个多媒体源文件转码后的音频流与视频流进行合 并, 得到每个多媒体源文件对应的多媒体目标文件。
进一步地, 合并单元 504, 还用于将得到的每个多媒体源文件对应的多媒体目标文件进 行合并, 得到一个多媒体目标文件。
可选地, 合并单元 504, 具体用于将所有多媒体源文件转码后的音频流进行合并, 得到 合并的音频流; 将所有多媒体源文件转码后的视频流进行合并, 得到合并的视频流; 将合并 的音频流及合并的视频流进行合并, 得到一个多媒体目标文件。
参见图 7, 本发明实施例提供的转码的装置, 还可以包括:
播放单元 506, 用于对合并单元 504得到的多媒体目标文件进行播放。
与上述方法实施例相应地, 本发明实施例提供的装置中的各单元可以合并为一个模块, 也可以进一步拆分, 本实施例对此不作具体限定。 例如, 具体实现时, 本发明实施例提供的 装置的接收单元 501, 可以采用方法实施例中所示的源文件列表管理模块和用户参数设置模 块来实现; 再如, 本发明实施例提供的装置的分离单元 502, 采用方法实施例中所示的音视 频流分离模块来实现; 本发明实施例提供的装置的转码单元 503, 可以采用方法实施例中所 示的音频转码模块以及视频转码模块来实现; 本发明实施例提供的装置的合并单元 504, 可 以采用方法实施例中所示的音视频流合成模块来实现, 其中, 音视频流模块、 视频转码模块 和视频转码模块均可以为一至多个。
综上所述, 本发明实施例提供的转码的装置, 通过将用户选择的多个待转码的源文件进 行音视频流分离, 并根据用户输入的转码目标参数对分离得到的每个音频流和视频流进行转 码, 可以在用户选择多个多媒体源文件后, 一次性执行对多个源文件进行转码的操作, 简化 了用户多次启动转码操作的步骤, 降低了转码次数, 从而可以提升用户体验; 且将转码的音 频流及视频流进行合并时, 可以合并成一个多媒体目标文件, 从而可以满足用户希望将多个 多媒体文件进行连续播放的需求, 进一步提高用户体验; 另外, 通过将多个待转码的源文件 同时进行音视频流分离及转码, 还可以进一步提高转码效率。
本发明实施例中的 "接收"一词可以理解为主动从其他模块获取, 也可以是接收其他模 块发送来的信息。
本领域技术人员可以理解附图只是一个优选实施例的示意图, 附图中的模块或流程并不 一定是实施本发明所必须的。
本领域技术人员可以理解实施例中的装置中的模块可以按照实施例描述分布于实施例的 装置中, 也可以进行相应变化位于不同于本实施例的一个或多个装置中。 上述实施例的模块 可以合并为一个模块, 也可以进一步拆分成多个子模块。
上述本发明实施例序号仅仅为了描述, 不代表实施例的优劣。
本发明实施例中的部分步骤, 可以利用软件实现, 相应的软件程序可以存储在可读取的 存储介质中, 如光盘或硬盘等。
以上所述仅为本发明的较佳实施例, 并不用以限制本发明, 凡在本发明的精神和原则之 内, 所作的任何修改、 等同替换、 改进等, 均应包含在本发明的保护范围之内。

Claims

权 利 要 求 书
1、 一种转码的方法, 其特征在于, 所述方法包括:
接收用户选择的多个多媒体源文件以及所述用户输入的转码目标参数;
对所述多个多媒体源文件进行音视频流分离;
根据所述转码目标参数, 对分离得到的每个音频流和视频流进行转码;
将所述多个多媒体源文件转码后的音频流和视频流合并成多媒体目标文件。
2、如权利要求 1所述的方法,其特征在于,所述接收所述用户输入的转码目标参数之后, 还包括:
判断所述接收的用户输入的转码目标参数是否合法, 如果是, 则执行后续步骤。
3、如权利要求 1所述的方法,其特征在于,所述对多个多媒体源文件进行音视频流分离, 包括:
依次将所述多个多媒体源文件进行音视频流分离; 或,
同时将所述多个多媒体源文件进行音视频流分离;
相应地, 所述对分离得到的每个音频流和视频流进行转码, 包括:
依次对分离得到的每个音频流和视频流进行转码; 或,
同时对分离得到的每个音频流和视频流进行转码。
4、 如权利要求 3所述的方法, 其特征在于, 所述依次将所述多个多媒体源文件进行音视 频流分离, 包括:
根据所述多个多媒体源文件的创建时间的先后顺序, 依次将所述多个多媒体源文件进行 音视频流分离; 或,
根据所述多个多媒体源文件的文件名称的排列前后顺序, 依次将所述多个多媒体源文件 进行音视频流分离; 或,
根据用户要求的合并转码完成后的各文件排列的先后顺序, 依次将所述多个多媒体源文 件进行音视频流分离;
相应地, 所述依次对分离得到的每个音频流和视频流进行转码, 包括:
根据所述多个多媒体源文件的创建时间的先后顺序, 依次对分离得到的每个音频流和视 频流进行转码; 或,
根据所述多个多媒体源文件的文件名称的排列前后顺序, 依次对分离得到的每个音频流 和视频流进行转码; 或,
根据用户要求的合并转码完成后的各文件排列的先后顺序, 依次对分离得到的每个音频 流和视频流进行转码。
5、 如权利要求 1所述的方法, 其特征在于, 所述转码目标参数至少包括:
多媒体目标文件的文件格式, 或, 多媒体目标文件的文件大小。
6、 如权利要求 1所述的方法, 其特征在于, 所述将所述多个多媒体源文件转码后的音频 流和视频流合并成多媒体目标文件, 具体包括:
将每个多媒体源文件转码后的音频流与视频流进行合并, 得到所述每个多媒体源文件对 应的多媒体目标文件。
7、根据权利要求 6所述的方法, 其特征在于, 所述得到所述每个多媒体源文件对应的多 媒体目标文件之后, 还包括:
将得到的所述每个多媒体源文件对应的多媒体目标文件进行合并, 得到一个多媒体目标 文件。
8、 如权利要求 1所述的方法, 其特征在于, 所述将所述多个多媒体源文件转码后的音频 流和视频流合并成多媒体目标文件, 具体包括:
将所有多媒体源文件转码后的音频流进行合并, 得到合并的音频流;
将所述所有多媒体源文件转码后的视频流进行合并, 得到合并的视频流;
将所述合并的音频流及合并的视频流进行合并, 得到一个多媒体目标文件。
9、 一种转码的装置, 其特征在于, 所述装置包括:
接收单元, 用于接收用户选择的多个多媒体源文件以及所述用户输入的转码目标参数; 分离单元, 用于对所述多个多媒体源文件进行音视频流分离;
转码单元, 用于根据所述接收单元接收的转码目标参数, 对所述分离单元分离得到的每 个音频流和视频流进行转码; 合并单元, 用于将所述转码单元得到的所述多个多媒体源文件转码后的音频流和视频流 合并成多媒体目标文件。
10、 如权利要求 9所述的装置, 其特征在于, 所述装置还包括:
判断单元, 用于对所述接收单元接收的所述用户输入的转码目标参数进行合法性判断, 如果合法, 则将所述转码目标参数提供给所述转码单元。
11、 如权利要求 9所述的装置, 其特征在于, 所述分离单元包括:
第一分离子单元, 用于依次将所述多个多媒体源文件进行音视频流分离; 或, 第二分离子单元, 用于同时将所述多个多媒体源文件进行音视频流分离;
相应地, 所述转码单元, 包括:
第一转码单元, 用于依次对分离得到的每个音频流和视频流进行转码; 或,
第二转码单元, 用于同时对分离得到的每个音频流和视频流进行转码。
12、 如权利要求 11所述的装置, 其特征在于, 所述第一分离子单元具体用于当接收到所 述用户输入的转码确认信号后, 根据所述多个多媒体源文件的创建时间的先后顺序, 依次将 所述多个多媒体源文件进行音视频流分离; 或, 根据所述多个多媒体源文件的文件名称的排 列前后顺序, 依次将所述多个多媒体源文件进行音视频流分离; 或, 根据用户要求的合并转 码完成后的各文件排列的先后顺序, 依次将所述多个多媒体源文件进行音视频流分离;
相应地, 所述第一转码单元, 具体用于根据所述多个多媒体源文件的创建时间的先后顺 序, 依次对分离得到的每个音频流和视频流进行转码; 或, 根据所述多个多媒体源文件的文 件名称的排列前后顺序, 依次对分离得到的每个音频流和视频流进行转码; 或, 根据用户要 求的合并转码完成后的各文件排列的先后顺序, 依次对分离得到的每个音频流和视频流进行 转码。
13、 如权利要求 9所述的装置, 其特征在于, 所述合并单元, 具体用于将每个多媒体源 文件转码后的音频流与视频流进行合并,得到所述每个多媒体源文件对应的多媒体目标文件。
14、 根据权利要求 13所述的装置, 其特征在于, 所述合并单元, 还用于将得到的所述每 个多媒体源文件对应的多媒体目标文件进行合并, 得到一个多媒体目标文件。
15、 如权利要求 9所述的装置, 其特征在于, 所述合并单元, 具体用于将所有多媒体源 文件转码后的音频流进行合并, 得到合并的音频流; 将所述所有多媒体源文件转码后的视频 流进行合并, 得到合并的视频流; 将所述合并的音频流及合并的视频流进行合并, 得到一个 多媒体目标文件。
PCT/CN2010/073723 2009-08-26 2010-06-09 一种转码的方法和装置 WO2011023017A1 (zh)

Priority Applications (2)

Application Number Priority Date Filing Date Title
SG2011092145A SG176822A1 (en) 2009-08-26 2010-06-09 Method and device for transcoding
US13/336,331 US8583828B2 (en) 2009-08-26 2011-12-23 Method and device for transcoding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200910168146.6 2009-08-26
CN2009101681466A CN101635854B (zh) 2009-08-26 2009-08-26 一种实现合并转码的方法和装置

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/336,331 Continuation US8583828B2 (en) 2009-08-26 2011-12-23 Method and device for transcoding

Publications (1)

Publication Number Publication Date
WO2011023017A1 true WO2011023017A1 (zh) 2011-03-03

Family

ID=41594884

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2010/073723 WO2011023017A1 (zh) 2009-08-26 2010-06-09 一种转码的方法和装置

Country Status (5)

Country Link
US (1) US8583828B2 (zh)
CN (1) CN101635854B (zh)
MY (1) MY163613A (zh)
SG (1) SG176822A1 (zh)
WO (1) WO2011023017A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107529092A (zh) * 2017-09-30 2017-12-29 北京元心科技有限公司 用户设备、多媒体信息处理的方法及装置
CN113873176A (zh) * 2021-10-27 2021-12-31 北京奇艺世纪科技有限公司 一种媒体文件合并方法及装置

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101635854B (zh) * 2009-08-26 2012-07-04 腾讯科技(深圳)有限公司 一种实现合并转码的方法和装置
CN102163201A (zh) * 2010-02-24 2011-08-24 腾讯科技(深圳)有限公司 一种多媒体文件切割方法、装置及转码器
CN102263941A (zh) * 2010-05-31 2011-11-30 苏州闻道网络科技有限公司 一种视频文件的转码方法和装置
CN102137250A (zh) * 2011-03-16 2011-07-27 深圳市捷视飞通科技有限公司 一种视频会议的方法及系统
EP2648364B1 (en) * 2012-03-07 2018-06-06 Accenture Global Services Limited Communication collaboration
CN102595242B (zh) 2012-03-12 2015-03-11 华为技术有限公司 动态调整视频的系统、终端和方法
CN103327401B (zh) * 2012-03-19 2016-08-03 深圳市快播科技有限公司 多媒体转码器及转码方法、多媒体播放终端
CN103902648B (zh) * 2014-02-10 2018-05-04 深圳市永兴元科技股份有限公司 多文件处理系统及方法
CN103929655B (zh) * 2014-04-25 2017-06-06 网易传媒科技(北京)有限公司 对音视频文件进行转码处理的方法和设备
CN104093072B (zh) 2014-06-30 2017-06-16 京东方科技集团股份有限公司 一种视频信息播放系统和方法
US20160037176A1 (en) * 2014-07-30 2016-02-04 Arris Enterprises, Inc. Automatic and adaptive selection of profiles for adaptive bit rate streaming
CN104159127B (zh) * 2014-08-21 2019-02-22 北京奇艺世纪科技有限公司 一种视频转码方法、装置及系统
US10187684B2 (en) 2015-06-23 2019-01-22 Facebook, Inc. Streaming media presentation system
CN105898448A (zh) * 2015-12-14 2016-08-24 乐视云计算有限公司 转码属性信息的提交方法和装置
CN106131666A (zh) * 2016-07-18 2016-11-16 杭州当虹科技有限公司 一种基于多音轨视频合成技术的机顶盒视频导航系统
CN107016631B (zh) * 2017-03-31 2021-02-12 弘成科技发展有限公司 跨平台课件智能合成方法
CN108989831A (zh) * 2017-05-31 2018-12-11 北京视联动力国际信息技术有限公司 一种多码流的网络录制方法和装置
CN110069455B (zh) * 2017-09-21 2021-12-14 北京华为数字技术有限公司 一种文件合并方法及装置
CN107959884B (zh) * 2017-12-07 2020-10-16 上海网达软件股份有限公司 一种单声道多音频流媒体文件的转码处理方法
US10764396B2 (en) 2017-12-18 2020-09-01 The Directv Group, Inc. Media transcoding based on priority of media
CN112788374B (zh) * 2019-11-05 2023-02-28 腾讯科技(深圳)有限公司 一种信息处理方法、装置、设备及存储介质
CN111352572B (zh) * 2020-05-25 2020-08-25 深圳传音控股股份有限公司 资源处理方法、移动终端和计算机可读存储介质
US20220086197A1 (en) * 2020-09-14 2022-03-17 Damaka, Inc. System and method for establishing and managing multiple call sessions from a centralized control interface
CN112689194B (zh) * 2020-12-21 2023-02-10 展讯半导体(成都)有限公司 功能机视频配乐方法、装置、终端设备及存储介质
CN114827751B (zh) * 2022-03-28 2023-01-17 慧之安信息技术股份有限公司 基于wasm的web端无插件监控录像播放方法
CN115510825B (zh) * 2022-11-18 2023-04-07 深圳市徐港电子有限公司 一种音频参数配置方法、装置、电子设备及存储介质

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101098483A (zh) * 2007-07-19 2008-01-02 上海交通大学 以图像组结构为并行处理单元的视频集群转码系统
US20090003458A1 (en) * 2007-06-29 2009-01-01 The Hong Kong University Of Science And Technology Video transcoding quality enhancement
CN101635854A (zh) * 2009-08-26 2010-01-27 腾讯科技(深圳)有限公司 一种实现合并转码的方法和装置

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8010711B2 (en) * 2007-01-26 2011-08-30 Digital Video Chip, Llc Universal multimedia
CN100496129C (zh) * 2007-06-05 2009-06-03 南京大学 基于h.264多路视频转码复用的方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090003458A1 (en) * 2007-06-29 2009-01-01 The Hong Kong University Of Science And Technology Video transcoding quality enhancement
CN101098483A (zh) * 2007-07-19 2008-01-02 上海交通大学 以图像组结构为并行处理单元的视频集群转码系统
CN101635854A (zh) * 2009-08-26 2010-01-27 腾讯科技(深圳)有限公司 一种实现合并转码的方法和装置

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
WARM APPLAUSE VIDEO FORMAT GRABBED ALL VIDEO JOINER COMPUTER ENTHUSIASTS, no. 12, June 2005 (2005-06-01), pages 59 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107529092A (zh) * 2017-09-30 2017-12-29 北京元心科技有限公司 用户设备、多媒体信息处理的方法及装置
CN113873176A (zh) * 2021-10-27 2021-12-31 北京奇艺世纪科技有限公司 一种媒体文件合并方法及装置
CN113873176B (zh) * 2021-10-27 2024-03-08 北京奇艺世纪科技有限公司 一种媒体文件合并方法及装置

Also Published As

Publication number Publication date
MY163613A (en) 2017-10-13
US20120185610A1 (en) 2012-07-19
SG176822A1 (en) 2012-01-30
CN101635854A (zh) 2010-01-27
US8583828B2 (en) 2013-11-12
CN101635854B (zh) 2012-07-04

Similar Documents

Publication Publication Date Title
WO2011023017A1 (zh) 一种转码的方法和装置
US8477950B2 (en) Home theater component for a virtualized home theater system
JP6493765B2 (ja) 情報処理装置および方法
WO2018082284A1 (zh) 3d全景音视频直播系统及音视频采集方法
US20150249848A1 (en) Intelligent Video Quality Adjustment
US20110261151A1 (en) Video and audio processing method, multipoint control unit and videoconference system
WO2011054208A1 (zh) 一种媒体文件的压缩方法和系统
WO2017101369A1 (zh) 直播视频的转码方法及装置
CN108040061B (zh) 一种云会议直播方法
WO2013037241A1 (zh) 移动多媒体实时转码播放系统、装置、存储介质及方法
WO2018077259A1 (zh) 一种媒体信息的切换方法及服务器、计算机存储介质
WO2011050690A1 (zh) 用于录制和回播多媒体会议的方法和系統
US10812841B2 (en) Apparatus for encoding and transcoding which applies multi-format ultra-high definition high-efficiency codec
JP7207447B2 (ja) 受信装置、受信方法、送信装置および送信方法
WO2005104579A1 (en) Interactive broadcast system
WO2021143043A1 (zh) 多人即时通讯方法、系统、装置及电子设备
JP4741325B2 (ja) 多地点会議方法及び多地点会議システム
KR102137858B1 (ko) 송신 장치, 송신 방법, 수신 장치, 수신 방법 및 프로그램
WO2010020193A1 (zh) 一种提供实时场景的多媒体系统及其实现方法
JP2008288974A (ja) ビデオ会議システム及びビデオ会議装置
WO2021235048A1 (ja) 無観客ライブ配信方法及びシステム
WO2014012384A1 (zh) 通信数据的发送方法、系统及接收装置
WO2020258976A1 (zh) 一种会议录制方法、装置及会议录制系统
WO2013082750A1 (zh) 实时转码方法及设备
TW200829000A (en) Apparatus, system and method for remotely opearting multimedia streaming

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10811177

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (EPO F1205A DATED 06-07-2012)

122 Ep: pct application non-entry in european phase

Ref document number: 10811177

Country of ref document: EP

Kind code of ref document: A1