WO2017166583A1 - 一种视频流转码方法、装置和电子设备 - Google Patents
一种视频流转码方法、装置和电子设备 Download PDFInfo
- Publication number
- WO2017166583A1 WO2017166583A1 PCT/CN2016/095971 CN2016095971W WO2017166583A1 WO 2017166583 A1 WO2017166583 A1 WO 2017166583A1 CN 2016095971 W CN2016095971 W CN 2016095971W WO 2017166583 A1 WO2017166583 A1 WO 2017166583A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- track information
- track
- audio
- video stream
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 27
- 238000012216 screening Methods 0.000 claims abstract description 18
- 238000004590 computer program Methods 0.000 claims description 15
- 238000001914 filtration Methods 0.000 claims description 13
- 230000005540 biological transmission Effects 0.000 abstract description 4
- 238000010586 diagram Methods 0.000 description 6
- 238000012545 processing Methods 0.000 description 5
- 230000006870 function Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
- H04N21/4341—Demultiplexing of audio and video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4398—Processing of audio elementary streams involving reformatting operations of audio signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/4402—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
- H04N21/440236—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by media transcoding, e.g. video is transformed into a slideshow of still pictures, audio is converted into text
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
Definitions
- the invention relates to a video transcoding technology, in particular to a video stream transcoding method, device and electronic device.
- the object of the present invention is to provide a video stream transcoding method, apparatus and electronic device, which can save video stream storage space and transmission bandwidth.
- the present invention provides a video stream transcoding method, including:
- the video stream of the filtered output track information is transcoded.
- the method of the present invention wherein the step of parsing the preset video stream source information including one video track information and a plurality of track information further includes language information and an audio encoding format. information.
- step of filtering out the output track information in the parsed plurality of track information further comprises: filtering an output for each language included in the plurality of track information Track information.
- the step of parsing the preset video stream source information including one video track information and the plurality of track information further includes language information and an audio encoding format.
- Information, audio code rate information, channel number information; the audio track with the best audio quality is selected according to audio coding format information, audio code rate information, and channel number information.
- the present invention further provides a video stream transcoding device, including:
- a source information parsing unit configured to parse a preset video stream source information including one video track information and a plurality of track information
- An output track screening unit configured to filter out output track information according to the plurality of track information parsed by the slice source parsing unit
- the video stream transcoding unit is configured to transcode and output the video stream of the output track information filtered by the output track screening unit.
- the slice source information parsing unit is further configured to parse a preset video stream source information including one video track information and a plurality of track information, each track information including language information And audio encoding format information.
- the output track screening unit is further configured to filter out one output track information for each language information included in the parsed plurality of track information.
- the output track screening unit is further configured to filter out an output track information with the best audio quality for each language information included in the parsed plurality of track information.
- the slice source information parsing unit is further configured to parse a preset video stream source information including one video track information and a plurality of track information, each track information including language information And audio code format information, audio code rate information, and channel number information;
- the output audio filtering unit is further configured to: use the audio code parsed by the slice source information parsing unit
- the format information, the audio code rate information, and the channel number information are used to filter out an output audio track information with the best audio quality for each language information included in the parsed plurality of audio track information.
- Embodiments of the present invention also provide an electronic device including at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor, Executing, by the at least one processor, the at least one processor to: parse a preset video stream source information including one video track information and a plurality of track information; The output track information is filtered out from the plurality of track information; and the video stream of the filtered output track information is transcoded and output.
- each of the track information further includes language information and audio encoding format information.
- the filtering out the output track information in the parsed plurality of track information further comprises: filtering out one output track information for each language included in the plurality of track information.
- an output track information having the best audio quality is filtered for each language included in the plurality of track information.
- each track information further includes language information, audio encoding format information, and audio.
- the bit rate information and the channel number information; the audio track with the best audio quality is selected according to the audio encoding format information, the audio bit rate information, and the channel number information.
- Embodiments of the present invention also provide a non-volatile computer storage medium, wherein the storage medium stores computer-executable instructions that, when executed by an electronic device, enable the electronic device to: resolve a pre-set Video stream source information including one video track information and a plurality of track information; filtering out output track information in the parsed plurality of track information; and transcoding the filtered video stream of the output track information Output.
- the storage medium stores computer-executable instructions that, when executed by an electronic device, enable the electronic device to: resolve a pre-set Video stream source information including one video track information and a plurality of track information; filtering out output track information in the parsed plurality of track information; and transcoding the filtered video stream of the output track information Output.
- each of the track information further includes language information and audio Encoding format information.
- the non-volatile computer storage medium wherein the filtering out the output audio track information in the parsed plurality of audio track information further comprises: filtering one for each language included in the plurality of audio track information Output track information.
- each of the track information further includes language information and audio
- Embodiments of the present invention also provide a computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising program instructions, when the program instructions are executed by a computer
- the computer is caused to perform the method of any of the above.
- a video stream transcoding method, apparatus, and electronic device provided by an embodiment of the present invention, by parsing a preset video stream source information including one video track information and a plurality of track information; The output track information is filtered out from the parsed plurality of track information; finally, the video stream of the filtered output track information is transcoded and output. It realizes saving video stream storage space and also saves video streaming bandwidth.
- FIG. 1 is a flowchart of an embodiment of a video stream transcoding method according to the present invention
- FIG. 2 is a structural block diagram of an embodiment of a video stream transcoding device according to the present invention.
- FIG. 3 is a schematic structural diagram of hardware of an electronic device according to an embodiment of the present invention.
- connection or integral connection; may be mechanical connection or electrical connection; may be directly connected, may also be indirectly connected through an intermediate medium, or may be internal communication of two components, may be wireless connection, or may be wired connection.
- connection or integral connection; may be mechanical connection or electrical connection; may be directly connected, may also be indirectly connected through an intermediate medium, or may be internal communication of two components, may be wireless connection, or may be wired connection.
- FIG. 1 is a flow chart of an embodiment of a video stream transcoding method of the present invention.
- Step 100 Parsing preset video stream source information including one video track information and multiple track information; each track information further includes language information and audio encoding format information, audio bit rate information, and channel Number information. If a plurality of language information is included in the plurality of track information, the track information corresponding to each language information is filtered out as output track information; if there are two or more identical languages having different audio codes Format, then the best audio track information is filtered out as the output track information corresponding to this language. Here, the best judgment condition of the audio quality is judged by the audio encoding format information, the audio bit rate information, and the channel number information.
- Step 200 Filter out output track information in the parsed plurality of track information
- Step 300 Transcode output of the filtered video stream of the output track information.
- the video stream source information including one video track information and the plurality of track information is preset by parsing; and the output track information is filtered out in the parsed plurality of track information; The video stream of the filtered output track information is transcoded. It realizes saving video stream storage space and also saves video streaming bandwidth.
- FIG. 2 is a block diagram showing the structure of an embodiment of a video stream transcoding apparatus of the present invention.
- An apparatus 1 includes: a source information parsing unit 2, an output audio track screening unit 3, and a video stream transcoding unit 4.
- the source information parsing unit 2 is configured to parse the preset video stream source information including one video track information and multiple track information;
- the output track screening unit 3 is configured to filter out the output track information according to the plurality of track information parsed by the slice source parsing unit;
- the video stream transcoding unit 4 is configured to perform transcoding output on the video stream of the output track information filtered by the output track screening unit.
- the source information parsing unit 2 is further configured to parse a preset video stream source information including one video track information and a plurality of track information, each track information including language information and audio. Encoding format information.
- the output track screening unit 2 filters out one output track information for each language information included in the parsed plurality of track information. For example, if a plurality of language information is included in a plurality of track information, the track information corresponding to each language information is filtered out as output track information; if two or more of the same languages have different
- the audio encoding format then filters out the best output audio information for an audio quality.
- the best judgment condition of the audio quality is judged by the audio encoding format information, the audio bit rate information, and the channel number information.
- the device in this embodiment is used to implement the corresponding method in the foregoing first embodiment, and has the beneficial effects of the corresponding method embodiments, and details are not described herein again.
- an embodiment of the present invention further discloses an electronic device including at least one processor 810; and a memory 800 communicably connected to the at least one processor 810; wherein the memory 800 stores Executing instructions executed by at least one processor 810, the instructions being executed by the at least one processor 810 to enable the at least one processor 810 to: parse a predetermined set of video track information and a plurality of track information Video stream source information; outputting the output track information in the parsed plurality of track information; transcoding and outputting the filtered video stream of the output track information.
- the electronic device also includes an input device 830 and an output device 840 that are electrically coupled to the memory 800 and the processor, the electrical connections preferably being connected by a bus.
- the parsing the preset video stream source information including one video track information and the plurality of track information, each of the track information further includes language information and an audio encoding format. information.
- the screening is performed on the parsed plurality of audio track information
- Selecting the output track information further includes: filtering out one output track information for each language included in the plurality of track information.
- the electronic device of the embodiment preferably filters out one output audio track information having the best audio quality for each language included in the plurality of pieces of track information.
- the parsing the preset video stream source information including one video track information and the plurality of track information each of the track information further includes language information and an audio encoding format.
- Information, audio code rate information, channel number information; the audio track with the best audio quality is selected according to audio coding format information, audio code rate information, and channel number information.
- Embodiments of the present invention also disclose a non-volatile computer storage medium, wherein the storage medium stores computer-executable instructions that, when executed by an electronic device, enable the electronic device to: parse the preset The video stream source information including one video track information and a plurality of track information; the output track information is filtered out in the parsed plurality of track information; and the filtered video stream of the output track information is rotated Code output.
- each of the track information further includes a language Information and audio encoding format information.
- the filtering out the output audio track information in the parsed plurality of audio track information further includes: for each language included in the plurality of audio track information Filter out an output track information.
- the nonvolatile computer storage medium of the present embodiment preferably filters out one of the best audio quality output track information for each of the plurality of audio track information.
- each of the track information further includes a language Information, audio encoding format information, audio rate information, channel number information; the audio track with the best audio quality is selected according to audio encoding format information, audio bit rate information, and channel number information.
- the embodiment of the invention further provides a computer program product comprising a computer program stored on a non-transitory computer readable storage medium, the computer program comprising A sequence instruction that, when executed by a computer, causes the computer to perform the method described in the above embodiments.
- embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) including computer usable program code.
- computer-usable storage media including but not limited to disk storage, CD-ROM, optical storage, etc.
- the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device.
- the apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
- These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
- the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
一种视频流转码方法、装置和电子设备,所述装置包括:片源信息解析单元,用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;输出音轨筛选单元,用于根据片源解析单元解析出的所述多个音轨信息中筛选出输出音轨信息;视频流转码单元,用于对输出音轨筛选单元筛选的输出音轨信息的视频流进行转码输出。实现了节约视频流存储空间及传输带宽的效果。
Description
交叉引用
本申请要求在2016年03月31日提交中国专利局、申请号为201610200823.8、发明名称为“一种视频流转码方法及装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
本发明涉及视频转码技术,特别是指一种视频流转码方法、装置和电子设备。
随着网络视频技术的发展,用户对视频播放的体验越来越看重。在现有的视频流进行播放时,通常只有一个视频轨对应一个音轨。或者即使是多音轨时,也是多个视频轨分别绑定多个音轨。比如,一个视频有中文、英文配音时,其实,也是中文配音对应一个视频,英文配音对应一个视频。也就是说两个不同的音轨但对应的视频轨数据是完全相同的,这样以来,造成了存储空间和传输带宽的浪费。
因此,如何提供节约存储和传输带宽的一种视频流转码方法、装置和电子设备成为亟待解决的技术问题。
发明内容
有鉴于此,本发明的目的在于提出一种视频流转码方法、装置和电子设备,实现节约视频流存储空间及传输带宽。
基于上述目的本发明提供了一种视频流转码方法,包括:
解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;
在解析出的所述多个音轨信息中筛选出输出音轨信息;
对筛选的输出音轨信息的视频流进行转码输出。
本发明所述的方法,其中,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息的步骤中所述每个音轨信息进一步包含语种信息和音频编码格式信息。
本发明所述的方法,其中,所述在解析出的所述多个音轨信息中筛选出输出音轨信息的步骤进一步包括:针对多个音轨信息中包含的每个语种筛选出一个输出音轨信息。
本发明所述的方法,其中,针对多个音轨信息中包含的每个语种筛选出一个音频质量最好的输出音轨信息。
本发明所述的方法,其中,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息的步骤中所述每个音轨信息进一步包含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述音频质量最好的音轨是根据音频编码格式信息、音频码率信息、声道数信息选择出来的。
基于上述目的本发明还提供了一种视频流转码装置,包括:
片源信息解析单元,用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;
输出音轨筛选单元,用于根据片源解析单元解析出的所述多个音轨信息中筛选出输出音轨信息;
视频流转码单元,用于对输出音轨筛选单元筛选的输出音轨信息的视频流进行转码输出。
本发明所述的装置,其中,所述片源信息解析单元,进一步用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息,每个音轨信息包含语种信息和音频编码格式信息。
本发明所述的装置,其中,所述输出音轨筛选单元,进一步用于针对解析出的多个音轨信息中包含的每个语种信息筛选出一个输出音轨信息。
本发明所述的装置,其中,所述输出音轨筛选单元,进一步用于针对解析出的多个音轨信息中包含的每个语种信息筛选出一个音频质量最好的输出音轨信息。
本发明所述的装置,其中,所述片源信息解析单元,进一步用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息,每个音轨信息含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述输出音频筛选单元,进一步用于根据所述片源信息解析单元解析出的音频编码
格式信息、音频码率信息、声道数信息,针对解析出的多个音轨信息中包含的每个语种信息筛选出一个音频质量最好的输出音轨信息。
本发明实施例还提供一种电子设备,包括至少一个处理器;以及,与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够:解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;对筛选的输出音轨信息的视频流进行转码输出。
上述的电子设备,其中,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,每个所述音轨信息进一步包含语种信息和音频编码格式信息。
上述的电子设备,其中,所述在解析出的所述多个音轨信息中筛选出输出音轨信息进一步包括:针对多个音轨信息中包含的每个语种筛选出一个输出音轨信息。
上述的电子设备,其中,针对多个音轨信息中包含的每个语种筛选出一个音频质量最好的输出音轨信息。
上述的电子设备,其中,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,所述每个音轨信息进一步包含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述音频质量最好的音轨是根据音频编码格式信息、音频码率信息、声道数信息选择出来的。
本发明实施例还提供一种非易失性计算机存储介质,其中,所述存储介质存储有计算机可执行指令,所述计算机可执行指令当由电子设备执行时使得电子设备能够:解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;对筛选的输出音轨信息的视频流进行转码输出。
上述的非易失性计算机存储介质,其中,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,每个所述音轨信息进一步包含语种信息和音频编码格式信息。
上述的非易失性计算机存储介质,其中,所述在解析出的所述多个音轨信息中筛选出输出音轨信息进一步包括:针对多个音轨信息中包含的每个语种筛选出一个输出音轨信息。
上述的非易失性计算机存储介质,其中,针对多个音轨信息中包含的每个语种筛选出一个音频质量最好的输出音轨信息。
上述的非易失性计算机存储介质,其中,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,所述每个音轨信息进一步包含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述音频质量最好的音轨是根据音频编码格式信息、音频码率信息、声道数信息选择出来的。
本发明实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,使所述计算机执行上述任一所述的方法。
从上面所述可以看出,本发明实施例提供的一种视频流转码方法、装置和电子设备,通过解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;最后对筛选的输出音轨信息的视频流进行转码输出。实现了节约视频流存储空间的同时,也节约了视频流传输带宽。
图1为本发明一种视频流转码方法的实施例的流程图;
图2为本发明一种视频流转码装置的实施例的结构框图;
图3为本发明实施例的电子设备的硬件结构示意图。
下面将结合附图对本发明的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
在本发明的描述中,需要说明的是,术语“中心”、“上”、“下”、“左”、“右”、“竖直”、“水平”、“内”、“外”等指示的方位或位置关系为基于附图所示的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,
因此不能理解为对本发明的限制。此外,术语“第一”、“第二”、“第三”仅用于描述目的,而不能理解为指示或暗示相对重要性。
在本发明的描述中,需要说明的是,除非另有明确的规定和限定,术语“安装”、“相连”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是机械连接,也可以是电连接;可以是直接相连,也可以通过中间媒介间接相连,还可以是两个元件内部的连通,可以是无线连接,也可以是有线连接。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本发明中的具体含义。
此外,下面所描述的本发明不同实施方式中所涉及的技术特征只要彼此之间未构成冲突就可以相互结合。
实施例一
参照图1,是本发明一种视频流转码方法的实施例的流程图。
本实施例所述的一种视频流转码方法,包括:
步骤100:解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;所述每个音轨信息还包含了语种信息和音频编码格式信息、音频码率信息、声道数信息。如果多个音轨信息中包含了多种语种信息,那么针对每种语种信息对应的音轨信息被筛选出来作为输出音轨信息;如果有两个或两个以上的同一语种具有不同的音频编码格式,那么筛选出音频质量最好的音轨信息作为这种语种对应的输出音轨信息。此处,音频质量最好的判断条件是通过音频编码格式信息、音频码率信息、声道数信息来判断的。
步骤200:在解析出的所述多个音轨信息中筛选出输出音轨信息;
步骤300:对筛选的输出音轨信息的视频流进行转码输出。
本实施例可以看出,通过解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;最后对筛选的输出音轨信息的视频流进行转码输出。实现了节约视频流存储空间的同时,也节约了视频流传输带宽。
实施例二
参照图2,是本发明一种视频流转码装置的实施例的结构框图。
本实施例所述的一种装置1,包括:片源信息解析单元2、输出音轨筛选单元3、视频流转码单元4。
片源信息解析单元2,用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;
输出音轨筛选单元3,用于根据片源解析单元解析出的所述多个音轨信息中筛选出输出音轨信息;
视频流转码单元4,用于对输出音轨筛选单元筛选的输出音轨信息的视频流进行转码输出。
在本实施例中,所述片源信息解析单元2,还用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息,每个音轨信息包含语种信息和音频编码格式信息。
当所述输出音轨筛选单元2针对解析出的多个音轨信息中包含的每个语种信息筛选出一个输出音轨信息。例如,如果多个音轨信息中包含了多种语种信息,那么针对每种语种信息对应的音轨信息被筛选出来作为输出音轨信息;如果有两个或两个以上的同一语种具有不同的音频编码格式,那么筛选出一个音频质量最好的输出音轨信息。此处,音频质量最好的判断条件是通过音频编码格式信息、音频码率信息、声道数信息来判断的。
本实施例的装置用于实现前述实施例一中相应的方法,并且具有相应的方法实施例的有益效果,在此不再赘述。
实施例三
参照图3,本发明实施例又公开了一种电子设备,包括至少一个处理器810;以及,与所述至少一个处理器810通信连接的存储器800;其中,所述存储器800存储有可被所述至少一个处理器810执行的指令,所述指令被所述至少一个处理器810执行,以使所述至少一个处理器810能够:解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;对筛选的输出音轨信息的视频流进行转码输出。所述电子设备还包括与所述存储器800和所述处理器电连接的输入装置830和输出装置840,所述电连接优选为通过总线连接。
本实施例的电子设备,优选地,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,每个所述音轨信息进一步包含语种信息和音频编码格式信息。
本实施例的电子设备,优选地,所述在解析出的所述多个音轨信息中筛
选出输出音轨信息进一步包括:针对多个音轨信息中包含的每个语种筛选出一个输出音轨信息。
本实施例的电子设备,优选地,针对多个音轨信息中包含的每个语种筛选出一个音频质量最好的输出音轨信息。
本实施例的电子设备,优选地,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,所述每个音轨信息进一步包含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述音频质量最好的音轨是根据音频编码格式信息、音频码率信息、声道数信息选择出来的。
实施例四
本发明实施例还公开了一种非易失性计算机存储介质,其中,所述存储介质存储有计算机可执行指令,所述计算机可执行指令当由电子设备执行时使得电子设备能够:解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;对筛选的输出音轨信息的视频流进行转码输出。
本实施例的非易失性计算机存储介质,优选地,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,每个所述音轨信息进一步包含语种信息和音频编码格式信息。
本实施例的非易失性计算机存储介质,优选地,所述在解析出的所述多个音轨信息中筛选出输出音轨信息进一步包括:针对多个音轨信息中包含的每个语种筛选出一个输出音轨信息。
本实施例的非易失性计算机存储介质,优选地,针对多个音轨信息中包含的每个语种筛选出一个音频质量最好的输出音轨信息。
本实施例的非易失性计算机存储介质,优选地,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,所述每个音轨信息进一步包含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述音频质量最好的音轨是根据音频编码格式信息、音频码率信息、声道数信息选择出来的。
实施例五
本发明实施例还提供了一种计算机程序产品,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算机程序,所述计算机程序包括程
序指令,当所述程序指令被计算机执行时,使所述计算机执行上述实施例所述的方法。
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。
显然,上述实施例仅仅是为清楚地说明所作的举例,而并非对实施方式的限定。对于所属领域的普通技术人员来说,在上述说明的基础上还可以做出其它不同形式的变化或变动。这里无需也无法对所有的实施方式予以穷举。而由此所引伸出的显而易见的变化或变动仍处于本发明创造的保护范围之中。
Claims (21)
- 一种视频流转码方法,应用于终端,其特征在于,包括:解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;对筛选的输出音轨信息的视频流进行转码输出。
- 根据权利要求1所述的方法,其特征在于,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息的步骤中所述每个音轨信息进一步包含语种信息和音频编码格式信息。
- 根据权利要求2所述的方法,其特征在于,所述在解析出的所述多个音轨信息中筛选出输出音轨信息的步骤进一步包括:针对多个音轨信息中包含的每个语种筛选出一个输出音轨信息。
- 根据权利要求3所述的方法,其特征在于,针对多个音轨信息中包含的每个语种筛选出一个音频质量最好的输出音轨信息。
- 根据权利要求4所述的方法,其特征在于:所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息的步骤中所述每个音轨信息进一步包含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述音频质量最好的音轨是根据音频编码格式信息、音频码率信息、声道数信息选择出来的。
- 一种视频流转码装置,其特征在于包括:片源信息解析单元,用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;输出音轨筛选单元,用于根据片源解析单元解析出的所述多个音轨信息中筛选出输出音轨信息;视频流转码单元,用于对输出音轨筛选单元筛选的输出音轨信息的视频流进行转码输出。
- 根据权利要求6所述的装置,其特征在于:所述片源信息解析单元,进一步用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息,每个音轨信息包含语种信息和音频编 码格式信息。
- 根据权利要求7所述的装置,其特征在于:所述输出音轨筛选单元,进一步用于针对解析出的多个音轨信息中包含的每个语种信息筛选出一个输出音轨信息。
- 根据权利要求7所述的装置,其特征在于:所述输出音轨筛选单元,进一步用于针对解析出的多个音轨信息中包含的每个语种信息筛选出一个音频质量最好的输出音轨信息。
- 根据权利要求7所述的装置,其特征在于:所述片源信息解析单元,进一步用于解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息,每个音轨信息含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述输出音频筛选单元,进一步用于根据所述片源信息解析单元解析出的音频编码格式信息、音频码率信息、声道数信息,针对解析出的多个音轨信息中包含的每个语种信息筛选出一个音频质量最好的输出音轨信息。
- 一种电子设备,其特征在于,包括至少一个处理器;以及,与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够:解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;对筛选的输出音轨信息的视频流进行转码输出。
- 根据权利要求11所述的电子设备,其特征在于,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,每个所述音轨信息进一步包含语种信息和音频编码格式信息。
- 根据权利要求12所述的电子设备,其特征在于,所述在解析出的所述多个音轨信息中筛选出输出音轨信息进一步包括:针对多个音轨信息中包含的每个语种筛选出一个输出音轨信息。
- 根据权利要求13所述的电子设备,其特征在于,针对多个音轨信息中包含的每个语种筛选出一个音频质量最好的输出音轨信息。
- 根据权利要求14所述的电子设备,其特征在于,所述解析预先设置 的含有一个视频轨信息和多个音轨信息的视频流片源信息中,所述每个音轨信息进一步包含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述音频质量最好的音轨是根据音频编码格式信息、音频码率信息、声道数信息选择出来的。
- 一种非易失性计算机存储介质,其特征在于,所述存储介质存储有计算机可执行指令,所述计算机可执行指令当由电子设备执行时使得电子设备能够:解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息;在解析出的所述多个音轨信息中筛选出输出音轨信息;对筛选的输出音轨信息的视频流进行转码输出。
- 根据权利要求16所述的非易失性计算机存储介质,其特征在于,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,每个所述音轨信息进一步包含语种信息和音频编码格式信息。
- 根据权利要求17所述的非易失性计算机存储介质,其特征在于,所述在解析出的所述多个音轨信息中筛选出输出音轨信息进一步包括:针对多个音轨信息中包含的每个语种筛选出一个输出音轨信息。
- 根据权利要求18所述的非易失性计算机存储介质,其特征在于,针对多个音轨信息中包含的每个语种筛选出一个音频质量最好的输出音轨信息。
- 根据权利要求19所述的非易失性计算机存储介质,其特征在于,所述解析预先设置的含有一个视频轨信息和多个音轨信息的视频流片源信息中,所述每个音轨信息进一步包含语种信息、音频编码格式信息、音频码率信息、声道数信息;所述音频质量最好的音轨是根据音频编码格式信息、音频码率信息、声道数信息选择出来的。
- 一种计算机程序产品,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,其特征在于,当所述程序指令被计算机执行时,使所述计算机执行上述任一权利要求所述的方法。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610200823.8A CN105872727A (zh) | 2016-03-31 | 2016-03-31 | 一种视频流转码方法及装置 |
CN201610200823.8 | 2016-03-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2017166583A1 true WO2017166583A1 (zh) | 2017-10-05 |
Family
ID=56626723
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2016/095971 WO2017166583A1 (zh) | 2016-03-31 | 2016-08-19 | 一种视频流转码方法、装置和电子设备 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN105872727A (zh) |
WO (1) | WO2017166583A1 (zh) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105872727A (zh) * | 2016-03-31 | 2016-08-17 | 乐视控股(北京)有限公司 | 一种视频流转码方法及装置 |
EP3783906A4 (en) * | 2018-05-29 | 2021-02-24 | Huawei Technologies Co., Ltd. | METHOD AND DEVICE FOR SELECTING THE AUDIO TRACK FROM AUDIO AND VIDEO FILES |
CN112735445A (zh) * | 2020-12-25 | 2021-04-30 | 广州朗国电子科技有限公司 | 自适应选择音轨的方法、装置及存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070136777A1 (en) * | 2005-12-09 | 2007-06-14 | Charles Hasek | Caption data delivery apparatus and methods |
CN103929655A (zh) * | 2014-04-25 | 2014-07-16 | 网易传媒科技(北京)有限公司 | 对音视频文件进行转码处理的方法和设备 |
CN104768052A (zh) * | 2015-04-02 | 2015-07-08 | 无锡天脉聚源传媒科技有限公司 | 一种根据语言提取音频及字幕的方法及装置 |
CN104796759A (zh) * | 2015-04-07 | 2015-07-22 | 无锡天脉聚源传媒科技有限公司 | 一种从多路音频中提取一路音频的方法及装置 |
CN105872727A (zh) * | 2016-03-31 | 2016-08-17 | 乐视控股(北京)有限公司 | 一种视频流转码方法及装置 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SG150415A1 (en) * | 2007-09-05 | 2009-03-30 | Creative Tech Ltd | A method for incorporating a soundtrack into an edited video-with-audio recording and an audio tag |
CN103916692A (zh) * | 2014-03-25 | 2014-07-09 | 小米科技有限责任公司 | 视频播放方法、装置及播放终端 |
CN105392028B (zh) * | 2015-10-12 | 2019-05-24 | 天脉聚源(北京)传媒科技有限公司 | 一种数据的传输方法及装置 |
-
2016
- 2016-03-31 CN CN201610200823.8A patent/CN105872727A/zh active Pending
- 2016-08-19 WO PCT/CN2016/095971 patent/WO2017166583A1/zh active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070136777A1 (en) * | 2005-12-09 | 2007-06-14 | Charles Hasek | Caption data delivery apparatus and methods |
CN103929655A (zh) * | 2014-04-25 | 2014-07-16 | 网易传媒科技(北京)有限公司 | 对音视频文件进行转码处理的方法和设备 |
CN104768052A (zh) * | 2015-04-02 | 2015-07-08 | 无锡天脉聚源传媒科技有限公司 | 一种根据语言提取音频及字幕的方法及装置 |
CN104796759A (zh) * | 2015-04-07 | 2015-07-22 | 无锡天脉聚源传媒科技有限公司 | 一种从多路音频中提取一路音频的方法及装置 |
CN105872727A (zh) * | 2016-03-31 | 2016-08-17 | 乐视控股(北京)有限公司 | 一种视频流转码方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN105872727A (zh) | 2016-08-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6475228B2 (ja) | コンテナフォーマットでのメディアファイルの構文を意識した操作 | |
WO2017166583A1 (zh) | 一种视频流转码方法、装置和电子设备 | |
TWI595480B (zh) | 用以處理音訊信號之方法及裝置、音訊解碼器及音訊編碼器 | |
US10298931B2 (en) | Coupling sample metadata with media samples | |
EP3171593B1 (en) | Testing system and method | |
MX349110B (es) | Método para codificar video para un entorno de decodificador y dispositivo para el mismo y método para decodificar video con base en el entorno de decodificador y dispositivo para el mismo. | |
US11818189B2 (en) | Method and apparatus for media streaming | |
RU2015104987A (ru) | Способ кодирования видео и устройство кодирования видео и способ декодирования видео и устройство декодирования видео для сигнализации параметров sao | |
US11303688B2 (en) | Methods and apparatuses for dynamic adaptive streaming over HTTP | |
US11490169B2 (en) | Events in timed metadata tracks | |
WO2021067187A1 (en) | Methods and apparatuses for dynamic adaptive streaming over http | |
CN106358047A (zh) | 一种播放流媒体视频的方法及装置 | |
WO2020026009A1 (zh) | 一种视频对象的推荐方法、装置和设备/终端/服务器 | |
US11632599B2 (en) | Manifest file updating and early termination of content | |
WO2017076325A1 (zh) | 码流播出的方法及装置 | |
US11700415B2 (en) | Audio transitions when streaming audiovisual media titles | |
CN104796732A (zh) | 一种音视频编辑方法及装置 | |
TWI552573B (zh) | 具有初始化片段之視訊及音訊之寫碼 | |
CN105812922A (zh) | 多媒体文件数据的处理方法及系统、播放器和客户端 | |
CN111225210B (zh) | 视频编码方法、视频编码装置及终端设备 | |
US20170289585A1 (en) | Information processing apparatus and information processing method | |
JP2016116148A (ja) | デコード装置及びデコード方法 | |
WO2022183841A1 (zh) | 解码方法、装置和计算机可读存储介质 | |
US20130336408A1 (en) | Information processing apparatus, information processing method and non-transitory storage medium | |
US20150098022A1 (en) | Methods and systems for file based content verification using multicore architecture |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16896355 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 16896355 Country of ref document: EP Kind code of ref document: A1 |