CN104768052A - Method and device for extracting voice frequency and subtitles according to language - Google Patents

Method and device for extracting voice frequency and subtitles according to language Download PDF

Info

Publication number
CN104768052A
CN104768052A CN201510155980.7A CN201510155980A CN104768052A CN 104768052 A CN104768052 A CN 104768052A CN 201510155980 A CN201510155980 A CN 201510155980A CN 104768052 A CN104768052 A CN 104768052A
Authority
CN
China
Prior art keywords
language
captions
data
frequency
video
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510155980.7A
Other languages
Chinese (zh)
Inventor
彭岳松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuxi Tvmining Juyuan Media Technology Co Ltd
Original Assignee
Wuxi Tvmining Juyuan Media Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuxi Tvmining Juyuan Media Technology Co Ltd filed Critical Wuxi Tvmining Juyuan Media Technology Co Ltd
Priority to CN201510155980.7A priority Critical patent/CN104768052A/en
Publication of CN104768052A publication Critical patent/CN104768052A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4341Demultiplexing of audio and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4348Demultiplexing of additional data and video streams

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Studio Circuits (AREA)

Abstract

The invention discloses a method and a device for extracting voice frequency and subtitles according to a language, is used for extracting one channel of the voice frequency and the subtitles of the designated language from a video file with multiple channels of the voice frequency and the subtitles, and the aim that a multi-thread video is converted into a single-thread video is achieved. The method comprises the following steps that decapsulation is carried out on the video file with the multiple channels of the voice frequency and the subtitles, and video data and the mixed stored multiple channels of the voice frequency and multiple channels of the subtitles are obtained; according to a format of the video file with the multiple channels of the voice frequency and the subtitles, language information of each channel of the voice frequency and each channel of the subtitles are obtained from information of the video file with the multiple channels of the voice frequency and the subtitles; according to the language information of each channel of the voice frequency and each channel of the subtitles, the voice frequency data and subtitle data of the designated language are extracted from the mixed stored multiple channels of the voice frequency and the multiple channels of the subtitle data; the voice frequency data and the subtitle data of the designated language are combined with the video data. According to the method, the aim that the multi-thread video is converted into the single-thread video is achieved.

Description

A kind of method and device extracting audio frequency and captions according to language
Technical field
The present invention relates to multimedia technology field, particularly relate to a kind of method and the device that extract audio frequency and captions according to language.
Background technology
Along with developing rapidly of Internet video, recompile and encapsulation after usually needing to decode to existing video file, to obtain the video file of the form that can be play for local player or current video website.At present, video file format mainly contains FLV, DV, MP4, MKV, MOV, TS, 3GP etc., and wherein, in these forms such as FLV, DV, MP4, Audio and Video is two streams, and each be single stream.And Audio and Video can be all multiple stream in these forms of MKV, MOV, TS, and MKV can also have multiple caption stream.But, existing video decode instrument can only be single flow to into, single stream exports, and does not support the video format of multithread.Therefore, need a kind of scheme that the video format of multithread (especially MKV form) can be converted to the video format of single current, namely a kind of scheme that can extract audio frequency and the captions of specifying on a road from the video file with multichannel voice frequency and captions is needed, to carry out the process such as transcoding and broadcasting below to video.
Summary of the invention
The invention provides a kind of method and the device that extract audio frequency and captions according to language, for extracting road audio frequency and captions of appointed language from the video file with multichannel voice frequency and captions, realizing the object of the video by the Video Quality Metric of multithread being single current.
The invention provides a kind of method extracting audio frequency and captions according to language, comprising:
Decapsulation is carried out to the video file with multichannel voice frequency and captions, obtains multi-path audio-frequency data and the multichannel caption data of video data and mixing storage;
According to the described form with the video file of multichannel voice frequency and captions, from described, there is the language message obtaining each road audio frequency and Ge Lu captions the information header of the video file of multichannel voice frequency and captions;
According to the language message of each road audio frequency and Ge Lu captions, the multi-path audio-frequency data stored from described mixing and multichannel caption data, extract voice data and the caption data of appointed language;
The voice data of described appointed language and caption data and video data are merged.
Some beneficial effects of the embodiment of the present invention can comprise:
According to the language message with each road audio frequency and the Ge Lu captions recorded in the video file information head of multichannel voice frequency and captions, extract road voice data and a caption data of appointed language, and merge with video data, realize the object of the video by the Video Quality Metric of multithread being single current.Meanwhile, the audio frequency of the video of the single current after merging and captions can be selected for the language of spectators, drastically increase the experience sense of video spectators.
In one embodiment, the video file described in multichannel voice frequency and captions is multimedia container MKV file.
Some beneficial effects of the embodiment of the present invention can comprise:
MKV is novel packaged type, can encapsulate the file of multiple format, is following trend of the times.Method provided by the invention, can be applicable to MKV file, has the advantages that applicability is strong.
In one embodiment, comprise from the described method with the language message obtaining each road audio frequency and Ge Lu captions the information header of the video file of multichannel voice frequency and captions:
Read the rail Track information header of multimedia container MKV file;
From rail Track information header, read each road audio frequency and rail entity TrackEntry corresponding to Ge Lu captions;
Read language Language field in rail entity TrackEntry;
According to language Language field, obtain the language message of each road audio frequency and Ge Lu captions.
Some beneficial effects of the embodiment of the present invention can comprise:
Because MKV file has its specific form, therefore by finding its Track information header in MKV file, and therefrom read Language field information in each road audio frequency and TrackEntry corresponding to captions, the language message of each road audio frequency and captions can be identified thus fast, thus road voice data and a caption data of appointed language can be extracted easily, and merge with video data, realize the object of the video by the Video Quality Metric of multithread being single current.
In one embodiment, when the described video file with multichannel voice frequency and captions is towards Continental Area, extract voice data and the caption data of appointed language the described multi-path audio-frequency data that stores from described mixing and multichannel caption data, comprising: the multi-path audio-frequency data stored from described mixing and multichannel caption data, extract standard Chinese voice data and simplified form of Chinese Character caption data.
Some beneficial effects of the embodiment of the present invention can comprise:
The method can determine described appointed language according to the required application scenarios only comprising a road audio frequency and captions or spectators, the appointment of language is flexible and changeable, as when video spectators mainly continent crowd time, standard Chinese voice data and simplified form of Chinese Character caption data can be extracted, and merge with video data, thus drastically increase the experience sense of video spectators.
Extract a device for audio frequency and captions according to language, comprising:
Video decapsulation module, for carrying out decapsulation to the video file with multichannel voice frequency and captions, obtaining multi-path audio-frequency data and the multichannel caption data of video data and mixing storage, and exporting the data obtained;
Language message acquisition module, for having the form of the video file of multichannel voice frequency and captions described in basis, having the language message that obtains each road audio frequency and Ge Lu captions the information header of the video file of multichannel voice frequency and captions from described and export;
Audio frequency caption recognition module, the each road audio frequency exported according to described language message acquisition module and the language message of Ge Lu captions, from the multi-path audio-frequency data and multichannel caption data of the mixing storage of described video decapsulation module output, extract voice data and the caption data of appointed language, and export;
Synthesis module, receives voice data and the caption data of the appointed language that described audio frequency caption recognition module exports, and is merged by the video data that itself and described video decapsulation module export.
In one embodiment, the video file described in multichannel voice frequency and captions is multimedia container MKV file; Described language message acquisition module comprises:
Information header reading unit, for reading the rail Track information header of present multimedia container MKV file, and from rail Track information header, reads each road audio frequency and rail entity TrackEntry corresponding to Ge Lu captions;
Language message acquiring unit, read the language Language field in each road audio frequency and rail entity TrackEntry corresponding to Ge Lu captions that described information header reading unit obtains, and obtain the language message of each road audio frequency and Ge Lu captions according to language Language field and export.
Other features and advantages of the present invention will be set forth in the following description, and, partly become apparent from specification, or understand by implementing the present invention.Object of the present invention and other advantages realize by structure specifically noted in write specification, claims and accompanying drawing and obtain.
Below by drawings and Examples, technical scheme of the present invention is described in further detail.
Accompanying drawing explanation
Accompanying drawing is used to provide a further understanding of the present invention, and forms a part for specification, together with embodiments of the present invention for explaining the present invention, is not construed as limiting the invention.In the accompanying drawings:
Fig. 1 is a kind of flow chart extracting the method for audio frequency and captions according to language in the embodiment of the present invention;
Fig. 2 is the form schematic diagram of MKV file;
Fig. 3 is the flow chart of the method for the language message obtaining each road audio frequency and Ge Lu captions;
Fig. 4 is a kind of structural representation extracting the device of audio frequency and captions according to language in the embodiment of the present invention;
Fig. 5 is language message acquisition module structural representation.
Embodiment
Below in conjunction with accompanying drawing, the preferred embodiments of the present invention are described, should be appreciated that preferred embodiment described herein is only for instruction and explanation of the present invention, is not intended to limit the present invention.
Fig. 1 is a kind of flow chart extracting the method for audio frequency and captions according to language in the embodiment of the present invention, and as shown in fig. 1, the method comprises the following steps:
Step S101: carry out decapsulation to the video file with multichannel voice frequency and captions, obtains multi-path audio-frequency data and the multichannel caption data of video data and mixing storage;
Step S102: according to the form of video file with multichannel voice frequency and captions, from have multichannel voice frequency and captions video file information header obtain the language message of each road audio frequency and Ge Lu captions;
Step S103: according to the language message of each road audio frequency and Ge Lu captions, extracts voice data and the caption data of appointed language from mixing the multi-path audio-frequency data stored and multichannel caption data;
Step S104: the voice data of appointed language and caption data and video data are merged.
The technical scheme that the embodiment of the present invention provides, according to the language message with each road audio frequency and the Ge Lu captions recorded in the video file information head of multichannel voice frequency and captions, extract road voice data and a caption data of appointed language, and merge with video data, realize the object of the video by the Video Quality Metric of multithread being single current.Meanwhile, the audio frequency of the video of the single current after merging and captions can be selected for the language of spectators, drastically increase the experience sense of video spectators.
In one embodiment, the video file with multichannel voice frequency and captions is multimedia container MKV file.
MKV file is novel packaged type, can encapsulate the file of multiple format, be following trend of the times, if Fig. 2 is the form schematic diagram of MKV file, MKV file entirety comprises EBML Header and Segment, and wherein EBML Header contains the relevant information such as version, Doctype of file; Segment saves the real data of the Audio and Video of media file, comprises some daughter elements such as Track, Clusters.
In one embodiment, in above-mentioned steps S102 from have multichannel voice frequency and captions video file information header obtain the method for the language message of each road audio frequency and Ge Lu captions, as shown in Figure 3, comprise the following steps:
Step S301: the rail Track information header reading multimedia container MKV file; The rail Track information header of MKV file contains the essential information of audio frequency and video, as audio/video decoder type, video resolution, audio sample rate etc.By the parsing to Track part, the essential information of audio frequency and video just can be obtained.
Step S302: from rail Track information header, reads each road audio frequency and rail entity TrackEntry corresponding to Ge Lu captions; Each TrackEntry represents 1 orbit information.TrackNumber wherein in TrackEntry illustrates this TrackEntry and describes orbit number; TrackType illustrates the type of track, can be audio frequency, video, captions etc.
Step S303: read language Language field in rail entity TrackEntry; Language in TrackEntry, for representing the language message of respective carter, language is 3 codes, and code derives from ISO-639-2 and states.
Step S304: according to language Language field, obtain the language message of each road audio frequency and Ge Lu captions.
The technical scheme that the embodiment of the present invention provides, because MKV file has its specific form, therefore by finding its rail Track information header in MKV file, and therefrom read Language field information in each road audio frequency and rail entity TrackEntry corresponding to captions, the language message of each road audio frequency and captions can be identified thus fast, thus road voice data and a caption data of appointed language can be extracted easily, and merge with video data, realize the object of the video by the Video Quality Metric of multithread being single current.
In one embodiment, when the video file with multichannel voice frequency and captions is towards Continental Area, from the multi-path audio-frequency data of mixing storage and multichannel caption data, extract voice data and the caption data of appointed language in step S103, specifically can be embodied as: the multi-path audio-frequency data stored from mixing and multichannel caption data, extract standard Chinese voice data and simplified form of Chinese Character caption data.
The technical scheme that the embodiment of the present invention provides, appointed language can be determined according to the required application scenarios only comprising a road audio frequency and captions or spectators, the appointment of language is flexible and changeable, as when video spectators mainly continent crowd time, standard Chinese voice data and simplified form of Chinese Character caption data can be extracted, and merge with video data, thus drastically increase the experience sense of video spectators.
Corresponding to a kind of method extracting audio frequency and captions according to language that above-described embodiment provides, the embodiment of the present invention also provides a kind of and extracts the device of audio frequency and captions as shown in Figure 4 according to language, comprising:
Video decapsulation module 41, for carrying out decapsulation to the video file with multichannel voice frequency and captions, obtaining multi-path audio-frequency data and the multichannel caption data of video data and mixing storage, and exporting the data obtained;
Language message acquisition module 42, for according to the form of video file with multichannel voice frequency and captions, from have multichannel voice frequency and captions video file information header obtain the language message of each road audio frequency and Ge Lu captions and export;
Audio frequency caption recognition module 43, the each road audio frequency exported according to language message acquisition module 42 and the language message of Ge Lu captions, from the multi-path audio-frequency data and multichannel caption data of the mixing storage of video decapsulation module 41 output, extract voice data and the caption data of appointed language, and export;
Synthesis module 44, the voice data of the appointed language that audio reception caption recognition module 43 exports and caption data, and the video data that itself and video decapsulation module 41 export is merged.
In one embodiment, the video file with multichannel voice frequency and captions is multimedia container MKV file; Now, as shown in Figure 5, language message acquisition module 42 comprises:
Information header reading unit 51, for reading the rail Track information header of present multimedia container MKV file, and from rail Track information header, reads each road audio frequency and rail entity TrackEntry corresponding to Ge Lu captions;
Language message acquiring unit 52, read the language Language field in each road audio frequency that information header reading unit 51 obtains and rail entity TrackEntry corresponding to Ge Lu captions, and obtain the language message of each road audio frequency and Ge Lu captions according to language Language field and export.
A kind of device extracting audio frequency and captions according to language that the embodiment of the present invention provides, according to the language message with each road audio frequency and the Ge Lu captions recorded in the video file information head of multichannel voice frequency and captions, extract road voice data and a caption data of appointed language, and merge with video data, realize the object of the video by the Video Quality Metric of multithread being single current.Meanwhile, the audio frequency of the video of the single current after merging and captions can be selected for the language of spectators, drastically increase the experience sense of video spectators.
Those skilled in the art should understand, embodiments of the invention can be provided as method, system or computer program.Therefore, the present invention can adopt the form of complete hardware embodiment, completely software implementation or the embodiment in conjunction with software and hardware aspect.And the present invention can adopt in one or more form wherein including the upper computer program implemented of computer-usable storage medium (including but not limited to magnetic disc store and optical memory etc.) of computer usable program code.
The present invention describes with reference to according to the flow chart of the method for the embodiment of the present invention, equipment (system) and computer program and/or block diagram.Should understand can by the combination of the flow process in each flow process in computer program instructions realization flow figure and/or block diagram and/or square frame and flow chart and/or block diagram and/or square frame.These computer program instructions can being provided to the processor of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing device to produce a machine, making the instruction performed by the processor of computer or other programmable data processing device produce device for realizing the function of specifying in flow chart flow process or multiple flow process and/or block diagram square frame or multiple square frame.
These computer program instructions also can be stored in can in the computer-readable memory that works in a specific way of vectoring computer or other programmable data processing device, the instruction making to be stored in this computer-readable memory produces the manufacture comprising command device, and this command device realizes the function of specifying in flow chart flow process or multiple flow process and/or block diagram square frame or multiple square frame.
These computer program instructions also can be loaded in computer or other programmable data processing device, make on computer or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computer or other programmable devices is provided for the step realizing the function of specifying in flow chart flow process or multiple flow process and/or block diagram square frame or multiple square frame.
Obviously, those skilled in the art can carry out various change and modification to the present invention and not depart from the spirit and scope of the present invention.Like this, if these amendments of the present invention and modification belong within the scope of the claims in the present invention and equivalent technologies thereof, then the present invention is also intended to comprise these change and modification.

Claims (6)

1. extract a method for audio frequency and captions according to language, it is characterized in that, comprising:
Decapsulation is carried out to the video file with multichannel voice frequency and captions, obtains multi-path audio-frequency data and the multichannel caption data of video data and mixing storage;
According to the described form with the video file of multichannel voice frequency and captions, from described, there is the language message obtaining each road audio frequency and Ge Lu captions the information header of the video file of multichannel voice frequency and captions;
According to the language message of each road audio frequency and Ge Lu captions, the multi-path audio-frequency data stored from described mixing and multichannel caption data, extract voice data and the caption data of appointed language;
The voice data of described appointed language and caption data and video data are merged.
2. a kind of method extracting audio frequency and captions according to language as claimed in claim 1, is characterized in that, described in there are multichannel voice frequency and captions video file be multimedia container MKV file.
3. a kind of method extracting audio frequency and captions according to language as claimed in claim 2, is characterized in that, comprise from the described method with the language message obtaining each road audio frequency and Ge Lu captions the information header of the video file of multichannel voice frequency and captions:
Read the rail Track information header of multimedia container MKV file;
From rail Track information header, read each road audio frequency and rail entity TrackEntry corresponding to Ge Lu captions;
Read language Language field in rail entity TrackEntry;
According to language Language field, obtain the language message of each road audio frequency and Ge Lu captions.
4. a kind of method extracting audio frequency and captions according to language as described in any one of claim 1-3, it is characterized in that, when the described video file with multichannel voice frequency and captions is towards Continental Area, extract voice data and the caption data of appointed language the described multi-path audio-frequency data that stores from described mixing and multichannel caption data, comprising: the multi-path audio-frequency data stored from described mixing and multichannel caption data, extract standard Chinese voice data and simplified form of Chinese Character caption data.
5. extract a device for audio frequency and captions according to language, it is characterized in that, comprising:
Video decapsulation module, for carrying out decapsulation to the video file with multichannel voice frequency and captions, obtaining multi-path audio-frequency data and the multichannel caption data of video data and mixing storage, and exporting the data obtained;
Language message acquisition module, for having the form of the video file of multichannel voice frequency and captions described in basis, having the language message that obtains each road audio frequency and Ge Lu captions the information header of the video file of multichannel voice frequency and captions from described and export;
Audio frequency caption recognition module, the each road audio frequency exported according to described language message acquisition module and the language message of Ge Lu captions, from the multi-path audio-frequency data and multichannel caption data of the mixing storage of described video decapsulation module output, extract voice data and the caption data of appointed language, and export;
Synthesis module, receives voice data and the caption data of the appointed language that described audio frequency caption recognition module exports, and is merged by the video data that itself and described video decapsulation module export.
6. a kind of device extracting audio frequency and captions according to language as claimed in claim 5, is characterized in that, described in there are multichannel voice frequency and captions video file be multimedia container MKV file; Described language message acquisition module comprises:
Information header reading unit, for reading the rail Track information header of present multimedia container MKV file, and from rail Track information header, reads each road audio frequency and rail entity TrackEntry corresponding to Ge Lu captions;
Language message acquiring unit, read the language Language field in each road audio frequency and rail entity TrackEntry corresponding to Ge Lu captions that described information header reading unit obtains, and obtain the language message of each road audio frequency and Ge Lu captions according to language Language field and export.
CN201510155980.7A 2015-04-02 2015-04-02 Method and device for extracting voice frequency and subtitles according to language Pending CN104768052A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510155980.7A CN104768052A (en) 2015-04-02 2015-04-02 Method and device for extracting voice frequency and subtitles according to language

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510155980.7A CN104768052A (en) 2015-04-02 2015-04-02 Method and device for extracting voice frequency and subtitles according to language

Publications (1)

Publication Number Publication Date
CN104768052A true CN104768052A (en) 2015-07-08

Family

ID=53649600

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510155980.7A Pending CN104768052A (en) 2015-04-02 2015-04-02 Method and device for extracting voice frequency and subtitles according to language

Country Status (1)

Country Link
CN (1) CN104768052A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105744287A (en) * 2016-02-01 2016-07-06 杭州当虹科技有限公司 DVD-VIDEO video positive film content extraction device
CN105872727A (en) * 2016-03-31 2016-08-17 乐视控股(北京)有限公司 Video stream transcoding method and device
CN107959884A (en) * 2017-12-07 2018-04-24 上海网达软件股份有限公司 A kind of trans-coding treatment method of monophonic Multi-audio-frequency files in stream media
CN109803173A (en) * 2017-11-16 2019-05-24 腾讯科技(深圳)有限公司 A kind of video transcoding method, device and storage equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1937609A (en) * 2006-08-29 2007-03-28 华为技术有限公司 Method and system for supporting multi-audio-track content by flow media platform and flow media server
CN103093776A (en) * 2011-11-04 2013-05-08 腾讯科技(深圳)有限公司 Method and system of multi-audio-track content play in network seeing and hearing
WO2014181969A1 (en) * 2013-05-07 2014-11-13 Seok Cheol Recording medium recorded with multi-track media file, method for editing multi-track media file, and apparatus for editing multi-track media file

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1937609A (en) * 2006-08-29 2007-03-28 华为技术有限公司 Method and system for supporting multi-audio-track content by flow media platform and flow media server
CN103093776A (en) * 2011-11-04 2013-05-08 腾讯科技(深圳)有限公司 Method and system of multi-audio-track content play in network seeing and hearing
WO2014181969A1 (en) * 2013-05-07 2014-11-13 Seok Cheol Recording medium recorded with multi-track media file, method for editing multi-track media file, and apparatus for editing multi-track media file

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105744287A (en) * 2016-02-01 2016-07-06 杭州当虹科技有限公司 DVD-VIDEO video positive film content extraction device
CN105744287B (en) * 2016-02-01 2019-03-01 杭州当虹科技有限公司 A kind of DVD-VIDEO video feature film content extraction element
CN105872727A (en) * 2016-03-31 2016-08-17 乐视控股(北京)有限公司 Video stream transcoding method and device
WO2017166583A1 (en) * 2016-03-31 2017-10-05 乐视控股(北京)有限公司 Video stream transcoding method and apparatus, and electronic device
CN109803173A (en) * 2017-11-16 2019-05-24 腾讯科技(深圳)有限公司 A kind of video transcoding method, device and storage equipment
CN107959884A (en) * 2017-12-07 2018-04-24 上海网达软件股份有限公司 A kind of trans-coding treatment method of monophonic Multi-audio-frequency files in stream media
CN107959884B (en) * 2017-12-07 2020-10-16 上海网达软件股份有限公司 Transcoding processing method of single track multi-audio streaming media file

Similar Documents

Publication Publication Date Title
KR102130429B1 (en) Method and device for decoding multimedia file
US10129587B2 (en) Fast switching of synchronized media using time-stamp management
US9870799B1 (en) System and method for processing ancillary data associated with a video stream
CN105376612A (en) Video playing method, media equipment, playing equipment and multimedia system
US10529383B2 (en) Methods and systems for processing synchronous data tracks in a media editing system
CN101796828A (en) Method and apparatus for reproducing multi-stream
CN103093776A (en) Method and system of multi-audio-track content play in network seeing and hearing
CN102150424B (en) Method for file formation according to freeview AV service
KR20160135301A (en) Audiovisual content item data streams
CN104768052A (en) Method and device for extracting voice frequency and subtitles according to language
CN105704579A (en) Real-time automatic caption translation method during media playing and system
CN104410902A (en) Playing method and terminal for live program, as well as generation method and equipment for index document
CN104918097A (en) Subtitle generation method and device
KR20090009847A (en) Method and apparatus for re-constructing media from a media representation
JP7218772B2 (en) Receiving device and receiving method
CN106060628A (en) DirectShow-based method and system supporting variable coding
US20160322080A1 (en) Unified Processing of Multi-Format Timed Data
US9911460B2 (en) Fast and smart video trimming at frame accuracy on generic platform
CN104796759A (en) Method and device for extracting one-channel audio frequency from multiple-channel audio frequency
CN105898320A (en) Panorama video decoding method and device and terminal equipment based on Android platform
CN102811383A (en) Video file playing method and device based on set top box
JP2016072858A (en) Media data generation method, media data reproduction method, media data generation device, media data reproduction device, computer readable recording medium and program
CN101803378B (en) Method and apparatus for generating and accessing metadata in media file format
JP2006020102A (en) Broadcast recording/reproducing device and broadcast recording/reproducing processing program
CN104575542A (en) Method and device for realizing audio regional play

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20150708