WO2008061416A1 - Procédé et système permettant d'accepter des données media de divers formats de codage - Google Patents

Procédé et système permettant d'accepter des données media de divers formats de codage Download PDF

Info

Publication number
WO2008061416A1
WO2008061416A1 PCT/CN2007/002148 CN2007002148W WO2008061416A1 WO 2008061416 A1 WO2008061416 A1 WO 2008061416A1 CN 2007002148 W CN2007002148 W CN 2007002148W WO 2008061416 A1 WO2008061416 A1 WO 2008061416A1
Authority
WO
WIPO (PCT)
Prior art keywords
media
frame
video
file
information
Prior art date
Application number
PCT/CN2007/002148
Other languages
English (en)
French (fr)
Inventor
Bailiang Yang
Zhihao Shang
Hui Huang
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Priority to EP07785125A priority Critical patent/EP2086240A4/en
Publication of WO2008061416A1 publication Critical patent/WO2008061416A1/zh
Priority to US12/343,099 priority patent/US20090106288A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/637Control signals issued by the client directed to the server or network components
    • H04N21/6377Control signals issued by the client directed to the server or network components directed to server
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234309Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by transcoding between formats or standards, e.g. from MPEG-2 to MPEG-4 or from Quicktime to Realvideo
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/6437Real-time Transport Protocol [RTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/658Transmission by the client directed to the server
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/658Transmission by the client directed to the server
    • H04N21/6587Control parameters, e.g. trick play commands, viewpoint selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8455Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/173Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
    • H04N7/17309Transmission or handling of upstream communications
    • H04N7/17318Direct or substantially direct transmission and handling of requests

Definitions

  • the present invention relates to the field of streaming media technologies, and in particular, to a system for supporting media data of multiple encoding formats. Background technique
  • the streaming media technology compresses the multimedia data (images, sounds, etc.) of the program and stores it on the network server.
  • the user can access the network server and watch and listen to the program while downloading the program, without The entire multimedia file of the program can be viewed and listened to after the download is completed.
  • Streaming media technology delivers high quality audio and video in real time in low bandwidth environments.
  • the streaming media service has a wide range of applications, and various application services require streaming media technology support.
  • the streaming media service has become the mainstream business of the third generation mobile communication technology and broadband network.
  • the data of the streaming media compresses the original data through the data encoding technology, and reduces the amount of data transmitted and reduces the load pressure on the transmission network under the premise of ensuring that the visual effects are not affected.
  • data encoding technologies including: ISO/IEC MPEG-2/MPEG-4 standard, ITU-T H.263/H.264 standard, China's AVS and other standards.
  • encoded media data is encoded by using different streaming media servers to carry different coding standards.
  • the MPEG-2/MPEG-4 streaming server, the H.263/H.264 streaming server, and the AVS streaming server are respectively carried according to the MPEG-2/MPEG-4 standard, ITU-T.
  • the H.263/H.264 standard and the AVS standard encode the streaming media data.
  • the streaming server storing the media data of the client endpoint broadcast program first performs the media file according to the request of the client. Real-time transport protocol encapsulation, and then send the encapsulated media files to the client.
  • the streaming media data encoded according to different coding standards cannot be shared between the streaming media servers, and each streaming media server can only play the supported encoding format. Even if there are few client users, multiple servers are needed to serve the client, which increases the cost of the system. At the same time, if the system needs to support the new encoding format, it needs to add a new server, which not only further increases The cost of the system also increases the difficulty of system integration. Moreover, various streaming media servers carry streaming media data in different encoding formats. At one time, one streaming media server may be lightly loaded, and another streaming media server is overloaded, and load balancing of the streaming media server cannot be achieved. . Summary of the invention
  • the present invention provides a method, a communication system, and a streaming server that support media data in a plurality of encoding formats.
  • a method of supporting media data in a multi-encoding format comprising:
  • the streaming media server separately converts the received media files of different encoding formats into media files of a specific file format, and the media files of the specific file format include media data information and index information;
  • the streaming media server determines a corresponding media file according to an operation command of the client, and further sends corresponding media data information in the corresponding media file to the client according to the index information in the corresponding media file.
  • a streaming media server comprising:
  • a receiving unit configured to receive media files of different encoding formats and operation commands sent by the client; and a converting unit, configured to convert the received media files of different encoding formats into media files of a specific file format pre-packaged by the real-time transport protocol
  • the media file of the specific file format includes media data information and index information
  • a storage unit configured to store the media file in the specific file format
  • a processing unit configured to determine a corresponding media file according to an operation command sent by the client, and determine a corresponding video key frame according to the index information in the corresponding media file, and set the video key frame in the corresponding media file a starting position, reading media data information from the starting position;
  • a sending unit configured to return the corresponding media data information to the client.
  • a communication system comprising: a client, configured to send an operation command to the streaming media server, and receive media data information returned by the streaming media server;
  • a streaming media server configured to separately convert the received media files of different encoding formats into media files of a specific file format, where the media file of the specific file format includes media data information and index information; and determining corresponding according to an operation command sent by the client
  • the media file further determines and returns corresponding media data information in the corresponding media file according to the index information in the corresponding media file.
  • the invention converts the received media files of different encoding formats into media files of a specific file format pre-packaged by the real-time transmission protocol, so that the same streaming media server can provide corresponding system services for media files of multiple encoding formats.
  • This reduces the cost of the system to a certain extent, reduces the integration difficulty of the system, and further realizes the load balancing of the system; on the other hand, the present invention pre-packages the media file by real-time transmission protocol, which is to some extent It shortens the time for the streaming server to process information and improves the user experience.
  • Figure 1 is an architecture diagram of an existing streaming media service system
  • FIG. 2A is a schematic structural diagram of a streaming media service system according to an embodiment of the present invention.
  • 2B is a structural diagram of a function of a streaming media server according to an embodiment of the present invention.
  • FIG. 3 is a flowchart of a media file in which a streaming media server converts media files of different encoding formats into a specific file format according to an embodiment of the present invention
  • FIG. 4 is a structural diagram of a PES packet in an embodiment of the present invention. detailed description
  • a streaming media server can only provide corresponding system services for media files of one format, which is likely to cause an increase in system cost and an imbalance in system load.
  • the streaming media server receives the media files of different encoding formats (for example, media files compressed according to standards such as MPEG-2, MPEG-4, h.263, h.264, and AVS, respectively), respectively Convert to a media file in a specific file format.
  • the conversion process is as follows:
  • the streaming media server first parses The server obtains its corresponding video frame, audio frame and index parameters according to the encoding format of the media file, and pre-packages the video frame and the audio frame of the media file by real-time transmission protocol (RTP) to generate corresponding media data information.
  • the video frame includes a video key frame and a video prediction frame.
  • the streaming media server generates index information for locating the video key frame according to an index parameter of the media file, where two phases of the media data information Video prediction frames and audio frames are set between adjacent video key frames.
  • the streaming media server converts the media files of different encoding formats into media files of a specific file format pre-packaged by RTP, and the media files of the specific file format include media data information and index information; according to the design of the file system Media data information and index information can be combined into one file and stored in two different files.
  • a system for providing a streaming media service includes a streaming media server 20 and a client 21.
  • the streaming media server 20 is configured to separately convert the received media files of different encoding formats into media files of a specific file format pre-packaged by the RTP; and return corresponding media data according to the operation command sent by the client and the corresponding index information.
  • the client 21 is configured to send an operation command to the streaming server 20 and receive corresponding media data information returned by the streaming server 20.
  • the streaming media server 20 includes a receiving unit 201, a converting unit 202, a storage unit 203, a processing unit 204, and a transmitting unit 205.
  • the receiving unit 201 is configured to receive a media file in a different encoding format and an operation command sent by the client 21;
  • the converting unit 202 is configured to parse the media file to determine a corresponding encoding format, and obtain corresponding according to the encoding format.
  • the storage unit 203 is configured to store the media file of the specific file format;
  • the processing unit 204 is configured to determine, according to the operation command sent by the client 21 and the corresponding index information, that the corresponding video key frame is a starting position in the media file, and reading corresponding media data information from the starting position;
  • the sending unit 205 is configured to return the corresponding to the client 21 Media data information.
  • the media data information includes all stream data, and the stream data are sequentially arranged in the form of data frames.
  • the streaming server 20 attributes all the original data collected in the same media file to the same media data package. Referring to Table 1, the streaming server 20 obtains a video frame and an audio frame of the media file according to an encoding format of a media file, wherein the video frame is further divided into a video key frame (I frame) and a video prediction frame (P frame). Or B frame), the I frame contains the complete video data corresponding to one picture, and the P frame or B frame is used to adjust the corresponding I frame to obtain a new picture.
  • I frame video key frame
  • P frame video prediction frame
  • the video data saved in the first I frame corresponds to the picture A, and the subsequent picture B and the picture C do not change much with respect to the picture A, and then in the video frames corresponding to the picture B and the picture C (eg, P frame or B frame) It is not necessary to save the complete video data again, but only need to save the corresponding prediction information.
  • the video data saved in the first I frame is performed.
  • screen B and screen C can be generated.
  • I frame, P frame, B frame, and audio frame are sequentially arranged in order.
  • the streaming media server 20 performs RTP pre-encapsulation on the I frame, the P frame, the B frame, and the audio frame, that is, packages the different data frames while encapsulating them into different RTP package.
  • RTP pre-encapsulation on the I frame, the P frame, the B frame, and the audio frame, that is, packages the different data frames while encapsulating them into different RTP package.
  • each I frame, P frame, B frame, and audio frame is divided into one or more RTP packets, where VI1R1 represents the first RTP packet of the first I frame, and VI1R2 represents the video.
  • each RTP includes three data parts: real-time stream protocol header information (rtsp header), real-time transport protocol header information (rtp header), and media data (sample).
  • the rtsp header contains 4 bytes, the first byte is the RTP data, the second byte is the port number, the third byte and the fourth byte are the length of the RTP packet; the rtp header is the RTP packet. Header information; sample is video data or audio data.
  • the streaming media server 20 may directly send the corresponding RTP package to the user, instead of performing RTP encapsulation on the media data to be sent after receiving the operation command of the user as in the prior art. And then sending the media data information to the corresponding user; and each time the same operation command is received, the same streaming media information is subjected to repetitive RTP encapsulation. In this way, the amount of information processing of the streaming media server 20 is reduced to a certain extent, and the time for the streaming media server 20 to process information when providing the streaming media service is shortened, thereby improving the user experience.
  • the index information includes description information of each I frame, where the description information includes a starting position of the current I frame, a data size of the current I frame, and a data size of the current I frame to the next I frame,
  • the time and time identification; the streaming media server 20 saves the index parameters to the corresponding fields of the index table; through the index table, the streaming media server 20 can quickly locate each I frame, thereby playing and positioning the media data information. , fast forward, rewind, etc.
  • file-offset indicates the starting position of the current I frame in the media data information
  • size indicates the data size of the current I frame to the next I frame
  • iframesize indicates the current I frame.
  • Data size; time indicates media Absolute time in data information, used for time control; tS is time stamp.
  • Table 4 Referring to Table 1, all P frames and B frames between the first I frame and the second I frame are used to adjust the first I frame, and the first I frame and the second I frame. All audio frames between them should be matched by their own time identifiers and corresponding video frames (including I frames, P frames and B frames). When the time identifier carried in the audio frame is the same as in the video frame, it means The audio frame and the video frame should be played simultaneously.
  • the following is an example of a media file with an encoding format of MPEG-4, a suffix of ".mp4", and a media file with an encoding format of MPEG-2 and a suffix of ".ts".
  • the streaming server 20 is shown.
  • the detailed flow of converting media files encoded in MPEG-4 and MPEG-2 into media files in a specific file format is as follows:
  • Step 300 After receiving the media files suffixed with ".mp4" and “.ts", the streaming media server 20 determines that the encoding format of the media file is "MPEG-4" and "" according to the header information in the source file. MPEG-2".
  • the media file suffixed with ".mp4" consists of several atoms, each atom consisting of three parts: length (type), type (type), and contained data (data).
  • the media file suffixed with ". mp4" also includes a sample list atom ( Sample table Atom ), as shown in Table 6, the Sample table atom includes a description of each sample (sample description atom ), time information for each sample point ( Time-to-sample atom ), and each video key The sequence number of the frame data ( Sync sample atom ) and so on.
  • Sample table Atom includes a description of each sample (sample description atom ), time information for each sample point ( Time-to-sample atom ), and each video key The sequence number of the frame data ( Sync sample atom ) and so on.
  • the media file suffixed with ".ts" consists of several TS packets. These TS packets are divided into several PES packets. As shown in Figure 4, the first TS packet encapsulated by each PES packet is TS header, PES. The header and DATA are composed of three parts. The other TS packets are composed of a TS header and a DATA. The TS header is used to identify information such as the transmission priority. The PES header contains index parameters such as time identifiers. DATA is used to store corresponding media data. .
  • Step 310 The streaming media server 20 extracts media data of an encoding format of MPEG-4 in the data portion of each atom of the ".mp4" media file, and encodes the encoding format of the ".ts” media file as The DATA portion of MPEG-2 is extracted to form a media data packet as shown in Table 1, which includes video frames and audio frames of the media file.
  • Step 320 The streaming media server 20 performs RTP pre-encapsulation on the video frame and the audio frame. As shown in Table 2, the video frame and the audio frame are respectively divided by the streaming media server 20 into one or more RTP packets.
  • Step 330 The streaming media server 20 generates index information as shown in Table 4 according to the index parameter in the ".mp4" media file Sample table atom whose encoding format is MPEG-4, according to the encoding format MPEG-2 ".ts".
  • the index parameter in the PES header of the media file generates index information as shown in Table 4, which is used to quickly locate the I frame in the video frame.
  • the streaming media server 20 converts the media files of the plurality of different encoding formats into the media files of the specific file format
  • the streaming media server 20 reads from the index table. Reading the index information of the corresponding I frame, locating the starting position of the I frame in the media data information, and reading valid media data from the starting position, and corresponding video key frame, video prediction frame And the audio frame is sent to the client 21.
  • the streaming server 20 obtains the absolute time C of the I frame corresponding to the episode B according to the time calculation, and passes Finding a record whose "time" field is equal to C in the index table, obtaining the starting position D of the I frame in the movie A, and then the streaming server 20 reads the I frame and its subsequent contents from the D position.
  • I frames, P frames, B frames and corresponding audio frames, and the corresponding RTP packets are sent to the client 21 through the network.
  • the user can also perform fast forward operation or fast reverse operation on the movie A through the fast forward command or the fast reverse command through the client 21, and the fast forward operation and the fast reverse operation are classified into IX, 2X, and 4X according to different speeds.
  • the streaming server 20 reads the I frame from the D position after acquiring the start position D of the I frame corresponding to the scenario B, and Read all I frames continuously backwards or forwards without reading P frames, B frames, or audio frames.
  • the streaming server 20 When the user performs a 2X or 4X fast forward operation or a fast reverse operation on the movie A from the scenario B, the streaming server 20 reads the I frame from the D position after acquiring the start position D of the I frame corresponding to the scenario B. And only read the corresponding I frame after every one or more I frames are spaced backwards or forwards. During the fast forward operation or the fast reverse operation, the streaming server 20 determines the size of the I frame to be read each time by the "iframesize" field in the index table.
  • the same streaming media server 20 can process media files of different encoding formats, so that in the streaming media service system, a small number of streaming media servers 20 can provide content-rich system services to users, which is to some extent
  • the cost of the system is reduced, the integration difficulty of the system is reduced, and the load balancing of the system is further realized; the spirit and scope of the system.
  • the present invention cover the modifications and the modifications of the invention

Description

一种支持多种编码格式的媒体数据的方法及系统 技术领域
本发明涉及流媒体技术领域, 尤其涉及一种支持多种编码格式的媒体数据 的方〉去及系统。 背景技术
随着第三代移动通信技术和宽带网络的发展, 网络通信能力不断提高, 带 来了业务的数量和种类的急剧增加, 极大丰富了用户的生活。 其中, 釆用流媒 体技术将节目的多媒体数据 (影像和声音等)进行压缩处理后存储到网络服务 器, 用户通过访问网络服务器, 在下载节目的同时便可观看和收听该节目, 而 不需要在节目的整个多媒体文件下载完成后才能观看、 收听。 流媒体技术实现 了在低带宽环境下实时提供高质量的影音效果。 流媒体业务应用范围广泛, 多 种应用业务都需要流媒体技术支撑, 流媒体业务已成为第三代移动通信技术和 宽带网的主流业务。
流媒体的数据通过数据编码技术将原始数据进行了压缩, 在保证不影响人 们视觉效果的前提下, 减少了传送的数据量, 减轻了对传输网络的负载压力。 目前的数据编码技术很多, 包括: ISO/IEC的 MPEG-2/MPEG-4标准, ITU-T的 H.263/ H.264标准, 中国的 AVS等标准。
现有技术中, 为了在同一个流媒体系统中支持多种编码标准, 通过使用不 同的流媒体服务器来承载不同的编码标准进行编码后的流媒体数据。 参阅图 1 所示, 分别釆用 MPEG-2/MPEG-4流媒体服务器、 H.263/ H.264流媒体服务器和 AVS流媒体服务器承载根据 MPEG-2/MPEG-4标准、 ITU-T的 H.263/ H.264标 准、 AVS 标准进行编码后的流媒体数据, 客户端发送媒体播放请求后, 存储有 客户端点播节目的媒体数据的流媒体服务器根据客户端的请求, 先将媒体文件 进行实时传输协议封装, 然后再将封装的媒体文件发送给客户端。
釆用现有技术的方式, 根据不同编码标准进行编码后的流媒体数据不能实 现在各流媒体服务器之间的共享, 每种流媒体服务器只能播放支持的编码格式, 即使客户端用户很少的情况下, 仍然需要多个服务器为客户端提供服务, 增加 了系统的成本; 同时, 如果系统中需要支持新的编码格式, 则需要增加新的服 务器, 不仅进一步增加了系统的成本, 同时也增加了系统的集成难度。 而且, 各种流媒体服务器承载了不同编码格式的流媒体数据, 在某个时刻可能一种流 媒体服务器负载较轻, 而另一种流媒体服务器已经过载, 无法实现流媒体服器 的负载均衡。 发明内容
本发明提供了一种支持多种编码格式的媒体数据的方法、 通信系统以及流 媒体服务器。
一种支持多编码格式的媒体数据的方法, 包括:
流媒体服务器将接收的不同编码格式的媒体文件分别转换成特定文件格式 的媒体文件, 该特定文件格式的媒体文件包括媒体数据信息和索引信息;
所述流媒体服务器根据客户端的操作命令确定相应的媒体文件, 进一步根 据该相应的媒体文件中的索引信息, 将该相应的媒体文件中相应的媒体数据信 息发送给所述客户端。
一种流媒体服务器, 包括:
接收单元, 用于接收不同编码格式的媒体文件和客户端发送的操作命令; 转换单元, 用于将接收的不同编码格式的媒体文件分别转换成经过实时传 输协议预封装的特定文件格式的媒体文件, 该特定文件格式的媒体文件包括媒 体数据信息和索引信息;
存储单元, 用于存储所述特定文件格式的媒体文件;
处理单元, 用于根据客户端发送的操作命令确定相应的媒体文件, 并根据 相应的媒体文件中的索引信息确定对应的视频关键帧及设定该视频关键帧在该 相应的媒体文件中的起始位置, 从该起始位置开始读取媒体数据信息;
发送单元, 用于向所述客户端返回所述相应的媒体数据信息。
一种通信系统, 包括: 客户端, 用于向流媒体服务器发送操作命令, 并接收所述流媒体服务器返 回的媒体数据信息;
流媒体服务器, 用于将接收的不同编码格式的媒体文件分别转换成特定文 件格式的媒体文件, 该特定文件格式的媒体文件包括媒体数据信息和索引信息; 以及根据客户端发送的操作命令确定相应的媒体文件, 进一步根据相应的媒体 文件中的索引信息确定并返回相应的媒体文件中相应的媒体数据信息。
本发明通过将接收的不同编码格式的媒体文件分别转换成经过实时传输协 议预封装的特定文件格式的媒体文件, 使得同一台流媒体服务器可以针对多种 编码格式的媒体文件提供相应的系统服务, 这在一定程度上减少了系统的成本, 降低了系统的集成难度, 并且进一步实现了系统的负载均衡; 另一方面, 本发明 由于对媒体文件进行了实时传输协议预封装, 这在一定程度上缩短了流媒体服 务器处理信息的时间, 提高了用户的使用体验。 附图说明
图 1为现有流媒体服务系统体系结构图;
图 2A为本发明实施例中流媒体服务系统体系结构图;
图 2B为本发明实施例中流媒体服务器功能结构图;
图 3 为本发明实施例中流媒体服务器将不同编码格式的媒体文件转换为特 定文件格式的媒体文件流程图;
图 4为本发明实施例中 PES包结构图。 具体实施方式
为了解决现有技术下, 一台流媒体服务器只能针对一种格式的媒体文件提 供相应的系统服务, 从而容易造成系统成本增加, 以及系统负载不均衡的问题。 本发明实施例中, 流媒体服务器接收不同编码格式的媒体文件后 (例如: 分别 按 MPEG-2、 MPEG-4、 h.263、 h.264和 AVS等标准压缩的媒体文件), 分别将其 转换成特定文件格式的媒体文件。 其中, 转换的过程为: 流媒体服务器先解析 务器根据媒体文件的编码格式获得其相应的视频帧、 音频帧和索引参数, 并对 所述媒体文件的视频帧和音频帧进行实时传输协议(RTP )预封装, 生成相应的 媒体数据信息, 所述视频帧包括视频关键帧和视频预测帧; 另一方面, 流媒体 服务器根据所述媒体文件的索引参数生成用于定位所述视频关键帧的索引信 息, 所述媒体数据信息中两个相邻的视频关键帧之间设置视频预测帧和音频帧。
这样,流媒体服务器便将不同编码格式的媒体文件分别转换成经过 RTP预封 装的特定文件格式的媒体文件, 所述特定文件格式的媒体文件包括媒体数据信 息和索引信息; 根据文件系统的设计不同, 媒体数据信息和索引信息可以合成 一个文件存放, 也可以分为两个不同的文件分别存放。
下面结合附图对本发明优选的实施方式进行详细说明。
参阅图 2A 所示, 本实施例中, 提供流媒体服务的系统包括流媒体服务器 20和客户端 21。 所述流媒体服务器 20用于将接收的不同编码格式的媒体文件 分别转换成经过 RTP预封装的特定文件格式的媒体文件; 以及根据客户端发送 的操作命令和对应的索引信息返回相应的媒体数据信息; 所述客户端 21用于向 流媒体服务器 20发送操作命令, 并接收所述流媒体服务器 20返回的相应的媒 体数据信息。
参阅图 2B所示, 所述流媒体服务器 20包括接收单元 201、 转换单元 202、 存储单元 203、 处理单元 204和发送单元 205。 所述接收单元 201用于接收不同 编码格式的媒体文件和客户端 21发送的操作命令; 所述转换单元 202用于解析 媒体文件从而确定其相应的编码格式, 并根据所述编码格式获得其相应的视频 帧、 音频帧和索引参数, 然后对所述媒体文件的视频帧和音频帧进行 RTP预封 装, 生成相应的媒体数据信息, 以及根据所述媒体文件的索引参数生成用于定 位所述视频关键帧的索引信息; 所述存储单元 203 用于存储所述特定文件格式 的媒体文件; 所述处理单元 204用于根据客户端 21发送的操作命令和对应的索 引信息确定相应的视频关键帧在媒体文件中的起始位置, 并从该起始位置开始 读取相应的媒体数据信息; 所述发送单元 205用于向客户端 21返回所述相应的 媒体数据信息。
本实施例中, 所述媒体数据信息包含全部的流数据, 这些流数据以数据帧 的形式依次排列。 为了简化媒体数据包的发包流程并支持视音频同步, 流媒体 服务器 20把在同一个媒体文件中收集到的所有原始数据都归属至同一个媒体数 据包内。 参阅表 1所示, 流媒体服务器 20根据某个媒体文件的编码格式获取了 该媒体文件的视频帧和音频帧, 其中视频帧又分为视频关键帧(I帧)和视频预 测帧 (P帧或 B帧), I帧中保存有对应于一个画面的完整的视频数据, 而 P帧 或 B帧则用于对相应的 I帧进行调整, 从而获得新的画面。 例如, 第一个 I帧中 保存的视频数据对应于画面 A, 而之后的画面 B、 画面 C相对于画面 A而言变 化不大, 那么在画面 B和画面 C所对应的视频帧中 (如 P帧或 B帧)就没有必 要将完整的视频数据再保存一遍, 而只需要保存相应的预测信息, 当播放该 P 帧或 B帧时, 只要对第一个 I帧中保存的视频数据进行相应调整, 即可生成画 面 B和画面 C。 如表 1所示, 在同一个媒体数据包内, I帧、 P帧、 B帧和音频 帧按照顺序依次排列。
和现有技术不同, 本实施例中, 流媒体服务器 20对所述 I帧、 P帧、 B帧 和音频帧进行 RTP预封装, 即在获取所述多种数据帧的同时将其封装为不同的 RTP包。 参阅表 2所示, 每一个 I帧、 P帧、 B帧和音频帧被分割为 1个或多个 RTP包, 其中, VI1R1表示第一个 I帧的第一个 RTP包, VI1R2表示视频第一 个 I帧的第二个 RTP包; A1R1表示第一个音频帧的第一个 RTP包, A1R2表示 第一个音频帧的第二个 RTP包; VP1R1表示第一个 P帧的第一个 RTP包; VB1R1 表示第一个 B帧的第一个 RTP包; VI2R1表示第二个 I帧的第一个 RTP包, 以 此类推。
Figure imgf000007_0001
VI1R1 VI1R2 A1R1 A1R2 VP1R1 VB1R1 VI2R1... 参阅表 3所示, 每一个 RTP包括实时流协议头信息(rtsp header )、 实时传输 协议头信息( rtp header )和媒体数据 ( sample )三个数据部分,其中, rtsp header 包含 4个字节, 第一个字节为 RTP数据, 第二个字节为端口号, 第三字节和第四 字节为 RTP包的长度; rtp header为 RTP数据包的包头信息; sample为釆样的视频 数据或音频数据。
表 3
rtsp header rtp header sample 由于流媒体服务器 20在接收到媒体文件后, 即对该媒体文件中的媒体数据 信息 (包括 I帧、 B帧、 P帧和音频帧)进行了 RTP预封装, 则流媒体服务器 20在接收到用户的操作命令后, 可以直接向该用户发送相应的 RTP包, 而不必 像现有技术一样, 在接收到用户的操作命令后, 才对所要发送的媒体数据信息 进行 RTP封装, 然后再将该媒体数据信息发送给相应的用户; 并且每次接收到 相同的操作命令时, 都要对相同的流媒体信息进行重复性的 RTP封装。 这样便 在一定程度上减少了流媒体服务器 20 的信息处理量, 缩短了流媒体服务器 20 在提供流媒体服务时处理信息的时间, 进而提高了用户的使用体验。
本实施例中, 所述索引信息包括每个 I帧的描述信息,该描述信息包括当前 I帧的起始位置、 当前 I帧的数据大小、 当前 I帧到一下个 I帧的数据大小、 釆 样时间和时间标识; 流媒体服务器 20把这些索引参数分别保存到索引表对应的 字段; 通过该索引表, 流媒体服务器 20可以快速定位到每一个 I帧, 从而对媒 体数据信息实现播放、 定位、 快进、 快退等操作。 参阅表 4所示, 在索引表中, file-offset表示当前的 I帧在媒体数据信息中的开始位置; size表示当前的 I帧到 下一个 I帧的数据大小; iframesize表示当前的 I帧的数据大小; time表示媒体 数据信息中的绝对时间, 用于时间控制; tS为时间标识。
表 4
Figure imgf000009_0002
参阅表 1所示, 第一个 I帧和第二个 I帧之间的所有 P帧、 B帧都是用于调整第 一个 I帧的, 而第一个 I帧和第二个 I帧之间的所有音频帧应依靠自身的时间标识 和相应的视频帧 (包括 I帧、 P帧和 B帧)进行匹配, 当音频帧中携带的时间标识 和视频帧中的相同时, 即表示该音频帧和该视频帧应同时播放。
下面以编码格式为 MPEG-4、 后缀为 " .mp4" 的媒体文件和编码格式为 MPEG-2, 后缀为 ".ts" 的媒体文件为例进行说明, 参阅图 3所示, 流媒体服务 器 20将编码格式为 MPEG-4和 MPEG-2的媒体文件分别转换成特定文件格式的 媒体文件的详细流程如下:
步骤 300: 流媒体服务器 20接收到后缀为 ".mp4" 和 ".ts" 的媒体文件后, 根据其源文件中的文件头信息判断该媒体文件的编码格式分别为 "MPEG-4"和 "MPEG-2"。
参阅表 5 所示, 以 ".mp4" 为后缀的媒体文件由若干原子 (atom )组成, 每个 atom包含三个部分: 长度 ( size )、 类型 ( type )和所包含的数据 ( data )„
Figure imgf000009_0001
size type data
表 6
Sample description atom
Time■ to■ sample atom
Time■ to■ sample atom
另夕卜,以". mp4"为后缀的媒体文件中还包括一个釆样列表原子( Sample table atom ), 参阅表 6所示, Sample table atom包括每个釆样点的描述信息 ( Sample description atom )、 每个釆样点对应的时间信息 ( Time-to-sample atom )、 每个包 含视频关键帧数据的釆样点的序号 ( Sync sample atom )等等用于索引的参数。
以 ".ts" 为后缀的媒体文件由若干个 TS包组成, 这些 TS包由若干 PES包分 割而成, 参阅图 4所示, 每个 PES包封装的第一个 TS包由 TS头、 PES头和 DATA 三部分组成, 其他的 TS包由 TS头和 DATA两部分组成, 其中, TS头用于标识发 送优先级等信息, PES头包含时间标识等索引参数, DATA用于保存相应的媒 体数据。
步骤 310: 流媒体服务器 20将所述 ".mp4" 媒体文件的每个 atom的 data部分 中的编码格式为 MPEG-4的媒体数据提取出来, 将所述 ".ts" 媒体文件中编码格 式为 MPEG-2的 DATA部分提取出来,分别组成如表 1所示的媒体数据包,该媒体 数据包中包括媒体文件的视频帧和音频帧。
步骤 320: 流媒体服务器 20对所述视频帧和音频帧进行 RTP预封装, 如表 2 所示, 所述视频帧和音频帧被流媒体服务器 20分别分割为一个或多个 RTP包。
步骤 330: 流媒体服务器 20根据编码格式为 MPEG-4的 ".mp4" 媒体文件 Sample table atom中的索引参数生成如表 4所示的索引信息, 根据编码格式为 MPEG-2的 ".ts"媒体文件的 PES头中的索引参数生成如表 4所示的索引信息, 该 索引信息用于对所述视频帧中的 I帧进行快速定位。
根据上述实施例, 流媒体服务器 20在将多种不同编码格式的媒体文件转换 为特定文件格式的媒体文件后, 当客户端 21请求播放某段媒体文件时, 所述流 媒体服务器 20从索引表中读取相应的 I帧的索引信息,定位该 I帧在媒体数据信息 中的起始位置, 并从所述起始位置开始读取有效的媒体数据, 将相应的视频关 键帧、 视频预测帧和音频帧发送到客户端 21。
例如, 当用户通过客户端 21登录流媒体服务器 20, 并通过定位命令定位点 播电影 A中的情节 B时, 流媒体服务器 20根据时间计算获得情节 B所对应的 I帧的 绝对时间 C, 并通过在索引表中查找 "time" 字段等于 C的记录, 获取该 I帧在电 影 A中的起始位置 D, 接着流媒体服务器 20从 D位置开始读取该 I帧及其后续的所 有 I帧、 P帧、 B帧和相应的音频帧, 并通过网络向客户端 21发送相应的 RTP包。 此外,用户也可以通过客户端 21通过快进命令或快退命令对电影 A进行快进 操作或快退操作, 快进操作和快退操作按速度不同分为 IX、 2X和 4X等级别。 当 用户从情节 B开始对电影 A进行 IX的快进操作或快退操作时, 流媒体服务器 20在 获取情节 B对应的 I帧的起始位置 D后,从 D位置开始读取 I帧, 并向后或向前连续 只读取所有的 I帧, 而不读取 P帧、 B帧或音频帧。 当用户从情节 B开始对电影 A 进行 2X或 4X的快进操作或快退操作时, 流媒体服务器 20在获取情节 B对应的 I帧 的起始位置 D后, 从 D位置开始读取 I帧, 并向后或向前每间隔一个或多个 I帧后 只读取相应的 I帧。 在进行快进操作或快退操作的过程中, 流媒体服务器 20通过 索引表中的 "iframesize" 字段来确定每次要读取的 I帧的大小。
通过上述方法, 同一台流媒体服务器 20可以处理不同编码格式的媒体文件, 这样, 在流媒体服务系统中, 少量的流媒体服务器 20即可向用户提供内容丰富 的系统服务, 这在一定程度上减少了系统的成本, 降低了系统的集成难度, 并 且进一步实现了系统的负载均衡; 明的精神和范围。 这样, 倘若对本发明的这些修改和变型属于本发明权利要求 及其等同技术的范围之内, 则本发明也意图包含这些改动和变型在内。

Claims

权 利 要 求
1、 一种支持多种编码格式的媒体数据的方法, 其特征在于, 包括步骤: 流媒体服务器将接收的不同编码格式的媒体文件分别转换成特定文件格式 的媒体文件, 所述特定文件格式的媒体文件包括媒体数据信息和索引信息; 所述流媒体服务器根据客户端的操作命令确定相应的媒体文件, 进一步根 据所述相应的媒体文件中的索引信息, 将所述相应的媒体文件中相应的媒体数 据信息发送给所述客户端。
2、 如权利要求 1所述的方法, 其特征在于, 所述转换成特定文件格式的媒 体文件的步骤包括: 根据媒体文件的编码格式获得其相应的视频帧、 音频帧和索引参数; 根据所述相应的视频帧和音频帧生成相应的媒体数据信息, 其中视频帧包 括视频关键帧和视频预测帧; 以及根据所述相应的索引参数生成用于定位所述 相应的视频帧中的视频关键帧的索引信息。
3、 如权利要求 2所述的方法, 其特征在于, 所述根据相应的视频帧和音频 帧生成相应的媒体数据信息的步骤包括: 根据实时传输协议, 通过分割所述视 频关键帧、 视频预测帧和音频帧, 获得包括一个或多个实时传输协议数据包的 媒体数据信息, 所述实时传输协议包包括实时流协议头信息、 实时传输协议头 信息和媒体数据。
4、 如权利要求 2所述的方法, 其特征在于, 所述索引信息包括视频关键帧 的起始位置、 视频关键帧自身的数据大小、 与下一个视频关键帧之间的数据大 'J、、 釆样时间和时间标识。
5、 如权利要求 4所述的方法, 其特征在于, 根据所述索引信息定位视频关 键帧的步骤包括: 根据所述索引信息中的时间标识和音频帧包含的时间标识, 确定所述相应的音频帧, 并 居所述相应的音频帧中的时间标识, 确定所述相 应的视频帧中的视频关键帧。
6、 如权利要求 1 - 5任一项所述的方法, 其特征在于, 所述操作命令包括 播放命令、 定位命令、 快进命令和快退命令。
7、 如权利要求 6所述的方法, 其特征在于, 所述流媒体服务器接收播放命 令或定位命令后, 根据所述索引信息确定相应的视频帧中的视频关键帧及设定 所述视频关键帧在相应的媒体文件中的起始位置, 并从所述起始位置开始读取 所述视频关键帧及其后续的视频关键帧、 视频预测帧和音频帧。
8、 如权利要求 6所述的方法, 其特征在于, 所述流媒体服务器接收快进命 令或快退命令后, 根据所述索引信息确定相应的视频帧中的视频关键帧及设定 所述视频关键帧在相应的媒体文件中的起始位置, 并从所述起始位置开始读取 所述视频关键帧, 以及向后或向前连续只读取视频关键帧, 或者向后或向前每 间隔一个或多个视频关键帧后读取相应的视频关键帧。
9、 一种流媒体服务器, 其特征在于, 包括:
接收单元, 用于接收不同编码格式的媒体文件和客户端发送的操作命令; 转换单元, 用于将接收的不同编码格式的媒体文件分别转换成特定文件格 式的媒体文件, 所述特定文件格式的媒体文件包括媒体数据信息和索引信息; 存储单元, 用于存储所述特定文件格式的媒体文件;
处理单元, 用于根据客户端发送的操作命令确定相应的媒体文件, 并根据 相应的媒体文件中的索引信息确定对应的视频关键帧及设定所述视频关键帧在 所述相应的媒体文件中的起始位置, 从所述起始位置开始读取媒体数据信息; 发送单元, 用于向所述客户端返回所述相应的媒体数据信息。
10、 如权利要求 9 所述的流媒体服务器, 其特征在于, 所述转换单元还用 于根据实时传输协议, 通过分割视频关键帧、 视频预测帧和音频帧, 获得包括 一个或多个实时传输协议数据包的媒体数据信息。
11、 一种通信系统, 其特征在于, 包括:
客户端, 用于向流媒体服务器发送操作命令, 并接收所述流媒体服务器返 回的媒体数据信息; 流媒体服务器, 用于将接收的不同编码格式的媒体文件分别转换成特定文 件格式的媒体文件, 所述特定文件格式的媒体文件包括媒体数据信息和索引信 息; 以及根据客户端发送的操作命令确定相应的媒体文件, 进一步根据相应的 媒体文件中的索引信息确定并返回相应的媒体文件中相应的媒体数据信息。
12、如权利要求 11所述的通信系统, 其特征在于, 所述流媒体服务器包括: 接收单元, 用于接收不同编码格式的媒体文件和客户端发送的操作命令; 转换单元, 用于将接收的不同编码格式的媒体文件分别转换成经过实时传 输协议预封装的特定文件格式的媒体文件, 所述特定文件格式的媒体文件包括 媒体数据信息和索引信息;
存储单元, 用于存储所述特定文件格式的媒体文件;
处理单元, 用于根据客户端发送的操作命令确定相应的媒体文件, 进一步 根据相应的媒体文件中的索引信息确定对应的视频关键帧及所述视频关键帧在 所述相应的媒体文件中的起始位置, 从所述起始位置开始读取媒体数据信息; 发送单元, 用于向所述客户端返回所述相应的媒体数据信息。
PCT/CN2007/002148 2006-11-21 2007-07-13 Procédé et système permettant d'accepter des données media de divers formats de codage WO2008061416A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP07785125A EP2086240A4 (en) 2006-11-21 2007-07-13 METHOD AND SYSTEM FOR SUPPORTING MEDIA DATA OF VARIOUS CODING FORMATS
US12/343,099 US20090106288A1 (en) 2006-11-21 2008-12-23 Method and system for supporting media data of various coding formats

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CNB2006101448171A CN100544439C (zh) 2006-11-21 2006-11-21 一种支持多种编码格式的媒体数据的方法及系统
CN200610144817.1 2006-11-21

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US12/343,099 Continuation US20090106288A1 (en) 2006-11-21 2008-12-23 Method and system for supporting media data of various coding formats

Publications (1)

Publication Number Publication Date
WO2008061416A1 true WO2008061416A1 (fr) 2008-05-29

Family

ID=38019229

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2007/002148 WO2008061416A1 (fr) 2006-11-21 2007-07-13 Procédé et système permettant d'accepter des données media de divers formats de codage

Country Status (4)

Country Link
US (1) US20090106288A1 (zh)
EP (1) EP2086240A4 (zh)
CN (1) CN100544439C (zh)
WO (1) WO2008061416A1 (zh)

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9325805B2 (en) 2004-08-02 2016-04-26 Steve J Shattil Content delivery in wireless wide area networks
CN101312531B (zh) * 2007-11-02 2010-11-17 北京创毅视讯科技有限公司 一种广播系统中的流媒体业务传输方法及流媒体帧封装器
US20110197237A1 (en) * 2008-10-10 2011-08-11 Turner Steven E Controlled Delivery of Content Data Streams to Remote Users
US9538142B2 (en) * 2009-02-04 2017-01-03 Google Inc. Server-side support for seamless rewind and playback of video streaming
WO2011022405A2 (en) 2009-08-17 2011-02-24 Akamai Technologies, Inc. Method and system for http-based stream delivery
KR101777347B1 (ko) 2009-11-13 2017-09-11 삼성전자주식회사 부분화에 기초한 적응적인 스트리밍 방법 및 장치
KR101750049B1 (ko) 2009-11-13 2017-06-22 삼성전자주식회사 적응적인 스트리밍 방법 및 장치
KR101786050B1 (ko) * 2009-11-13 2017-10-16 삼성전자 주식회사 데이터 전송 방법 및 장치
KR101786051B1 (ko) * 2009-11-13 2017-10-16 삼성전자 주식회사 데이터 제공 방법 및 장치와 데이터 수신 방법 및 장치
KR101750048B1 (ko) * 2009-11-13 2017-07-03 삼성전자주식회사 변속 재생 서비스 제공 방법 및 장치
KR101737084B1 (ko) * 2009-12-07 2017-05-17 삼성전자주식회사 메인 콘텐트에 다른 콘텐트를 삽입하여 스트리밍하는 방법 및 장치
US8429269B2 (en) * 2009-12-09 2013-04-23 Sony Computer Entertainment Inc. Server-side rendering
US20110296048A1 (en) * 2009-12-28 2011-12-01 Akamai Technologies, Inc. Method and system for stream handling using an intermediate format
CN102118633B (zh) * 2009-12-31 2013-04-17 华为技术有限公司 视频文件播放的方法、装置及系统
KR101777348B1 (ko) * 2010-02-23 2017-09-11 삼성전자주식회사 데이터 전송 방법 및 장치와 데이터 수신 방법 및 장치
US11330046B2 (en) 2010-03-01 2022-05-10 Tybalt, Llc Content delivery in wireless wide area networks
US10419533B2 (en) 2010-03-01 2019-09-17 Genghiscomm Holdings, LLC Edge server selection for device-specific network topologies
KR20110105710A (ko) * 2010-03-19 2011-09-27 삼성전자주식회사 복수의 챕터를 포함하는 콘텐트를 적응적으로 스트리밍하는 방법 및 장치
KR101837687B1 (ko) 2010-06-04 2018-03-12 삼성전자주식회사 콘텐트의 품질을 결정하는 복수의 인자에 기초한 적응적인 스트리밍 방법 및 장치
CN102377694A (zh) * 2010-08-06 2012-03-14 鸿富锦精密工业(深圳)有限公司 流媒体网关及其提供流媒体服务的方法
CN102447673A (zh) * 2010-09-30 2012-05-09 突触计算机系统(上海)有限公司 一种用于解封装携有封装格式的多媒体文件的方法与设备
US20120265853A1 (en) * 2010-12-17 2012-10-18 Akamai Technologies, Inc. Format-agnostic streaming architecture using an http network for streaming
US8880633B2 (en) 2010-12-17 2014-11-04 Akamai Technologies, Inc. Proxy server with byte-based include interpreter
CN102131106A (zh) * 2010-12-22 2011-07-20 华为技术有限公司 流媒体内容的播放方法、装置及系统
CN102625138A (zh) * 2011-01-31 2012-08-01 苏州科达科技有限公司 一种视频集中存储和播放的控制方法及装置
CN102231847B (zh) * 2011-03-14 2013-03-06 新奥特(北京)视频技术有限公司 一种支持流媒体文件直接帧精确定位方法
TWI476761B (zh) * 2011-04-08 2015-03-11 Dolby Lab Licensing Corp 用以產生可由實施不同解碼協定之解碼器所解碼的統一位元流之音頻編碼方法及系統
CN102752586B (zh) * 2011-04-18 2015-08-12 中兴通讯股份有限公司 终端中收看电视的实现方法、装置及系统
CN102891997A (zh) * 2011-07-20 2013-01-23 深圳市快播科技有限公司 用于快速播放的音视频多媒体文件格式、播放方法及装置
CN102325244B (zh) * 2011-08-26 2014-02-19 杭州海康威视数字技术股份有限公司 按时间快速定位录像的方法、录像存储控制装置及定位装置
CN103177043A (zh) * 2011-12-26 2013-06-26 联想(北京)有限公司 一种跨系统平台传输文件的方法及电子系统
CN102665103A (zh) * 2012-04-13 2012-09-12 烽火通信科技股份有限公司 一种适用于流媒体服务的音频视频封装方法
US20140063339A1 (en) * 2012-09-04 2014-03-06 Google Inc. In Browser Muxing and Demuxing For Video Playback
CN104065978B (zh) * 2013-03-22 2017-12-01 北京中传数广技术有限公司 一种媒体内容定位的方法及系统
CN104104972A (zh) * 2013-04-10 2014-10-15 深圳市快播科技有限公司 多媒体播放方法、装置及系统
CN103634694B (zh) * 2013-11-18 2017-04-12 奥维通信股份有限公司 一种基于ip的流媒体播放方法及系统
CN104661086A (zh) * 2013-11-25 2015-05-27 乐视网信息技术(北京)股份有限公司 一种媒体流的编码方法和装置
US9883213B2 (en) * 2014-02-10 2018-01-30 Dolby International Ab Embedding encoded audio into transport stream for perfect splicing
US9930388B2 (en) * 2015-03-31 2018-03-27 Avago Technologies General Ip (Singapore) Pte. Ltd. Differential and distributive queuing of video frames
CN105357531B (zh) * 2015-12-15 2018-07-20 北京网博视界科技股份有限公司 基于视频局部编码快速切割封装方法
CN110545490B (zh) * 2018-05-29 2021-03-16 北京字节跳动网络技术有限公司 媒体文件转换方法、装置及存储介质
CN109729318B (zh) * 2019-01-07 2021-01-08 深圳英飞拓科技股份有限公司 视频数据回放处理方法、装置、计算机设备和存储介质
CN113742518A (zh) * 2020-05-28 2021-12-03 伊姆西Ip控股有限责任公司 存储和提供视频的方法、设备和计算机程序产品
CN111866457B (zh) * 2020-07-14 2022-07-12 广州市宏视电子技术有限公司 监控图像处理方法、电子设备、存储介质及系统
CN113873275B (zh) * 2021-09-13 2023-12-29 乐相科技有限公司 一种视频媒体数据的传输方法及装置
CN114666309A (zh) * 2022-03-23 2022-06-24 中磊电子(苏州)有限公司 流媒体的处理方法、传送装置和接收装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1435056A (zh) * 2000-06-09 2003-08-06 通用仪器公司 视频尺度转换及从mpeg-2到mpeg-4的编码转换
US20040175097A1 (en) * 2003-03-07 2004-09-09 Rami Caspi System and method for e-mail control of an integrated communications center
CN1551631A (zh) * 2003-05-16 2004-12-01 ������������ʽ���� 用于媒体回放体系结构的方法和系统

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7159233B2 (en) * 2000-01-28 2007-01-02 Sedna Patent Services, Llc Method and apparatus for preprocessing and postprocessing content in an interactive information distribution system
US7159235B2 (en) * 2000-01-28 2007-01-02 Sedna Patent Services, Llc Method and apparatus for content distribution via non-homogeneous access networks
JP3951695B2 (ja) * 2001-12-11 2007-08-01 ソニー株式会社 画像配信システムおよび方法、画像配信装置および方法、画像受信装置および方法、記録媒体、並びにプログラム
EP1967006A2 (en) * 2005-12-23 2008-09-10 Koninklijke Philips Electronics N.V. Splitting of a data stream

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1435056A (zh) * 2000-06-09 2003-08-06 通用仪器公司 视频尺度转换及从mpeg-2到mpeg-4的编码转换
US20040175097A1 (en) * 2003-03-07 2004-09-09 Rami Caspi System and method for e-mail control of an integrated communications center
CN1551631A (zh) * 2003-05-16 2004-12-01 ������������ʽ���� 用于媒体回放体系结构的方法和系统

Also Published As

Publication number Publication date
CN100544439C (zh) 2009-09-23
CN1949876A (zh) 2007-04-18
EP2086240A1 (en) 2009-08-05
US20090106288A1 (en) 2009-04-23
EP2086240A4 (en) 2010-03-17

Similar Documents

Publication Publication Date Title
WO2008061416A1 (fr) Procédé et système permettant d'accepter des données media de divers formats de codage
US6580756B1 (en) Data transmission method, data transmission system, data receiving method, and data receiving apparatus
US20160337424A1 (en) Transferring media data using a websocket subprotocol
CN110832872B (zh) 使用用于文件格式方框的通用描述符处理媒体数据
KR102303582B1 (ko) 웹 콘텐츠에 대한 파일 트랙들을 사용하여 미디어 데이터를 프로세싱
CN114503599A (zh) 使用gltf2场景描述中的扩展来支持视频和音频数据
JP2005504480A (ja) メタデータ及びメディアデータを含むマルチメディアファイルのストリーミング
EP1602213A1 (en) System for broadcasting multimedia content
CN109155876B (zh) 一种检索视频数据的方法、装置、设备及可读存储媒体
CN112752115B (zh) 直播数据传输方法、装置、设备及介质
US10469919B2 (en) Broadcast signal transmission apparatus, broadcast signal reception apparatus, broadcast signal transmission method, and broadcast signal reception method
CN112770122B (zh) 一种在云导播台视频同步的方法及系统
US7555009B2 (en) Data processing method and apparatus, and data distribution method and information processing apparatus
CN109076264B (zh) 样本条目和随机存取
US20230045876A1 (en) Video Playing Method, Apparatus, and System, and Computer Storage Medium
Seema et al. WVSNP-DASH: Name-based segmented video streaming
CN108494792A (zh) 一种flash播放器播放hls视频流的转换系统及其工作方法
EP2453652B1 (en) Transmission method, receiving method and device for scalable video coding files
CN105900437B (zh) 通信设备、通信数据生成方法和通信数据处理方法
CN101193105A (zh) 一种媒体流的传送/切换方法
CN101193101A (zh) 一种媒体流的传送/切换系统
JP2024511948A (ja) Heifフォーマットされた画像をリアルタイムトランスポートプロトコル上でトランスポートすること
KR100577001B1 (ko) 유무선 vod 스트리밍 패키징 서비스 제공 방법 및 시스템
JP2000232632A (ja) データ伝送方法,データ伝送システム,データ受信方法,及びデータ受信装置
JP2019053350A (ja) 配信装置、再生装置、配信方法、再生方法、再生プログラムおよびデータ構造

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07785125

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2007785125

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE