WO2012094975A1 - Method and device for transmitting and receiving multimedia data - Google Patents

Method and device for transmitting and receiving multimedia data Download PDF

Info

Publication number
WO2012094975A1
WO2012094975A1 PCT/CN2012/070161 CN2012070161W WO2012094975A1 WO 2012094975 A1 WO2012094975 A1 WO 2012094975A1 CN 2012070161 W CN2012070161 W CN 2012070161W WO 2012094975 A1 WO2012094975 A1 WO 2012094975A1
Authority
WO
WIPO (PCT)
Prior art keywords
code stream
dtq
svc
presentation
description information
Prior art date
Application number
PCT/CN2012/070161
Other languages
French (fr)
Chinese (zh)
Inventor
赵宇
王芳
孙健
刘继年
李加周
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012094975A1 publication Critical patent/WO2012094975A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8451Structuring of content, e.g. decomposing content into time segments using Advanced Video Coding [AVC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234327Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440227Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4621Controlling the complexity of the content stream or additional data, e.g. lowering the resolution or bit-rate of the video stream for a mobile client with a small screen

Definitions

  • the invention relates to the field of multimedia, in particular to a method for transmitting and receiving multimedia data and a transmission and receiving device thereof. Background technique
  • the Real-time Transport Protocol is an example.
  • the packet structure includes two parts, a protocol header and a payload part.
  • the protocol header includes the payload media type, packet sequence number, timestamp, synchronization source identifier, and so on.
  • the payload portion is typically a simple sequential storage of multimedia data.
  • Another common streaming media transfer protocol such as the MPEG (Moving Pictures Experts Group) TS (Transport Stream) packet, is the same for its packet structure.
  • MPEG Motion Picture Experts Group
  • TS Transport Stream
  • the client cannot obtain any auxiliary information from the packet header when extracting a subset of them in the transport stream.
  • the current MPEG-2 TS bears the SVC stream. Only one substream corresponding to one airspace hierarchy is determined, as shown in Figure 1. There is no further subdivision of the quality and time domain grading in the same airspace grading, which makes it easy to find the code stream corresponding to a certain airspace grading by PID (program identification), but in the airspace grading, I want to find a certain Quality grading or time domain grading is difficult, and can only be obtained by traversing the load and comparing the NALU (Network Abstraction Layer Unit) header.
  • NALU Network Abstraction Layer Unit
  • MPEG2-TS is used as a storage file.
  • the client frequently switches the attributes of the viewing stream, for example, frequently switches the size of the viewing stream (spatial grading), frame rate (time domain grading).
  • MPEG2-TS is used as a file, it is not possible to efficiently extract the required code stream from the file.
  • the user extracts the transmitted code stream, it must parse to SEI (Supplemental Enhancement Information) to determine the DTQ (Space Time Quality) corresponding to the code stream to be extracted.
  • SEI Supplemental Enhancement Information
  • the invention provides a method for transmitting and receiving multimedia data and a transmission and receiving device thereof.
  • One of the objectives of the present invention is to provide information of each presentation stream in a multimedia encoded stream and its corresponding identification to simplify processing of multimedia data by the receiving end.
  • the present invention discloses a method for transmitting multimedia data, including:
  • the media forwarding server generates, according to the additional information (SEI) in the layerable video coding (SVC) code stream to be sent, description information of each presentation code stream included in the SVC code stream, and a spatial time quality (DTQ) identifier.
  • SEI additional information
  • SVC layerable video coding
  • DTQ spatial time quality
  • the method further includes: after the multimedia client receives the SVC code stream, determining, according to the received description information of each of the presentation code streams and the correspondence between the DTQ identifiers, that the client supports playing Deriving the DTQ identifier corresponding to the description information of the code stream, traversing the SVC code stream to obtain the display code stream corresponding to the determined DTQ identifier for decoding display.
  • the above method further includes:
  • the original code stream of the SVC code stream is divided into one or more spatial grading code stream segments according to spatial grading, according to time domain grading and quality grading.
  • each spatial grading code stream segment is subjected to secondary grading, and each secondary grading code stream segment is hierarchically classified three times according to another hierarchical manner, to be divided into different presentation code streams, and each presentation is performed.
  • the code streams correspond to a DTQ identifier.
  • the multimedia client receives the description information of each presentation code stream and the correspondence between the DTQ identifiers, and determines a DTQ label corresponding to the description information of the presentation code stream supported by the client. Obtaining, the selected code stream corresponding to the determined DTQ identifier is selected from the received SVC code stream for decoding display.
  • the present invention discloses a method for transmitting multimedia data, including:
  • the media forwarding server divides the original code stream of the received SVC code stream into one or more spatial grading code stream segments according to spatial grading, and respectively classifies each space according to a hierarchical manner in time domain grading and quality grading.
  • the flow segment is subjected to secondary grading, and the code segments of each secondary grading are respectively classified into three times according to another grading manner, and are divided into different presentation code streams, and each of the presentation code streams respectively corresponds to one DTQ identifier; the media forwarding The server sends the SVC code stream that has been hierarchically graded as needed.
  • the multimedia client receives the SVC code stream, according to the
  • the SEI in the SVC code stream generates description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers, and determines a DTQ identifier corresponding to the description information of the presentation code stream supported by the client, from the The display code stream corresponding to the determined DTQ identifier is selected and decoded in the SVC code stream.
  • the method further includes: before the sending, by the media forwarding server, the description information of each presentation stream included in the SVC code stream according to the SEI in the SVC code stream before sending the SVC code stream. And the corresponding relationship between the DTQ identifiers, and the corresponding description information of the generated presentation code streams and the spatial temporal quality DTQ identifiers are sent before the SVC code stream is sent or before the SVC code stream is sent.
  • the multimedia client receives the description information of each of the presentation code streams and the correspondence between the DTQ identifiers, and determines a DTQ identifier corresponding to the description information of the presentation code stream supported by the client, from the received SVC.
  • the code stream corresponding to the determined DTQ identifier is selected and decoded for display.
  • Still another object of the present invention is to simplify the data processing operation when the multimedia client receives the information of each of the presentation streams in the multimedia encoded stream and its corresponding identifier.
  • the present invention discloses a method for receiving multimedia data, including:
  • the multimedia client determines, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the presentation code supported by the client.
  • the DTQ identifier corresponding to the description information of the stream is traversed to the SVC code stream to obtain the display code stream corresponding to the determined DTQ identifier for decoding display.
  • Another object of the present invention is to improve the efficiency of a multimedia client in extracting a multimedia code stream.
  • the present invention discloses a method for receiving multimedia data, including: when a multimedia client receives an SVC code stream, determining the present information from the description information of each code stream and the correspondence between the DTQ identifiers in the SVC code stream.
  • the client supports the DTQ identifier corresponding to the description information of the displayed presentation stream, and selects the presentation stream corresponding to the determined DTQ identifier from the SVC stream to perform decoding and display.
  • the multimedia client determines, according to the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the DTQ corresponding to the description information of the presentation code stream supported by the client.
  • the process of identification is as follows:
  • the multimedia client generates, according to the additional information SEI in the received SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and determines that the client supports the playback.
  • the DTQ identifier corresponding to the description information of the code stream is displayed.
  • the multimedia client receives the description information of each presentation stream in the SVC stream sent by the multimedia forwarding server and the correspondence between the DTQ identifiers, and determines the description information of the presentation stream supported by the client. Corresponding DTQ identifier.
  • Still another object of the present invention is to provide a transmission apparatus that can provide information of each presentation stream in a multimedia encoded stream and its corresponding identification.
  • the present invention discloses a multimedia data transmission device, including:
  • a first module configured to: generate, according to an SEI in the SVC code stream to be sent, description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers;
  • a second module configured to: after sending the SVC code stream to the multimedia client or before sending the SVC code stream, send the generated description information of each presentation code stream and a correspondence relationship of the DTQ identifier.
  • the foregoing transmission device further includes: a third module, configured to: divide the original code stream of the SVC code stream into one or more spatial grading code stream segments according to spatial grading, according to time domain grading and quality grading a grading method to separately classify each spatial grading code stream segment, according to another
  • the grading mode divides the code streams of each of the two gradings into three times, and divides them into different presentation streams, and sends them to the multimedia client, where each of the presentation streams respectively corresponds to one DTQ identifier.
  • Still another object of the present invention is to provide a transmission device that can implement ordered layering of a multimedia code stream.
  • the present invention discloses a multimedia data transmission device, including:
  • a first module configured to: divide the original code stream of the SVC code stream into one or more spatial grading code stream segments according to a spatial grading;
  • the second module is configured to: separately classify each spatial grading code stream segment according to a grading manner in time domain grading and quality grading, and separately code each code segment of the second grading according to another grading manner Perform three times of grading to divide into different presentation streams and send them to the multimedia client, where each presentation stream corresponds to a DTQ identifier.
  • the foregoing transmission device further includes: a third module, configured to: generate, according to the additional addition information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and correspondence of the DTQ identifier And the relationship between the description information of each generated presentation stream and the DTQ identifier is sent before the SVC code stream is sent to the multimedia client or before the SVC code stream is sent.
  • a third module configured to: generate, according to the additional addition information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and correspondence of the DTQ identifier And the relationship between the description information of each generated presentation stream and the DTQ identifier is sent before the SVC code stream is sent to the multimedia client or before the SVC code stream is sent.
  • Still another object of the present invention is to provide a receiving device for multimedia data which simplifies data processing operations.
  • the present invention discloses a receiving device for multimedia data, including:
  • the first module is configured to: receive the SVC code stream, and determine, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the presentation code stream supported by the client to play Describe the DTQ identifier corresponding to the information;
  • a second module configured to: traverse the SVC code stream to obtain a decoded display stream corresponding to the determined DTQ identifier.
  • Another object of the present invention is to provide a receiving device for extracting multimedia data with high efficiency of a multimedia code stream.
  • the present invention discloses a receiving device for multimedia data, including: a first module, configured to: receive an SVC code stream, and determine, according to the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, a DTQ corresponding to the description information of the presentation code stream supported by the client.
  • a second module configured to: select a presentation code stream corresponding to the determined DTQ identifier from the SVC code stream for decoding display.
  • the first module is further configured to: generate, according to the SEI in the received SVC code stream, description information of each presentation code stream included in the SVC code stream, and a correspondence between DTQ identifiers And determining, from the DTQ identifier corresponding to the description information of the presentation stream supported by the client.
  • the first module is further configured to: receive the description information of each presentation code stream in the SVC code stream sent by the multimedia forwarding server, and the correspondence between the DTQ identifiers, and determine the client support The DTQ identifier corresponding to the description information of the displayed presentation stream.
  • One embodiment of the present invention reorders existing SVC code streams to provide regular storage, i.e., provides an ordered hierarchical multimedia code stream. Therefore, the server or the client can conveniently find the required code stream when the code stream is extracted, switched, etc., and the response speed of the server or the client is improved. Still another embodiment of the present invention provides information of each presentation stream in the SVC code stream and its corresponding DTQ identifier, thereby simplifying processing of the multimedia data by the receiving end side. BRIEF abstract
  • FIG. 1 is a schematic structural diagram of a prior art MPEG2-TS bearer SVC
  • FIG. 3 is a schematic structural diagram of an MPEG2-TS bearer SVC according to Embodiment 3 of the present invention
  • FIG. 4 is a schematic diagram of a SVC code stream multicast networking in Embodiment 4 of the present invention
  • FIG. 5 is a schematic diagram of MPEG2-TS bearer SVC code stream substream division according to Embodiment 4 of the present invention
  • FIG. 6 is a flowchart of a client extracting a substream according to Embodiment 4 of the present invention.
  • the applicant of the present invention considers that when the streaming media server sends the media stream, the media stream carries
  • SVC loads are divided according to airspace, quality, time domain or space, time domain, and quality layer. That is, the outermost level is the airspace rating. Quality and time domain grading are not in order.
  • the media forwarding server or the client parses the SVC payload, the code stream is selected according to these ratings, thereby achieving personalized viewing. If the SVC load in the SVC stream sent by the media forwarding server is directly sorted according to the airspace, quality, time domain or space, time domain, and quality layer, the client of the SVC does not need to traverse the entire load. Determine the DTQ corresponding to the code stream to be extracted.
  • the embodiment provides a multimedia data transmission process, in which the media forwarding server divides the original code stream of the existing SVC code stream into one or more spatial grading code streams according to spatial grading. Segments, according to a hierarchical manner in time domain grading and quality grading, respectively categorize each spatial grading code stream segment, and classify each second grading code stream segment three times according to another grading manner. For different presentation streams, each presentation stream corresponds to one DTQ identifier; the media forwarding server sends the three-stage SVC stream as needed.
  • the media forwarding server sends the SVC code stream after three times of classification according to the requirement:
  • the media forwarding server uses the unicast mode.
  • the D (space class grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream required for the unicast transmission are selected from the three-stage SVC code stream.
  • the corresponding spatial domain grading code stream and its dependent code stream are found according to the D value.
  • the corresponding time domain hierarchical code stream and its dependent code stream are found according to the T value.
  • the corresponding quality grading code stream and its dependent code stream are found according to the Q value.
  • the media forwarding server sends the selected presentation stream.
  • the multimedia client when receiving the SVC code stream, the multimedia client first determines the DTQ identifier corresponding to the description information of the presentation code stream supported by the client in the SVC code stream, and selects the determined DTQ identifier from the SVC code stream.
  • the corresponding presentation code stream is decoded and displayed.
  • the multimedia client may generate, according to the SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers, thereby determining a description of the presentation code stream supported by the client.
  • the DTQ identifier corresponding to the information.
  • the applicant of the present invention further proposes a method for transmitting multimedia data, including:
  • the media forwarding server Before the SVC code stream is sent, the media forwarding server generates, according to the SEI in the SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers; the media forwarding server needs to send the SVC code. At the time of streaming or before transmitting the SVC code stream, the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifier are transmitted.
  • the multimedia client can receive the description information of each of the foregoing presentation streams and the correspondence between the DTQ identifiers, the DTQ identifier corresponding to the description information of the presentation stream supported by the client is determined, and the received SVC code is received. The stream of the presentation code corresponding to the determined DTQ identifier is found in the stream for decoding and display.
  • This embodiment provides a preferred solution, which combines the technical means of the foregoing Embodiments 1 and 2 to perform multimedia data transmission.
  • the specific transmission process is as shown in FIG. 2, and includes the following process:
  • Step 200 The multimedia forwarding server generates a code stream description that is displayed and a correspondence relationship with the DTQ, that is, an overall description information of the SVC code stream.
  • the multimedia forwarding server traverses the SVC code stream to find the SEI, parses the SEI, and obtains the total number of presentation streams that the SVC code stream can provide; parses the SEI to generate description information for each presentation stream; parses the SEI, and obtains the DTQ corresponding to each presentation stream. That is, the multimedia forwarding server generates an overall description information of the SVC code stream.
  • the multimedia data of the MPEG2-TS is taken as an example, and the SVC code stream overall description information svc_total_descriptor is added to the PMT, which can be defined as a program level information, which can be defined as follows:
  • Table 1 is the svc_total_descriptor definition table
  • PresentationNum indicates the number of views (presentation streams) in the SVC stream;
  • DID the identifier of the airspace hierarchy;
  • TID the identifier of the time domain hierarchy
  • QID the identification of the quality rating
  • PresentationDescription A description of the presentation stream that is provided to the user for reading and selects the stream to watch based on the description.
  • Step 201 The multimedia forwarding server generates a layer-level hierarchical SVC code stream for storage. Specifically, the operation process of the step is as follows:
  • the multimedia forwarding server traverses the SVC code stream generated by the current technology, accesses the extended header of the SVC code stream NALU unit, and divides the original code stream into one or more according to the DID of the spatial hierarchy based on the original code stream of the SVC code stream.
  • DID stream segment The length and DID of the code stream segment are added before the code stream segment.
  • Performing a second grading on the code streams of different DIDs respectively traversing the code segment of the spatial domain grading, and dividing the spatially classified code stream into one or more TID codes according to the time domain grading corresponding TID Flow segment. The length and TID of the code stream segment are added before the code stream segment.
  • Performing a third grading on the code stream segments of different TIDs respectively traversing the code stream segments of the time domain grading, and dividing the code stream of the time domain grading into one or more QID code stream segments according to the QID corresponding to the quality grading.
  • the length and QID of the code stream segment are added to the code stream segment.
  • the multimedia forwarding server may first perform secondary classification according to the QID corresponding to the quality classification, and the spatially graded code stream. Dividing into one or more QID code stream segments; thereafter, traversing the quality-graded code stream segments, performing a third grading according to the TID corresponding to the time domain grading, and dividing the quality grading code stream into one or more TID code stream segments .
  • Step 202 When the multimedia forwarding server uses the multicast sending mode (the multimedia forwarding server may also be referred to as a multicast server), the sending operation is performed according to the large and complete SVC code stream of the good level; the multimedia forwarding server uses the unicast In the case of the transmission mode (the multimedia forwarding server may also be referred to as a unicast server), the corresponding code stream segment is selected from the well-classed code streams and then transmitted.
  • the multimedia forwarding server sends the SVC code stream overall description information generated in step 200 at the same time as or before the SVC code stream is sent.
  • the SVC code stream generated in step 201 is sent as a load, including the newly added length and other fields, according to the large and complete code stream of the good level.
  • the structure of the SVC is as shown in FIG. 3, which is consistent with the current SVC standard.
  • the substream On the basis of the substream, it is divided into one or more mass substreams, and based on the mass substream, it is divided into one or more time substreams.
  • the sending further includes:
  • the media forwarding server determines the D (space domain grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream to be selected from the SVC code stream overall description information generated by the step 200 as needed.
  • the corresponding spatial domain-classified code stream and its dependent code stream are found according to the D value.
  • the corresponding time domain hierarchical code stream and its dependent code stream are found according to the T value.
  • the corresponding quality grading code stream and its dependent code stream are found according to the Q value.
  • the media forwarding server sends the selected presentation stream.
  • Step 203 The multimedia client receives the foregoing SVC code stream, and performs decoding and displaying.
  • the multimedia client When the multimedia client receives the SVC code stream sent by the multicast mode, it selects and displays the corresponding presentation code stream for decoding and display, and the process is as follows:
  • the multimedia client determines the D (space domain grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream to be selected according to the received SVC code stream overall description information.
  • the corresponding airspace-classified code stream and its dependent code stream are found according to the Q value.
  • the code stream of the spatial domain grading and the code stream it depends on the corresponding time domain grading code stream and its dependent code stream are found according to the T value.
  • the corresponding quality grading code stream and its dependent code stream are found according to the Q value.
  • the selected code streams are reordered and passed to the decoder for decoding.
  • the SVC code stream overall description information generated by step 200 and the layered SVC code stream generated by step 201 are generated. Just do it once. For the stored hierarchical SVC streams, it does not need to be regenerated.
  • the mobile phone accesses the SVC code stream multicast as an example to describe the method for receiving the multimedia data.
  • the terminal access devices also referred to as multimedia clients
  • the terminal access devices can be classified into three types, one is a mobile phone with a small screen, and the other is a moderate screen.
  • PDA devices one type of laptop or television device with a large screen. These three types of devices receive the multicast stream sent by the server. Each device extracts the code streams supported by each of the multicast streams.
  • the mobile phone device extracts the image size from the code stream to QQVGA (1/4 of the size of QVGA), the frame rate is 15 frames/s, and the code stream of high definition image quality.
  • the PDA device extracts an image size of QVGA from the code stream, a frame rate of 15 frames/s, and a high-definition picture quality stream.
  • the computer device extracts the image size from the code stream to VGA, the frame rate is 30 frames/s, and the high-definition picture quality stream.
  • the multimedia forwarding server sends a stream of PID 201 in the MPEG2-TS code stream carrying the SVC to carry the substream of the QQVGA.
  • a code stream with a PID of 202 carries a substream of QVGA, and a code stream with a PID of 203 carries a substream of VGA.
  • the code stream of each airspace classification it is further divided into quality classification and time domain classification.
  • the process of accessing the SVC multicast by the mobile phone is as shown in FIG. 6, and includes the following steps:
  • Step 601 The client (that is, the mobile phone) receives the TS data.
  • Step 602 Determine whether the received TS data is completed. If the receiving is complete, go to step 610. Otherwise skip to step 603.
  • Step 603 Determine whether the received data is PSI data, and if yes, skip to step 604, otherwise, go to step 605.
  • Step 604 Parse the PSI data, and obtain a PID corresponding to the QQVGA code stream. Obtain the t value corresponding to 15 frames/second to obtain the q value corresponding to HD.
  • Step 605 determining whether the PSI data has been received. If yes, then go to step 606, otherwise skip to step 607.
  • Step 606 Determine whether the PID of the received code stream is the PID of the QQVGA code stream. If yes, go to step 608. Otherwise, go to step 607.
  • Step 607 discarding the data.
  • Step 608 parsing the load, skipping the length field, and determining whether the Q field is less than or equal to the q value corresponding to the high definition. If yes, it is judged whether the T field is equal to the t value corresponding to 15 frames/s. If yes, select the codestream. Otherwise, the stream segment is not selected. After that, the code stream segment is skipped, and the code stream is selected by the above judgment method. Until the end of the code stream, then step 609 is performed.
  • Step 609 Send the selected code stream segment to the decoder.
  • Step 610 ending.
  • the embodiment provides a multimedia data transmission device, which is built in a media forwarding server or other multimedia network element.
  • the device includes:
  • the first module generates, according to the SEI in the SVC code stream to be sent, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers;
  • the second module sends the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifiers before sending the SVC code stream to the multimedia client or before transmitting the SVC code stream.
  • the transmission device may further include a third module, and the original code stream of the SVC code stream is divided into one or more spatial grading code stream segments according to a spatial grading, according to a hierarchical manner in time domain grading and quality grading.
  • the spatial grading code stream segments are separately graded separately, and the code segments of each secondary grading are respectively graded three times according to another grading manner, and are divided into different presentation code streams, and sent to The multimedia client, wherein each of the presentation streams respectively corresponds to a DTQ identifier.
  • the embodiment further provides a multimedia data transmission device, which may also be built in a media forwarding server, or other multimedia network element.
  • the device includes:
  • the first module divides the original code stream of the layerable video coding SVC code stream into one or more spatial grading code stream segments according to a spatial grading;
  • each spatial grading code stream segment is separately categorized according to one of the time domain grading and the quality grading, and the second grading code stream segments are respectively categorized three times according to another grading manner. Divided into different presentation streams, and sent to the multimedia client, where each presentation stream corresponds to a spatial time quality DTQ identifier.
  • the transmission device further includes: a third module, generating, according to the additional information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and a correspondence relationship between the DTQ identifiers, And before the multimedia client sends the SVC code stream or sends the SVC code stream, sending the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifier.
  • a third module generating, according to the additional information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and a correspondence relationship between the DTQ identifiers.
  • the embodiment provides a multimedia data receiving device, which can be a multimedia client such as a mobile phone, a PDA device, a computer, and a television.
  • the device can include:
  • the first module receives the SVC code stream, and determines, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the description information corresponding to the presentation code stream supported by the client.
  • DTQ logo
  • the second module traverses the received SVC code stream to obtain the decoded code stream corresponding to the determined DTQ identifier for decoding display.
  • the SVC code stream received by the first module is an SVC code stream that has not been hierarchically sorted. Therefore, the second module needs to traverse the received SVC code stream to obtain the corresponding DTQ identifier corresponding to the display. Code stream.
  • the embodiment provides a receiving device for multimedia data, which may be a mobile phone, a PDA device, or an electric Brain and TV and other multimedia clients.
  • the device can include:
  • the first module the SVC code stream is received, and the DTQ identifier corresponding to the description information of the presentation code stream supported by the client is determined from the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers;
  • the first module generates, according to the SEI in the received SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and determines the presentation code stream supported by the client.
  • the description information corresponds to the DTQ identifier.
  • the description information of each presentation stream in the SVC stream sent by the multimedia forwarding server and the correspondence between the DTQ identifiers are received, and the DTQ identifier corresponding to the description information of the presentation stream supported by the client is determined.
  • the second module selects the presentation code stream corresponding to the determined DTQ identifier from the SVC code stream for decoding and display.
  • the SVC code stream received by the first module is the SVC code stream that is hierarchically sorted. Therefore, the second module selects the presentation code stream corresponding to the determined DTQ identifier from the hierarchically sorted SVC code streams. Just fine.
  • One embodiment of the present invention reorders existing SVC code streams to provide regular storage, i.e., provides an ordered hierarchical multimedia code stream. Therefore, the server or the client can conveniently find the required code stream when the code stream is extracted, switched, etc., and the response speed of the server or the client is improved. Still another embodiment of the present invention provides information of each presentation stream in the SVC code stream and its corresponding DTQ identifier, thereby simplifying processing of the multimedia data by the receiving end side.

Abstract

Provided are a method and device for transmitting and receiving multimedia data, relating to the multimedia field. The method for transmitting multimedia data disclosed in the present invention includes: a media forwarding server generates descriptive information about each rendering code stream included in a Scalable Video Coding (SVC) code stream to be transmitted and the correlation identified by the spatial time quality (DTQ) according to the Supplemental Enhancement Information (SEI) in the SVC code stream; and the media forwarding server transmits the generated descriptive information about each rendering code stream and the correlation identified by DTQ while or before transmitting the SVC code stream to a multimedia client. The method enables the receiver to simplify the multimedia data processing after receiving the multimedia data.

Description

一种多媒体数据的传输、 接收方法及其传输、 接收设备  Method for transmitting and receiving multimedia data and transmission and receiving device thereof
技术领域 Technical field
本发明涉及多媒体领域, 特别是一种多媒体数据的传输、 接收方法及其 传输、 接收设备。 背景技术  The invention relates to the field of multimedia, in particular to a method for transmitting and receiving multimedia data and a transmission and receiving device thereof. Background technique
随着编码技术的发展, 在流媒体典型的应用系统中, 支持的视频格式越 来越多。其中 H264/AVC( Advanced Video Coding )格式的码流应用较为广泛。 并且在 H264/AVC码流的基础上发展了 SVC ( Scalable Video Coding, 可分层 视频编码) 。 SVC码流有空域分层, 时域分层和质量分层。 由于码流的分层 特性, SVC给用户带来了全新的用户体验。 然而, 目前常用的流媒体传输协 议规范对于传输内容而言几乎是透明的, 其只提供了流媒体传输的通道。  With the development of coding technology, in the typical application system of streaming media, more and more video formats are supported. Among them, the code stream of H264/AVC (Advanced Video Coding) format is widely used. And based on the H264/AVC code stream, SVC (Scalable Video Coding) has been developed. SVC code streams have spatial domain layering, time domain layering and quality layering. Due to the layering nature of the code stream, SVC brings a new user experience to the user. However, the currently used streaming media protocol protocol specification is almost transparent to the transmission content, and it only provides a channel for streaming media transmission.
以流媒体传输协议 RTP ( Real-time Transport protocol, 实时传输协议 )为 例, 其分组包结构包括两部分, 协议头和负载部分。 协议头包括了负载媒体 类型, 包序号, 时间戳, 同步源标识等。 而负载部分通常是多媒体数据的简 单连续顺序存储。 另一种常用流媒体传输协议如 MPEG ( Moving Pictures Experts Group, 运动图像专家组) TS ( Transport Stream, 传输流)分组, 其 分组包结构同样如此。 发明内容  For example, the Real-time Transport Protocol (RTP) is an example. The packet structure includes two parts, a protocol header and a payload part. The protocol header includes the payload media type, packet sequence number, timestamp, synchronization source identifier, and so on. The payload portion is typically a simple sequential storage of multimedia data. Another common streaming media transfer protocol, such as the MPEG (Moving Pictures Experts Group) TS (Transport Stream) packet, is the same for its packet structure. Summary of the invention
对于 SVC和 MVC码流来说, 对于在传输码流中提取其中的子集时, 客 户无法从分组包头中获得任何辅助信息, 在目前的标准规范中, 目前的 MPEG-2 TS承载 SVC码流只确定了一个子流对应一个空域分级,如图 1所示。 对于同一个空域分级中的质量和时域分级没有进一步细分,导致通过 PID(节 目标识)可以很方便地找到某个空域分级对应的码流, 但是, 在该空域分级 中, 想找到某个质量分级或时域分级的码流就比较困难, 只能通过遍历负载, 比对 NALU (网络抽象层单元) 头才能获取。 在 MPEG2-TS作为存储文件时, 也有这个问题。 当客户端频繁地切换观 看码流的属性, 例如频繁地切换观看码流的大小 (空间分级) , 帧率(时域 分级) 。 MPEG2-TS作为存储文件时, 不能高效地将需要的码流从文件中提 取出来。 并且用户提取传输的码流时, 必须解析到 SEI ( Supplemental enhancement information, 附加增加信息)才能确定要提取的码流对应的 DTQ (空间时间质量) 。 而 SEI在传输码流中不一定存在。 For SVC and MVC streams, the client cannot obtain any auxiliary information from the packet header when extracting a subset of them in the transport stream. In the current standard specification, the current MPEG-2 TS bears the SVC stream. Only one substream corresponding to one airspace hierarchy is determined, as shown in Figure 1. There is no further subdivision of the quality and time domain grading in the same airspace grading, which makes it easy to find the code stream corresponding to a certain airspace grading by PID (program identification), but in the airspace grading, I want to find a certain Quality grading or time domain grading is difficult, and can only be obtained by traversing the load and comparing the NALU (Network Abstraction Layer Unit) header. This problem also occurs when MPEG2-TS is used as a storage file. When the client frequently switches the attributes of the viewing stream, for example, frequently switches the size of the viewing stream (spatial grading), frame rate (time domain grading). When MPEG2-TS is used as a file, it is not possible to efficiently extract the required code stream from the file. And when the user extracts the transmitted code stream, it must parse to SEI (Supplemental Enhancement Information) to determine the DTQ (Space Time Quality) corresponding to the code stream to be extracted. The SEI does not necessarily exist in the transport stream.
本发明提供一种多媒体数据的传输、 接收方法及其传输、 接收设备。 本发明目的之一是, 提供多媒体编码流中各展现码流的信息及其对应的 标识以便简化接收端侧对多媒体数据的处理。  The invention provides a method for transmitting and receiving multimedia data and a transmission and receiving device thereof. One of the objectives of the present invention is to provide information of each presentation stream in a multimedia encoded stream and its corresponding identification to simplify processing of multimedia data by the receiving end.
为此, 本发明公开了一种多媒体数据的传输方法, 包括:  To this end, the present invention discloses a method for transmitting multimedia data, including:
媒体转发服务器根据所要发送的可分层视频编码(SVC )码流中的附加 增加信息 (SEI )生成该 SVC码流中所包含的各展现码流的描述信息以及空 间时间质量(DTQ )标识的对应关系;  And the media forwarding server generates, according to the additional information (SEI) in the layerable video coding (SVC) code stream to be sent, description information of each presentation code stream included in the SVC code stream, and a spatial time quality (DTQ) identifier. Correspondence relationship
所述媒体转发服务器在向多媒体客户端发送所述 SVC码流的同时或者 发送所述 SVC码流前, 发送所生成的各展现码流的描述信息以及 DTQ标识 的对应关系。  And sending, by the media forwarding server, the generated description information of each presentation code stream and the correspondence between the DTQ identifiers before sending the SVC code stream to the multimedia client or before transmitting the SVC code stream.
较佳地, 上述方法还包括: 所述多媒体客户端接收所述 SVC码流后, 从 所收到的所述各展现码流的描述信息以及 DTQ标识的对应关系中,确定本客 户端支持播放的展现码流的描述信息对应的 DTQ标识, 遍历所述 SVC码流 以获取所确定的 DTQ标识对应的展现码流进行解码显示。  Preferably, the method further includes: after the multimedia client receives the SVC code stream, determining, according to the received description information of each of the presentation code streams and the correspondence between the DTQ identifiers, that the client supports playing Deriving the DTQ identifier corresponding to the description information of the code stream, traversing the SVC code stream to obtain the display code stream corresponding to the determined DTQ identifier for decoding display.
较佳地, 上述方法还包括:  Preferably, the above method further includes:
所述媒体转发服务器将向多媒体客户端发送所述 SVC码流之前,将所述 SVC码流的原始码流按照空间分级分为一个或多个空间分级码流段, 按照时 域分级和质量分级中的一种分级方式分别将各空间分级码流段进行二次分 级, 按照另一种分级方式分别将各二次分级的码流段进行三次分级, 以分为 不同的展现码流, 各展现码流分别对应一个 DTQ标识。  Before the media forwarding server sends the SVC code stream to the multimedia client, the original code stream of the SVC code stream is divided into one or more spatial grading code stream segments according to spatial grading, according to time domain grading and quality grading. In a hierarchical manner, each spatial grading code stream segment is subjected to secondary grading, and each secondary grading code stream segment is hierarchically classified three times according to another hierarchical manner, to be divided into different presentation code streams, and each presentation is performed. The code streams correspond to a DTQ identifier.
较佳地,所述多媒体客户端接收所述各展现码流的描述信息以及 DTQ标 识的对应关系,确定本客户端支持播放的展现码流的描述信息对应的 DTQ标 识, 从所接收到的 SVC码流中挑选所确定的 DTQ标识对应的展现码流进行 解码显示。 Preferably, the multimedia client receives the description information of each presentation code stream and the correspondence between the DTQ identifiers, and determines a DTQ label corresponding to the description information of the presentation code stream supported by the client. Obtaining, the selected code stream corresponding to the determined DTQ identifier is selected from the received SVC code stream for decoding display.
本发明又一目的是, 实现多媒体码流的有序分层。  It is yet another object of the present invention to achieve an ordered layering of a multimedia code stream.
为此, 本发明公开了一种多媒体数据的传输方法, 包括:  To this end, the present invention discloses a method for transmitting multimedia data, including:
所述媒体转发服务器将收到的 SVC码流的原始码流按照空间分级分为 一个或多个空间分级码流段, 按照时域分级和质量分级中的一种分级方式分 别将各空间分级码流段进行二次分级, 按照另一种分级方式分别将各二次分 级的码流段进行三次分级, 以分为不同的展现码流, 各展现码流分别对应一 个 DTQ标识;所述媒体转发服务器根据需要发送经过三次分级的 SVC码流。  The media forwarding server divides the original code stream of the received SVC code stream into one or more spatial grading code stream segments according to spatial grading, and respectively classifies each space according to a hierarchical manner in time domain grading and quality grading. The flow segment is subjected to secondary grading, and the code segments of each secondary grading are respectively classified into three times according to another grading manner, and are divided into different presentation code streams, and each of the presentation code streams respectively corresponds to one DTQ identifier; the media forwarding The server sends the SVC code stream that has been hierarchically graded as needed.
较佳地, 上述方法中, 所述多媒体客户端接收所述 SVC码流, 根据该 Preferably, in the above method, the multimedia client receives the SVC code stream, according to the
SVC码流中的 SEI生成该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标识的对应关系, 确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识 , 从所述 SVC码流中挑选所确定的 DTQ标识对应的展现码流进行 解码显示。 The SEI in the SVC code stream generates description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers, and determines a DTQ identifier corresponding to the description information of the presentation code stream supported by the client, from the The display code stream corresponding to the determined DTQ identifier is selected and decoded in the SVC code stream.
较佳地, 上述方法还包括: 所述媒体转发服务器在发送所述 SVC码流之 前, 才艮据所述 SVC码流中的 SEI生成该 SVC码流中所包含的各展现码流的 描述信息以及 DTQ标识的对应关系, 在发送所述 SVC码流的同时或者发送 所述 SVC码流前, 发送所生成的各展现码流的描述信息以及空间时间质量 DTQ标识的对应关系。  Preferably, the method further includes: before the sending, by the media forwarding server, the description information of each presentation stream included in the SVC code stream according to the SEI in the SVC code stream before sending the SVC code stream. And the corresponding relationship between the DTQ identifiers, and the corresponding description information of the generated presentation code streams and the spatial temporal quality DTQ identifiers are sent before the SVC code stream is sent or before the SVC code stream is sent.
较佳地,所述多媒体客户端接收所述各展现码流的描述信息以及 DTQ标 识的对应关系,确定本客户端支持播放的展现码流的描述信息对应的 DTQ标 识, 从所接收到的 SVC码流中挑选所确定的 DTQ标识对应的展现码流进行 解码显示。  Preferably, the multimedia client receives the description information of each of the presentation code streams and the correspondence between the DTQ identifiers, and determines a DTQ identifier corresponding to the description information of the presentation code stream supported by the client, from the received SVC. The code stream corresponding to the determined DTQ identifier is selected and decoded for display.
本发明还有一目的是, 多媒体客户端接收到多媒体编码流中各展现码流 的信息及其对应的标识时, 简化数据处理操作。  Still another object of the present invention is to simplify the data processing operation when the multimedia client receives the information of each of the presentation streams in the multimedia encoded stream and its corresponding identifier.
为此, 本发明公开了一种多媒体数据的接收方法, 包括:  To this end, the present invention discloses a method for receiving multimedia data, including:
多媒体客户端接收到 SVC码流时, 从已收到的该 SVC码流中各展现码 流的描述信息以及 DTQ标识的对应关系中,确定本客户端支持播放的展现码 流的描述信息对应的 DTQ标识,遍历所述 SVC码流以获取所确定的 DTQ标 识对应的展现码流进行解码显示。 When receiving the SVC code stream, the multimedia client determines, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the presentation code supported by the client. The DTQ identifier corresponding to the description information of the stream is traversed to the SVC code stream to obtain the display code stream corresponding to the determined DTQ identifier for decoding display.
本发明另一目的是, 提高多媒体客户端提取多媒体码流的效率。  Another object of the present invention is to improve the efficiency of a multimedia client in extracting a multimedia code stream.
为此, 本发明公开了一种多媒体数据的接收方法, 包括: 多媒体客户端 接收到 SVC码流时,从该 SVC码流中各展现码流的描述信息以及 DTQ标识 的对应关系中,确定本客户端支持播放的展现码流的描述信息对应的 DTQ标 识, 从所述 SVC码流中挑选所确定的 DTQ标识对应的展现码流进行解码显 示。  To this end, the present invention discloses a method for receiving multimedia data, including: when a multimedia client receives an SVC code stream, determining the present information from the description information of each code stream and the correspondence between the DTQ identifiers in the SVC code stream. The client supports the DTQ identifier corresponding to the description information of the displayed presentation stream, and selects the presentation stream corresponding to the determined DTQ identifier from the SVC stream to perform decoding and display.
较佳地, 上述方法中, 所述多媒体客户端从该 SVC码流中各展现码流的 描述信息以及 DTQ标识的对应关系中,确定本客户端支持播放的展现码流的 描述信息对应的 DTQ标识的过程如下:  Preferably, in the foregoing method, the multimedia client determines, according to the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the DTQ corresponding to the description information of the presentation code stream supported by the client. The process of identification is as follows:
所述多媒体客户端根据所接收到的 SVC码流中的附加增加信息 SEI生成 该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标识的对应关系, 从中确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识。  The multimedia client generates, according to the additional information SEI in the received SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and determines that the client supports the playback. The DTQ identifier corresponding to the description information of the code stream is displayed.
较佳地,所述多媒体客户端接收所述多媒体转发服务器发送的该 SVC码 流中各展现码流的描述信息以及 DTQ标识的对应关系,从中确定本客户端支 持播放的展现码流的描述信息对应的 DTQ标识。  Preferably, the multimedia client receives the description information of each presentation stream in the SVC stream sent by the multimedia forwarding server and the correspondence between the DTQ identifiers, and determines the description information of the presentation stream supported by the client. Corresponding DTQ identifier.
本发明还有一目的, 提出可提供多媒体编码流中各展现码流的信息及其 对应的标识的传输设备。  Still another object of the present invention is to provide a transmission apparatus that can provide information of each presentation stream in a multimedia encoded stream and its corresponding identification.
为此, 本发明公开了一种多媒体数据的传输设备, 包括:  To this end, the present invention discloses a multimedia data transmission device, including:
第一模块, 其设置为: 根据所要发送的 SVC码流中的 SEI生成该 SVC 码流中所包含的各展现码流的描述信息以及 DTQ标识的对应关系;  a first module, configured to: generate, according to an SEI in the SVC code stream to be sent, description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers;
第二模块, 其设置为: 在向多媒体客户端发送所述 SVC码流的同时或者 发送所述 SVC码流前, 发送所生成的各展现码流的描述信息以及 DTQ标识 的对应关系。  And a second module, configured to: after sending the SVC code stream to the multimedia client or before sending the SVC code stream, send the generated description information of each presentation code stream and a correspondence relationship of the DTQ identifier.
较佳地, 上述传输设备还包括: 第三模块, 设置为: 将所述 SVC码流的 原始码流按照空间分级分为一个或多个空间分级码流段, 按照时域分级和质 量分级中的一种分级方式分别将各空间分级码流段进行二次分级, 按照另一 种分级方式分别将各二次分级的码流段进行三次分级, 以分为不同的展现码 流, 发送给多媒体客户端, 其中, 各展现码流分别对应一个 DTQ标识。 本发明还有一目的, 是提供一种传输设备, 可实现多媒体码流的有序分 层的。 Preferably, the foregoing transmission device further includes: a third module, configured to: divide the original code stream of the SVC code stream into one or more spatial grading code stream segments according to spatial grading, according to time domain grading and quality grading a grading method to separately classify each spatial grading code stream segment, according to another The grading mode divides the code streams of each of the two gradings into three times, and divides them into different presentation streams, and sends them to the multimedia client, where each of the presentation streams respectively corresponds to one DTQ identifier. Still another object of the present invention is to provide a transmission device that can implement ordered layering of a multimedia code stream.
为此, 本发明公开了一种多媒体数据的传输设备, 包括:  To this end, the present invention discloses a multimedia data transmission device, including:
第一模块, 其设置为: 将 SVC码流的原始码流按照空间分级分为一个或 多个空间分级码流段;  a first module, configured to: divide the original code stream of the SVC code stream into one or more spatial grading code stream segments according to a spatial grading;
第二模块, 其设置为: 按照时域分级和质量分级中的一种分级方式分别 将各空间分级码流段进行二次分级, 按照另一种分级方式分别将各二次分级 的码流段进行三次分级, 以分为不同的展现码流, 发送给多媒体客户端, 其 中, 各展现码流分别对应一个 DTQ标识。  The second module is configured to: separately classify each spatial grading code stream segment according to a grading manner in time domain grading and quality grading, and separately code each code segment of the second grading according to another grading manner Perform three times of grading to divide into different presentation streams and send them to the multimedia client, where each presentation stream corresponds to a DTQ identifier.
较佳地, 上述传输设备还包括: 第三模块, 设置为: 根据所述 SVC码流 中的附加增加信息 SEI生成该 SVC码流中所包含的各展现码流的描述信息以 及 DTQ标识的对应关系, 在向多媒体客户端发送所述 SVC码流的同时或者 发送所述 SVC码流前, 发送所生成的各展现码流的描述信息以及 DTQ标识 的对应关系。  Preferably, the foregoing transmission device further includes: a third module, configured to: generate, according to the additional addition information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and correspondence of the DTQ identifier And the relationship between the description information of each generated presentation stream and the DTQ identifier is sent before the SVC code stream is sent to the multimedia client or before the SVC code stream is sent.
本发明还有一目的是, 提供一种多媒体数据的接收设备, 可简化数据处 理操作。  Still another object of the present invention is to provide a receiving device for multimedia data which simplifies data processing operations.
为此, 本发明公开了一种多媒体数据的接收设备, 包括:  To this end, the present invention discloses a receiving device for multimedia data, including:
第一模块, 其设置为: 接收到 SVC码流, 从已收到的该 SVC码流中各 展现码流的描述信息以及 DTQ标识的对应关系中 ,确定本客户端支持播放的 展现码流的描述信息对应的 DTQ标识;  The first module is configured to: receive the SVC code stream, and determine, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the presentation code stream supported by the client to play Describe the DTQ identifier corresponding to the information;
第二模块, 其设置为: 遍历所述 SVC码流以获取所确定的 DTQ标识对 应的展现码流进行解码显示。  And a second module, configured to: traverse the SVC code stream to obtain a decoded display stream corresponding to the determined DTQ identifier.
本发明另一目的是, 提供一种提取多媒体码流效率高的多媒体数据的接 收设备。  Another object of the present invention is to provide a receiving device for extracting multimedia data with high efficiency of a multimedia code stream.
为此, 本发明公开了一种多媒体数据的接收设备, 包括: 第一模块, 其设置为: 接收 SVC码流, 从该 SVC码流中各展现码流的 描述信息以及 DTQ标识的对应关系中 ,确定本客户端支持播放的展现码流的 描述信息对应的 DTQ标识; To this end, the present invention discloses a receiving device for multimedia data, including: a first module, configured to: receive an SVC code stream, and determine, according to the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, a DTQ corresponding to the description information of the presentation code stream supported by the client. Identification
第二模块, 其设置为: 从所述 SVC码流中挑选所确定的 DTQ标识对应 的展现码流进行解码显示。  And a second module, configured to: select a presentation code stream corresponding to the determined DTQ identifier from the SVC code stream for decoding display.
较佳地, 上述设备中, 所述第一模块还设置为: 根据所接收到的 SVC码 流中的 SEI生成该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标 识的对应关系, 从中确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识。  Preferably, in the foregoing device, the first module is further configured to: generate, according to the SEI in the received SVC code stream, description information of each presentation code stream included in the SVC code stream, and a correspondence between DTQ identifiers And determining, from the DTQ identifier corresponding to the description information of the presentation stream supported by the client.
较佳地, 上述设备中, 所述第一模块还设置为: 接收所述多媒体转发服 务器发送的该 SVC码流中各展现码流的描述信息以及 DTQ标识的对应关系, 从中确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识。  Preferably, in the foregoing device, the first module is further configured to: receive the description information of each presentation code stream in the SVC code stream sent by the multimedia forwarding server, and the correspondence between the DTQ identifiers, and determine the client support The DTQ identifier corresponding to the description information of the displayed presentation stream.
本发明的一个实施例对现有 SVC码流重新排序, 使其有规律的存放, 即 提供了一种有序分层的多媒体码流。 从而使服务器或者客户端在对该码流进 行抽取、 切换等操作时可以很方便地找到所需的码流, 提高了服务器或客户 端的响应速度。本发明还有一个实施例提供了 SVC码流中各展现码流的信息 及其对应的 DTQ标识, 从而简化接收端侧对多媒体数据的处理。 附图概述  One embodiment of the present invention reorders existing SVC code streams to provide regular storage, i.e., provides an ordered hierarchical multimedia code stream. Therefore, the server or the client can conveniently find the required code stream when the code stream is extracted, switched, etc., and the response speed of the server or the client is improved. Still another embodiment of the present invention provides information of each presentation stream in the SVC code stream and its corresponding DTQ identifier, thereby simplifying processing of the multimedia data by the receiving end side. BRIEF abstract
图 1为现有技术 MPEG2-TS承载 SVC的结构示意图;  1 is a schematic structural diagram of a prior art MPEG2-TS bearer SVC;
图 2为本发明实施例 3中传输多媒体数据的流程图;  2 is a flowchart of transmitting multimedia data in Embodiment 3 of the present invention;
图 3为本发明实施例 3中 MPEG2-TS承载 SVC的结构示意图; 图 4为本发明实施例 4中 SVC码流组播组网示意图;  3 is a schematic structural diagram of an MPEG2-TS bearer SVC according to Embodiment 3 of the present invention; FIG. 4 is a schematic diagram of a SVC code stream multicast networking in Embodiment 4 of the present invention;
图 5为本发明实施例 4中 MPEG2-TS承载 SVC码流子流划分示意图; 图 6为本发明实施例 4中客户端提取子流的流程图。 本发明的较佳实施方式 下面结合附图及具体实施例对本发明技术方案做进一步详细说明。 需要 说明的是, 在不冲突的情况下, 本申请中的实施例及实施例中的特征可以相 互任意组合。 5 is a schematic diagram of MPEG2-TS bearer SVC code stream substream division according to Embodiment 4 of the present invention; FIG. 6 is a flowchart of a client extracting a substream according to Embodiment 4 of the present invention. Preferred embodiment of the invention The technical solutions of the present invention are further described in detail below with reference to the accompanying drawings and specific embodiments. It should be noted that, in the case of no conflict, the features in the embodiments and the embodiments in the present application may be arbitrarily combined with each other.
实施例 1  Example 1
本发明申请人考虑到, 目前流媒体服务器发送媒体流时, 媒体流携带的 The applicant of the present invention considers that when the streaming media server sends the media stream, the media stream carries
SVC负载都是根据空域、 质量、 时域或者空间、 时域、 质量层层划分的。 即 最外一层分级是空域分级。 而质量和时域分级则没有先后顺序。 媒体转发服 务器或者客户端解析 SVC负载时, 根据这些分级对码流进行选择, 从而实现 个性化观看。 如果媒体转发服务器下送的 SVC码流中 SVC负载直接按照空 域、 质量、 时域或者空间、 时域、 质量层层分级排序, 则接收到该 SVC的客 户端时, 无需遍历整个负载, 直接可确定要提取的码流对应的 DTQ。 SVC loads are divided according to airspace, quality, time domain or space, time domain, and quality layer. That is, the outermost level is the airspace rating. Quality and time domain grading are not in order. When the media forwarding server or the client parses the SVC payload, the code stream is selected according to these ratings, thereby achieving personalized viewing. If the SVC load in the SVC stream sent by the media forwarding server is directly sorted according to the airspace, quality, time domain or space, time domain, and quality layer, the client of the SVC does not need to traverse the entire load. Determine the DTQ corresponding to the code stream to be extracted.
因此, 基于上述思想, 本实施例提供一种多媒体数据的传输过程, 在该 过程中,媒体转发服务器将现有的 SVC码流的原始码流按照空间分级分为一 个或多个空间分级码流段, 按照时域分级和质量分级中的一种分级方式分别 将各空间分级码流段进行二次分级, 按照另一种分级方式分别将各二次分级 的码流段进行三次分级, 以分为不同的展现码流, 各展现码流分别对应一个 DTQ标识; 媒体转发服务器根据需要发送经过三次分级的 SVC码流。  Therefore, based on the above idea, the embodiment provides a multimedia data transmission process, in which the media forwarding server divides the original code stream of the existing SVC code stream into one or more spatial grading code streams according to spatial grading. Segments, according to a hierarchical manner in time domain grading and quality grading, respectively categorize each spatial grading code stream segment, and classify each second grading code stream segment three times according to another grading manner. For different presentation streams, each presentation stream corresponds to one DTQ identifier; the media forwarding server sends the three-stage SVC stream as needed.
其中, 媒体转发服务器根据需要发送经过三次分级的 SVC码流指: 媒体转发服务器釆用组播方式发送时,将经过三次分级的 SVC码流当成 负载全部发送出去;媒体转发服务器釆用单播方式发送时, 则从经过三次分 级的 SVC码流中挑选该单播发送所需要的展现码流的 D (空域分级)值、 T (时域分级)值、 Q (质量分级)值。 根据 D值找到对应的空域分级的码流 及其依赖的码流。 在该空域分级的码流中和其依赖的码流中, 根据 T值找到 对应的时域分级的码流及其依赖的码流。 在找到的时域分级的码流和其依赖 的码流中, 根据 Q值找到对应的质量分级的码流及其依赖的码流。 最后, 媒 体转发服务器将选择的展现码流发送出去。  The media forwarding server sends the SVC code stream after three times of classification according to the requirement: When the media forwarding server sends the multicast mode, the three-level SVC code stream is sent as a load; the media forwarding server uses the unicast mode. When transmitting, the D (space class grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream required for the unicast transmission are selected from the three-stage SVC code stream. The corresponding spatial domain grading code stream and its dependent code stream are found according to the D value. In the air stream of the air domain hierarchy and its dependent code stream, the corresponding time domain hierarchical code stream and its dependent code stream are found according to the T value. In the found time domain grading code stream and its dependent code stream, the corresponding quality grading code stream and its dependent code stream are found according to the Q value. Finally, the media forwarding server sends the selected presentation stream.
此时, 多媒体客户端接收到上述 SVC码流时, 先确定该 SVC码流中本 客户端支持播放的展现码流的描述信息对应的 DTQ标识, 并从 SVC码流中 挑选所确定的 DTQ标识对应的展现码流进行解码显示。 其中, 多媒体客户端可根据该 SVC码流中的 SEI生成该 SVC码流中所 包含的各展现码流的描述信息以及 DTQ标识的对应关系,从而确定本客户端 支持播放的展现码流的描述信息对应的 DTQ标识。 At this time, when receiving the SVC code stream, the multimedia client first determines the DTQ identifier corresponding to the description information of the presentation code stream supported by the client in the SVC code stream, and selects the determined DTQ identifier from the SVC code stream. The corresponding presentation code stream is decoded and displayed. The multimedia client may generate, according to the SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers, thereby determining a description of the presentation code stream supported by the client. The DTQ identifier corresponding to the information.
实施例 2  Example 2
本发明申请人又提出一种多媒体数据的传输方法, 包括:  The applicant of the present invention further proposes a method for transmitting multimedia data, including:
媒体转发服务器在发送 SVC码流之前, 根据该 SVC码流中的 SEI生成 该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标识的对应关系; 媒体转发服务器在需要发送上述 SVC码流时或者在发送该 SVC码流前, 发送所生成的各展现码流的描述信息以及 DTQ标识的对应关系。  Before the SVC code stream is sent, the media forwarding server generates, according to the SEI in the SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers; the media forwarding server needs to send the SVC code. At the time of streaming or before transmitting the SVC code stream, the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifier are transmitted.
这样,当多媒体客户端可以接收上述各展现码流的描述信息以及 DTQ标 识的对应关系, 以确定本客户端支持播放的展现码流的描述信息对应的 DTQ 标识, 并从所接收到的 SVC码流中找到所确定的 DTQ标识对应的展现码流 进行解码显示即可。  In this way, when the multimedia client can receive the description information of each of the foregoing presentation streams and the correspondence between the DTQ identifiers, the DTQ identifier corresponding to the description information of the presentation stream supported by the client is determined, and the received SVC code is received. The stream of the presentation code corresponding to the determined DTQ identifier is found in the stream for decoding and display.
实施例 3  Example 3
本实施例提出一种优选方案, 其结合上述实施例 1和 2的技术手段, 以 进行多媒体数据的传输, 具体传输过程如图 2所示, 包括如下过程:  This embodiment provides a preferred solution, which combines the technical means of the foregoing Embodiments 1 and 2 to perform multimedia data transmission. The specific transmission process is as shown in FIG. 2, and includes the following process:
步骤 200、 多媒体转发服务器生成展示的码流描述以及和 DTQ的对应关 系, 即 SVC码流总体描述信息。  Step 200: The multimedia forwarding server generates a code stream description that is displayed and a correspondence relationship with the DTQ, that is, an overall description information of the SVC code stream.
具体地, 该步骤的操作过程如下:  Specifically, the operation process of this step is as follows:
多媒体转发服务器遍历 SVC码流找到 SEI, 解析 SEI, 获得 SVC码流可 以提供的展现码流总数;解析 SEI,生成每个展现码流的描述信息;解析 SEI, 获得每个展现码流对应的 DTQ, 即多媒体转发服务器生成 SVC码流总体描 述信息。  The multimedia forwarding server traverses the SVC code stream to find the SEI, parses the SEI, and obtains the total number of presentation streams that the SVC code stream can provide; parses the SEI to generate description information for each presentation stream; parses the SEI, and obtains the DTQ corresponding to each presentation stream. That is, the multimedia forwarding server generates an overall description information of the SVC code stream.
本实施例以 MPEG2-TS的多媒体数据为例, 在其中的 PMT中增加 SVC 码流总体描述信息 svc— total— descriptor, 其可定义为一个节目级别的信息, 具 体可定义如下:  In this embodiment, the multimedia data of the MPEG2-TS is taken as an example, and the SVC code stream overall description information svc_total_descriptor is added to the PMT, which can be defined as a program level information, which can be defined as follows:
表 1为 svc— total— descriptor定义表  Table 1 is the svc_total_descriptor definition table
语法 ( Syntax ) bits 助记符 ( Mnemonic ) svc— total— descriptorO Syntax ( Bytes) Mnemonic Svc— total— descriptorO
{ uimsbf  { uimsbf
descriptor— tag 8 uimsbf  Descriptor — tag 8 uimsbf
descriptor— length 8 uimsbf  Descriptor — length 8 uimsbf
PresentationNum 32 uimsbf  PresentationNum 32 uimsbf
for(i=0; i< PresentationNum; i++)  For(i=0; i< PresentationNum; i++)
/  /
\  \
DID, 32 uimsbf  DID, 32 uimsbf
TID, 128 bslbf TID, 128 bslbf
QID, 32 uimsbf QID, 32 uimsbf
PresentationDescription PresentationDescription
}  }
32 uimsbf  32 uimsbf
} 上述表格中, 各字段的含义如下:  } In the above table, the meaning of each field is as follows:
PresentationNum: 表明该 SVC码流中包含 view (展现码流) 的个数; DID: 空域分级的标识;  PresentationNum: indicates the number of views (presentation streams) in the SVC stream; DID: the identifier of the airspace hierarchy;
TID: 时域分级的标识;  TID: the identifier of the time domain hierarchy;
QID: 质量分级的标识;  QID: the identification of the quality rating;
PresentationDescription: 该展现码流的描述, 其可提供给用户阅读, 并根 据描述选择观看的码流。  PresentationDescription: A description of the presentation stream that is provided to the user for reading and selects the stream to watch based on the description.
步骤 201、 多媒体转发服务器生成层层分级的 SVC码流用于存储; 具体地, 该步骤的操作过程如下:  Step 201: The multimedia forwarding server generates a layer-level hierarchical SVC code stream for storage. Specifically, the operation process of the step is as follows:
多媒体转发服务器遍历目前技术下生成的 SVC码流, 访问 SVC码流 NALU单元的扩展头, 在 SVC码流的原始码流的基础上, 根据空间分级对应 的 DID将原始码流分成一个或多个 DID码流段。在码流段的前面加上该码流 段的长度和 DID。 分别在不同 DID的码流上进行第二次分级, 遍历空域分级 的码流段, 根据时域分级对应 TID将空间分级的码流分成 1个或多个 TID码 流段。 在该码流段的前面加上该码流段的长度和 TID。 分别在不同的 TID的 码流段上进行第三次分级,遍历时域分级的码流段,根据质量分级对应的 QID 将时域分级的码流分成 1个或多个 QID码流段。 在该码流段上加上该码流段 的长度和 QID。 The multimedia forwarding server traverses the SVC code stream generated by the current technology, accesses the extended header of the SVC code stream NALU unit, and divides the original code stream into one or more according to the DID of the spatial hierarchy based on the original code stream of the SVC code stream. DID stream segment. The length and DID of the code stream segment are added before the code stream segment. Performing a second grading on the code streams of different DIDs respectively, traversing the code segment of the spatial domain grading, and dividing the spatially classified code stream into one or more TID codes according to the time domain grading corresponding TID Flow segment. The length and TID of the code stream segment are added before the code stream segment. Performing a third grading on the code stream segments of different TIDs respectively, traversing the code stream segments of the time domain grading, and dividing the code stream of the time domain grading into one or more QID code stream segments according to the QID corresponding to the quality grading. The length and QID of the code stream segment are added to the code stream segment.
当然在其他场景中, 多媒体转发服务器根据空间分级对应的 DID将原始 码流分成一个或多个 DID码流段后 ,也可以先按照质量分级对应的 QID进行 二次分级, 将空间分级的码流分成 1个或多个 QID码流段; 之后, 遍历质量 分级的码流段, 根据时域分级对应的 TID进行第三次分级, 将质量分级的码 流分成 1个或多个 TID码流段。  Of course, in other scenarios, after the multimedia forwarding server divides the original code stream into one or more DID code stream segments according to the DID corresponding to the spatial hierarchy, the multimedia forwarding server may first perform secondary classification according to the QID corresponding to the quality classification, and the spatially graded code stream. Dividing into one or more QID code stream segments; thereafter, traversing the quality-graded code stream segments, performing a third grading according to the TID corresponding to the time domain grading, and dividing the quality grading code stream into one or more TID code stream segments .
步骤 202、 多媒体转发服务器釆用组播发送方式时(此时多媒体转发服 务器也可以称为组播服务器)根据分好级的大而全的 SVC码流进行发送操作; 多媒体转发服务器釆用单播发送方式时(此时多媒体转发服务器也可以称为 单播服务器) , 从分好级的码流中挑选相应的码流段后发送。 其中, 多媒体 转发服务器在发送 SVC码流的同时或者之前, 还将发送步骤 200所生成的 SVC码流总体描述信息。  Step 202: When the multimedia forwarding server uses the multicast sending mode (the multimedia forwarding server may also be referred to as a multicast server), the sending operation is performed according to the large and complete SVC code stream of the good level; the multimedia forwarding server uses the unicast In the case of the transmission mode (the multimedia forwarding server may also be referred to as a unicast server), the corresponding code stream segment is selected from the well-classed code streams and then transmitted. The multimedia forwarding server sends the SVC code stream overall description information generated in step 200 at the same time as or before the SVC code stream is sent.
该步骤中, 根据分好级的大而全的码流进行发送操作指, 将步骤 201生 成的 SVC码流当成负载全部发送出去, 包含新增的长度等字段。 例如, 釆用 该步骤实现 MPEG2-TS承载 SVC时, SVC的结构如图 3所示, 其与目前的 SVC的标准保持一致。 在子流的基础上分为 1或多个质量子流, 在质量子流 的基础上, 分为 1个或多个时间子流。  In this step, the SVC code stream generated in step 201 is sent as a load, including the newly added length and other fields, according to the large and complete code stream of the good level. For example, when this step is implemented to implement MPEG2-TS bearer SVC, the structure of the SVC is as shown in FIG. 3, which is consistent with the current SVC standard. On the basis of the substream, it is divided into one or more mass substreams, and based on the mass substream, it is divided into one or more time substreams.
从分好级的码流中挑选相应的码流段 (即分级后 DTQ标识的展现码流 ) 后发送进一步包括:  After selecting the corresponding code stream segment from the code stream that is classified into a good level (that is, the presentation code stream of the DTQ identifier after the classification), the sending further includes:
媒体转发服务器会根据需要从由步骤 200生成的 SVC码流总体描述信息 中确定出需要挑选的展现码流的 D (空域分级)值、 T (时域分级)值、 Q (质 量分级)值。 根据 D值找到对应的空域分级的码流及其依赖的码流。 在该空 域分级的码流中和其依赖的码流中, 根据 T值找到对应的时域分级的码流及 其依赖的码流。 在找到的时域分级的码流和其依赖的码流中, 根据 Q值找到 对应的质量分级的码流及其依赖的码流。 最后, 媒体转发服务器将选择的展 现码流发送出去。 步骤 203、 多媒体客户端接收上述 SVC码流, 进行解码显示; The media forwarding server determines the D (space domain grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream to be selected from the SVC code stream overall description information generated by the step 200 as needed. The corresponding spatial domain-classified code stream and its dependent code stream are found according to the D value. In the code stream of the spatial domain classification and its dependent code stream, the corresponding time domain hierarchical code stream and its dependent code stream are found according to the T value. In the found time domain grading code stream and its dependent code stream, the corresponding quality grading code stream and its dependent code stream are found according to the Q value. Finally, the media forwarding server sends the selected presentation stream. Step 203: The multimedia client receives the foregoing SVC code stream, and performs decoding and displaying.
其中, 当多媒体客户接收到组播方式发送的 SVC码流时, 从中进行挑选 相应的展现码流进行解码显示, 过程如下:  When the multimedia client receives the SVC code stream sent by the multicast mode, it selects and displays the corresponding presentation code stream for decoding and display, and the process is as follows:
多媒体客户端根据收到的 SVC码流总体描述信息确定出需要挑选的展 现码流的 D (空域分级)值、 T (时域分级)值、 Q (质量分级)值。 根据 Q 值找到对应的空域分级的码流及其依赖的码流。 在该空域分级的码流中和其 依赖的码流中, 根据 T值找到对应的时域分级的码流及其依赖的码流。 在找 到的时域分级的码流和其依赖的码流中, 根据 Q值找到对应的质量分级的码 流及其依赖的码流。 将选择的码流重排序后交给解码器解码。  The multimedia client determines the D (space domain grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream to be selected according to the received SVC code stream overall description information. The corresponding airspace-classified code stream and its dependent code stream are found according to the Q value. In the code stream of the spatial domain grading and the code stream it depends on, the corresponding time domain grading code stream and its dependent code stream are found according to the T value. In the found time domain grading code stream and its dependent code stream, the corresponding quality grading code stream and its dependent code stream are found according to the Q value. The selected code streams are reordered and passed to the decoder for decoding.
上述流程中, 步骤 200生成的 SVC码流总体描述信息以及步骤 201生成 的各级分层的 SVC码流。只需要做一次即可。对于存储好的各级分层的 SVC 码流, 可不需要再重新生成的。  In the above process, the SVC code stream overall description information generated by step 200 and the layered SVC code stream generated by step 201 are generated. Just do it once. For the stored hierarchical SVC streams, it does not need to be regenerated.
实施例 4  Example 4
本实施例以图 4所示的网络中,手机接入 SVC码流组播为例说明多媒体 数据的接收方法。 如图 4所示, 多媒体转发服务器组播 SVC码流时, 终端接 入设备(也可称为多媒体客户端)可分为三类, 一类是屏幕较小的手机, 一 类是屏幕适中的 PDA设备, 一类是屏幕较大的笔记本电脑或电视设备。 这三 类设备接收服务器发的组播码流。 每种设备从组播码流中提取各自所支持播 放的码流。例如,手机设备从码流中提取图像大小为 QQVGA (大小为 QVGA 的 1/4 ) , 帧率为 15帧 /s, 高清画质的码流。 而 PDA设备从码流中提取图像 大小为 QVGA, 帧率为 15帧 /s, 高清画质的码流。 而电脑设备从码流中提取 图像大小为 VGA, 帧率为 30帧 /s, 高清画质的码流。 具体地, 可以图 5所示 为例, 多媒体转发服务器发送承载 SVC的 MPEG2-TS码流中 PID为 201的 码流承载 QQVGA的子流。 PID为 202的码流承载 QVGA的子流, PID为 203 的码流承载 VGA的子流。在各个空域分级的码流中,又分为质量分级和时域 分级。 此时, 手机接入 SVC组播的过程如图 6所示, 包括如下步骤:  In this embodiment, in the network shown in FIG. 4, the mobile phone accesses the SVC code stream multicast as an example to describe the method for receiving the multimedia data. As shown in FIG. 4, when the multimedia forwarding server multicasts the SVC code stream, the terminal access devices (also referred to as multimedia clients) can be classified into three types, one is a mobile phone with a small screen, and the other is a moderate screen. PDA devices, one type of laptop or television device with a large screen. These three types of devices receive the multicast stream sent by the server. Each device extracts the code streams supported by each of the multicast streams. For example, the mobile phone device extracts the image size from the code stream to QQVGA (1/4 of the size of QVGA), the frame rate is 15 frames/s, and the code stream of high definition image quality. The PDA device extracts an image size of QVGA from the code stream, a frame rate of 15 frames/s, and a high-definition picture quality stream. The computer device extracts the image size from the code stream to VGA, the frame rate is 30 frames/s, and the high-definition picture quality stream. Specifically, as shown in FIG. 5, the multimedia forwarding server sends a stream of PID 201 in the MPEG2-TS code stream carrying the SVC to carry the substream of the QQVGA. A code stream with a PID of 202 carries a substream of QVGA, and a code stream with a PID of 203 carries a substream of VGA. In the code stream of each airspace classification, it is further divided into quality classification and time domain classification. At this point, the process of accessing the SVC multicast by the mobile phone is as shown in FIG. 6, and includes the following steps:
步骤 601 , 客户端 (即为手机)接收 TS数据。  Step 601: The client (that is, the mobile phone) receives the TS data.
步骤 602, 判断接收 TS数据是否完毕, 如果接收完毕, 则跳到步骤 610, 否则跳到步骤 603。 Step 602: Determine whether the received TS data is completed. If the receiving is complete, go to step 610. Otherwise skip to step 603.
步骤 603 ,判断接收到的数据是否为 PSI数据,如果是,则跳到步骤 604, 否则跳到步骤 605。  Step 603: Determine whether the received data is PSI data, and if yes, skip to step 604, otherwise, go to step 605.
步骤 604, 解析 PSI数据, 获取 QQVGA码流对应的 PID。 获取 15帧 / 秒对应的 t值, 获取高清对应的 q值。  Step 604: Parse the PSI data, and obtain a PID corresponding to the QQVGA code stream. Obtain the t value corresponding to 15 frames/second to obtain the q value corresponding to HD.
步骤 605, 判断是否接收过 PSI数据。 如果是, 则跳到步骤 606, 否则跳 到步骤 607。  Step 605, determining whether the PSI data has been received. If yes, then go to step 606, otherwise skip to step 607.
步骤 606, 判断接收到的码流的 PID是否为 QQVGA的码流的 PID, 如 果是, 则跳到步骤 608, 否则, 跳到步骤 607。  Step 606: Determine whether the PID of the received code stream is the PID of the QQVGA code stream. If yes, go to step 608. Otherwise, go to step 607.
步骤 607, 丟弃该数据。  Step 607, discarding the data.
步骤 608, 解析负载, 跳过长度字段, 判断 Q字段是否小于等于高清对 应的 q值。 如果是, 则判断 T字段是否等于 15帧 /s对应的 t值。 如果是, 则 选择该码流段。 否则, 不选择该码流段。 之后, 跳过该码流段, 釆用上述判 断方法选择码流。 直至码流结束为止, 然后执行步骤 609。  Step 608, parsing the load, skipping the length field, and determining whether the Q field is less than or equal to the q value corresponding to the high definition. If yes, it is judged whether the T field is equal to the t value corresponding to 15 frames/s. If yes, select the codestream. Otherwise, the stream segment is not selected. After that, the code stream segment is skipped, and the code stream is selected by the above judgment method. Until the end of the code stream, then step 609 is performed.
步骤 609, 将选择的码流段送入解码器。  Step 609: Send the selected code stream segment to the decoder.
步骤 610, 结束。  Step 610, ending.
实施例 5  Example 5
本实施例提供一种多媒体数据的传输设备, 该传输设备内置于媒体转发 服务器, 或者其他多媒体网元。 具体地, 该设备包括:  The embodiment provides a multimedia data transmission device, which is built in a media forwarding server or other multimedia network element. Specifically, the device includes:
第一模块, 才艮据所要发送的 SVC码流中的 SEI生成该 SVC码流中所包 含的各展现码流的描述信息以及 DTQ标识的对应关系;  The first module generates, according to the SEI in the SVC code stream to be sent, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers;
第二模块, 在向多媒体客户端发送 SVC码流的同时或者发送 SVC码流 前, 发送所生成的各展现码流的描述信息以及 DTQ标识的对应关系。  The second module sends the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifiers before sending the SVC code stream to the multimedia client or before transmitting the SVC code stream.
优选方案中, 该传输设备还可以包括第三模块, 将 SVC码流的原始码流 按照空间分级分为一个或多个空间分级码流段, 按照时域分级和质量分级中 的一种分级方式分别将各空间分级码流段进行二次分级, 按照另一种分级方 式分别将各二次分级的码流段进行三次分级, 分为不同的展现码流, 发送给 多媒体客户端, 其中, 各展现码流分别对应一个 DTQ标识。 In a preferred solution, the transmission device may further include a third module, and the original code stream of the SVC code stream is divided into one or more spatial grading code stream segments according to a spatial grading, according to a hierarchical manner in time domain grading and quality grading. The spatial grading code stream segments are separately graded separately, and the code segments of each secondary grading are respectively graded three times according to another grading manner, and are divided into different presentation code streams, and sent to The multimedia client, wherein each of the presentation streams respectively corresponds to a DTQ identifier.
实施例 6  Example 6
本实施例再提供一种多媒体数据的传输设备, 该传输设备也可内置于媒 体转发服务器, 或者其他多媒体网元等。 该设备包括:  The embodiment further provides a multimedia data transmission device, which may also be built in a media forwarding server, or other multimedia network element. The device includes:
第一模块,将可分层视频编码 SVC码流的原始码流按照空间分级分为一 个或多个空间分级码流段;  The first module divides the original code stream of the layerable video coding SVC code stream into one or more spatial grading code stream segments according to a spatial grading;
第二模块, 按照时域分级和质量分级中的一种分级方式分别将各空间分 级码流段进行二次分级, 按照另一种分级方式分别将各二次分级的码流段进 行三次分级, 分为不同的展现码流, 发送给多媒体客户端, 其中, 各展现码 流分别对应一个空间时间质量 DTQ标识。  In the second module, each spatial grading code stream segment is separately categorized according to one of the time domain grading and the quality grading, and the second grading code stream segments are respectively categorized three times according to another grading manner. Divided into different presentation streams, and sent to the multimedia client, where each presentation stream corresponds to a spatial time quality DTQ identifier.
优选地, 该传输设备还包括: 第三模块, 根据所述 SVC码流中的附加增 加信息 SEI生成该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标 识的对应关系, 在向多媒体客户端发送所述 SVC码流的同时或者发送所述 SVC码流前, 发送所生成的各展现码流的描述信息以及 DTQ标识的对应关 系。  Preferably, the transmission device further includes: a third module, generating, according to the additional information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and a correspondence relationship between the DTQ identifiers, And before the multimedia client sends the SVC code stream or sends the SVC code stream, sending the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifier.
实施例 7  Example 7
本实施例提供一种多媒体数据的接收设备, 可以是手机、 PDA设备、 电 脑以及电视等多媒体客户端。 该设备可包括:  The embodiment provides a multimedia data receiving device, which can be a multimedia client such as a mobile phone, a PDA device, a computer, and a television. The device can include:
第一模块, 接收到 SVC码流, 从已收到的该 SVC码流中各展现码流的 描述信息以及 DTQ标识的对应关系中 ,确定本客户端支持播放的展现码流的 描述信息对应的 DTQ标识;  The first module receives the SVC code stream, and determines, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the description information corresponding to the presentation code stream supported by the client. DTQ logo;
第二模块, 遍历所收到的 SVC码流以获取所确定的 DTQ标识对应的展 现码流进行解码显示。  The second module traverses the received SVC code stream to obtain the decoded code stream corresponding to the determined DTQ identifier for decoding display.
本实施例中, 第一模块所接收到的 SVC码流为未经过分级排序的 SVC 码流, 因此, 第二模块需遍历所收到的 SVC码流才可以获取所确定的 DTQ 标识对应的展现码流。  In this embodiment, the SVC code stream received by the first module is an SVC code stream that has not been hierarchically sorted. Therefore, the second module needs to traverse the received SVC code stream to obtain the corresponding DTQ identifier corresponding to the display. Code stream.
实施例 8  Example 8
本实施例提供一种多媒体数据的接收设备, 可以是手机、 PDA设备、 电 脑以及电视等多媒体客户端。 该设备可包括: The embodiment provides a receiving device for multimedia data, which may be a mobile phone, a PDA device, or an electric Brain and TV and other multimedia clients. The device can include:
第一模块, 接收 SVC码流, 从该 SVC码流中各展现码流的描述信息以 及 DTQ标识的对应关系中,确定本客户端支持播放的展现码流的描述信息对 应的 DTQ标识;  The first module, the SVC code stream is received, and the DTQ identifier corresponding to the description information of the presentation code stream supported by the client is determined from the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers;
其中, 第一模块根据所接收到的 SVC码流中的 SEI生成该 SVC码流中 所包含的各展现码流的描述信息以及 DTQ标识的对应关系,从中确定本客户 端支持播放的展现码流的描述信息对应的 DTQ标识。 或者,接收多媒体转发 服务器发送的该 SVC码流中各展现码流的描述信息以及 DTQ标识的对应关 系, 从中确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识。  The first module generates, according to the SEI in the received SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and determines the presentation code stream supported by the client. The description information corresponds to the DTQ identifier. Alternatively, the description information of each presentation stream in the SVC stream sent by the multimedia forwarding server and the correspondence between the DTQ identifiers are received, and the DTQ identifier corresponding to the description information of the presentation stream supported by the client is determined.
第二模块, 从 SVC码流中挑选所确定的 DTQ标识对应的展现码流进行 解码显示。  The second module selects the presentation code stream corresponding to the determined DTQ identifier from the SVC code stream for decoding and display.
本实施例中, 第一模块所接收到的 SVC码流为进行了分级排序的 SVC 码流, 因此, 第二模块从已分级排序的 SVC码流中挑选所确定的 DTQ标识 对应的展现码流即可。  In this embodiment, the SVC code stream received by the first module is the SVC code stream that is hierarchically sorted. Therefore, the second module selects the presentation code stream corresponding to the determined DTQ identifier from the hierarchically sorted SVC code streams. Just fine.
以上所述仅为本发明的优选实施例而已, 并不用于限制本发明, 对于本 领域的技术人员来说, 本发明可以有各种更改和变化。 凡在本发明的精神和 原则之内, 所作的任何修改、 等同替换、 改进等, 均应包含在本发明的保护 范围之内。  The above description is only the preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes can be made to the present invention. Any modifications, equivalent substitutions, improvements, etc. made within the spirit and scope of the present invention are intended to be included within the scope of the present invention.
工业实用性 Industrial applicability
本发明的一个实施例对现有 SVC码流重新排序, 使其有规律的存放, 即 提供了一种有序分层的多媒体码流。 从而使服务器或者客户端在对该码流进 行抽取、 切换等操作时可以很方便地找到所需的码流, 提高了服务器或客户 端的响应速度。本发明还有一个实施例提供了 SVC码流中各展现码流的信息 及其对应的 DTQ标识, 从而简化接收端侧对多媒体数据的处理。  One embodiment of the present invention reorders existing SVC code streams to provide regular storage, i.e., provides an ordered hierarchical multimedia code stream. Therefore, the server or the client can conveniently find the required code stream when the code stream is extracted, switched, etc., and the response speed of the server or the client is improved. Still another embodiment of the present invention provides information of each presentation stream in the SVC code stream and its corresponding DTQ identifier, thereby simplifying processing of the multimedia data by the receiving end side.

Claims

权 利 要 求 书 Claim
1、 一种多媒体数据的传输方法, 该方法包括: A method for transmitting multimedia data, the method comprising:
媒体转发服务器根据所要发送的可分层视频编码 SVC码流中的附加增 加信息 SEI生成该 SVC码流中所包含的各展现码流的描述信息以及空间时间 质量 DTQ标识的对应关系;  The media forwarding server generates, according to the additional enhanced information SEI of the layered video coding SVC code stream to be sent, description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the spatial time quality DTQ identifiers;
所述媒体转发服务器在向多媒体客户端发送所述 SVC码流的同时或者 发送所述 SVC码流前, 发送所生成的各展现码流的描述信息以及 DTQ标识 的对应关系。  And sending, by the media forwarding server, the generated description information of each presentation code stream and the correspondence between the DTQ identifiers before sending the SVC code stream to the multimedia client or before transmitting the SVC code stream.
2、 如权利要求 1所述的方法, 其中, 该方法还包括: 2. The method of claim 1, wherein the method further comprises:
所述多媒体客户端接收所述 SVC码流后,从所收到的所述各展现码流的 描述信息以及 DTQ标识的对应关系中 ,确定本客户端支持播放的展现码流的 描述信息对应的 DTQ标识,遍历所述 SVC码流以获取所确定的 DTQ标识对 应的展现码流进行解码显示。  After receiving the SVC code stream, the multimedia client determines, according to the received description information of each of the presentation code streams and the DTQ identifier, the description information corresponding to the presentation code stream supported by the client. The DTQ identifier traverses the SVC code stream to obtain a display code stream corresponding to the determined DTQ identifier for decoding display.
3、 如权利要求 1所述的方法, 其中, 该方法还包括: 3. The method of claim 1, wherein the method further comprises:
所述媒体转发服务器将向多媒体客户端发送所述 SVC码流之前,将所述 Before the media forwarding server sends the SVC code stream to the multimedia client, the
SVC码流的原始码流按照空间分级分为一个或多个空间分级码流段, 按照时 域分级和质量分级中的一种分级方式分别将各空间分级码流段进行二次分 级, 按照另一种分级方式分别将各二次分级的码流段进行三次分级, 以分为 不同的展现码流, 各展现码流分别对应一个 DTQ标识。 The original code stream of the SVC code stream is divided into one or more spatial grading code stream segments according to spatial grading, and each spatial grading code stream segment is separately graded according to a hierarchical manner in time domain grading and quality grading, according to another A hierarchical manner separately divides the code segments of each secondary grading into three different gradations to be divided into different presentation code streams, and each of the presentation code streams respectively corresponds to one DTQ identifier.
4、 如权利要求 3所述的方法, 其中, 4. The method of claim 3, wherein
所述多媒体客户端接收所述各展现码流的描述信息以及 DTQ标识的对 应关系, 确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识,从 所接收到的 SVC码流中挑选所确定的 DTQ标识对应的展现码流进行解码显 示。  Receiving, by the multimedia client, the description information of each of the presentation code streams and the correspondence between the DTQ identifiers, determining a DTQ identifier corresponding to the description information of the presentation code stream supported by the client, and selecting from the received SVC code streams. The display code stream corresponding to the determined DTQ identifier is decoded and displayed.
5、 一种多媒体数据的传输方法, 该方法包括: 5. A method for transmitting multimedia data, the method comprising:
所述媒体转发服务器将收到的可分层视频编码 SVC码流的原始码流按 照空间分级分为一个或多个空间分级码流段, 按照时域分级和质量分级中的 一种分级方式分别将各空间分级码流段进行二次分级, 按照另一种分级方式 分别将各二次分级的码流段进行三次分级, 以分为不同的展现码流, 各展现 码流分别对应一个空间时间质量 DTQ标识; The media forwarding server presses the original code stream of the layered video coded SVC code stream received Dividing into one or more spatial grading code stream segments according to spatial grading, respectively classifying each spatial grading code stream segment according to a grading manner in time domain grading and quality grading, respectively, according to another grading manner The second-stage code stream segment is divided into three times to be divided into different presentation code streams, and each of the presentation code streams respectively corresponds to a spatial time quality DTQ identifier;
所述媒体转发服务器根据需要发送经过三次分级的 SVC码流。  The media forwarding server sends the SVC code stream that has been hierarchically graded as needed.
6、 如权利要求 5所述的方法, 其中, 6. The method of claim 5, wherein
所述多媒体客户端接收所述 SVC码流, 根据该 SVC码流中的附加增加 信息 SEI生成该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标识 的对应关系, 确定本客户端支持播放的展现码流的描述信息对应的 DTQ标 识, 从所述 SVC码流中挑选所确定的 DTQ标识对应的展现码流进行解码显 示。  The multimedia client receives the SVC code stream, and generates, according to the additional information SEI in the SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and determines the client. The DTQ identifier corresponding to the description information of the displayed presentation stream is selected, and the presentation code stream corresponding to the determined DTQ identifier is selected from the SVC code stream for decoding display.
7、 如权利要求 5所述的方法, 其中, 该方法还包括: 7. The method of claim 5, wherein the method further comprises:
所述媒体转发服务器在发送所述 SVC码流之前, 根据所述 SVC码流中 的 SEI生成该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标识的 对应关系, 在发送所述 SVC码流的同时或者发送所述 SVC码流前, 发送所 生成的各展现码流的描述信息以及 DTQ标识的对应关系。  Before the SVC code stream is sent, the media forwarding server generates, according to the SEI in the SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and sends the Before the SVC code stream is sent or before the SVC code stream is sent, the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifier are transmitted.
8、 如权利要求 7所述的方法, 其中, 8. The method of claim 7, wherein
所述多媒体客户端接收所述各展现码流的描述信息以及 DTQ标识的对 应关系, 确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识,从 所接收到的 SVC码流中挑选所确定的 DTQ标识对应的展现码流进行解码显 示。  Receiving, by the multimedia client, the description information of each of the presentation code streams and the correspondence between the DTQ identifiers, determining a DTQ identifier corresponding to the description information of the presentation code stream supported by the client, and selecting from the received SVC code streams. The display code stream corresponding to the determined DTQ identifier is decoded and displayed.
9、 一种多媒体数据的接收方法, 该方法包括: 9. A method of receiving multimedia data, the method comprising:
多媒体客户端接收到可分层视频编码 SVC码流时, 从已收到的该 SVC 码流中各展现码流的描述信息以及空间时间质量 DTQ标识的对应关系中,确 定本客户端支持播放的展现码流的描述信息对应的 DTQ标识,遍历所述 SVC 码流以获取所确定的 DTQ标识对应的展现码流进行解码显示。 When receiving the splicable video coding SVC code stream, the multimedia client determines, according to the description information of each presentation code stream in the SVC code stream and the correspondence between the spatial time quality DTQ identifiers, determining that the client supports playing. The DTQ identifier corresponding to the description information of the code stream is traversed, and the SVC code stream is traversed to obtain the display code stream corresponding to the determined DTQ identifier for decoding display.
10、 一种多媒体数据的接收方法, 该方法包括: 10. A method of receiving multimedia data, the method comprising:
多媒体客户端接收到可分层视频编码 SVC码流时, 从该 SVC码流中各 展现码流的描述信息以及空间时间质量 DTQ标识的对应关系中,确定本客户 端支持播放的展现码流的描述信息对应的 DTQ标识, 从所述 SVC码流中挑 选所确定的 DTQ标识对应的展现码流进行解码显示。  When the multimedia client receives the layered video coded SVC code stream, determining, from the description information of each code stream in the SVC code stream and the correspondence between the spatial time quality DTQ identifiers, determining the presentation code stream supported by the client. Decoding the DTQ identifier corresponding to the information, and selecting a presentation code stream corresponding to the determined DTQ identifier from the SVC code stream for decoding display.
11、 如权利要求 10所述的方法, 其中, 11. The method of claim 10, wherein
所述多媒体客户端从该 SVC码流中各展现码流的描述信息以及 DTQ标 识的对应关系中, 确定本客户端支持播放的展现码流的描述信息对应的 DTQ 标识的过程如下:  The process of determining, by the multimedia client, the DTQ identifier corresponding to the description information of the presentation stream supported by the client from the description information of the presentation stream and the correspondence between the DTQ identifiers in the SVC code stream is as follows:
所述多媒体客户端根据所接收到的 SVC码流中的附加增加信息 SEI生成 该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标识的对应关系, 从中确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识。  The multimedia client generates, according to the additional information SEI in the received SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and determines that the client supports the playback. The DTQ identifier corresponding to the description information of the code stream is displayed.
12、 如权利要求 10所述的方法, 其中, 12. The method of claim 10, wherein
所述多媒体客户端从该 SVC码流中各展现码流的描述信息以及 DTQ标 识的对应关系中, 确定本客户端支持播放的展现码流的描述信息对应的 DTQ 标识指:  The multimedia client determines, from the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the DTQ identifier corresponding to the description information of the presentation code stream supported by the client:
所述多媒体客户端接收所述多媒体转发服务器发送的该 SVC码流中各 展现码流的描述信息以及 DTQ标识的对应关系,从中确定本客户端支持播放 的展现码流的描述信息对应的 DTQ标识。  Receiving, by the multimedia client, the description information of each presentation stream in the SVC code stream and the correspondence between the DTQ identifiers, and determining the DTQ identifier corresponding to the description information of the presentation stream supported by the client. .
13、 一种多媒体数据的传输设备, 该传输设备包括: 13. A multimedia data transmission device, the transmission device comprising:
第一模块, 其设置为: 根据所要发送的可分层视频编码 SVC码流中的附 加增加信息 SEI生成该 SVC码流中所包含的各展现码流的描述信息以及空间 时间质量 DTQ标识的对应关系;  a first module, configured to: generate, according to the additional addition information SEI in the layered video coded SVC code stream to be sent, description information of each presentation code stream included in the SVC code stream and a correspondence of a spatial time quality DTQ identifier Relationship
第二模块, 其设置为: 在向多媒体客户端发送所述 SVC码流的同时或者 发送所述 SVC码流前, 发送所生成的各展现码流的描述信息以及 DTQ标识 的对应关系。 And a second module, configured to: before sending the SVC code stream to the multimedia client or before sending the SVC code stream, send the generated description information of each presentation code stream and a correspondence relationship of the DTQ identifier.
14、 如权利要求 13所述的传输设备, 其中, 该传输设备还包括: 第三模块, 设置为: 将所述 SVC码流的原始码流按照空间分级分为一个 或多个空间分级码流段, 按照时域分级和质量分级中的一种分级方式分别将 各空间分级码流段进行二次分级, 按照另一种分级方式分别将各二次分级的 码流段进行三次分级, 分为不同的展现码流, 发送给多媒体客户端, 其中, 各展现码流分别对应一个 DTQ标识。 The transmission device according to claim 13, wherein the transmission device further comprises: a third module, configured to: divide the original code stream of the SVC code stream into one or more spatial classification code streams according to a spatial classification Segments, according to a hierarchical manner in time domain grading and quality grading, respectively, each spatial grading code stream segment is subjected to secondary grading, and according to another grading manner, each secondary grading code stream segment is hierarchically divided into three. The different presentation streams are sent to the multimedia client, where each presentation stream corresponds to a DTQ identifier.
15、 一种多媒体数据的传输设备, 该传输设备包括: 15. A multimedia data transmission device, the transmission device comprising:
第一模块, 其设置为: 将可分层视频编码 SVC码流的原始码流按照空间 分级分为一个或多个空间分级码流段;  a first module, configured to: divide the original code stream of the layerable video coding SVC code stream into one or more spatial grading code stream segments according to a spatial grading;
第二模块, 其设置为: 按照时域分级和质量分级中的一种分级方式分别 将各空间分级码流段进行二次分级, 按照另一种分级方式分别将各二次分级 的码流段进行三次分级, 分为不同的展现码流, 发送给多媒体客户端, 其中, 各展现码流分别对应一个空间时间质量 DTQ标识。  The second module is configured to: separately classify each spatial grading code stream segment according to a grading manner in time domain grading and quality grading, and separately code each code segment of the second grading according to another grading manner The three-level grading is divided into different presentation code streams and sent to the multimedia client, where each presentation code stream corresponds to a spatial time quality DTQ identifier.
16、 如权利要求 15所述的传输设备, 其中, 该传输设备还包括: 第三模块, 设置为: 根据所述 SVC码流中的附加增加信息 SEI生成该The transmission device according to claim 15, wherein the transmission device further comprises: a third module, configured to: generate the information according to the additional added information SEI in the SVC code stream
SVC码流中所包含的各展现码流的描述信息以及 DTQ标识的对应关系, 在 向多媒体客户端发送所述 SVC码流的同时或者发送所述 SVC码流前, 发送 所生成的各展现码流的描述信息以及 DTQ标识的对应关系。 Transmitting the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and transmitting the generated presentation codes before transmitting the SVC code stream to the multimedia client or before transmitting the SVC code stream. The description of the flow and the correspondence between the DTQ identifiers.
17、 一种多媒体数据的接收设备, 该设备包括: 17. A receiving device for multimedia data, the device comprising:
第一模块, 其设置为: 接收到可分层视频编码 SVC码流, 从已收到的该 a first module, configured to: receive a layerable video encoded SVC code stream, from the received
SVC码流中各展现码流的描述信息以及空间时间质量 DTQ标识的对应关系 中, 确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识; Determining, in the SVC code stream, the description information of each code stream and the correspondence between the spatial time quality DTQ identifiers, determining a DTQ identifier corresponding to the description information of the presentation code stream supported by the client;
第二模块, 其设置为: 遍历所述 SVC码流以获取所确定的 DTQ标识对 应的展现码流进行解码显示。  And a second module, configured to: traverse the SVC code stream to obtain a decoded display stream corresponding to the determined DTQ identifier.
18、 一种多媒体数据的接收设备, 该设备包括: 18. A receiving device for multimedia data, the device comprising:
第一模块, 其设置为: 接收可分层视频编码 SVC码流, 从该 SVC码流 中各展现码流的描述信息以及空间时间质量 DTQ标识的对应关系中,确定本 客户端支持播放的展现码流的描述信息对应的 DTQ标识; a first module, configured to: receive a layered video encoded SVC code stream, from the SVC code stream Determining, in the correspondence between the description information of the code stream and the spatial time quality DTQ identifier, determining a DTQ identifier corresponding to the description information of the presentation code stream supported by the client;
第二模块, 其设置为: 从所述 SVC码流中挑选所确定的 DTQ标识对应 的展现码流进行解码显示。  And a second module, configured to: select a presentation code stream corresponding to the determined DTQ identifier from the SVC code stream for decoding display.
19、 如权利要求 18所述的设备, 其中, 19. The apparatus according to claim 18, wherein
所述第一模块还设置为: 根据所接收到的 SVC码流中的附加增加信息 SEI生成该 SVC码流中所包含的各展现码流的描述信息以及 DTQ标识的对应 关系, 从中确定本客户端支持播放的展现码流的描述信息对应的 DTQ标识。  The first module is further configured to: generate, according to the added additional information SEI in the received SVC code stream, description information of each presentation code stream included in the SVC code stream and a correspondence relationship of the DTQ identifier, and determine the customer from the The DTQ identifier corresponding to the description information of the presentation code stream supported by the terminal is supported.
20、 如权利要求 18所述的设备, 其中, 20. The apparatus according to claim 18, wherein
所述第一模块还设置为:接收所述多媒体转发服务器发送的该 SVC码流 中各展现码流的描述信息以及 DTQ标识的对应关系,从中确定本客户端支持 播放的展现码流的描述信息对应的 DTQ标识。  The first module is further configured to: receive the description information of each presentation code stream in the SVC code stream sent by the multimedia forwarding server, and the correspondence between the DTQ identifiers, and determine the description information of the presentation code stream that the client supports to play. Corresponding DTQ identifier.
PCT/CN2012/070161 2011-01-11 2012-01-10 Method and device for transmitting and receiving multimedia data WO2012094975A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110004982.8 2011-01-11
CN2011100049828A CN102595203A (en) 2011-01-11 2011-01-11 Method and equipment for transmitting and receiving multi-media data

Publications (1)

Publication Number Publication Date
WO2012094975A1 true WO2012094975A1 (en) 2012-07-19

Family

ID=46483340

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/070161 WO2012094975A1 (en) 2011-01-11 2012-01-10 Method and device for transmitting and receiving multimedia data

Country Status (2)

Country Link
CN (1) CN102595203A (en)
WO (1) WO2012094975A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170104560A1 (en) * 2015-03-09 2017-04-13 Korea Aerospace Research Institute Apparatus and method for coding packet

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105824841A (en) * 2015-01-07 2016-08-03 阿里巴巴集团控股有限公司 Storage and output methods and devices of multimedia information
CN106303537B (en) * 2016-08-30 2019-05-10 北京容联易通信息技术有限公司 A kind of more code stream transmission methods of openh264
CN107959861B (en) * 2016-10-18 2020-08-25 华为技术有限公司 Data processing method, related equipment and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007080223A1 (en) * 2006-01-10 2007-07-19 Nokia Corporation Buffering of decoded reference pictures
CN101056403A (en) * 2007-04-28 2007-10-17 西安交通大学 Design method of P2P network transmission system architecture with the telescopic video coding
CN101189881A (en) * 2005-04-13 2008-05-28 诺基亚公司 Coding of frame number in scalable video coding
CN101621688A (en) * 2009-04-30 2010-01-06 武汉大学 Codec method for realizing AVS video standard time domain classification

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100142613A1 (en) * 2007-04-18 2010-06-10 Lihua Zhu Method for encoding video data in a scalable manner
CN101547356B (en) * 2008-03-24 2011-07-27 展讯通信(上海)有限公司 Video code stream receiving, sending and retransmission method and equipment
KR101099784B1 (en) * 2008-12-05 2011-12-28 한국전자통신연구원 Apparatus for MPEG-2 TS file format using layered encoding of H.264 SVC multimedia data and method thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101189881A (en) * 2005-04-13 2008-05-28 诺基亚公司 Coding of frame number in scalable video coding
WO2007080223A1 (en) * 2006-01-10 2007-07-19 Nokia Corporation Buffering of decoded reference pictures
CN101056403A (en) * 2007-04-28 2007-10-17 西安交通大学 Design method of P2P network transmission system architecture with the telescopic video coding
CN101621688A (en) * 2009-04-30 2010-01-06 武汉大学 Codec method for realizing AVS video standard time domain classification

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170104560A1 (en) * 2015-03-09 2017-04-13 Korea Aerospace Research Institute Apparatus and method for coding packet
US10122503B2 (en) * 2015-03-09 2018-11-06 Korea Aerospace Research Institute Apparatus and method for coding packet

Also Published As

Publication number Publication date
CN102595203A (en) 2012-07-18

Similar Documents

Publication Publication Date Title
TWI473016B (en) Method and apparatus for processing a multi-view video bitstream and computer-readable medium
KR102534899B1 (en) Virtual Reality Video Signaling in Dynamic Adaptive Streaming over HTTP
CN102037731B (en) Signalling and extraction in compressed video of pictures belonging to interdependency tiers
CN103188522B (en) Method and system for providing and delivering a composite condensed stream
EP2540034B1 (en) Method and apparatus for transmitting and receiving data
JP2018186524A (en) Content transmitting device and content reproduction device
CA2967245C (en) Transmission device, transmission method, reception device, and reception method
JP5774652B2 (en) Transmitting apparatus, transmitting method, receiving apparatus, and receiving method
US10820024B2 (en) Communication apparatus, communication data generation method, and communication data processing method
AU2012270417A1 (en) Method and apparatus for transmitting/receiving media contents in multimedia system
US8930442B2 (en) Apparatus and method for playing media content data
WO2014193996A2 (en) Network video streaming with trick play based on separate trick play files
US20130204973A1 (en) Method for transmitting a scalable http stream for natural reproduction upon the occurrence of expression-switching during http streaming
EP2453652B1 (en) Transmission method, receiving method and device for scalable video coding files
WO2012094975A1 (en) Method and device for transmitting and receiving multimedia data
KR101656193B1 (en) MMT-based Broadcasting System and Method for UHD Video Streaming over Heterogeneous Networks
KR101941781B1 (en) Method and Apparatus for Receiving 8K Broadcasting based on MMT
KR102349451B1 (en) The method for transmitting or receiving multimedia and apparatus thereof
US11863767B2 (en) Transporting HEIF-formatted images over real-time transport protocol
JP7230981B2 (en) Receiving device and receiving method
JP2008187368A (en) Content sending out apparatus
KR101943214B1 (en) Method for Generating 8K Broadcasting Stream based on MMT and Broadcasting System applying the same
JP5905148B2 (en) Transmitting apparatus, transmitting method, receiving apparatus, and receiving method
EP4315875A1 (en) Transporting heif-formatted images over real-time transport protocol including overlay images
Liu et al. An HD IPTV system based on scalable video coding

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12734234

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12734234

Country of ref document: EP

Kind code of ref document: A1