WO2012094975A1 - Procédé et dispositif de transmission et de réception de données multimédias - Google Patents

Procédé et dispositif de transmission et de réception de données multimédias Download PDF

Info

Publication number
WO2012094975A1
WO2012094975A1 PCT/CN2012/070161 CN2012070161W WO2012094975A1 WO 2012094975 A1 WO2012094975 A1 WO 2012094975A1 CN 2012070161 W CN2012070161 W CN 2012070161W WO 2012094975 A1 WO2012094975 A1 WO 2012094975A1
Authority
WO
WIPO (PCT)
Prior art keywords
code stream
dtq
svc
presentation
description information
Prior art date
Application number
PCT/CN2012/070161
Other languages
English (en)
Chinese (zh)
Inventor
赵宇
王芳
孙健
刘继年
李加周
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2012094975A1 publication Critical patent/WO2012094975A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8451Structuring of content, e.g. decomposing content into time segments using Advanced Video Coding [AVC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234327Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440227Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4621Controlling the complexity of the content stream or additional data, e.g. lowering the resolution or bit-rate of the video stream for a mobile client with a small screen

Definitions

  • the invention relates to the field of multimedia, in particular to a method for transmitting and receiving multimedia data and a transmission and receiving device thereof. Background technique
  • the Real-time Transport Protocol is an example.
  • the packet structure includes two parts, a protocol header and a payload part.
  • the protocol header includes the payload media type, packet sequence number, timestamp, synchronization source identifier, and so on.
  • the payload portion is typically a simple sequential storage of multimedia data.
  • Another common streaming media transfer protocol such as the MPEG (Moving Pictures Experts Group) TS (Transport Stream) packet, is the same for its packet structure.
  • MPEG Motion Picture Experts Group
  • TS Transport Stream
  • the client cannot obtain any auxiliary information from the packet header when extracting a subset of them in the transport stream.
  • the current MPEG-2 TS bears the SVC stream. Only one substream corresponding to one airspace hierarchy is determined, as shown in Figure 1. There is no further subdivision of the quality and time domain grading in the same airspace grading, which makes it easy to find the code stream corresponding to a certain airspace grading by PID (program identification), but in the airspace grading, I want to find a certain Quality grading or time domain grading is difficult, and can only be obtained by traversing the load and comparing the NALU (Network Abstraction Layer Unit) header.
  • NALU Network Abstraction Layer Unit
  • MPEG2-TS is used as a storage file.
  • the client frequently switches the attributes of the viewing stream, for example, frequently switches the size of the viewing stream (spatial grading), frame rate (time domain grading).
  • MPEG2-TS is used as a file, it is not possible to efficiently extract the required code stream from the file.
  • the user extracts the transmitted code stream, it must parse to SEI (Supplemental Enhancement Information) to determine the DTQ (Space Time Quality) corresponding to the code stream to be extracted.
  • SEI Supplemental Enhancement Information
  • the invention provides a method for transmitting and receiving multimedia data and a transmission and receiving device thereof.
  • One of the objectives of the present invention is to provide information of each presentation stream in a multimedia encoded stream and its corresponding identification to simplify processing of multimedia data by the receiving end.
  • the present invention discloses a method for transmitting multimedia data, including:
  • the media forwarding server generates, according to the additional information (SEI) in the layerable video coding (SVC) code stream to be sent, description information of each presentation code stream included in the SVC code stream, and a spatial time quality (DTQ) identifier.
  • SEI additional information
  • SVC layerable video coding
  • DTQ spatial time quality
  • the method further includes: after the multimedia client receives the SVC code stream, determining, according to the received description information of each of the presentation code streams and the correspondence between the DTQ identifiers, that the client supports playing Deriving the DTQ identifier corresponding to the description information of the code stream, traversing the SVC code stream to obtain the display code stream corresponding to the determined DTQ identifier for decoding display.
  • the above method further includes:
  • the original code stream of the SVC code stream is divided into one or more spatial grading code stream segments according to spatial grading, according to time domain grading and quality grading.
  • each spatial grading code stream segment is subjected to secondary grading, and each secondary grading code stream segment is hierarchically classified three times according to another hierarchical manner, to be divided into different presentation code streams, and each presentation is performed.
  • the code streams correspond to a DTQ identifier.
  • the multimedia client receives the description information of each presentation code stream and the correspondence between the DTQ identifiers, and determines a DTQ label corresponding to the description information of the presentation code stream supported by the client. Obtaining, the selected code stream corresponding to the determined DTQ identifier is selected from the received SVC code stream for decoding display.
  • the present invention discloses a method for transmitting multimedia data, including:
  • the media forwarding server divides the original code stream of the received SVC code stream into one or more spatial grading code stream segments according to spatial grading, and respectively classifies each space according to a hierarchical manner in time domain grading and quality grading.
  • the flow segment is subjected to secondary grading, and the code segments of each secondary grading are respectively classified into three times according to another grading manner, and are divided into different presentation code streams, and each of the presentation code streams respectively corresponds to one DTQ identifier; the media forwarding The server sends the SVC code stream that has been hierarchically graded as needed.
  • the multimedia client receives the SVC code stream, according to the
  • the SEI in the SVC code stream generates description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers, and determines a DTQ identifier corresponding to the description information of the presentation code stream supported by the client, from the The display code stream corresponding to the determined DTQ identifier is selected and decoded in the SVC code stream.
  • the method further includes: before the sending, by the media forwarding server, the description information of each presentation stream included in the SVC code stream according to the SEI in the SVC code stream before sending the SVC code stream. And the corresponding relationship between the DTQ identifiers, and the corresponding description information of the generated presentation code streams and the spatial temporal quality DTQ identifiers are sent before the SVC code stream is sent or before the SVC code stream is sent.
  • the multimedia client receives the description information of each of the presentation code streams and the correspondence between the DTQ identifiers, and determines a DTQ identifier corresponding to the description information of the presentation code stream supported by the client, from the received SVC.
  • the code stream corresponding to the determined DTQ identifier is selected and decoded for display.
  • Still another object of the present invention is to simplify the data processing operation when the multimedia client receives the information of each of the presentation streams in the multimedia encoded stream and its corresponding identifier.
  • the present invention discloses a method for receiving multimedia data, including:
  • the multimedia client determines, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the presentation code supported by the client.
  • the DTQ identifier corresponding to the description information of the stream is traversed to the SVC code stream to obtain the display code stream corresponding to the determined DTQ identifier for decoding display.
  • Another object of the present invention is to improve the efficiency of a multimedia client in extracting a multimedia code stream.
  • the present invention discloses a method for receiving multimedia data, including: when a multimedia client receives an SVC code stream, determining the present information from the description information of each code stream and the correspondence between the DTQ identifiers in the SVC code stream.
  • the client supports the DTQ identifier corresponding to the description information of the displayed presentation stream, and selects the presentation stream corresponding to the determined DTQ identifier from the SVC stream to perform decoding and display.
  • the multimedia client determines, according to the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the DTQ corresponding to the description information of the presentation code stream supported by the client.
  • the process of identification is as follows:
  • the multimedia client generates, according to the additional information SEI in the received SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and determines that the client supports the playback.
  • the DTQ identifier corresponding to the description information of the code stream is displayed.
  • the multimedia client receives the description information of each presentation stream in the SVC stream sent by the multimedia forwarding server and the correspondence between the DTQ identifiers, and determines the description information of the presentation stream supported by the client. Corresponding DTQ identifier.
  • Still another object of the present invention is to provide a transmission apparatus that can provide information of each presentation stream in a multimedia encoded stream and its corresponding identification.
  • the present invention discloses a multimedia data transmission device, including:
  • a first module configured to: generate, according to an SEI in the SVC code stream to be sent, description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers;
  • a second module configured to: after sending the SVC code stream to the multimedia client or before sending the SVC code stream, send the generated description information of each presentation code stream and a correspondence relationship of the DTQ identifier.
  • the foregoing transmission device further includes: a third module, configured to: divide the original code stream of the SVC code stream into one or more spatial grading code stream segments according to spatial grading, according to time domain grading and quality grading a grading method to separately classify each spatial grading code stream segment, according to another
  • the grading mode divides the code streams of each of the two gradings into three times, and divides them into different presentation streams, and sends them to the multimedia client, where each of the presentation streams respectively corresponds to one DTQ identifier.
  • Still another object of the present invention is to provide a transmission device that can implement ordered layering of a multimedia code stream.
  • the present invention discloses a multimedia data transmission device, including:
  • a first module configured to: divide the original code stream of the SVC code stream into one or more spatial grading code stream segments according to a spatial grading;
  • the second module is configured to: separately classify each spatial grading code stream segment according to a grading manner in time domain grading and quality grading, and separately code each code segment of the second grading according to another grading manner Perform three times of grading to divide into different presentation streams and send them to the multimedia client, where each presentation stream corresponds to a DTQ identifier.
  • the foregoing transmission device further includes: a third module, configured to: generate, according to the additional addition information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and correspondence of the DTQ identifier And the relationship between the description information of each generated presentation stream and the DTQ identifier is sent before the SVC code stream is sent to the multimedia client or before the SVC code stream is sent.
  • a third module configured to: generate, according to the additional addition information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and correspondence of the DTQ identifier And the relationship between the description information of each generated presentation stream and the DTQ identifier is sent before the SVC code stream is sent to the multimedia client or before the SVC code stream is sent.
  • Still another object of the present invention is to provide a receiving device for multimedia data which simplifies data processing operations.
  • the present invention discloses a receiving device for multimedia data, including:
  • the first module is configured to: receive the SVC code stream, and determine, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the presentation code stream supported by the client to play Describe the DTQ identifier corresponding to the information;
  • a second module configured to: traverse the SVC code stream to obtain a decoded display stream corresponding to the determined DTQ identifier.
  • Another object of the present invention is to provide a receiving device for extracting multimedia data with high efficiency of a multimedia code stream.
  • the present invention discloses a receiving device for multimedia data, including: a first module, configured to: receive an SVC code stream, and determine, according to the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, a DTQ corresponding to the description information of the presentation code stream supported by the client.
  • a second module configured to: select a presentation code stream corresponding to the determined DTQ identifier from the SVC code stream for decoding display.
  • the first module is further configured to: generate, according to the SEI in the received SVC code stream, description information of each presentation code stream included in the SVC code stream, and a correspondence between DTQ identifiers And determining, from the DTQ identifier corresponding to the description information of the presentation stream supported by the client.
  • the first module is further configured to: receive the description information of each presentation code stream in the SVC code stream sent by the multimedia forwarding server, and the correspondence between the DTQ identifiers, and determine the client support The DTQ identifier corresponding to the description information of the displayed presentation stream.
  • One embodiment of the present invention reorders existing SVC code streams to provide regular storage, i.e., provides an ordered hierarchical multimedia code stream. Therefore, the server or the client can conveniently find the required code stream when the code stream is extracted, switched, etc., and the response speed of the server or the client is improved. Still another embodiment of the present invention provides information of each presentation stream in the SVC code stream and its corresponding DTQ identifier, thereby simplifying processing of the multimedia data by the receiving end side. BRIEF abstract
  • FIG. 1 is a schematic structural diagram of a prior art MPEG2-TS bearer SVC
  • FIG. 3 is a schematic structural diagram of an MPEG2-TS bearer SVC according to Embodiment 3 of the present invention
  • FIG. 4 is a schematic diagram of a SVC code stream multicast networking in Embodiment 4 of the present invention
  • FIG. 5 is a schematic diagram of MPEG2-TS bearer SVC code stream substream division according to Embodiment 4 of the present invention
  • FIG. 6 is a flowchart of a client extracting a substream according to Embodiment 4 of the present invention.
  • the applicant of the present invention considers that when the streaming media server sends the media stream, the media stream carries
  • SVC loads are divided according to airspace, quality, time domain or space, time domain, and quality layer. That is, the outermost level is the airspace rating. Quality and time domain grading are not in order.
  • the media forwarding server or the client parses the SVC payload, the code stream is selected according to these ratings, thereby achieving personalized viewing. If the SVC load in the SVC stream sent by the media forwarding server is directly sorted according to the airspace, quality, time domain or space, time domain, and quality layer, the client of the SVC does not need to traverse the entire load. Determine the DTQ corresponding to the code stream to be extracted.
  • the embodiment provides a multimedia data transmission process, in which the media forwarding server divides the original code stream of the existing SVC code stream into one or more spatial grading code streams according to spatial grading. Segments, according to a hierarchical manner in time domain grading and quality grading, respectively categorize each spatial grading code stream segment, and classify each second grading code stream segment three times according to another grading manner. For different presentation streams, each presentation stream corresponds to one DTQ identifier; the media forwarding server sends the three-stage SVC stream as needed.
  • the media forwarding server sends the SVC code stream after three times of classification according to the requirement:
  • the media forwarding server uses the unicast mode.
  • the D (space class grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream required for the unicast transmission are selected from the three-stage SVC code stream.
  • the corresponding spatial domain grading code stream and its dependent code stream are found according to the D value.
  • the corresponding time domain hierarchical code stream and its dependent code stream are found according to the T value.
  • the corresponding quality grading code stream and its dependent code stream are found according to the Q value.
  • the media forwarding server sends the selected presentation stream.
  • the multimedia client when receiving the SVC code stream, the multimedia client first determines the DTQ identifier corresponding to the description information of the presentation code stream supported by the client in the SVC code stream, and selects the determined DTQ identifier from the SVC code stream.
  • the corresponding presentation code stream is decoded and displayed.
  • the multimedia client may generate, according to the SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream and a correspondence relationship between the DTQ identifiers, thereby determining a description of the presentation code stream supported by the client.
  • the DTQ identifier corresponding to the information.
  • the applicant of the present invention further proposes a method for transmitting multimedia data, including:
  • the media forwarding server Before the SVC code stream is sent, the media forwarding server generates, according to the SEI in the SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers; the media forwarding server needs to send the SVC code. At the time of streaming or before transmitting the SVC code stream, the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifier are transmitted.
  • the multimedia client can receive the description information of each of the foregoing presentation streams and the correspondence between the DTQ identifiers, the DTQ identifier corresponding to the description information of the presentation stream supported by the client is determined, and the received SVC code is received. The stream of the presentation code corresponding to the determined DTQ identifier is found in the stream for decoding and display.
  • This embodiment provides a preferred solution, which combines the technical means of the foregoing Embodiments 1 and 2 to perform multimedia data transmission.
  • the specific transmission process is as shown in FIG. 2, and includes the following process:
  • Step 200 The multimedia forwarding server generates a code stream description that is displayed and a correspondence relationship with the DTQ, that is, an overall description information of the SVC code stream.
  • the multimedia forwarding server traverses the SVC code stream to find the SEI, parses the SEI, and obtains the total number of presentation streams that the SVC code stream can provide; parses the SEI to generate description information for each presentation stream; parses the SEI, and obtains the DTQ corresponding to each presentation stream. That is, the multimedia forwarding server generates an overall description information of the SVC code stream.
  • the multimedia data of the MPEG2-TS is taken as an example, and the SVC code stream overall description information svc_total_descriptor is added to the PMT, which can be defined as a program level information, which can be defined as follows:
  • Table 1 is the svc_total_descriptor definition table
  • PresentationNum indicates the number of views (presentation streams) in the SVC stream;
  • DID the identifier of the airspace hierarchy;
  • TID the identifier of the time domain hierarchy
  • QID the identification of the quality rating
  • PresentationDescription A description of the presentation stream that is provided to the user for reading and selects the stream to watch based on the description.
  • Step 201 The multimedia forwarding server generates a layer-level hierarchical SVC code stream for storage. Specifically, the operation process of the step is as follows:
  • the multimedia forwarding server traverses the SVC code stream generated by the current technology, accesses the extended header of the SVC code stream NALU unit, and divides the original code stream into one or more according to the DID of the spatial hierarchy based on the original code stream of the SVC code stream.
  • DID stream segment The length and DID of the code stream segment are added before the code stream segment.
  • Performing a second grading on the code streams of different DIDs respectively traversing the code segment of the spatial domain grading, and dividing the spatially classified code stream into one or more TID codes according to the time domain grading corresponding TID Flow segment. The length and TID of the code stream segment are added before the code stream segment.
  • Performing a third grading on the code stream segments of different TIDs respectively traversing the code stream segments of the time domain grading, and dividing the code stream of the time domain grading into one or more QID code stream segments according to the QID corresponding to the quality grading.
  • the length and QID of the code stream segment are added to the code stream segment.
  • the multimedia forwarding server may first perform secondary classification according to the QID corresponding to the quality classification, and the spatially graded code stream. Dividing into one or more QID code stream segments; thereafter, traversing the quality-graded code stream segments, performing a third grading according to the TID corresponding to the time domain grading, and dividing the quality grading code stream into one or more TID code stream segments .
  • Step 202 When the multimedia forwarding server uses the multicast sending mode (the multimedia forwarding server may also be referred to as a multicast server), the sending operation is performed according to the large and complete SVC code stream of the good level; the multimedia forwarding server uses the unicast In the case of the transmission mode (the multimedia forwarding server may also be referred to as a unicast server), the corresponding code stream segment is selected from the well-classed code streams and then transmitted.
  • the multimedia forwarding server sends the SVC code stream overall description information generated in step 200 at the same time as or before the SVC code stream is sent.
  • the SVC code stream generated in step 201 is sent as a load, including the newly added length and other fields, according to the large and complete code stream of the good level.
  • the structure of the SVC is as shown in FIG. 3, which is consistent with the current SVC standard.
  • the substream On the basis of the substream, it is divided into one or more mass substreams, and based on the mass substream, it is divided into one or more time substreams.
  • the sending further includes:
  • the media forwarding server determines the D (space domain grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream to be selected from the SVC code stream overall description information generated by the step 200 as needed.
  • the corresponding spatial domain-classified code stream and its dependent code stream are found according to the D value.
  • the corresponding time domain hierarchical code stream and its dependent code stream are found according to the T value.
  • the corresponding quality grading code stream and its dependent code stream are found according to the Q value.
  • the media forwarding server sends the selected presentation stream.
  • Step 203 The multimedia client receives the foregoing SVC code stream, and performs decoding and displaying.
  • the multimedia client When the multimedia client receives the SVC code stream sent by the multicast mode, it selects and displays the corresponding presentation code stream for decoding and display, and the process is as follows:
  • the multimedia client determines the D (space domain grading) value, the T (time domain grading) value, and the Q (quality grading) value of the presentation code stream to be selected according to the received SVC code stream overall description information.
  • the corresponding airspace-classified code stream and its dependent code stream are found according to the Q value.
  • the code stream of the spatial domain grading and the code stream it depends on the corresponding time domain grading code stream and its dependent code stream are found according to the T value.
  • the corresponding quality grading code stream and its dependent code stream are found according to the Q value.
  • the selected code streams are reordered and passed to the decoder for decoding.
  • the SVC code stream overall description information generated by step 200 and the layered SVC code stream generated by step 201 are generated. Just do it once. For the stored hierarchical SVC streams, it does not need to be regenerated.
  • the mobile phone accesses the SVC code stream multicast as an example to describe the method for receiving the multimedia data.
  • the terminal access devices also referred to as multimedia clients
  • the terminal access devices can be classified into three types, one is a mobile phone with a small screen, and the other is a moderate screen.
  • PDA devices one type of laptop or television device with a large screen. These three types of devices receive the multicast stream sent by the server. Each device extracts the code streams supported by each of the multicast streams.
  • the mobile phone device extracts the image size from the code stream to QQVGA (1/4 of the size of QVGA), the frame rate is 15 frames/s, and the code stream of high definition image quality.
  • the PDA device extracts an image size of QVGA from the code stream, a frame rate of 15 frames/s, and a high-definition picture quality stream.
  • the computer device extracts the image size from the code stream to VGA, the frame rate is 30 frames/s, and the high-definition picture quality stream.
  • the multimedia forwarding server sends a stream of PID 201 in the MPEG2-TS code stream carrying the SVC to carry the substream of the QQVGA.
  • a code stream with a PID of 202 carries a substream of QVGA, and a code stream with a PID of 203 carries a substream of VGA.
  • the code stream of each airspace classification it is further divided into quality classification and time domain classification.
  • the process of accessing the SVC multicast by the mobile phone is as shown in FIG. 6, and includes the following steps:
  • Step 601 The client (that is, the mobile phone) receives the TS data.
  • Step 602 Determine whether the received TS data is completed. If the receiving is complete, go to step 610. Otherwise skip to step 603.
  • Step 603 Determine whether the received data is PSI data, and if yes, skip to step 604, otherwise, go to step 605.
  • Step 604 Parse the PSI data, and obtain a PID corresponding to the QQVGA code stream. Obtain the t value corresponding to 15 frames/second to obtain the q value corresponding to HD.
  • Step 605 determining whether the PSI data has been received. If yes, then go to step 606, otherwise skip to step 607.
  • Step 606 Determine whether the PID of the received code stream is the PID of the QQVGA code stream. If yes, go to step 608. Otherwise, go to step 607.
  • Step 607 discarding the data.
  • Step 608 parsing the load, skipping the length field, and determining whether the Q field is less than or equal to the q value corresponding to the high definition. If yes, it is judged whether the T field is equal to the t value corresponding to 15 frames/s. If yes, select the codestream. Otherwise, the stream segment is not selected. After that, the code stream segment is skipped, and the code stream is selected by the above judgment method. Until the end of the code stream, then step 609 is performed.
  • Step 609 Send the selected code stream segment to the decoder.
  • Step 610 ending.
  • the embodiment provides a multimedia data transmission device, which is built in a media forwarding server or other multimedia network element.
  • the device includes:
  • the first module generates, according to the SEI in the SVC code stream to be sent, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers;
  • the second module sends the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifiers before sending the SVC code stream to the multimedia client or before transmitting the SVC code stream.
  • the transmission device may further include a third module, and the original code stream of the SVC code stream is divided into one or more spatial grading code stream segments according to a spatial grading, according to a hierarchical manner in time domain grading and quality grading.
  • the spatial grading code stream segments are separately graded separately, and the code segments of each secondary grading are respectively graded three times according to another grading manner, and are divided into different presentation code streams, and sent to The multimedia client, wherein each of the presentation streams respectively corresponds to a DTQ identifier.
  • the embodiment further provides a multimedia data transmission device, which may also be built in a media forwarding server, or other multimedia network element.
  • the device includes:
  • the first module divides the original code stream of the layerable video coding SVC code stream into one or more spatial grading code stream segments according to a spatial grading;
  • each spatial grading code stream segment is separately categorized according to one of the time domain grading and the quality grading, and the second grading code stream segments are respectively categorized three times according to another grading manner. Divided into different presentation streams, and sent to the multimedia client, where each presentation stream corresponds to a spatial time quality DTQ identifier.
  • the transmission device further includes: a third module, generating, according to the additional information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and a correspondence relationship between the DTQ identifiers, And before the multimedia client sends the SVC code stream or sends the SVC code stream, sending the generated description information of each presentation code stream and the correspondence relationship of the DTQ identifier.
  • a third module generating, according to the additional information SEI in the SVC code stream, description information of each presentation code stream included in the SVC code stream, and a correspondence relationship between the DTQ identifiers.
  • the embodiment provides a multimedia data receiving device, which can be a multimedia client such as a mobile phone, a PDA device, a computer, and a television.
  • the device can include:
  • the first module receives the SVC code stream, and determines, from the received description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers, the description information corresponding to the presentation code stream supported by the client.
  • DTQ logo
  • the second module traverses the received SVC code stream to obtain the decoded code stream corresponding to the determined DTQ identifier for decoding display.
  • the SVC code stream received by the first module is an SVC code stream that has not been hierarchically sorted. Therefore, the second module needs to traverse the received SVC code stream to obtain the corresponding DTQ identifier corresponding to the display. Code stream.
  • the embodiment provides a receiving device for multimedia data, which may be a mobile phone, a PDA device, or an electric Brain and TV and other multimedia clients.
  • the device can include:
  • the first module the SVC code stream is received, and the DTQ identifier corresponding to the description information of the presentation code stream supported by the client is determined from the description information of each code stream in the SVC code stream and the correspondence between the DTQ identifiers;
  • the first module generates, according to the SEI in the received SVC code stream, the description information of each presentation code stream included in the SVC code stream and the correspondence between the DTQ identifiers, and determines the presentation code stream supported by the client.
  • the description information corresponds to the DTQ identifier.
  • the description information of each presentation stream in the SVC stream sent by the multimedia forwarding server and the correspondence between the DTQ identifiers are received, and the DTQ identifier corresponding to the description information of the presentation stream supported by the client is determined.
  • the second module selects the presentation code stream corresponding to the determined DTQ identifier from the SVC code stream for decoding and display.
  • the SVC code stream received by the first module is the SVC code stream that is hierarchically sorted. Therefore, the second module selects the presentation code stream corresponding to the determined DTQ identifier from the hierarchically sorted SVC code streams. Just fine.
  • One embodiment of the present invention reorders existing SVC code streams to provide regular storage, i.e., provides an ordered hierarchical multimedia code stream. Therefore, the server or the client can conveniently find the required code stream when the code stream is extracted, switched, etc., and the response speed of the server or the client is improved. Still another embodiment of the present invention provides information of each presentation stream in the SVC code stream and its corresponding DTQ identifier, thereby simplifying processing of the multimedia data by the receiving end side.

Abstract

L'invention porte sur un procédé et un dispositif de transmission et de réception de données multimédias, concernant le domaine multimédia. Le procédé de transmission de données multimédias décrit dans la présente invention comprend les opérations suivantes : un serveur de transfert multimédia génère des informations descriptives concernant chaque flux de codes de rendu inclus dans un flux de codes de codage vidéo hiérarchique (SVC) à transmettre et la corrélation identifiée par la qualité spatio-temporelle (DTQ) conformément aux informations d'amélioration supplémentaires (SEI) figurant dans le flux de codes SVC ; et le serveur de transfert multimédia transmet les informations descriptives générées concernant chaque flux de codes de rendu et la corrélation identifiée par DTQ pendant ou avant la transmission du flux de codes SVC à un client multimédia. Le procédé permet au récepteur de simplifier le traitement de données multimédias après réception des données multimédias.
PCT/CN2012/070161 2011-01-11 2012-01-10 Procédé et dispositif de transmission et de réception de données multimédias WO2012094975A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201110004982.8 2011-01-11
CN2011100049828A CN102595203A (zh) 2011-01-11 2011-01-11 一种多媒体数据的传输、接收方法及其传输、接收设备

Publications (1)

Publication Number Publication Date
WO2012094975A1 true WO2012094975A1 (fr) 2012-07-19

Family

ID=46483340

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/070161 WO2012094975A1 (fr) 2011-01-11 2012-01-10 Procédé et dispositif de transmission et de réception de données multimédias

Country Status (2)

Country Link
CN (1) CN102595203A (fr)
WO (1) WO2012094975A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170104560A1 (en) * 2015-03-09 2017-04-13 Korea Aerospace Research Institute Apparatus and method for coding packet

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105824841A (zh) * 2015-01-07 2016-08-03 阿里巴巴集团控股有限公司 一种多媒体信息的存储、输出方法及装置
CN106303537B (zh) * 2016-08-30 2019-05-10 北京容联易通信息技术有限公司 一种openh264多码流传输方法
CN107959861B (zh) * 2016-10-18 2020-08-25 华为技术有限公司 一种数据处理方法、相关设备及系统

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007080223A1 (fr) * 2006-01-10 2007-07-19 Nokia Corporation Methode de mise en tampon d'images de reference decodees
CN101056403A (zh) * 2007-04-28 2007-10-17 西安交通大学 可伸缩视频编码的p2p网络传输体系结构设计方法
CN101189881A (zh) * 2005-04-13 2008-05-28 诺基亚公司 可分级视频编码中的帧号编码
CN101621688A (zh) * 2009-04-30 2010-01-06 武汉大学 一种实现avs视频标准时域分级的编解码方法

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100142613A1 (en) * 2007-04-18 2010-06-10 Lihua Zhu Method for encoding video data in a scalable manner
CN101547356B (zh) * 2008-03-24 2011-07-27 展讯通信(上海)有限公司 视频码流收发、转发方法及设备
KR101099784B1 (ko) * 2008-12-05 2011-12-28 한국전자통신연구원 H.264 svc 데이터의 계층적 부호화를 이용한 mpeg-2 ts 파일 포맷 장치 및 그 방법

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101189881A (zh) * 2005-04-13 2008-05-28 诺基亚公司 可分级视频编码中的帧号编码
WO2007080223A1 (fr) * 2006-01-10 2007-07-19 Nokia Corporation Methode de mise en tampon d'images de reference decodees
CN101056403A (zh) * 2007-04-28 2007-10-17 西安交通大学 可伸缩视频编码的p2p网络传输体系结构设计方法
CN101621688A (zh) * 2009-04-30 2010-01-06 武汉大学 一种实现avs视频标准时域分级的编解码方法

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170104560A1 (en) * 2015-03-09 2017-04-13 Korea Aerospace Research Institute Apparatus and method for coding packet
US10122503B2 (en) * 2015-03-09 2018-11-06 Korea Aerospace Research Institute Apparatus and method for coding packet

Also Published As

Publication number Publication date
CN102595203A (zh) 2012-07-18

Similar Documents

Publication Publication Date Title
TWI473016B (zh) 用以處理多視圖視訊位元串流之方法與裝置及電腦可讀媒體
KR102534899B1 (ko) Http 를 통한 동적 적응형 스트리밍에서의 가상 현실 비디오 시그널링
CN102037731B (zh) 压缩视频中属于互相关性层的图片的通知和抽取
CN103188522B (zh) 提供与传送复合浓缩串流的方法以及系统
EP2540034B1 (fr) Procédé et appareil pour transmettre et recevoir des données
JP2018186524A (ja) コンテンツ送信装置およびコンテンツ再生装置
JP5774652B2 (ja) 送信装置、送信方法、受信装置および受信方法
US20170325000A1 (en) Transmission device, transmission method, reception device, and reception method
US10820024B2 (en) Communication apparatus, communication data generation method, and communication data processing method
AU2012270417A1 (en) Method and apparatus for transmitting/receiving media contents in multimedia system
US8930442B2 (en) Apparatus and method for playing media content data
WO2014193996A2 (fr) Diffusion vidéo en continu sur un réseau, avec trick play basé sur des fichiers trick play séparés
US20130204973A1 (en) Method for transmitting a scalable http stream for natural reproduction upon the occurrence of expression-switching during http streaming
EP2453652B1 (fr) Procédé de transmission, procédé de réception et dispositif pour fichiers de codage vidéo échelonnable
WO2012094975A1 (fr) Procédé et dispositif de transmission et de réception de données multimédias
KR101656193B1 (ko) 이기종 망에서의 uhd 비디오 전송을 위한 mmt 기반 방송 시스템 및 방법
KR102349451B1 (ko) 멀티미디어의 전송 또는 수신 방법 및 그 장치
KR101941781B1 (ko) Mmt 기반 8k 방송 수신 방법 및 장치
US11863767B2 (en) Transporting HEIF-formatted images over real-time transport protocol
JP7230981B2 (ja) 受信装置および受信方法
US20240163461A1 (en) Transporting heif-formatted images over real-time transport protocol
JP2008187368A (ja) コンテンツ送出装置
KR101943214B1 (ko) Mmt 기반 8k 방송 스트림 생성 방법 및 이를 적용한 방송 서비스 시스템
JP5905148B2 (ja) 送信装置、送信方法、受信装置および受信方法
EP4315875A1 (fr) Transport d'images en format heif sur un protocole de transport en temps réel comprenant des images de superposition

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12734234

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12734234

Country of ref document: EP

Kind code of ref document: A1