WO2012060581A2 - 미디어 콘텐트 송수신 방법 및 그를 이용한 송수신 장치 - Google Patents
미디어 콘텐트 송수신 방법 및 그를 이용한 송수신 장치 Download PDFInfo
- Publication number
- WO2012060581A2 WO2012060581A2 PCT/KR2011/008094 KR2011008094W WO2012060581A2 WO 2012060581 A2 WO2012060581 A2 WO 2012060581A2 KR 2011008094 W KR2011008094 W KR 2011008094W WO 2012060581 A2 WO2012060581 A2 WO 2012060581A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- subsegment
- representation
- media content
- representations
- subsegments
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 61
- 230000005540 biological transmission Effects 0.000 claims description 24
- 238000002716 delivery method Methods 0.000 claims 1
- 238000001094 photothermal spectroscopy Methods 0.000 description 20
- 238000010586 diagram Methods 0.000 description 15
- 230000003044 adaptive effect Effects 0.000 description 8
- 239000012634 fragment Substances 0.000 description 7
- 239000012092 media component Substances 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 230000006870 function Effects 0.000 description 6
- 230000008859 change Effects 0.000 description 3
- AWSBQWZZLBPUQH-UHFFFAOYSA-N mdat Chemical compound C1=C2CC(N)CCC2=CC2=C1OCO2 AWSBQWZZLBPUQH-UHFFFAOYSA-N 0.000 description 3
- 239000000284 extract Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 102100021597 Endoplasmic reticulum aminopeptidase 2 Human genes 0.000 description 1
- 241000906446 Theraps Species 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 108010080821 leucine-rich amelogenin peptide Proteins 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000002688 persistence Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/23424—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for inserting or substituting an advertisement
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/02—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
- H04N21/23439—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/242—Synchronization processes, e.g. processing of PCR [Program Clock References]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44016—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving splicing one content stream with another content stream, e.g. for substituting a video clip
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8455—Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
Definitions
- the present invention relates to a method and apparatus for transmitting and receiving media content via the Internet.
- DMB digital multimedia broadcasting
- TS MPEG-2 Transport Stream
- Digital broadcasting includes stereo 3D video broadcasting, ultra high definition (UHD) broadcasting, and multiview 3D. Increasing amounts of transmission are required, such as video broadcasting and hologram broadcasting.
- MPEG-2 TS having a fixed length packet of 188 bytes may be inefficient to transmit content having a higher resolution than conventional HDTV using an IP network.
- One embodiment of the present invention is to provide a method and apparatus for transmitting and receiving that can efficiently stream media content according to network conditions.
- a method of transmitting media content comprising: organizing media content into a plurality of representations; Dividing the constructed representation into a plurality of segments; Arranging subsegments included in the segment; And transmitting any one of the representations, wherein the subsegments are the earliest presentation time of a k-th subsegment included in a first representation of the representations. It is arranged to be larger than the latest presentation time of the k-1 th subsegment included in the second representation.
- a method of receiving media content includes: receiving a first representation of a plurality of representations including the media content; Switching the received representation to a second representation; And reproducing the received representation, wherein the representation is divided into a plurality of segments and received, and the subsegments included in the segment are included in any one of the first and second representations.
- the earliest playing time of the k-th subsegment included is arranged to be greater than the latest playing time of the k-th subsegment included in another representation.
- an apparatus for transmitting media content includes: a media encoder configured to configure the media content into a plurality of representations; A segmenter for dividing the configured representation into a plurality of segments, wherein the sub-segments included in the segment include a head of a k-th sub-segment included in a first representation of the representations.
- the fast playback time is arranged such that it is greater than the latest playback time of the k-1 th subsegment included in the second representation.
- a media receiving apparatus includes: a receiving unit configured to receive any one of a plurality of representations including the media content; And a controller for switching the received representation, wherein the representation is divided into a plurality of segments and received, and the subsegments included in the segment are assigned to a first representation of the representations.
- the earliest playing time of the k-th subsegment included is arranged to be greater than the latest playing time of the k-th subsegment included in the second representation.
- the media content transmission / reception method may be implemented as a computer-readable recording medium that records a program to be executed in a computer.
- the subsegments included in the segments according to the reproduction time of the representation may be used. By aligning, it is possible to prevent loss or duplicate reception of data that may occur during bitstream switching.
- the client receiving the media content can easily switch representations in any subsegment that includes a Representation Access Point (RAP) without additional searching or parsing.
- RAP Representation Access Point
- the media content can be continuously played back seamlessly.
- FIG. 1 is a block diagram showing the configuration of a media content transmission and reception system according to an embodiment of the present invention.
- FIG. 2 is a diagram illustrating a structure of an MPEG2 TS (Transport Stream).
- FIG. 3 is a flowchart illustrating a method of transmitting media content according to an embodiment of the present invention.
- FIG. 4 is a diagram illustrating an embodiment of a configuration of representations including media content.
- FIG. 5 is a diagram illustrating an embodiment of a configuration of a subsegment.
- 6 and 7 are diagrams for describing an embodiment of a method of determining a location of a RAP included in a subsegment.
- FIGS. 8 and 9 are diagrams for describing an embodiment of a method of aligning subsegments.
- 10 and 11 illustrate embodiments of a method of switching representations.
- FIGS. 12 and 13 are diagrams illustrating a first embodiment of the configuration of a segment index box that provides metadata about media content.
- 14 and 15 are diagrams illustrating a second embodiment of the configuration of the segment index box.
- 16 is a block diagram illustrating a configuration of an apparatus for receiving media content according to an embodiment of the present invention.
- 17 is a flowchart illustrating a method of receiving media content according to an embodiment of the present invention.
- FIG. 18 is a block diagram showing a first embodiment of the configuration of the IPTV receiver according to the present invention.
- FIG. 19 is a block diagram showing a second embodiment of the configuration of the IPTV receiver according to the present invention.
- the transmitting apparatus and the receiving apparatus transmit and receive media content using an IP (Internet Protocol) based transmission system.
- IP Internet Protocol
- the media content is TCP (Transmission Control Protocol) and HTTP (Hypertext Transfer Protocol).
- TCP Transmission Control Protocol
- HTTP Hypertext Transfer Protocol
- the transmitting device may receive content from a content provider, convert the content, transmit the content, or store the content in a media DB (media DataBase).
- a content provider converts the content, transmit the content, or store the content in a media DB (media DataBase).
- media DB media DataBase
- the media content provided from the content provider may be in the MPEG-2 Transport Stream (MPEG-2 TS) format or the ISO Base Media file format, and the transmitting apparatus may transmit the media content in IP-based transmission (IP-). After converting to a format that is easy for based delivery, it may be transmitted through a network such as an internet network or stored in the media DB.
- MPEG-2 Transport Stream MPEG-2 TS
- IP- IP-based transmission
- the reception device may receive media content from a transmission device using an IP-based transmission system, convert the received media content, play the converted media content, or store the content in a local storage.
- the receiving device may convert the media content received from the transmitting device into the MPEG-2 TS format or the ISO Base Media file format, process it, play it, or store it in the local storage medium.
- the bandwidth that can be transmitted to the receiving device is dynamically changed according to network traffic, thus providing stable and fixed bandwidth for transmitting content files. It can be difficult to secure.
- the Internet server which is a transmitting device operating in an open Internet environment, may still have difficulty in guaranteeing QoS because services are transmitted through different networks.
- services such as YouTube may be provided through another network operated by a specific service provider, so that content may be delayed, dropped, or retransmitted as network configuration or traffic volume increases during the transmission process.
- a specific service provider For example, services such as YouTube may be provided through another network operated by a specific service provider, so that content may be delayed, dropped, or retransmitted as network configuration or traffic volume increases during the transmission process.
- content may be delayed, dropped, or retransmitted as network configuration or traffic volume increases during the transmission process.
- the QoS guarantee problem as described above may be a major cause of difficulty for content providers who provide content services in an open Internet environment.
- an adaptive streaming (adaptive streaming) method that can properly transmit and receive the content according to the network bandwidth by using the play list file of the M3U type may be used.
- HTTP Adaptive Streaming content files having different bit rates for one content are made, and a bit rate can be selected and transmitted according to the bandwidth of a receiving device. Since most of the content is transmitted using the HTTP protocol, it is called HTTP Adaptive Streaming.
- FIG. 1 is a block diagram illustrating a configuration of a content transmission and reception system according to an embodiment of the present invention, and illustrates an example of a structure of the system for adaptive streaming.
- a system for adaptive streaming may include a server 10, a distribution server 20, and a client 30.
- the media encoder 51 transmits the AV data to a streaming file in the form of MPEG-2 TS. Convert to Streaming File Format.
- the streaming segmenter 12 may divide the MPEG-2 TS streaming file output from the media encoder 11 into a plurality of segments.
- the streaming segmenter 12 may divide one file or a real-time MPEG-2 TS streaming file into segments of a size that can be played for about 10 seconds and then store the same in the distribution server 20.
- the client 30 may request and receive the divided files stored in the distribution server 20 using the HTTP protocol.
- MPEG-2 TS file segments having different bit rates may be stored in the distribution server 20, and a web server 21 for storing MPEG-2 TS file segments encoded at a specific bit rate for each of them may be stored.
- Multiple distribution servers may exist.
- MPEG-2 TS file segments encoded at a bit rate of 20 kbps and MPEG-2 TS file segments encoded at a bit rate of 500 kbps may be stored in the distribution server 20 under different file names.
- the client 30 may request a file encoded at a bit rate of 20 kbps and play the file for 10 seconds, and periodically check the bandwidth before the 10 second, which is the reproduction time, elapses. .
- the client 30 may request the transmission of the file encoded at a bit rate of 500kbps.
- the client 30 can alternately receive and reproduce contents encoded at bit rates of 20 kbps and 500 kbps adaptively according to the network bandwidth. If the bandwidth is not good, the quality of the video to be played back may be degraded. Breaking can be minimized.
- the media content may be composed of a plurality of representations, and the media content included in each of the representations has a bitrate and a resolution.
- the resolution, language, or codec may be different.
- the media content includes media components such as audio, video or timed text, and the representation is a structured collection of one or more media components. May be).
- the client 30 may request and receive any one of a plurality of representations configured to have different bit rates as described above.
- the server 10 includes a media presentation description (MPD) including metadata required by the client 30 together with the segment data. Can be provided.
- MPD media presentation description
- the client 30 may select and transmit any one of the plurality of representations by using an element included in the MPD.
- the representation is divided into a plurality of segments as described above, and is divided into segments or segments according to a request of the client 30 using the HTTP GET or HTTP partial GET method defined in HTTP 1.1 (RFC 2616). Can be sent.
- the client 30 may switch the received representation to another representation based on the updated information during media playback.
- the client 30 may receive and play a new representation from the time when the previous representation was played.
- the segment may include a plurality of sub-segments, and the sub-segments may mean the smallest unit that can be indexed at the segment level.
- the MPEG-2 TS may include a plurality of MPEG-2 TS packets each having a fixed bit rate of 188 bytes, and the subsegment may be configured as a set of consecutive MPEG-2 TS packets. .
- one sample is an access unit composed of one or more MPEG-2 TS packets having the same PID value, and the subsegment may be defined as one indexed set of successive access units in decoding order. have.
- a "fragmented file” structure can be used to stream the ISO Base Media file format.
- the fragmented file physically divides media tracks into a plurality of fragments, and each fragment may be composed of a moof (Movie Fragment) box and a mdat (media data) box.
- moof Menovie Fragment
- mdat media data
- the mdat box after the moof box includes fragment media data, and thus the combined moof-mdat pairs may constitute one fragment.
- the subsegment may be defined as a self-contained set of one or more contiguous movie fragments, and thus one or more moof (Movie Fragment) boxes and corresponding mdat (media data) boxes. Can include them.
- moof Menovie Fragment
- mdat media data
- the server 10 in the client 30 may be aligned to efficiently perform bitstream switching of the segments.
- the media content of the MPEG2 TS or ISO base media file format is transmitted to the Internet through HTTP.
- the present invention is not limited thereto, and the media content transmission and reception method according to an embodiment of the present invention and The device is applicable to media content in various formats.
- FIG. 3 is a flowchart illustrating a media content transmission method according to an embodiment of the present invention, and descriptions of the same as those described with reference to FIGS. 1 and 2 will be omitted below.
- the transmission apparatus configures media content into a plurality of representations (step S100), and divides the configured representation into a plurality of segments (step S110).
- the plurality of representations may be configured differently according to at least one of a bit rate, a resolution, a language, and a codec of the transmitted media content, and each representation may include a plurality of representations for transmission using an HTTP protocol. It may be divided into segments of.
- the transmitting apparatus may make the media content into Representation A and Representation A having different bit rates, and the Representations A and B are one. Groups can be formed. The group may further include other representations in addition to the representations A and B.
- index information exists for all subsegments included in the representations A and B, and the index information is randomly obtained by the client 30 using the partial HTTP GET method. May be for access.
- each of the subsegments may include a segment index, or an index segment may exist that provides an index for the subsegment.
- the earliest presentation time (T E ) and the latest reproduction time (T L ) of the sub segment 200 may be defined.
- the subsegment 200 may include two media components, video and audio, and any one of the media components (eg, video) may be a reference stream ( reference stream).
- the reference stream may mean a group of consecutive samples of a primary media component among media components.
- the fastest playback time T E of the subsegment 200 may be defined as the fastest playback time of any sample present in the reference stream, and thus the fastest sample 201 of the video samples. It can be play time.
- the latest playback time T L of the subsegment 200 may be the playback time of the latest sample 204 present in the reference stream.
- sample 204 may mean an access unit present in the stream of the subsegment 200 (or segment).
- Subsegments as shown in FIG. 4 may differ in size or duration of media content.
- the latest reproduction time T L of a subsegment is shown in the boundary between successive sub-segments in FIG. 4, the latest reproduction time T L of the sub-segment is between the sub-segments. May be slightly ahead of the boundary.
- the latest reproduction time T L of the K-th subsegment may be slightly earlier than the time corresponding to the boundary between the K-th subsegment and the K + 1th subsegment.
- a segment may include one or more Representation Access Points (RAPs), and the RAPs may be included in a subsegment.
- RAPs Representation Access Points
- the client 30 may start playback forward from the position corresponding to the RAP using only the information included in the subsegment , and the RAP may include a byte index I LRAP and a playback time T RAP .
- the reproduction time T RAP may mean the earliest reproduction time for all access units having a reproduction time greater than or equal to the T RAP to be correctly decoded.
- the client 30 searches for the RAP in the representation B at a reproduction time close to the current reproduction time, and then the K-th subsegment 210. Up to the RAP 211 included in the) may reproduce the representation A, and after the searched RAP 211 may reproduce the representation B.
- the subsegment does not include a RAP or includes one RAP, but the subsegment may include two or more RAPs, in which case the RAP illustrated in FIG. It may mean the first RAP among the fastest.
- the playback time of the first RAP included in the sub-segment (T RAP) can match the earliest reproduction time of the sub-segment (T E).
- the playback time (T RAP ) of the first RAP included in the subsegment is the fastest playback time (T E ) of the corresponding subsegment. It allows the client 30 to switch representations at any boundary between successive subsegments with consecutive indices.
- the client 30 has a boundary or K-th subsegment between the K-th subsegment 220 and the K-th subsegment 210 after bitstream switching is required.
- the presentation can be seamlessly played back by switching the representation at any desired position among the boundaries between the 210 and the K + 1 th subsegments 230.
- the transmitting device arranges the subsegments included in the segment (step S120).
- the server 10 may arrange the subsegments included in the segment such that data provided to the client 30 is not repeatedly received or lost during bitstream switching.
- the earliest reproduction time T E of the K-th subsegment 200 of representation A is the latest reproduction of the K-th sub-segment 230 of representation B.
- FIG. May be less than the time T L.
- the client 30 switches at the boundary between representation K-th subsegment 220 and K-th subsegment 210. After that, the representation B can be reproduced.
- K-th subsegment 210 of Representation A since the client 30 has already received the K-th subsegment 210 of Representation A before the switching, some of the K-th subsegments 200 may be duplicated and provided to the client 30. .
- the redundant data may be some samples having a fast playback time among the reference streams included in the K-th subsegment 200.
- the fastest playing time T E of the K-th subsegment 200 of the representation A is K-1 of the representation B. It should have a value larger than the latest reproduction time T L of the first subsegment 230.
- the earliest reproduction time T E of the K-th subsegment 210 of the representation B is the latest reproduction of the K-th subsegment 240 of the representation A.
- FIG. May be less than the time T L.
- the client 30 switches at the boundary between representation K-th subsegment 220 and K-th subsegment 210. Accordingly, some of the K-th subsegments 240 may be lost without being provided to the client 30.
- the lost data may be some samples having a slow playback time among reference streams included in the K-1 th subsegment 240.
- the fastest reproduction time T E of the K-th subsegment 210 of the representation B is the K-1 th subsegment 240 of the representation A.
- T L the latest playback time
- the fastest playback time T E of the k-th subsegment included in either of the representations A and B is determined.
- the subsegments may be aligned such that they are greater than the latest playing time T L of the k-1 th subsegment included in another representation.
- two subsegments having different indices included in representations A and B may be arranged so as not to overlap each other.
- non-overlapping of the first subsegment and the second subsegment means that, for all media streams of one media component, the fastest playback time T of the first subsegment T E ) is less than the latest reproduction time T L of the second subsegment or the latest reproduction time T L of the first subsegment is the earliest reproduction time T E of the second subsegment. It may mean a smaller case.
- the client 30 includes any subs including the RAP in the representation B.
- FIG. 9 Even when switching to a segment, redundant data may not be provided or data may be lost.
- the data provided to the client 30 may not be duplicated or lost.
- the client 30 searches for the RAP located after the current playback time in Representation B, and searches for the K-1 th subsegment ( Switching the representation at any point of the boundary B1 between 220 and the K-th subsegment 210 and the boundary B2 between the K-th subsegment 210 and the K + 1th subsegment 230. can do.
- the client 30 since the K-1 st subsegment 220 of Representation B does not include a RAP, the client 30 includes the RAP after the K-1 st subsegment 220.
- the representation may be switched at any of the boundaries B1 and B2 of the subsegments.
- the transmitting device transmits any one of the plurality of representations (step S130).
- the client 30 selects one of a plurality of representations having different bit rates, resolutions, languages, or codecs, etc. of media content by using a media presentation description (MPD) provided from the server 10.
- MPD media presentation description
- the segment corresponding to the selected representation may be requested or a part of the segment by using the HTTP GET or HTTP partial GET method defined in HTTP 1.1 (RFC 2616).
- the MPD may include a bitstream switching flag indicating whether to bitstream the representations, and a subsegment alignment flag indicating whether the subsegments are aligned. have.
- the client 30 may perform the bitstream switching operation as described above.
- the subsegment alignment flag included in the MPD is 'true'
- the subsegments are aligned as shown in FIGS. 9 to 11, so that the client 30 can be continuously executed without duplicate or lost data. Representation can be switched at any boundary by subsegments.
- index information for all subsegments, and it is the earliest playing time T E of the corresponding subsegment in the playing time T RAP of the first RAP included in the subsegment.
- the earliest play time T E of the k th subsegment included in any first representation of the plurality of representations is the latest of the k-1 th subsegment included in any second representation.
- the client receiving the media content can easily switch the representation in any subsegment including RAP without searching or parsing, so that even when the bitstream is switched due to a change in network state, etc. Media content can be played back continuously without persistence.
- the media in order to divide MPEG2 TS media into segments for transmission, the media depends on the duration of a reference program element (which may mean the same as the above-mentioned reference stream) as required by the transmission protocol. Can be roughly divided. The boundary between the segments can be aligned with the MPEG2 TS packets.
- any program element belonging to a program is selected as a reference, and the fastest playback time for the playback unit (PU) for the reference program element in the segment is to be delivered. Can be.
- a data offset and a stamp (PTS) for the earliest playback time of that subsegment are delivered, optionally the playback time of the random access point and the PTS discontinuity. Whether or not (PTS discontinuity) occurs may be transmitted.
- the play time of the random access point and whether PTS discontinuity occurs may be used to allow the client 30 to fit the PTS-PCR based timeline to the media play timeline.
- the metadata may be represented in a binary format based on general ISO Base Media file format data types, which may have a structure similar to a 'sidx' box used in Dynamic adaptive streaming over HTTP (DASH).
- DASH Dynamic adaptive streaming over HTTP
- the format of the metadata may allow the client 30 to fit the MPEG2 PTS-PCR based timeline to the media playback timeline.
- This may be done by delivering the earliest PTS of the playback unit (PU) in the subsegment, and information about discontinuities present in the MPEG2 PTS timeline may be conveyed.
- PU playback unit
- FIGS. 12 and 13 illustrate a first embodiment of the configuration of a segment index box that provides metadata about media content.
- the metadata may be composed of an MPEG-2 segment index box ('m2si'), and the 'm2si' box may be an MPEG2 segment index box different from a subsegment in one segment. Indexes can be provided.
- the index may refer directly to a subsegment, or may refer to segment indexes referencing a subsegment, the segment index conveying a time and byte offset for another MPEG2 segment index box within the same segment or subsegment. This can be specified hierarchically or in a simple-chain or other form.
- the earliest reproduction time of the reference program element of the segment may be recorded at the segment index.
- the reference type indicates whether the reference is for a subsegment or an MPEG2 segment index box, and the offset is from the first byte of the first subsegment indexed in the box to the first byte of the referenced entity. You can provide the distance in bytes.
- the earliest playing time (for a reference program element) of the first subsegment recorded at the index is provided, and the segment index can provide one entry for each subsegment recorded by the index. .
- a subsegment duration is provided, wherein the fastest playback time of the subsegment is added by adding the subsegment duration of the previous subsegments and the earliest playback time of the first subsegment. Can be calculated.
- the earliest reproduction time of the subsegment may mean the earliest reproduction time on a reproduction timescale of any reproduction unit PU in the reference program element of the subsegment.
- the earliest PTS of the reference program element may be passed with a flag indicating whether the subsegment starts with a PTS discontinuity.
- segment index box may include a random access point.
- flags are 24-bit integers having flags and may be defined as the following values.
- the timescale-present indicates whether a time scale of a media play time exists in a corresponding box, and a flag value may have '0x000001'.
- the timescale is a timescale for playback, where its value may be the same for the entire playback when provided more than once in one media presentation.
- reference_PID provides a PID of MPEG2 TS packets carrying a reference program element.
- program_element_count represents the number of program elements indexed in the next loop and may be designated as 1 or greater.
- reference_count represents the number of elements indexed by the second loop, and may have a value of 1 or greater.
- PID represents the PID of MPEG2 TS packets that transmit a program element provided with a presentation time (presentation_time), and one PID in a corresponding loop is the same as the reference_PID.
- earlist_presentation_time may indicate the earliest playback time for the playback unit (PU) in the program element identified by the PID.
- reference_type indicates that the reference is for a subsegment, and if it is set to '1', it indicates that the reference is for an MPEG2 segment index ('m2si) box.
- discontinuity may be set to '1' to indicate that the corresponding subsegment starts with a PTS discontinuity.
- contains_PTS may be set to '1' to indicate that the PTS of the first playback unit (PU) of the reference program element is delivered in the corresponding subsegment.
- contains_RAP may be set to '1' to indicate that the reference program element in the referenced subsegment includes a random access point (RAP).
- the reference_offset may indicate the distance from the first byte of the first subsegment indexed in the corresponding box to the first byte of the subsegment referenced by the iteration of the loop in bytes.
- subsegment_duration passes the sum of the subsegment_duration fields present in the box when the reference is for an MPEG2 segment index ('m2si) box, and any of the reference program elements in the next subsegment when the reference is for a subsegment. It is possible to convey the difference between the earliest reproduction time of the reproduction unit PU of and the earliest reproduction time of any reproduction unit PU of the reference program element in the referenced subsegment.
- the duration may be represented by a signaled timescale that is signaled.
- the PTS may indicate the earliest MPEG2 TS PTS of the reference program element of the corresponding subsegment.
- RAP_delta_time may provide a reproduction time offset of the random access point (RAP) with respect to earlist_presentation_time of the reference program element. Meanwhile, when the contains_RAP is '0', the RAP_delta_time may be maintained at a value of '0'.
- the PTS may provide the fastest PTS of the reference program element of the corresponding subsegment.
- Meta data having the configuration as described above may be delivered to the client 30 in various ways.
- the metadata represented in the binary format may be called before the MPEG2 TS as a header part of the segments.
- the boxes may be located before the first byte of the subsegment being indexed, in which case the data offset may include the size of the meta data boxes.
- the first byte of the subsegment may be interpreted as the first byte of the meta data box that indexes the subsegment.
- Meta data expressed in XML or binary format may be delivered to the client 30 in a separate file, which may be an HTTP resource or provided out-of-band for the client 30.
- the client 30 should use the HTTP URL to obtain the resource.
- this may be provided in a Media Presentation Description with a mechanism similar to URL information for media segments.
- an additional indexing mechanism based on super-segments may be needed to index the set of boxes at the segment level of a given representation.
- the MPEG2 Super-Segment Index Box ('mp2s') is an MPEG2 Segment Index Box ('m2si') of one or more segments included in the representation. It can provide an index on a set of).
- reference_count indicates the number of segments indexed by the corresponding loop.
- segment_index is the unique segment index of the segment used by the segment list generation.
- m2si_count represents the number of MPEG2 segment index boxes ('m2si') belonging to one segment identified by the current segment_index.
- offset indicates the distance in bytes from the first byte of the first MPEG2 segment index box 'm2si' indexed by the loop to the first byte of the first MPEG2 segment index box 'm2si' for the indexed segment.
- An apparatus for transmitting media content may perform at least some of operations of the server 10 as described with reference to FIGS. 1 to 15, and for this purpose, a plurality of representations of the media content may be used. It may include a media encoder (11) consisting of a segment (segmenter, 12) for dividing the configured representation into a plurality of segments.
- the receiving device 300 may perform an operation of the client 30 as described with reference to FIGS. 1 to 15.
- the configuration of the media content received by the receiving device 300 for example, the configuration of the representation, the segment, and the subsegment, may be the same as described with reference to FIGS. 3 to 11, a detailed description thereof will be omitted below. Let's do it.
- FIG. 16 is a block diagram illustrating a configuration of a media content receiving apparatus according to an exemplary embodiment of the present invention.
- the illustrated receiving apparatus 300 includes a network interface 310, a decoder 320, and a synchronizer 330. It can be configured to include.
- the network interface 310 of the receiving apparatus 300 receives a media file together with the synchronization information, and the decoding unit 320 decodes the received media file. You can print
- the synchronizer 330 may synchronize the decoded media file to a media playback timeline using the media playback time included in the synchronization information.
- the network interface unit 310 may receive, from the transmitting apparatus 100, the synchronization information (Sync Info.) Having the configuration described with reference to FIGS. 6 to 13 together with the MPEG2 TS.
- the synchronization information Synchronization information
- the decoding unit 320 may include a second decoder 322 for decoding the MPEG2 TS, and the second decoder 322 may be a general MPEG2 TS decoder.
- the second decoder 322 may receive and decode the MPEG2 TS from the network interface 310, and then output the decoded playback units and their PTSs.
- the synchronizer 330 may identify a play unit provided with a media play time through the received sync information. For that purpose, the network interface 310 should recover the PTS of the playback units provided with the playback time and transfer it to the synchronization unit 330, or the PTS of the playback unit together with the playback time should be provided from the transmitting device 100 as well. do.
- Elementary stream time recovery may be used for the PTS recovery.
- the decoder 320 may further include a first decoder 321 for decoding another media received through the network interface 310, and the first decoder 321 may be It may be a decoder for decoding media of another format, such as a 3GPP file format, or the same MPEG2 TS decoder as the second decoder 322.
- the synchronizer 330 removes the MPEG2 TS-based media output from the second decoder 322 by using encoding information (for example, media playback times for the playback units) transmitted from the transmission apparatus 100. 1 may be synchronized with other media output from the decoder (321).
- the reception device 300 may play the MPEG2 TS-based media and the other media synchronized by the synchronization unit 330 together.
- FIG. 17 is a flowchart illustrating a method of receiving media content according to an embodiment of the present invention. A description of the same receiving method as that described with reference to FIGS. 1 to 16 will be omitted.
- the transmitting apparatus 300 receives a plurality of representations including media content (step S400), and plays the received representation (step S420).
- a receiving unit (not shown) provided in the network interface unit 310 of the transmitting device 300 may use the HTTP GET or HTTP partial GET method defined in HTTP 1.1 (RFC 2616) for the segment or part of the segment.
- the transmission can be requested to the transmitting device.
- control unit (not shown) provided in the network interface 310 is a plurality of representations different from each other in the bit rate, resolution, language or codec of the media content by using the Media Presentation Description (MPD) received from the transmitting device Either one can be selected as the representation to receive.
- MPD Media Presentation Description
- the controller checks whether bitstream switching is required (step S410).
- the controller switches the received representation to a new representation (S430).
- the method of switching the representations by the controller of the transmitting device 300 may be the same as the switching operation of the client 30 described with reference to FIGS. 3 to 11.
- the method of receiving and processing media content as described with reference to FIGS. 1 through 17 may be performed by an IPTV receiver according to an embodiment of the present invention.
- FIG. 18 is a block diagram illustrating a configuration of an IPTV receiver according to an embodiment of the present invention.
- the IPTV receiver according to an embodiment of the present invention may include a separate tuner or the like for receiving terrestrial broadcasting, cable broadcasting, satellite broadcasting, or the like.
- a separate tuner or the like for receiving terrestrial broadcasting, cable broadcasting, satellite broadcasting, or the like.
- a description will be given of the configuration of receiving an IPTV service provided using an IP network.
- ITF represents an Open IPTV Terminal Function, which may mean a receiver composed of a function module required to support the IPTV service.
- the IPTV receiver includes a network interface unit 501, a TCP / IP manager 502, a service delivery manager 504, a PVR manager 505, Demultiplexer (508), Data Decoder (507), Audio / Video Decoder (512), A / V Display & OSD Module (515), Application Manager ( Application Manager (513, 514), Service Information Database (SI & Metadata DB) 511, Service Discovery Manager (509), Service Control Manager (503), Metadata Manager (Metadata Manager) 510 and a content database unit (Content DB) 506.
- a network interface unit 501 a TCP / IP manager 502, a service delivery manager 504, a PVR manager 505, Demultiplexer (508), Data Decoder (507), Audio / Video Decoder (512), A / V Display & OSD Module (515), Application Manager ( Application Manager (513, 514), Service Information Database (SI & Metadata DB) 511, Service Discovery Manager (509), Service Control Manager (503), Metadata Manager (Metadata Manager) 510
- the network interface unit 501 receives packets received from a network and transmits the packets to the network. That is, the network interface unit 501 receives a service, media content, etc. from a service provider through a network.
- the TCP / IP manager 502 is involved in packet forwarding from the source to the destination for packets received by the IPTV receiver and packets transmitted by the IPTV receiver. In addition, the TCP / IP manager 502 classifies the received packet to correspond to an appropriate protocol, and sends it to the service delivery manager 504, the service discovery manager 509, the service control manager 503, and the metadata manager 510. Output sorted packets.
- the service delivery manager 504 is responsible for controlling the received service data.
- RTP / RTCP can be used to control real-time streaming data.
- the service delivery manager 504 parses the received data packet according to the RTP and transmits it to the demultiplexer 508 or under the control of the service manager 514.
- the content database unit 506 stores the content.
- the service delivery manager 504 feeds back the network reception information to a server that provides a service using RTCP.
- the demultiplexer 508 demultiplexes the received packet into audio, video, PSI (Program Specific Information) data, and the like, and transmits the demultiplexer 508 to the audio / video decoder 512 and the data decoder 507, respectively.
- PSI Program Specific Information
- the data decoder 507 decodes service information such as, for example, program specific information (PSI). That is, the data decoder 507 receives and decodes the demultiplexed PSI section, the Program and Service Information Protocol (PSIP) section, or the Service Information (DVB-SI) section from the demultiplexer 508.
- PSI program specific information
- PSIP Program and Service Information Protocol
- DVB-SI Service Information
- the data decoder 507 decodes the received sections to create a database of service information, and stores the database of the service information in the service information database unit 511.
- An audio / video decoder 512 decodes the video data and audio data received at the demultiplexer 508.
- the audio data and the video data decoded by the audio / video decoder 512 are provided to the user through the display unit 515.
- the application manager manages the overall state of the IPTV receiver, provides a user interface, and manages other managers.
- the application manager includes a user interface manager 513 and a service manager 514.
- the user interface manager 513 provides a graphical user interface (GUI) for a user using an OSD (On Screen Display), etc., and receives a key input from a user to perform a receiver operation according to the input. For example, upon receiving a key input related to channel selection from a user, the key input signal is transmitted to the service manager 514.
- GUI graphical user interface
- the service manager 514 controls a manager associated with a service, such as a service delivery manager 504, a service discovery manager 509, a service control manager 503, and a metadata manager 510.
- the service manager 514 creates a channel map and selects a channel using the channel map according to a key input received from the user interface manager 513.
- the service manager 514 receives service information of a channel from the data decoder 507 and sets an audio / video packet identifier (PID) of the selected channel to the demultiplexer 508.
- PID audio / video packet identifier
- the service discovery manager 509 provides information necessary to select a service provider that provides a service. Upon receiving a signal regarding channel selection from the service manager 514, the service discovery manager 509 uses the information to find a service.
- the service control manager 503 is responsible for selecting and controlling a service.
- the service control manager 503 uses IGMP or RTSP when the user selects a Live Broadcasting service such as a conventional broadcasting method, and uses RTSP when selecting a service such as Video On Demand (VOD). Can be used to select and control services.
- IGMP Live Broadcasting service
- RTSP Video On Demand
- the RTSP protocol may provide a trick mode for real time streaming.
- the service control manager 503 may initiate and manage a session through the IMC gateway using an IP multimedia subsystem (IMS) or a session initiation protocol (SIP).
- IMS IP multimedia subsystem
- SIP session initiation protocol
- the protocols are one embodiment, and other protocols may be used depending on implementation.
- the metadata manager 510 manages metadata associated with a service and stores the metadata in the service information database unit 511.
- the service information database unit 511 stores service information decoded by the data decoder 507, metadata managed by the metadata manager 510, and information necessary to select a service provider provided by the service discovery manager 509. do. In addition, the service information database unit 511 may store setup data for the system.
- the service information database unit 511 and the content database unit 506 may be implemented using nonvolatile memory (NVRAM) or flash memory, respectively, and are two logically separated areas on the same storage area. Can be implemented.
- NVRAM nonvolatile memory
- flash memory flash memory
- the PVR manager 505 is a module for recording and playing back live streaming content.
- the PVR manager 505 may collect metadata about the recorded content and generate additional information provided to a user such as a thumbnail image or an index.
- Functions of the control unit of the IPTV receiver is the TCP / IP manager 502, service delivery manager 504, PVR manager 505, application managers (513, 514), service discovery shown in FIG.
- the manager 509, the service control manager 503, and the metadata manager 510 may be divided into a plurality of modules and implemented.
- the TCP / IP manager 502 filters the SD & S information by using the target package information as described above, so that the network interface unit 501 corresponds to a specific package (for example, a package to which an IPTV receiver is subscribed). Only the payload or segment may be requested to the server and received.
- the TCP / IP manager 502 filters the SD & S information received by the multicast method using the target package information so that only payloads or segments corresponding to a specific package are parsed and processed by the data decoder 507. can do.
- FIG. 19 is a block diagram showing another embodiment of the configuration of the IPTV receiver according to the present invention, and shows the configuration of the IPTV receiver in functional blocks.
- the solid line arrow shown in FIG. 19 corresponds to the data path, and the dotted line arrow corresponds to the control signal path.
- Cable modem, DSL modem, etc (601) is an interface that allows ITF to be connected to IP network at the physical level, and demodulates the signal transmitted through the physical medium to restore the digital signal.
- the Ethernet NIC 602 restores the signal received through the physical interface to IP data, and the IP Network Stack 607 processes each layer according to the IP Protocol stack.
- the XML Parser 609 parses an XML Document among received IP data
- the File Handler 608 processes data transmitted in the form of File through FLUTE among the received IP data.
- the SI Handler 611 processes the part corresponding to the IPTV service information among the received File type data and stores it in the storage 612, and the EPG Handler 610 corresponds to the IPTV EPG information among the received File type data.
- the portion can be processed and stored in Storage 612.
- the storage 612 stores various data such as the SI and the EPG.
- the SI Decoder 613 receives the SI data from the storage 612 and analyzes it to obtain channel map information.
- the EPG Decoder 614 analyzes the EPG data stored in the storage 612 to restore necessary information for the EPG configuration. do.
- the ITF Operation Controller 615 is a main controller that controls the operation of an ITF such as a channel change or an EPG display.
- the channel service manager 616 performs an operation such as changing a channel according to a user input
- the application manager 617 may perform an application service such as an EPG display according to the user input.
- the MPEG-2 Demultiplexer 603 may extract MPEG-2 Transport Stream data from the received IP datagram and transmit the MPEG-2 Transport Stream data to the corresponding module according to the packet identification information (PID).
- PID packet identification information
- the MPEG-2 PSI / PSIP Parser 604 extracts and parses PSI / PSIP data including packet identification information (PID) of A / V data or access information about a program element from the MPEG-2 Transport Stream. Can be.
- PID packet identification information
- the A / V Decoder 605 decodes the input audio and video data and transmits the decoded audio and video data to the display module 606.
- the display module 606 may output decoded A / V data or an application.
- the above-described method for transmitting and receiving media content according to the present invention can be stored in a computer-readable recording medium that is produced as a program for execution on a computer.
- Examples of the computer-readable recording medium include ROM, RAM, CD-ROM. Magnetic tapes, floppy disks, optical data storage devices, and the like, and also include those implemented in the form of carrier waves (eg, transmission over the Internet).
- the computer readable recording medium can be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion.
- functional programs, codes, and code segments for implementing the method can be easily inferred by programmers in the art to which the present invention belongs.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Computer Networks & Wireless Communication (AREA)
- Business, Economics & Management (AREA)
- Marketing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Mobile Radio Communication Systems (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (20)
- 인터넷을 통해 미디어 콘텐트(media content)를 전송하는 방법에 있어서,상기 미디어 콘텐트를 복수의 레프리젠테이션들(representations)로 구성하는 단계;상기 구성된 레프리젠테이션을 복수의 세그먼트들(segments)로 분할하는 단계;상기 세그먼트에 포함된 서브세그먼트들(subsegments)을 정렬하는 단계; 및상기 레프리젠테이션들 중 어느 하나를 전송하는 단계를 포함하고,상기 서브세그먼트들은상기 레프리젠테이션들 중 제1 레프리젠테이션에 포함된 k 번째 서브세그먼트의 가장 빠른 재생 시간(the earliest presentation time)이 제2 레프리젠테이션에 포함된 k-1 번째 서브세그먼트의 가장 늦은 재생 시간(the latest presentation time)보다 크도록 정렬되는 미디어 콘텐트 전송 방법.
- 제1항에 있어서, 상기 복수의 레프리젠테이션들은상기 전송되는 미디어 콘텐트의 비트율(bitrate), 해상도(resolution), 언어(language) 및 코덱(codec) 중 적어도 하나에 따라 상이하게 구성되는 미디어 콘텐트 전송 방법.
- 제1항에 있어서,상기 세그먼트에 포함된 모든 서브세그먼트들에 대한 인덱스(index) 정보가 존재하는 미디어 콘텐트 전송 방법.
- 제3항에 있어서,상기 서브세그먼트들은 각각 세그먼트 인덱스를 포함하는 미디어 콘텐트 전송 방법.
- 제1항에 있어서,상기 서브세그먼트들 중 RAP(Representation Access Point)를 포함하는 서브세그먼트에 대하여, 상기 서브세그먼트에 포함된 첫번째 RAP의 재생 시간은 상기 서브세그먼트의 가장 빠른 재생 시간과 일치하는 미디어 콘텐트 전송 방법.
- 제1항에 있어서, 상기 서브세그먼트의 가장 빠른 재생 시간은상기 서브세그먼트의 레퍼런스 스트림에 존재하는 임의의 샘플의 가장 빠른 재생 시간인 미디어 콘텐트 전송 방법.
- 제1항에 있어서, 상기 서브세그먼트의 가장 늦은 재생 시간은상기 서브세그먼트의 레퍼런스 스트림에 존재하는 가장 늦은 샘플의 재생 시간인 미디어 콘텐트 전송 방법.
- 제1항에 있어서, 상기 서브세그먼트는하나 또는 연속하는 2 이상의 MPEG2 TS 패킷들을 포함하는 미디어 콘텐트 전송 방법.
- 제1항에 있어서,클라이언트(client)로부터 수신되는 요청에 따라, 상기 전송되는 레프리젠테이션을 스위칭(switching) 하는 단계를 더 포함하는 미디어 콘텐트 전송 방법.
- 제1항에 있어서,상기 세그먼트들에 액세스하기 위한 메타데이터를 포함하는 MPD(Media Presentation Description)를 클라이언트로 전송하는 단계를 더 포함하고,상기 MPD는 상기 레프리젠테이션들에 대한 비트스트림 스위칭 여부를 나타내는 제1 플래그(frag) 및 상기 서브세그먼트들에 대한 정렬 여부를 나타내는 제2 플래그를 포함하는 미디어 콘텐트 전송 방법.
- 인터넷을 통해 미디어 콘텐트를 수신하는 방법에 있어서,상기 미디어 콘텐트를 포함하는 복수의 레프리젠테이션들 중 제1 레프리젠테이션을 수신하는 단계;상기 수신되는 레프리젠테이션을 제2 레프리젠테이션으로 스위칭하는 단계; 및상기 수신되는 레프리젠테이션을 재생하는 단계를 포함하고,상기 레프리젠테이션은 복수의 세그먼트들로 분할되어 수신되며,상기 세그먼트에 포함된 서브세그먼트들은상기 제1, 2 레프리젠테이션 중 어느 하나에 포함된 k 번째 서브세그먼트의 가장 빠른 재생 시간이 또 다른 레프리젠테이션에 포함된 k-1 번째 서브세그먼트의 가장 늦은 재생 시간보다 크도록 정렬되어 있는 미디어 콘텐트 수신 방법.
- 제11항에 있어서,상기 세그먼트에 포함된 모든 서브세그먼트들에 대한 인덱스 정보가 존재하는 미디어 콘텐트 수신 방법.
- 제11항에 있어서,상기 서브세그먼트들 중 RAP를 포함하는 서브세그먼트에 대하여, 상기 서브세그먼트에 포함된 첫번째 RAP의 재생 시간은 상기 서브세그먼트의 가장 빠른 재생 시간과 일치하는 미디어 콘텐트 수신 방법.
- 제11항에 있어서,상기 스위칭 전후에 수신된 상기 제1, 2 레프리젠테이션들은 중복된 데이터를 포함하지 않는 미디어 콘텐트 수신 방법.
- 제11항에 있어서,상기 스위칭 전후에 수신된 상기 제1, 2 레프리젠테이션들에 손실된 데이터가 존재하지 않는 미디어 콘텐트 수신 방법. 미디어 콘텐트 수신 방법.
- 제11항에 있어서, 상기 스위칭 단계는상기 제2 레프리젠테이션 중 RAP를 포함하는 임의의 서브세그먼트로 스위칭하는 미디어 콘텐트 수신 방법.
- 제11항에 있어서, 상기 스위칭 단계는연속하는 서브세그먼트들 사이의 경계에서 수행되는 미디어 콘텐트 수신 방법.
- 인터넷을 통해 미디어 콘텐트를 전송하는 장치에 있어서,상기 미디어 콘텐트를 복수의 레프리젠테이션들로 구성하는 미디어 인코더;상기 구성된 레프리젠테이션을 복수의 세그먼트들로 분할하는 세그멘터(segmenter)를 포함하고,상기 세그먼트에 포함된 서브세그먼트들은상기 레프리젠테이션들 중 제1 레프리젠테이션에 포함된 k 번째 서브세그먼트의 가장 빠른 재생 시간이 제2 레프리젠테이션에 포함된 k-1 번째 서브세그먼트의 가장 늦은 재생 시간보다 크도록 정렬되는 미디어 콘텐트 전송 장치.
- 제18항에 있어서,상기 서브세그먼트들 중 RAP를 포함하는 서브세그먼트에 대하여, 상기 서브세그먼트에 포함된 첫번째 RAP의 재생 시간은 상기 서브세그먼트의 가장 빠른 재생 시간과 일치하는 미디어 콘텐트 전송 장치.
- 인터넷을 통해 미디어 콘텐트를 수신하는 장치에 있어서,상기 미디어 콘텐트를 포함하는 복수의 레프리젠테이션들 중 어느 하나를 수신하는 수신부; 및상기 수신되는 레프리젠테이션을 스위칭하는 제어부를 포함하고,상기 레프리젠테이션은 복수의 세그먼트들로 분할되어 수신되며,상기 세그먼트에 포함된 서브세그먼트들은상기 레프리젠테이션들 중 제1 레프리젠테이션에 포함된 k 번째 서브세그먼트의 가장 빠른 재생 시간이 제2 레프리젠테이션에 포함된 k-1 번째 서브세그먼트의 가장 늦은 재생 시간보다 크도록 정렬되어 있는 미디어 콘텐트 수신 장치.
Priority Applications (9)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP11838186.2A EP2637414A4 (en) | 2010-11-02 | 2011-10-27 | METHOD FOR TRANSMITTING / RECEIVING MULTIMEDIA CONTENT AND TRANSMITTING / RECEIVING DEVICE USING THE SAME |
KR1020147012386A KR101613941B1 (ko) | 2010-11-02 | 2011-10-27 | 미디어 콘텐트 송수신 방법 및 그를 이용한 송수신 장치 |
US13/882,926 US9781188B2 (en) | 2010-11-02 | 2011-10-27 | Method for transreceiving media content and device for transreceiving using same |
CA2816621A CA2816621A1 (en) | 2010-11-02 | 2011-10-27 | Method for adaptive streaming that transmits/receives media content according to a network bandwidth and device for adaptive streaming using the same |
GB1308663.2A GB2499539B (en) | 2011-10-27 | 2011-10-27 | Method for transreceiving media content and device for transreceiving using same |
KR1020137011179A KR101452669B1 (ko) | 2010-11-02 | 2011-10-27 | 미디어 콘텐트 송수신 방법 및 그를 이용한 송수신 장치 |
DE112011103642T DE112011103642T5 (de) | 2010-11-02 | 2011-10-27 | Verfahren zum Senden/Empfangen von Medieninhalt und Vorrichtung zum Senden/Empfangen, die dieses verwendet |
KR1020167009513A KR101649533B1 (ko) | 2010-11-02 | 2011-10-27 | 미디어 콘텐트 송수신 방법 및 그를 이용한 송수신 장치 |
CN201180060590.1A CN103262556B (zh) | 2010-11-02 | 2011-10-27 | 收发媒体内容的方法和利用该方法进行收发的装置 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US40912610P | 2010-11-02 | 2010-11-02 | |
US61/409,126 | 2010-11-02 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2012060581A2 true WO2012060581A2 (ko) | 2012-05-10 |
WO2012060581A3 WO2012060581A3 (ko) | 2012-07-19 |
Family
ID=46024918
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2011/008094 WO2012060581A2 (ko) | 2010-11-02 | 2011-10-27 | 미디어 콘텐트 송수신 방법 및 그를 이용한 송수신 장치 |
Country Status (7)
Country | Link |
---|---|
US (1) | US9781188B2 (ko) |
EP (1) | EP2637414A4 (ko) |
KR (3) | KR101452669B1 (ko) |
CN (2) | CN106851334A (ko) |
CA (1) | CA2816621A1 (ko) |
DE (1) | DE112011103642T5 (ko) |
WO (1) | WO2012060581A2 (ko) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014028168A1 (en) * | 2012-08-13 | 2014-02-20 | Disney Enterprises, Inc. | Content based partitioning of digital media content |
WO2014171718A1 (ko) * | 2013-04-16 | 2014-10-23 | 엘지전자 주식회사 | 방송 전송 장치, 방송 수신 장치, 방송 전송 장치의 동작 방법 및 방송 수신 장치의 동작 방법 |
CN105210372A (zh) * | 2013-05-22 | 2015-12-30 | 索尼公司 | 内容供应装置、内容供应方法、程序以及内容供应系统 |
EP3018912A4 (en) * | 2013-07-02 | 2017-03-08 | Sony Corporation | Content provision device, content provision method, program, terminal device, and content provision system |
WO2018021616A1 (en) * | 2016-07-29 | 2018-02-01 | Airbroad Inc. | Operating method of client for streaming service |
US11711569B2 (en) * | 2015-11-09 | 2023-07-25 | Interdigital Vc Holdings, Inc. | Method and device for adapting the video content decoded from elementary streams to the characteristics of a display |
Families Citing this family (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103222276B (zh) * | 2010-09-20 | 2017-04-19 | 数码士有限公司 | 将在http流式传输中发生表达切换时实现的处理方法 |
CN103493499B (zh) | 2011-03-16 | 2018-04-06 | 韩国电子通信研究院 | 提供使用表示的流媒体内容的装置及方法 |
US9590814B2 (en) * | 2011-08-01 | 2017-03-07 | Qualcomm Incorporated | Method and apparatus for transport of dynamic adaptive streaming over HTTP (DASH) initialization segment description fragments as user service description fragments |
KR101719998B1 (ko) * | 2011-12-12 | 2017-03-27 | 엘지전자 주식회사 | 미디어 컨텐트를 수신하는 장치 및 방법 |
US9866886B2 (en) * | 2012-10-23 | 2018-01-09 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for distributing a media content service |
US9075619B2 (en) * | 2013-01-15 | 2015-07-07 | Nuance Corporation, Inc. | Method and apparatus for supporting multi-modal dialog applications |
US9854017B2 (en) * | 2013-03-15 | 2017-12-26 | Qualcomm Incorporated | Resilience in the presence of missing media segments in dynamic adaptive streaming over HTTP |
WO2015001985A1 (ja) * | 2013-07-02 | 2015-01-08 | ソニー株式会社 | コンテンツ供給装置、コンテンツ供給方法、プログラム、端末装置、およびコンテンツ供給システム |
WO2015022997A1 (en) * | 2013-08-13 | 2015-02-19 | Canon Kabushiki Kaisha | Information processing apparatus, control method therefor, and program |
US10097294B2 (en) | 2014-01-03 | 2018-10-09 | Lg Electronics Inc. | Apparatus for transmitting broadcast signals, apparatus for receiving broadcast signals, method for transmitting broadcast signals and method for receiving broadcast signals |
US9348495B2 (en) | 2014-03-07 | 2016-05-24 | Sony Corporation | Control of large screen display using wireless portable computer and facilitating selection of audio on a headphone |
US20170055046A1 (en) * | 2014-05-21 | 2017-02-23 | Lg Electronics Inc. | Broadcast signal transmitting/receiving method and device |
EP2978225B1 (en) * | 2014-07-23 | 2017-11-08 | Wildmoka | Method for obtaining in real time a user selected multimedia content part |
WO2016024794A1 (ko) * | 2014-08-12 | 2016-02-18 | 엘지전자 주식회사 | 방송신호 전송방법, 방송신호 수신방법, 방송신호 전송장치, 방송신호 수신장치 |
KR102326282B1 (ko) | 2015-02-09 | 2021-11-15 | 한국전자통신연구원 | 대역폭에 적응적인 대쉬 기반 컨텐츠 제공 장치 및 방법 |
US9973562B2 (en) | 2015-04-17 | 2018-05-15 | Microsoft Technology Licensing, Llc | Split processing of encoded video in streaming segments |
WO2016186426A1 (ko) * | 2015-05-18 | 2016-11-24 | 엘지전자 주식회사 | 방송 신호 송신 장치, 방송 신호 수신 장치, 방송 신호 송신 방법, 및 방송 신호 수신 방법 |
CA2993917A1 (en) * | 2015-08-17 | 2017-02-23 | Sony Corporation | Receiving apparatus, transmitting apparatus, and data processing method |
CN105451073B (zh) * | 2015-11-16 | 2018-11-02 | 深圳Tcl数字技术有限公司 | Mp4视频源的传送方法和装置 |
US11128914B2 (en) * | 2018-04-25 | 2021-09-21 | Roku, Inc. | Client side stitching of content into a multimedia stream |
US11146852B2 (en) * | 2018-05-11 | 2021-10-12 | Qualcomm Incorporated | Signaling missing sections of media data for network streaming in a segment |
US10715882B2 (en) * | 2018-06-29 | 2020-07-14 | Intel Corporation | Timing synchronization between a content source and a display panel |
CN110876084B (zh) * | 2018-08-29 | 2021-01-01 | 浙江大学 | 处理和传输媒体数据的方法和装置 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN100459682C (zh) * | 1999-09-20 | 2009-02-04 | 提维股份有限公司 | 经由电视广播数据流的广告排定电视节目录制时程的装置和方法 |
KR100486713B1 (ko) * | 2002-09-17 | 2005-05-03 | 삼성전자주식회사 | 멀티미디어 스트리밍 장치 및 방법 |
WO2004028156A1 (en) * | 2002-09-19 | 2004-04-01 | Thomson Licensing S.A. | Hybrid video on demand using mpeg 2 transport |
US8650599B2 (en) * | 2004-03-29 | 2014-02-11 | Panasonic Corporation | Accumulation display device, interlocked display method and system |
WO2006115151A1 (ja) * | 2005-04-25 | 2006-11-02 | Sharp Kabushiki Kaisha | 記録装置、再生装置、記録再生装置、記録プログラムとその記録媒体、再生プログラムとその記録媒体 |
JP4534997B2 (ja) | 2006-02-13 | 2010-09-01 | ソニー株式会社 | 送受信システム、受信装置、受信方法 |
US9432433B2 (en) * | 2006-06-09 | 2016-08-30 | Qualcomm Incorporated | Enhanced block-request streaming system using signaling or block creation |
US7860996B2 (en) | 2008-05-30 | 2010-12-28 | Microsoft Corporation | Media streaming with seamless ad insertion |
JP4686587B2 (ja) * | 2008-10-16 | 2011-05-25 | 株式会社東芝 | 映像記録再生装置およびファイル管理方法 |
US9319448B2 (en) * | 2010-08-10 | 2016-04-19 | Qualcomm Incorporated | Trick modes for network streaming of coded multimedia data |
US20120114118A1 (en) | 2010-11-05 | 2012-05-10 | Samsung Electronics Co., Ltd. | Key rotation in live adaptive streaming |
US9462024B2 (en) * | 2011-06-08 | 2016-10-04 | Futurewei Technologies, Inc. | System and method of media content streaming with a multiplexed representation |
-
2011
- 2011-10-27 KR KR1020137011179A patent/KR101452669B1/ko active IP Right Grant
- 2011-10-27 CN CN201611042513.4A patent/CN106851334A/zh active Pending
- 2011-10-27 US US13/882,926 patent/US9781188B2/en active Active
- 2011-10-27 CN CN201180060590.1A patent/CN103262556B/zh active Active
- 2011-10-27 KR KR1020147012386A patent/KR101613941B1/ko active IP Right Grant
- 2011-10-27 EP EP11838186.2A patent/EP2637414A4/en not_active Withdrawn
- 2011-10-27 CA CA2816621A patent/CA2816621A1/en not_active Abandoned
- 2011-10-27 DE DE112011103642T patent/DE112011103642T5/de not_active Ceased
- 2011-10-27 KR KR1020167009513A patent/KR101649533B1/ko active IP Right Grant
- 2011-10-27 WO PCT/KR2011/008094 patent/WO2012060581A2/ko active Application Filing
Non-Patent Citations (1)
Title |
---|
None |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2014028168A1 (en) * | 2012-08-13 | 2014-02-20 | Disney Enterprises, Inc. | Content based partitioning of digital media content |
US9282348B2 (en) | 2012-08-13 | 2016-03-08 | Disney Enterprises, Inc. | Content based partitioning of digital media content |
WO2014171718A1 (ko) * | 2013-04-16 | 2014-10-23 | 엘지전자 주식회사 | 방송 전송 장치, 방송 수신 장치, 방송 전송 장치의 동작 방법 및 방송 수신 장치의 동작 방법 |
CN105210372A (zh) * | 2013-05-22 | 2015-12-30 | 索尼公司 | 内容供应装置、内容供应方法、程序以及内容供应系统 |
US9942619B2 (en) | 2013-05-22 | 2018-04-10 | Saturn Licensing Llc | Content supply device, content supply method, program, and content supply system |
CN105210372B (zh) * | 2013-05-22 | 2019-05-17 | 索尼公司 | 内容供应装置、内容供应方法、程序以及内容供应系统 |
EP3018912A4 (en) * | 2013-07-02 | 2017-03-08 | Sony Corporation | Content provision device, content provision method, program, terminal device, and content provision system |
US11711569B2 (en) * | 2015-11-09 | 2023-07-25 | Interdigital Vc Holdings, Inc. | Method and device for adapting the video content decoded from elementary streams to the characteristics of a display |
WO2018021616A1 (en) * | 2016-07-29 | 2018-02-01 | Airbroad Inc. | Operating method of client for streaming service |
KR20180013298A (ko) * | 2016-07-29 | 2018-02-07 | 주식회사 에어브로드 | 스트리밍 서비스를 위한 클라이언트의 동작 방법 |
KR101863598B1 (ko) * | 2016-07-29 | 2018-06-01 | 주식회사 에어브로드 | 스트리밍 서비스를 위한 클라이언트의 동작 방법 |
Also Published As
Publication number | Publication date |
---|---|
DE112011103642T5 (de) | 2013-09-19 |
CN106851334A (zh) | 2017-06-13 |
KR101649533B1 (ko) | 2016-08-19 |
US9781188B2 (en) | 2017-10-03 |
EP2637414A2 (en) | 2013-09-11 |
US20130238758A1 (en) | 2013-09-12 |
CN103262556A (zh) | 2013-08-21 |
KR20160047585A (ko) | 2016-05-02 |
WO2012060581A3 (ko) | 2012-07-19 |
KR101452669B1 (ko) | 2014-10-23 |
KR101613941B1 (ko) | 2016-04-20 |
CN103262556B (zh) | 2016-12-07 |
KR20130076884A (ko) | 2013-07-08 |
KR20140066265A (ko) | 2014-05-30 |
EP2637414A4 (en) | 2014-10-22 |
CA2816621A1 (en) | 2012-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2012060581A2 (ko) | 미디어 콘텐트 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2012177041A2 (ko) | 미디어 컨텐트 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2012011724A2 (ko) | 미디어 파일 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2011059291A2 (en) | Method and apparatus for transmitting and receiving data | |
WO2013141666A1 (ko) | Mmt 패키지화된 svc 비디오 콘텐츠의 하이브리드 전송 방법 및 수신 방법 | |
WO2013169084A1 (ko) | Mmt 패킷 포맷 확장을 통한 하이브리드 전송 방법 | |
WO2013089437A1 (ko) | 미디어 컨텐트를 수신하는 장치 및 방법 | |
WO2011152675A2 (en) | Method and apparatus for adaptive streaming based on plurality of elements for determining quality of content | |
WO2011059273A2 (en) | Method and apparatus for adaptive streaming using segmentation | |
WO2011071290A2 (en) | Streaming method and apparatus operating by inserting other content into main content | |
WO2011105811A2 (en) | Method and apparatus for transmitting and receiving data | |
WO2012011722A2 (ko) | 미디어 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2013077698A1 (ko) | Mmt 미디어와 dash 미디어와의 연동 방법 | |
WO2011059274A2 (en) | Adaptive streaming method and apparatus | |
WO2015002500A1 (ko) | 실시간 전송 프로토콜 기반의 방송 시스템에서 미디어 방송 신호의 송수신 방법 및 장치 | |
WO2013025035A2 (ko) | 송신 장치, 수신 장치 및 그 송수신 방법 | |
WO2012011735A2 (en) | Method and apparatus for transmitting and receiving adaptive streaming mechanism-based content | |
WO2011132883A2 (ko) | 인터넷 기반 컨텐츠 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2015012605A1 (ko) | 3차원 콘텐츠의 부호화 방법 및 장치 | |
WO2011115454A2 (en) | Method and apparatus for adaptively streaming content including plurality of chapters | |
WO2011132879A2 (ko) | 인터넷 기반 컨텐츠 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2010123248A2 (ko) | P2p 전송 방식에 의한 iptv 스트리밍 서비스를 전송하는 방법 및 p2p 전송 방식에 의한 iptv 스트리밍 서비스를 수신하는 방법. | |
WO2011132882A2 (ko) | 인터넷 기반 컨텐츠 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2011132880A2 (ko) | 인터넷 기반 컨텐츠 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2011132881A2 (ko) | 인터넷 기반 컨텐츠 송수신 방법 및 그를 이용한 송수신 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11838186 Country of ref document: EP Kind code of ref document: A2 |
|
ENP | Entry into the national phase |
Ref document number: 20137011179 Country of ref document: KR Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2816621 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13882926 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1120111036426 Country of ref document: DE Ref document number: 112011103642 Country of ref document: DE |
|
ENP | Entry into the national phase |
Ref document number: 1308663 Country of ref document: GB Kind code of ref document: A Free format text: PCT FILING DATE = 20111027 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1308663.2 Country of ref document: GB |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011838186 Country of ref document: EP |