GB2499539A - Method for transreceiving media content and device for transreceiving using same - Google Patents

Method for transreceiving media content and device for transreceiving using same Download PDF

Info

Publication number
GB2499539A
GB2499539A GB1308663.2A GB201308663A GB2499539A GB 2499539 A GB2499539 A GB 2499539A GB 201308663 A GB201308663 A GB 201308663A GB 2499539 A GB2499539 A GB 2499539A
Authority
GB
United Kingdom
Prior art keywords
subsegment
representation
presentation time
representations
subsegments
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB1308663.2A
Other versions
GB201308663D0 (en
GB2499539B (en
Inventor
Hyeonjae Lee
Waqar Jia
Hotaek Hong
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Priority claimed from PCT/KR2011/008094 external-priority patent/WO2012060581A2/en
Publication of GB201308663D0 publication Critical patent/GB201308663D0/en
Publication of GB2499539A publication Critical patent/GB2499539A/en
Application granted granted Critical
Publication of GB2499539B publication Critical patent/GB2499539B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234309Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by transcoding between formats or standards, e.g. from MPEG-2 to MPEG-4 or from Quicktime to Realvideo
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/23439Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The present invention relates to a method for transreceiving media content and a device for transreceiving using same, the method comprising the following steps: configuring the media content into a plurality of representations; dividing the representations into a plurality of segments; aligning subsegments which are included in the segments; and transmitting one of the representations.

Description

- 1 -
METHOD FOR TRANSRECEIVING MEDIA CONTENT AND DEVICE FOR TRANSRECEIVING USING THE SAME
BACKGROUND
5 [0001] The present disclosure relates to a method and device for transmitting/receiving media content via internet.
[0002] Currently, most of digital broadcastings such as terrestrial, cable, satellite, and Digital Multimedia Broadcasting (DMB) stream AV content by using MPEG-2 Transport
10 Stream (TS).
[0003] Additionally, according to the recent rapid growth of internet, a multimedia service providing content by using an Internet protocol (IP) network as a main transport network has been activated. Digital broadcastings such as stereo 3D video
15 broadcasting, Ultra High Definition (UHD) broadcasting, multiview 3D video broadcasting, and hologram broadcasting are evolving into a direction more requiring the amount of data transferred.
[0004] However, the MPEG-2 TS having a fixed length packet of 188 bytes may be inefficient to transmit a content having a
20 higher resolution than that of an existing HDTV by using the IP
- 2 -
network.
SUMMARY
[0005] Embodiments provide a method and device for 5 transmitting/receiving media content with efficient streaming according to a network state.
[0006] In one embodiment, provided is a method of transmitting media content via internet. The method includes: configuring the media content with a plurality of
10 representations; dividing the configured representation into a plurality segments; aligning subsegments in the segment; and transmitting one of the representations, wherein the subsegments are aligned to allow the earliest presentation time of the kth subsegment in the first representation to be greater than the
15 latest presentation time of the k-lth subsegment in the second representation among the representations.
[0007] In another embodiment, provided is a method of receiving media content via internet. The method includes: receiving a first representation among a plurality of
20 representations including the media content; switching the
- 3 -
received first representation to a second representation; and playing the received first representation, wherein the representation is divided into a plurality of segments and received; and subsegments in the segment are aligned to allow the 5 earliest presentation time of the kth subsegment in one of the first and second representations to be greater than the latest presentation time of the k-lth subsegment in the other representation.
[0008] In further another embodiment, provided is a device 10 for transmitting media content via internet. The device includes:
a media encoder configuring the media content with a plurality of representations; a segmenter segmenting the configured representation into a plurality of segments, wherein subsegments in the segment are aligned to allow the earliest presentation 15 time of the kth subsegment in the first representation to be greater than the latest presentation time of the k-lth subsegment in the second representation among the representations.
[0009] In still further another embodiment, provided is a device for receiving media content via internet. The device
20 includes: a receiving unit receiving one of a plurality of
- 4 -
representations including the media content; and a control unit switching the received representation, wherein the representation is divided into a plurality of segments and received; and subsegments in the segment are aligned to allow the earliest 5 presentation time of the kth subsegment in the first representation to be greater than the latest presentation time of the k-lth subsegment in the second representation among the representations.
[0010] The details of one or more embodiments are set forth 10 in the accompanying drawings and the description below. Other features will be apparent from the description and drawings, and from the claims.
BRIEF DESCRIPTION OF THE DRAWINGS 15 [0011] Fig. 1 is a block diagram illustrating a configuration of a content transceiving system according to an embodiment of the present invention.
[0012] Fig. 2 is a view illustrating a structure of MEPG-2 TS
[0013] Fig. 3 is a flowchart illustrating a media content 20 transmitting method according to an embodiment of the present
invention.
[0014] Fig. 4 is a view illustrating a configuration of representations including media content according to an embodiment of the present invention.
5 [0015] Fig. 5 is a view illustrating a configuration of a subsegment according to an embodiment of the present invention.
[0016] Figs. 6 and 7 are views illustrating a method of determining the position of a RAP in a subsegment according to an embodiment of the present invention.
10 [0017] Figs. 8 and 9 are views illustrating a method aligning subsegments according to an embodiment of the present invention.
[0018] Figs. 10 and 11 are views illustrating a method of switching a representation according to an embodiment of the present invention.
15 [0019] Figs. 12 and 13 are views illustrating a configuration of a segment index box providing metadata on media content according to a first embodiment of the present invention.
[0020] Figs. 14 and 15 are views illustrating a configuration of the segment index box according to a second embodiment of the 20 present invention.
[0021] FIG. 16 is a block diagram illustrating a configuration of a media content receiving device according to an embodiment of the present invention.
[0022] Fig. 17 is a flowchart illustrating a media content 5 receiving method according to an embodiment of the present invention.
[0023] Fig. 18 is a block diagram illustrating a configuration of an IPTV receiver according to a first embodiment of the present invention.
10 [0024] Fig. 19 is a block diagram illustrating a configuration of an IPTV receiver according to a second embodiment of the present invention.
DETAILED DESCRIPTION OF THE EMBODIMENTS 15 [0025] Hereinafter, a method and device for transmitting/receiving media content via internet will be described with reference to Figs. 1 to 19.
[0026] Hereinafter, detailed descriptions related to well-known functions or configurations will be ruled out in order not 20 to unnecessarily obscure subject matters of the present invention.
- 7 -
Moreover, terms described below are defined in consideration of the functions of the present invention and vary according to users, operators' intentions, or customs. Therefore, the definition may be determined based on the entire contents of this 5 specification.
[0027] A transmitting device and a receiving device according to an embodiment of the present invention may transmit/receive media content by using an Internet Protocol (IP) based transmitting system. For example, the media content may be
10 streamed by a web server through a Transmission Control Protocol (TCP) and a Hypertext Transfer Protocol (HTTP).
[0028] The transmitting device may receive a content from a content provider, and then may convert and transmit the received content, or store it in a media DataBase (DB).
15 [0029] For example, the media content provided from the content provider may be in an MPEG-2 TS format or an ISO Base Media file format. The transmitting device may convert the media content into a format easy for IP-based delivery and then may transmit the converted media content via a network such as an
20 internet network or may store it in the media DB.
- 8 -
[0030] Moreover, the receiving device may receive the media content from the transmitting device by using an IP based transmitting system, and then, may convert and play the received media content or may store it in a local storage.
5 [0031] For example, the receiving device may convert the media content received from the transmitting device into an MPEG-2 TS format or an ISO Base Media file format, and then, may process and play it or may store it in the local storage medium.
[0032] In addition, since an IP based open internet 10 environment using HTTP does not guarantee QoS, a bandwidth used for transmitting data to a receiving device according to network traffic changes dynamically. Thus, a stable and fixed bandwidth necessary for transmitting a content file may be hardly obtained.
[0033] For example, when a user installs or uses software for 15 frequent network transmission in a home PC system, due to the increase of network traffic, a stable and fixed bandwidth may not be obtained. In order to address such an issue, a method of allocating a bandwidth by an internet gateway to specific software was devised, but has not been widely used until now. 20 [0034] Moreover, even when a fixed bandwidth is guaranteed in
a receiving device, since an internet server, i.e. a transmitting device operating in an open internet environment, transmits service via different networks, there still may be difficulties in ensuring QoS.
5 [0035] For example, when services such as YouTube are provided via different networks that specific service providers operate, as network setting or the amount of traffic increases during a transmission process, content may be delayed, dropped, or re-transmitted.
10 [0036] The issue on QoS guarantee may be an important factor that causes content providers, which provide content services in an open internet environment, to feel difficulties.
[0037] Moreover, in order to solve the difficulties of the content transmission, provided is an adaptive streaming method
15 that appropriately transmits/receives content according to a network bandwidth by using a play list file in an M3U format.
[0038] The adaptive streaming method creates content files having different bit rates with respect to one content and requests transmission by selecting a bit rate according to a
20 bandwidth of a receiving device. Since contents are mostly
- 10 -
transmitted using an HTTP protocol in an open internet environment, the adaptive streaming method is also referred to as HTTP adaptive streaming.
[0039] Fig. 1 is a block diagram illustrating a configuration 5 of a content transceiving system according to an embodiment of the present invention, and shows one example of a structure of a system for the adaptive streaming.
[0040] Referring to Fig. 1, the system for adaptive streaming may include a server 10, a distribution server 20, and a client
10 30.
[0041] When the server 10 delivers Audio/Video (AV) data as raw data to a media encoder 11, the media encoder 51 converts the delivered AV data into a streaming file format of MPEG-2 TS.
[0042] Moreover, a stream segmenter 12 may divide the MPEG-2 15 TS streaming file outputted from the media encoder 11 into a plurality of segments.
[0043] For example, the streaming segmenter 12 may divide one file or a real-time MPEG-2 TS streaming file into segments having a size that can be played for about 10 sec, and then, may store
20 them in the distribution server 20.
- 11 -
[0044] The client 30 may request the divided files stored in the distribution server 20 through the HTTP protocol, and then, may receive and play them.
[0045] Moreover, MPEG-2 TS file segments having different bit 5 rates may be stored in the distribution server 20, and for this,
a plurality of web servers 21 storing MPEG-2 TS file segments, each encoded at a specific bit rate, may exist in the distribution server 20.
[0046] For example, MPEG-2 TS file segments encoded at a bit 10 rate of about 20 kbps and MPEG-2 TS file segments encoded at a bit rate of about 500 kbps may be stored in the distribution server 20 with different file names.
[0047] Accordingly, when the bandwidth of a reception area is reduced, the client 30 may request a file encoded at a bit rate
15 of about 20 kbps and play it for about 10 sec. Then, the client 30 may periodically check the bandwidth again before the presentation time of about 10 sec elapses.
[0048] At this point, when the bandwidth is increased to the degree at which a high capacity and high quality file is
20 transferred, the client 30 may request the transmission of a file
- 12 -
encoded at a bit rate of about 500 kbps.
[0049] Due to this, the client 30 adaptively and alternately may receive contents encoded at bit rates of about 20 kbps and about 500 kbps according a network bandwidth and then may play
5 them. Also, when the bandwidth is poor, the quality of an image may be deteriorated but the image may be continuously displayed.
[0050] In order for the adaptive streaming using the HTTP protocol, media content may include a plurality of representations. The media content in each of the representations
10 may have different bit rates, resolutions, languages, or codecs.
[0051] The media content may include media components such as audio, video, or timed text, and the representation may be a structured collection of at least one media component.
[0052] In this case, the client 30 may request one of a 15 plurality of representations having the different bit rates to the server 20 and may receive it.
[0053] Moreover, in order for the client 30 to access the segments and provide streaming service to a user, the server 10 may provide a Media Presentation Description (MPD) including
20 metadata requested by the client 30 in addition to segment data.
- 13 -
[0054] The client 30 may select one of the plurality of representations by using an element in the MPD and may request it for transmission.
[0055] The representation is divided into the plurality of 5 segments, and a segment or a partial segment may be transmitted in response to the request of the client 30 through the HTTP GET or HTTP partial GET method defined by HTTP 1.1(RFC 2616).
[0056] Furthermore, if bitstream switching is possible, the client 30 may switch a representation received based on
10 information updated during media presentation to another representation.
[0057] In this case, the client 30 may receive a new representation from the time when a previous representation is played, and may play it.
15 [0058] Additionally, the segment may include a plurality of subsegments, and the subsegment may mean the smallest unit indexed in a segment level.
[0059] Referring to Fig. 2, MPEG-2 TS may include a plurality of MPEG-2 TS packets having the fixed bit rate of 188 bytes, and
20 the subsegment may include continuous MPEG-2 TS packets.
- 14 -
[0060] For example, one sample is an access unit including at least one MPEG-2 TS packet having the same Packet Identifier (PID) value, and the segment is defined as one indexed set of continuous access units in decoding order.
5 [0061] Moreover, in order to stream an ISO Base Media file format, a "fragmented file" structure may be used.
[0062] For example, the fragmented file divides media tracks into a plurality of fragments physically, and each fragment includes a Movie Fragment (moof) box and a media data (mdat) box.
10 [0063] The mdat box after the moof box includes fragment media data, and a pair of the combined moof-mdat may configure one fragment.
[0064] In this case, the subsegment may be defined as a self-contained set of at least one continuous movie fragment, and
15 accordingly, may include at least one moof box and a mdat box corresponding thereto.
[0065] According to an embodiment of the present invention, when the media content in the MPEG-2 TS or ISO base media file format is configured with a plurality of representations and is
20 transmitted by a segment unit, the server 10 may align
- 15 -
subsegments in a segment in order to efficiently perform bitstream switching in the client 30.
[0066] Moreover, the case that the media content in the MPEG-2 TS or ISO base media file format is transmitted to internet
5 through HTTP is illustrated above, but the present invention is not limited thereto. A content transceiving method and device according to an embodiment of the present invention may be applicable to media contents of various formats.
[0067] Hereinafter, a method of transmitting media content by 10 aligning subsegments will be described in more detail with reference to Figs. 3 to 11.
[0068] Fig. 3 is a flowchart illustrating a media content transmitting method according to an embodiment of the present invention. The same method described with reference to Figs. 1 to
15 2 among transmission methods shown will be omitted.
[0069] Referring to Fig. 3, a transmitting device configures a media content with a plurality of representations in operation S100, and divides the configured representation into a plurality of segments in operation S110.
20 [0070] For example, the plurality of representations may be
- 16 -
differently configured depending on at least one of the bit rate, resolution, language, and codec of the transmitted media content, and each representation may be divided into a plurality of segments in order for transmission using the HTTP protocol.
5 [0071] Referring to Fig. 4, the transmitting device may configure a media content with a representation A and a representation B having different bit rates, and the representation A and the representation B may form one group. The group may further include other representations besides the 10 representations A and B.
[0072] Moreover, there is index information on all subsegments in the representations A and B, and the index information may be used for the client 30 to randomly access the subsegments of the segment through the partial HTTP GET method. 15 [0073] For example, each of the subsegments may include a segment index or there may be an index segment for providing an index on the subsegment.
[0074] A method of providing index information on the subsegments will be described in detail with reference to Figs. 20 12 to 15.
- 17 -
[0075] Additionally, in relation to a subsegment 200, the earliest presentation time TE of the subsegment 200 and the latest presentation time TL of the subsegment 200 may be defined.
[0076] Referring to Fig. 5, the subsegment 200 may include 5 two media components such as video and audio, and one of the media components (for example, video) may be designated as reference stream.
[0077] That is, the reference stream may mean a group of continuous samples of a primary media component among media
10 components.
[0078] In this case, the earliest presentation time TE of the subsegment 200 may be defined as the earliest presentation time of an existing arbitrary sample in the reference stream, and accordingly, may be the presentation time of the earliest sample
15 201 among video samples.
[0079] Also, the latest presentation time TL of the subsegment 200 may be the presentation time of the latest sample 204 in the reference stream.
[0080] In addition, the sample 204 may mean an access unit in
20 the stream of the subsegment 200 (or a segment).
- 18 -
[0081] The subsegments shown in Fig. 4 may have different sizes or different durations of the media content.
[0082] Additionally, although the latest presentation time TL of a segment is positioned at the boundary of continuous
5 subsegments as shown in Fig. 4, it may slightly ahead of the boundary between the subsegments.
[0083] That is, in relation to the representation A, the latest presentation time TL of the kth subsegment may be slightly ahead of the time corresponding to the boundary between the kth
10 subsegment and the k+lth subsegment.
[0084] Referring to Fig. 6, a segment includes at least one representation access point (RAP) and the RAP may be included in a subsegment.
[0085] The client 30 may play forward from the position
15 corresponding to the RAP by using only information in a subsegment, and the RAP may include a byte index ILRAP and a presentation time TRAP.
[0086] The presentation time TRAP may mean the fastest presentation time to allow all access units having a presentation
20 time greater than or identical to the TRAP to properly perform
- 19 -
decoding.
[0087] Moreover, when bitstream switching is requested from the representation A to the representation B, the client 30 searches the RAP from the representation B at the presentation
5 time closest to a current presentation time, and then, plays the representation A until the RAP 211 in the kth subsegment 210, and plays the representation B after the searched RAP 211.
[0088] Although a subsegment does not include a RAP or includes one RAP as shown in Fig. 6, it may include at least two
10 RAPs. In this case, the RAP in Fig. 6 may mean the fastest RAP among the RAPs in a subsegment.
[0089] Referring to Fig. 7, the presentation time TRAP of the first RAP in a subsegment may correspond to the fastest presentation time TE of a corresponding subsegment.
15 [0090] Thus, with respect to all segments (contatins_RAP=l)
including the RAP, the presentation time TRAP of the first RAP in the subsegment is allowed to be the fastest presentation of a corresponding subsegment. Therefore, the client 30 may switch representations at an arbitrary boundary between continuous
20 subsegments having continuous indices.
- 20 -
[0091] For example, as shown in Fig. 7, after bitstream switching is requested, the client 30 may switch a representation at a desired arbitrary position from among the boundary between the k-lth subsegment 220 and the kth subsegment 210 or the
5 boundary between the kth subsegment 210 and the k+lth subsegment 230 .
[0092] Hereinafter, as described with reference to Fig. 7, it is assumed that the presentation time TRAP of the first RAP in a subsegment corresponds to the fastest presentation time TE of a
10 corresponding subsegment.
[0093] A transmitting device aligns subsegments in the segment in operation S120.
[0094] In operation S120, the server 10 may align the subsegments in the segment in order to prevent the duplicate
15 reception or loss of data provided to the client 30 during bitstream switching.
[0095] Referring to Fig. 8(a), the fastest presentation time TE of the Kth subsegment 200 of the representation A may be less than the latest presentation time TL of the k-lth subsegment 230
20 of the representation B.
- 21 -
[0096] In this case, when bitstream switching from the representation A to the representation B is requested, the client 30 performs switching at the boundary between the k-lth subsegment 220 and the kth subsegment 210 of the representation B,
5 and then, plays the representation B.
[0097] Moreover, since the client 30 already receives the kth subsegment 210 of the representation A before the switching, part of the kth subsegment 200 may be redundantly provided to the client 30.
10 [0098] For example, the redundant data may be some samples having a fast representation time in the reference stream of the kth subsegment 200.
[0099] Accordingly, in order not to provide the redundant data to the client 30 during the switching process, the fastest 15 presentation time TE of the Kth subsegment 200 of the representation A needs to be greater than the latest presentation time TL of the k-lth subsegment 230 of the representation B.
[00100] Referring to Fig. 8(b), the fastest presentation time TE of the Kth subsegment 210 of the representation B may be less 20 than the latest presentation time TL of the k-lth subsegment 240
- 22 -
of the representation A.
[00101] In this case, when bitstream switching from the representation A to the representation B is requested, the client 30 performs switching at the boundary between the k-lth
5 subsegment 220 and the kth subsegment 210 of the representation B, and accordingly, part of the k-lth subsegment 240 may not be provided to the client 30 and may be lost.
[00102] For example, the missing data may be some samples having a late representation time in the reference stream of the
10 k-lth subsegment 240.
[00103] Accordingly, in order to prevent the missing data during the switching process, the fastest presentation time TE of the Kth subsegment 210 of the representation B needs to be greater than the latest presentation time TL of the k-lth
15 subsegment 240 of the representation A.
[00104] That is, in order to prevent the redundant data or missing data during a bitstream switching process, subsegments may be aligned to allow the fastest presentation time TE of the Kth subsegment 200 in one of the representations A and B to be
20 greater than the latest presentation time TL of the k-lth
- 23 -
subsegment 230 in another representation.
[00105] As shown in Fig. 9, two subsegments having different indices in the representations A and B may be aligned for non-overlapping .
5 [00106] As mentioned above, the non-overlapping of the first and second subsegments may mean that, with respect to all media streams of one media component, the fastest presentation time TE of the first subsegment is less than the latest presentation time TL of the second subsegment, or the latest presentation time TL
10 of the first subsegment is less than the fastest presentation time TE of the second subsegment.
[00107] According to the subsegment aligning method, when the subsegments in the representations A and B are aligned as shown in Fig. 9, even if the client 30 switches to an arbitrary
15 subsegment including the RAP in the representation B, redundant data or missing data may be prevented.
[00108] When subsegments are aligned as shown in Fig. 9, even when switching from the representation B to the representation A is requested, the data provided to client 30 may not be redundant
20 and lost.
- 24 -
[00109] Referring to Fig. 10, when switching from the representation A to the representation B is requested, the client 30 searches the RAP positioned after a current presentation time from the representation B, and switches a representation at an
5 arbitrary point of the boundary between the k-lth subsegment 220 and the kth subsegment 210 or the boundary between the kth subsegment 210 and the k+lth subsegment 230.
[00110] Moreover, as shown in Fig. 11, since the k-lth subsegment 220 of the representation B does not include a RAP,
10 the client 30 may switch a representation at an arbitrary boundary among the boundaries Bl and B2 of subsegments including a RAP after the k-lth subsegment 220.
[00111] When the subsegments are aligned as mentioned above, a transmitting device transmits one of a plurality of presentations
15 in operation S130.
[00112] For example, the client 30 may select one of a plurality of representations having the different rates, resolutions, languages, or codecs of media content by using an MPD provided from the server 10, and then, may request a segment
20 or partial segment corresponding to the selected representation
- 25 -
by using the HTTP GET, or HTTP partial GET method defined by HTTP
I.1 (RFC 2616) .
[00113] Moreover, the MPD may include a bitstream switching flag indicating whether bitstream switching is performed on
5 representations and a subsegment alignment flag indicating whether subsegments are aligned.
[00114] When the bitstream switching flag in the MPD is 'true', the client 30 may perform the bitstream switching operation.
[00115] Additionally, when the subsegment alignment flag in 10 the MPD is 'true', subsegments are aligned as shown in Figs. 9 to
II, and accordingly, the client 30 switches a representation at an arbitrary boundary by continuous subsegments without redundant and missing data.
[00116] According to an embodiment of the present invention, 15 there is index information on all subsegments; the presentation time TRAP of the first RAP in the subsegment is the fastest presentation time TE of a corresponding subsegment; and the kth subsegment in the arbitrary first representation among a plurality of representations is aligned to be greater than the 20 latest presentation time TL of the k-lth subsegment in the
- 26 -
arbitrary second representation. Therefore, the missing data or redundant reception occurring during bitstream switching may be prevented.
[00117] Moreover, a client receiving media content may easily 5 switch a representation in an arbitrary subsegment including a
RAP without additional search of parsing, and accordingly, even when bitstream is switched due to a change in a network state, media content may be seamlessly played.
[00118] Moreover, in order to divide MPEG-2 TS media into 10 segments for transmission, as required by the transport protocol,
the MPEG-2 TS media may be roughly divided according to the duration of a reference program element (this means the reference stream). The boundary between the segments may be aligned according to MPEG-2 TS packets.
15 [00119] In relation to the metadata delivered for the MPEG-2 TS media, one program element in a program is selected as a reference, and the fastest presentation time for Play Unit (PU) of the reference program element may be delivered in a segment. [00120] Additionally, with respect to each subsegment, a data 20 offset and a stamp PTS for the fastest presentation time of a
- 27 -
corresponding subsegment are delivered, and the presentation time of a random access point and whether PTS discontinuity occurs may be delivered selectively.
[00121] Furthermore, the presentation time of the random 5 access point and whether PTS discontinuity occurs may be used when the client 30 synchronizes a PTS-PCR based time line with a media presentation time line.
[00122] The metadata may be expressed with a binary format based on general ISO Base Media file format data types, and this
10 may have a similar structure to a 'sidx' box used in Dynamic adaptive streaming over HTTP (DASH).
[00123] In addition to the function of the 'sidx' bix, the format of metadata may allow the client 30 to synchronize an MPEG-2 PTS-PCR based time line with a media presentation time
15 line.
[00124] This may be accomplished by delivering the fastest PTS of a PU in a subsegment, and additionally, information on discontinuity in the MPEG-2 PTS time line may be delivered.
[00125] Figs. 12 and 13 are views illustrating a configuration 20 of a segment index box providing metadata on media content
- 28 -
according to a first embodiment of the present invention.
[00126] Referring to Fig. 12, the meta data may include an MPEG-2 segment index ('m2si') box. The m2si box may provide indices on m2si boxes different from subsegments in one segment.
5 [00127] The index may directly refer to a subsegment, or may refer to segment indices that reference a subsegment. The segment index may be designated as a class, a daisy-chain or another format in order to deliver the time and byte offset on another m2si box in the same segment or a subsegment.
10 [00128] The fastest presentation time of the reference program element in the segment may be recorded in the segment index.
[00129] The reference type indicates whether a corresponding reference relates to a subsegment or an M2si box, and the offset indicates a distance in a byte unit from the first byte of the
15 first subsegment indexed in a corresponding box to the first byte of a referenced entity.
[00130] The fastest presentation time (with respect to a reference program element) of the first subsegment recorded in the index is provided, and the segment index may provide one
20 entry for each subsegment recorded by a corresponding index.
- 29 -
[00131] With respect to each of the subsegments, subsegment duration is provided, and the fastest presentation time of a subsegment may be calculated by adding the subsegment duration of a previous subsegment to the fastest presentation time of the
5 first subsegment.
[00132] The fastest presentation time of the subsegment may mean the fastest presentation time on the presentation timescale of an arbitrary PU in the reference program element of a corresponding subsegment.
10 [00133] Selectively, the fastest PTS of the reference program element may be delivered together with a flag indicating whether a subsegment starts with a PTS discontinuity.
[00134] Additionally, the segment index box may include a random access point.
15 [00135] Referring to a syntax structure of the m2si box shown in Fig. 13, flags may be defined with the following values as 24-bit integers.
[00136] timescale-present indicates whether there is a time scale of a media presentation time in a corresponding box and its
20 flag value may have '0x000001'.
- 30 -
[00137] timescale is a time scale for presentation and when timescale is provided more than once in one media presentation, its value may be identical to an entire presentation.
[00138] reference_PID provides the PID of MPEG-2 TS packets 5 transmitting a reference program element.
[00139] program_element_count indicates the number of program elements indexed in the next loop and can be set to 1 or greater.
[00140] reference_count indicates the number of elements indexed in the second loop and can be set to 1 or greater.
10 [00141] PID indicates the PID of MPEG-2 TS packets transmitting a program element that provides presentation_time, and one PID in a corresponding loop is identical to reference_PID
[00142] earlist_presentation_time indicates the fastest presentation time on a PU in a program element identified by PID.
15 [00143] When ref erence_type is set to '0', the reference represents a subsegment, and when reference_type is set to ' 1', the reference represents an m2si box.
[00144] Discontinuity can be set to '1' to indicate that a corresponding subsegment starts with PTS discontinuity.
20 [00145] contains_PTS can be set to '1' to indicate that PTS of
- 31 -
the first PU of a reference program element is delivered in a corresponding subsegment.
[00146] contains_RAP can be set to ' 1' to indicate that a reference program element in the referenced subsegment includes a
5 random access point (RAP).
[00147] reference_offset indicates a distance in a byte unit from the first byte of the first subsegment indexed in a corresponding box to the first byte of a subsegment referenced by the iteration of a corresponding loop.
10 [00148] subsegment_duration delivers the sum of subsegment_duration fields in a corresponding box, when the reference represents an m2si box, and also, delivers a difference between the fastest presentation time of an arbitrary PU of a reference program element in the next subsegment and the fastest 15 presentation time of an arbitrary PU of a reference program element in the referenced subsegment, when the reference represents a subsegment. The duration may be represented as a signaled timescale.
[00149] PTS indicates the fastest MPEG-2 TS PTS of a reference 20 program element in a corresponding subsegment.
- 32 -
[00150] RAP_delta_time provides a presentation time offset of an RAP for earlist_presentation_time of a reference program element when contains_RAP is set to '1' . Moreover, when contains_RAP is '0', RAP_delta_time is maintained with a value of
5 J0f .
[00151] Moreover, when contains_PTS is '1', the PTS provides the fastest PTS of a reference program element in a corresponding subsegment.
[00152] The metadata having the above configuration may be 10 delivered to the client 30 through various methods.
[00153] For example, the metadata represented with the binary format may be attached to the front of MPEG-2 TS as a header of segments. The boxes may be positioned before the first byte of an indexed subsegment, and in this case, the data offset may include
15 the size of the metadata boxes.
[00154] Additionally, when a subsegment is referenced, the first byte of a subsegment may be interpreted as the first byte of a metadata box indexing the subsegment.
[00155] The metadata expressed in XML or a binary format may 20 be delivered to the client 30 as an additional file, and this may
- 33 -
be an HTTP resource or provided out-of-band for the client 30.
[00156] In this case, the metadata are provided as an additional resource on HTTP, and the client 30 uses HTTP URL to obtain a corresponding resource. Accordingly, this may be
5 provided from an MPD as a mechanism similar to URL information for media segments.
[00157] However, a set of the metadata boxes may be grouped into one file. Accordingly, the overhead of a storage medium may be reduced and the delivery may be facilitated, and also,
10 different segment sizes may be adaptive.
[00158] For this, in a segment level of a given representation, an additional indexing mechanism based on a super-segment may be required to index a set of the boxes.
[00159] Figs. 14 and 15 are views illustrating a configuration 15 of the segment index box, i.e., a configuration of a super-
segment index box, according to a second embodiment of the present invention.
[00160] Referring to Fig. 14, the MPEG-2 super-segment index box (mp2s) may provide an index on a set of the MPEG-2 segment
20 index boxes (m2si) for at least one segment in a representation.
- 34 -
[00161] Referring to the syntax structure of the mp2s box shown in Fig. 15, reference_count indicates the number of segments indexed by a corresponding loop.
[00162] segment_index is a unique segment index of a segment 5 used by segment list generation.
[00163] m2si_count indicates the number of MPEG-2 segment index boxes (m2si) in one segment identified by current segment_index.
[00164] offset indicates a distance in a byte unit from the
10 first byte of the first MPEG-2 segment index box (m2si) indexed by a corresponding loop to the first byte of the first MPEG-2 segment index box (m2si) for the indexed segment.
[00165] A media content transmitting device according to an embodiment of the present invention may perform at least some of
15 the operations of the server described with reference to Figs. 1 to 15, and for this, may include the media encoder 11 configuring media content with a plurality of representation and the segmenter 12 dividing the configured representation into a plurality of segments.
20 [00166] Hereinafter, a device and method for receiving media
- 35 -
content will be described with reference to Figs. 16 to 17 according to embodiments of the present invention.
[00167] A receiving device 300 may perform an operation of the client 30 described with reference to Figs. 1 to 15.
5 [00168] Additionally, since a configuration of a media content that the receiving device 300 receives, for example, configuration of a representation, a segment, and a subsegment, may be identical to the configuration described with reference to Figs. 3 to 11, its detailed description will be omitted.
10 [00169] Fig. 16 is a block diagram illustrating a configuration of a media content receiving device according to an embodiment of the present invention. The receiving device 300 includes a network interface unit 310, a decoding unit 320, and a synchronization unit 330.
15 [00170] Referring to Fig. 16, the network interface unit 310 of the receiving device 300, for example, an HTTP interface, may receive a media file in addition to the synchronization information, and the decoding unit 320 may decode and output the received media file.
20 [00171] Moreover, the synchronization unit 330 synchronizes
- 36 -
the decoded media file with a media presentation time line by using a media presentation time in the synchronization information.
[00172] For example, the network interface unit 310 may 5 receive the synchronization information (Sync Info.) having the same configuration described with reference to Figs. 6 to 13 in addition to MPET-2 TS.
[00173] The decoding unit 320 may include a second decoder 322 for decoding MPEG-2 TS, and the second decoder 322 may be a
10 general MPEG-2 TS decoder.
[00174] The second decoder 322 may receive MPEG-2 TS from the network interface unit 310, and may decode it, and then, may output the decoded PUs and their PTSs.
[00175] Moreover, the synchronization unit 330 may identify a 15 PU having a presentation time provided through the received synchronization information. For this, the network interface unit 310 may restore the PTSs of PUs having a presentation time provided and may deliver them to the synchronization unit 330, or the PTS of a corresponding PU in addition to the presentation 20 time may need to be provided from the transmitting device 100.
- 37 -
[00176] In order to restore the PTS, elementary stream time recovery may be used.
[00177] Additionally, the decoding unit 320 may further include a first decoder 321 for decoding other media received
5 through the network interface unit 310, and the first decoder 321 may be a decoder for decoding another format of media such as a 3GPP file format, or the same MEPG-2 TS decoder as the second decoder 322.
[00178] The synchronization unit 330 may synchronize the MPEG-10 2 TS based media outputted from the second decoder 322 with other media outputted from the first decoder 321 by using the encoding information (for example, media presentation times on PUs) delivered from the transmitting device 100.
[00179] Moreover, the receiving device 300 may play the MPEG-2 15 TS based media synchronized by the synchronization unit 330 and the other media simultaneously.
[00180] Fig. 17 is a flowchart illustrating a media content receiving method according to an embodiment of the present invention. The same method described with reference to Figs. 1 to
20 16 among receiving methods shown will be omitted.
- 38 -
[00181] Referring to Fig. 17, the transmitting device 300 receives a plurality of representations including media content in operation S400, and plays the received representation in operation S420.
5 [00182] For example, a receiving unit (not shown) in the network interface unit 310 of the transmitting device 300 may request transmission for a segment or a partial segment to a transmitting device by using the HTTP GET or HTTP partial GET method defined by HTTP 1.1(RFC 2616).
10 [00183] In addition, a control unit (not shown) in the network interface unit 310 may select the representation to be received from among a plurality of representations having the different bit rates, resolutions, languages, or codes of media content by using an MPD received from the transmitting device.
15 [00184] While the received representation is played, the control unit confirms whether bitstream switching is required in operation S410.
[00185] When the bitstream switching is required, the control unit switches the received representation to a new representation 20 in operation S430.
- 39 -
[00186] A method of the control unit in the transmitting device 300 to switch representations may be identical to the switching operation of the client 300 described with reference to Figs. 3 to 11.
5 [00187] The method of receiving and processing media content described with reference to Figs. 1 to 17 may be performed by an IPTV receiver.
[00188] Hereinafter, a configuration of an IPTV receiver will be described with reference to Figs. 18 and 19 according to an
10 embodiment of the present invention.
[00189] Fig. 18 is a block diagram illustrating a configuration of an IPTV receiver according to an embodiment of the present invention.
[00190] The IPTV receiver may include an additional tuner to 15 receive terrestrial broadcasting, cable broadcasting, and satellite broadcasting. However, for convenience of description, a configuration of receiving an IPTV service provided using an IP network will be mainly described.
[00191] Moreover, an ITF represents an Open IPTV Terminal 20 Function, and may mean a receiver consisting of a function module
- 40 -
necessary for providing IPTV service.
[00192] The IPTV receiver may include a Network Interface 501, a TCP/IP Manager 502, a Service Delivery Manager 504, a PVR Manager 505, a Demux 508, a Data Decoder 507, an Audio/Video
5 Decoder 512, an A/V Display & OSD Module 515, an Application Managers 513 and 514, a SI & Metadata DB 511, a Service Discovery Manager 509, a Service Control Manager 503, a Metadata Manager 510, and a Content DB 506.
[00193] Referring to Fig.18, the network interface 501 10 receives packets from a network, and transmits packets to a network. That is, the network interface 501 receives service and media content from a service provider via a network.
[00194] The TCP/IP manager 502 is involved in packet delivery from a source to a destination, with respect to packets received
15 by an IPTV receiver and packets transmitted from an IPTV receiver. Additionally, the TCP/IP manager 502 classifies received packets to correspond to an appropriate protocol, and outputs packets classified by the service delivery manager 504, the service discovery manager 509, the service control manager 503, and the 20 metadata manager 510.
- 41 -
[00195] The service delivery manager 504 controls received service data. For example, while controlling realtime streaming data, the service delivery manager 504 may use RTP/RTCP.
[00196] When the realtime streaming data are transmitted using 5 a RTP, the service delivery manager 504 parses the received data packets according to the RTP to transmit them to the demux 508, or store them in the content DB 506 according to a control of the service manager 514. Additionally, the service delivery manager 504 feeds back the network reception information to a server
10 providing service by using RTCP.
[00197] The demux 508 demultiplexes the received packets into audio, video, and Program Specific Information (PSI) data, and then, transmits them to the audio/video decoder 512, and the data decoder 507, respectively.
15 [00198] The data decoder 507 decodes service information such as PSI. That is, the data decoder 507 receives a PSI section, a Program and Service Information Protocol (PSIP) section, or a DVB-Service Information (SI) section, demuliplexed by the demux 508, and then, decode it.
20 [00199] Additionally, the data decoder 507 decodes the
- 42 -
received sections and creates a database on service information, and also stores the database on the service information in the SI & Metadata DB 511.
[00200] The Audio/Video Decoder 512 decodes the video and 5 audio data received from the demux 508. The video and audio data decoded by the Audio/Video Decoder 512 decodes are provided to a user through the A/V Display & OSD Module 515.
[00201] The application manager manages overall states of the IPTV receiver, provides a user interface, and manages another
10 manager. For this, the application manager includes a user interface manager 513 and a service manager 514.
[00202] The user interface manager 513 provides a graphic user interface to a user through on screen display (OSD), and performs an operation of a receiver according to an input after receiving
15 a key input from a user. For example, when receiving a key input relating to a channel selection from a user, the user interface manager 513 transmits the key input signal to the service manager 514 .
[00203] The service manager 514 controls a service related 20 manager such as the service delivery manager 504, the service
- 43 -
discovery manager 509, the service control manager 503, and the metadata manager 510.
[00204] Additionally, the service manager 514 creates a channel map and selects a channel by using the channel map
5 according to a key input received from the user interface manager 513. Also, the service manager 514 receives service information on a channel from the data decoder 507, and sets the audio/video PID of the selected channel to the demux 508.
[00205] The service discovery manager 509 provides information 10 necessary for selecting a service provider that provides service.
When a signal on channel selection is received from the service manager 512, the service discovery manager 509 discovers service by using the information.
[00206] The service control manager 503 is responsible for 15 service selection and control. For example, when a user selects an existing broadcasting method such as live broadcasting service, the service control manager 503 uses IGMP or RTSP. When a user selects video on demand (VOD) service, the service control manager 503 selects and controls the service by using RTSP. 20 [00207] The RTSP may provide a trick mode on realtime
- 44 -
streaming. Additionally, the service control manager 503 may initialize and manage a session through an IMC gateway by using IP Multimedia Subsystem (IMS) and Session Initiation Protocol (SIP) . The above protocols are just one example, and may vary 5 depending on embodiments.
[00208] The metadata manager 510 manages service related metadata and stores the metadata in the SI & Metadata DB 511.
[00209] The SI & Metadata DB 511 stores system information decoded by the data decoder 507, metadata that the metadata
10 manager 510 manages, and information necessary for selecting a service provider provided by the service discovery manager 509. Additionally, the SI & Metadata DB 511 may store setup data for a system.
[00210] The SI & Metadata DB 511 and the content DB 506 may be 15 implemented using nonvolatile RAM (NVRAM) or flash memory, and using two areas that are logically separated on the same storage area.
[00211] The PVR manager 505, as a module for recording and playing live streaming content, collects metadata on recorded
20 content and also generate additional information such as an
- 45 -
thumbnail image or index provided to a user.
[00212] Functions of the control unit in the IPTV receiver may be divided and implemented by a plurality of modules such as the TCP/IP Manager 502, the Service Delivery Manager 504, the PVR
5 Manager 505, the Application Managers 513 and 514, the Service Discovery Manager 509, the service control manager 503, and the metadata manager 510.
[00213] For example, the TCP/IP manager 502 controls the network interface 501 to request only a payload or segment
10 corresponding to a specific package (for example, a package that the IPTV receiver subscribes) to a server by filtering SD&S information with the target package information and receive it.
[00214] Moreover, the TCP/IP manager 502 filters the SD&S information received in a multicast method by using the target
15 package information, and allows the data decoder 507 to parse and process only a payload or segment corresponding to a specific package.
[00215] Fig. 19 is a block diagram illustrating an configuration of an IPTV receiver according to another embodiment
20 of the present invention. The configuration of the IPTV receiver
- 46 -
is represented with functional blocks. The solid line arrow of Fig. 19 corresponds to a data path, and the dotted line arrow corresponds to a control signal path.
[00216] A Cable modem, DSL modem, etc 601, as an interface 5 allowing an ITF to connect to an IP network in a physical level,
restores a digital signal by demodulating a signal transmitted through a physical medium.
[00217] An Ethernet NIC 602 restores a signal received through the physical interface to IP data and an IP Network Stack 607
10 processes each layer according to an IP protocol stack.
[00218] Moreover, an XML Parser 609 parses XML document among the received IP data, and a File Handler 608 processes data in a file format, which are transmitted through FLUTE, among the received IP data.
15 [00219] An SI Handler 611 processes a portion corresponding to IPTV service information in the received data in a file format, and stores it in a storage 612, and also processes a portion corresponding to IPTV EPG information in the received data in a file format, and stores it in the storage 612.
20 [00220] The storage 612 stores various data such as the SI and
- 47 -
EPG.
[00221] An SI Decoder 613 receives SI data from the storage 612 and analyzes them to obtain channel map information. An EPG Decoder 614 analyzes the EPG data stored in the storage 612 and
5 restores information necessary for EPG configuration.
[00222] An ITF Operation Controller 615 is a main controller controlling a channel change or an operation of an ITF such as an EPG display.
[00223] A Channel Service Manager 616 performs an operation 10 such as channel change according to a user input. An Application
Manager 617 performs application service such as an EPG display according to a user input.
[00224] An MPEG-2 Demultiplexer 603 extracts MPEG-2 Transport Stream data from the received IP datagram and delivers them to a
15 corresponding module according to PID.
[00225] Additionally, an MPEG-2 PSI/PSIP Parser 604 extracts the PID of A/V data or PSI/PSIP data including access information on a program element from the MPEG-2 Transport Stream and parses them.
20 [00226] Moreover, an A/V Decoder 605 decodes inputted audio
- 48 -
and video data and delivers them to a display module 606, and the display module 606 outputs the decoded A/V data or application.
[00227] According to an embodiment of the present invention, when a media content configured with a plurality of 5 representations having different bit rates is transmitted/received by a segment unit, the loss or duplicate reception of data, which occur during bit stream switching, may be prevented by aligning subsegments in the segment in synchronization with the presentation time of a representation.
10 [00228] Additionally, a client receiving media content may easily switch a representation in an arbitrary subsegment including a Representation Access Point (RAP) without additional searching or parsing.
[00229] Accordingly, even when bit stream switches due to a
15 change in a network state, media content is seamlessly played.
[00230] The invention can also be embodied as computer readable codes on a computer readable recording medium. The computer readable recording medium is any data storage device that can store data which can be thereafter read by a computer
20 system. Examples of the computer readable recording medium
- 49 -
include read-only memory (ROM), random-access memory (RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage devices , and carrier waves (such as data transmission through the Internet) .
5 [00231] The computer readable recording medium can also be distributed over network coupled computer systems so that the computer readable code is stored and executed in a distributed fashion. Also, functional programs, codes, and code segments for accomplishing the present invention can be easily construed by 10 programmers skilled in the art to which the present invention pertains.
[00232] Although embodiments have been described with reference to a number of illustrative embodiments thereof, it should be understood that numerous other modifications and 15 embodiments can be devised by those skilled in the art that will fall within the spirit and scope of the principles of this disclosure. More particularly, various variations and modifications are possible in the component parts and/or arrangements of the subject combination arrangement within the 20 scope of the disclosure, the drawings and the appended
- 50 -
claims. In addition to variations and modifications in the component parts and/or arrangements, alternative uses will also be apparent to those skilled in the art.

Claims (1)

  1. - 51 -
    What is claimed is:
    1. A method of transmitting media content via internet, the method comprising:
    5 configuring the media content with a plurality of representations;
    dividing the configured representation into a plurality segments;
    aligning subsegments in the segment; and 10 transmitting one of the representations,
    wherein the subsegments are aligned to allow the earliest presentation time of the kth subsegment in the first representation to be greater than the latest presentation time of the k-lth subsegment in the second representation among the 15 representations.
    2. The method according to claim 1, wherein the plurality of representations are differently configured according to at least one of a bit rate, resolution, language, and codec of the
    20 transmitted media content.
    - 52 -
    3. The method according to claim 1, wherein there is index information on all subsegments in the segment.
    5 4. The method according to claim 3, wherein each of the subsegments comprises a segment index.
    5. The method according to claim 1, wherein with respect to a subsegment including a representation access point (RAP)
    10 among the subsegments, a presentation time of the first RAP in the subsegment corresponds to the earliest presentation time of the subsegment.
    6. The method according to claim 1, wherein the earliest 15 presentation time of the subsegment is the earliest presentation time of an arbitrary sample in a reference stream of the subsegment.
    7. The method according to claim 1, wherein the latest 20 presentation time of the subsegment is a presentation time of the
    - 53 -
    latest sample in a reference stream of the subsegment.
    8. The method according to claim 1, wherein the subsegment comprises one or at least two continuous MPEG-2 TS packets.
    5
    9. The method according to claim 1, further comprising, in response to a request received from a client, switching the transmitted representation.
    10 10. The method according to claim 1, further comprising transmitting a media presentation description (MPD) including metadata to access the segments,
    wherein the MPD comprises a first flag indicating whether the bitstream switching can be performed on the representations 15 and a second flag indicating whether the subsegments are aligned.
    11. A method of receiving media content via internet, the method comprising:
    receiving a first representation among a plurality of 20 representations including the media content;
    - 54 -
    switching the received first representation to a second representation; and playing the received first representation,
    wherein
    5 the representation is divided into a plurality of segments and received; and subsegments in the segment are aligned to allow the earliest presentation time of the kth subsegment in one of the first and second representations to be greater than the latest presentation 10 time of the k-lth subsegment in the other representation.
    12. The method according to claim 11, wherein there is index information on all subsegments in the segment.
    15 13. The method according to claim 11, wherein with respect to a subsegment including a representation access point (RAP) among the subsegments, a presentation time of the first RAP in the subsegment corresponds to the earliest presentation time of the subsegment.
    20
    - 55 -
    14. The method according to claim 11, wherein the first and second representations received before/after the switching do not comprise duplicate data.
    5 15. The method according to claim 11, wherein there are no missing data in the first and second representations received before/after the switching.
    16. The method according to claim 11, wherein the switching 10 of the received representation to the second representation comprises switching to an arbitrary subsegment including a RAP in the second representation.
    17. The method according to claim 11, wherein the switching 15 of the received representation to the second representation is performed at a boundary of continuous subsegments.
    18. A device for transmitting media content via internet, the device comprising:
    20 a media encoder configuring the media content with a
    - 56 -
    plurality of representations;
    a segmenter segmenting the configured representation into a plurality of segments,
    wherein subsegments in the segment are aligned to allow the 5 earliest presentation time of the kth subsegment in the first representation to be greater than the latest presentation time of the k-lth subsegment in the second representation among the representations.
    10 19. The device according to claim 18, wherein with respect to a subsegment including a representation access point (RAP) among the subsegments, a presentation time of the first RAP in the subsegment corresponds to the earliest presentation time of the subsegment.
    15
    20. A device for receiving media content via internet, the device comprising:
    a receiving unit receiving one of a plurality of representations including the media content; and
    20 a control unit switching the received representation,
    wherein the representation is divided into a plurality of segments and received; and subsegments in the segment are aligned to allow the earliest presentation time of the kth subsegment in the first representation to be greater than the latest presentation time of the k-lth subsegment in the second representation among the representations.
GB1308663.2A 2011-10-27 2011-10-27 Method for transreceiving media content and device for transreceiving using same Active GB2499539B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/KR2011/008094 WO2012060581A2 (en) 2010-11-02 2011-10-27 Method for transreceiving media content and device for transreceiving using same

Publications (3)

Publication Number Publication Date
GB201308663D0 GB201308663D0 (en) 2013-06-26
GB2499539A true GB2499539A (en) 2013-08-21
GB2499539B GB2499539B (en) 2017-05-03

Family

ID=48703939

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1308663.2A Active GB2499539B (en) 2011-10-27 2011-10-27 Method for transreceiving media content and device for transreceiving using same

Country Status (1)

Country Link
GB (1) GB2499539B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3148197A1 (en) * 2015-09-25 2017-03-29 Comcast Cable Communications, LLC Coordinating content segmentation
EP4391559A1 (en) * 2022-12-20 2024-06-26 Broadpeak Method for delivering an audio and/or video content and transcoding proxy implementing said method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050076136A1 (en) * 2002-09-17 2005-04-07 Samsung Electronics Co., Ltd. Apparatus and method for streaming multimedia data
US20080074545A1 (en) * 2006-02-13 2008-03-27 Kei Matsubayashi Transmitting apparatus, transmitting method, receiving apparatus, receiving method, and transmitting and receiving system
US20090300203A1 (en) * 2008-05-30 2009-12-03 Microsoft Corporation Stream selection for enhanced media streaming

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9432433B2 (en) * 2006-06-09 2016-08-30 Qualcomm Incorporated Enhanced block-request streaming system using signaling or block creation
US20120114118A1 (en) * 2010-11-05 2012-05-10 Samsung Electronics Co., Ltd. Key rotation in live adaptive streaming

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050076136A1 (en) * 2002-09-17 2005-04-07 Samsung Electronics Co., Ltd. Apparatus and method for streaming multimedia data
US20080074545A1 (en) * 2006-02-13 2008-03-27 Kei Matsubayashi Transmitting apparatus, transmitting method, receiving apparatus, receiving method, and transmitting and receiving system
US20090300203A1 (en) * 2008-05-30 2009-12-03 Microsoft Corporation Stream selection for enhanced media streaming

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3148197A1 (en) * 2015-09-25 2017-03-29 Comcast Cable Communications, LLC Coordinating content segmentation
EP4391559A1 (en) * 2022-12-20 2024-06-26 Broadpeak Method for delivering an audio and/or video content and transcoding proxy implementing said method
WO2024132841A1 (en) * 2022-12-20 2024-06-27 Broadpeak Method for delivering an audio and/or video content and transcoding proxy implementing said method

Also Published As

Publication number Publication date
GB201308663D0 (en) 2013-06-26
GB2499539B (en) 2017-05-03

Similar Documents

Publication Publication Date Title
US9781188B2 (en) Method for transreceiving media content and device for transreceiving using same
US10009660B2 (en) Media content transceiving method and transceiving apparatus using same
US10129609B2 (en) Method for transceiving media files and device for transmitting/receiving using same
US10939149B2 (en) Apparatus and method for transmitting/receiving processes of a broadcast signal
GB2499539A (en) Method for transreceiving media content and device for transreceiving using same
KR20140103846A (en) Method for transmitting/receiving media content and transmitting/receiving apparatus thereof