WO2015080414A1 - 트릭 플레이 서비스 제공을 위한 방송 신호 송수신 방법 및 장치 - Google Patents
트릭 플레이 서비스 제공을 위한 방송 신호 송수신 방법 및 장치 Download PDFInfo
- Publication number
- WO2015080414A1 WO2015080414A1 PCT/KR2014/011042 KR2014011042W WO2015080414A1 WO 2015080414 A1 WO2015080414 A1 WO 2015080414A1 KR 2014011042 W KR2014011042 W KR 2014011042W WO 2015080414 A1 WO2015080414 A1 WO 2015080414A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- temporal
- picture
- information
- present
- field
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/238—Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
- H04N21/2387—Stream processing in response to a playback request from an end-user, e.g. for trick-play
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/65—Transmission of management data between client and server
- H04N21/658—Transmission by the client directed to the server
- H04N21/6587—Control parameters, e.g. trick play commands, viewpoint selection
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
- H04N19/31—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability in the temporal domain
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/23605—Creation or processing of packetized elementary streams [PES]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/236—Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
- H04N21/23614—Multiplexing of additional data and video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/414—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
- H04N21/4147—PVR [Personal Video Recorder]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/432—Content retrieval operation from a local storage medium, e.g. hard-disk
- H04N21/4325—Content retrieval operation from a local storage medium, e.g. hard-disk by playing back content from the storage medium
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/633—Control signals issued by server directed to the network components or client
- H04N21/6332—Control signals issued by server directed to the network components or client directed to client
- H04N21/6336—Control signals issued by server directed to the network components or client directed to client directed to decoder
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/63—Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
- H04N21/637—Control signals issued by the client directed to the server or network components
- H04N21/6377—Control signals issued by the client directed to the server or network components directed to server
- H04N21/6379—Control signals issued by the client directed to the server or network components directed to server directed to encoder, e.g. for requesting a lower encoding rate
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/60—Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client
- H04N21/65—Transmission of management data between client and server
- H04N21/658—Transmission by the client directed to the server
- H04N21/6581—Reference data, e.g. a movie identifier for ordering a movie or a product identifier in a home shopping application
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/816—Monomedia components thereof involving special video data, e.g 3D video
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8453—Structuring of content, e.g. decomposing content into time segments by locking or enabling a set of features, e.g. optional functionalities in an executable program
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8455—Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/83—Generation or processing of protective or descriptive data associated with content; Content structuring
- H04N21/845—Structuring of content, e.g. decomposing content into time segments
- H04N21/8456—Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/85—Assembly of content; Generation of multimedia applications
- H04N21/854—Content authoring
- H04N21/85406—Content authoring involving a specific file format, e.g. MP4 format
Definitions
- the present invention relates to the transmission and reception of broadcast signals. More specifically, the present invention relates to a method and / or apparatus for transmitting and receiving a broadcast signal for providing a trick play service.
- VCEG Video Coding Experts Group
- MPEG Moving Picture Experts Group
- FDIS HEVC
- HEVC is a next-generation video compression standard that shows coding efficiency of about 35% compared to H.264 / AVC, and is attracting attention as a key technology for effectively compressing massive data of HD and UHD video. It is anticipated that optimized HEVC software and hardware standards will enter the market through efforts to commercialize HEVC technology worldwide.
- the trick play refers to a service that provides a random access and a double speed function, such as 2x, 4x, etc., which can play an image of an arbitrary time. Since there is a difference between the random access point of HEVC and the random access point of H.264, it is necessary to newly define the category of the random access point of HEVC. Also, because HEVC provides scalability, it is necessary to use this to provide trick play. In addition, the existing CFF media file format specification defines the format for trick play in H.264 / AVC. However, since there is no definition of HEVC encoded content, it is necessary to provide a new format for decoding and trick play of HEVC encoded content.
- the existing AVC / H.264-based pictures are divided into pictures using the concept of tier according to dependency between pictures.
- the receiver decodes and displays only pictures having a specific tier value to provide a trick play service.
- HEVC basically provides temporal scalability of video streams, it is necessary to study how to provide a trick play service using temporal scalability.
- the user can know the maximum speed information that can be provided, it will be convenient to use the trick play. Therefore, there is a need for a method of providing a user with maximum speed information that can be provided.
- An object of the present invention is to solve the above problems and to provide a method and / or apparatus for transmitting and receiving a broadcast signal for providing a trick play service.
- an object of the present invention is to provide trick play related information for providing a trick play service.
- an object of the present invention is to provide a method for signaling trick play related information.
- an object of the present invention is to provide a signaling method capable of providing maximum speed information to a user.
- an object of the present invention is to provide a method for providing trick play using temporal scalability basically provided by HEVC.
- an object of the present invention is to provide a method of providing a better trick play service while using the existing trick play related information as much as possible.
- the broadcast signal transmission method includes the steps of: encoding a video data to generate a video unit stream, generating a packetized elementary stream (PES) including the video unit stream, the generated PES Generating a transport stream (TS), wherein the TS includes PVR assist information for performing trick play, wherein the PVR assist information includes tier number information and maximum temporal identification information,
- the tier number information indicates a tier number having a value obtained by adding 1 to a temporal identification information value of a picture other than a RAP
- the maximum temporal identification information indicates a maximum temporal identification of a video unit stream including the encoded video data. Indicating an information value and transmitting the generated TS.
- the tier number indicated by the tier number information may have a value of 0 in the RAP picture.
- said maximum temporal identification information may be used to provide information about the speed of trick play.
- the PVR assist information may be included in an adaptation field of the TS.
- the PVR assist information may include segmentation info flag information indicating whether information on a segment to which a picture belongs is present.
- the PVR assist information may include segment identifier information indicating an id of a segment to which a picture belongs.
- the PVR assist information may include program identifier information indicating an id of a program to which a picture belongs.
- the PVR assist information includes at least one of segment start flag information for identifying a picture having a first play time order in each segment and segment end flag information for identifying a picture with a last play time order in each segment. It may include.
- the PVR assist information includes at least one of program start flag information for identifying a picture having a first play time order in each program and program end flag information for identifying a picture with a last play time order in each program. It may include.
- a broadcast signal receiving apparatus includes a receiver for receiving a transport stream (TS), wherein the TS includes PVR assist information for performing a trick play, wherein the PVR assist information is a tier number.
- Information and maximum temporal identification information wherein the tier number information indicates a tier number having a value obtained by adding 1 to a temporal identification information value of a picture that is not a RAP, and the maximum temporal identification information is encoded by the encoded information.
- a first extractor which indicates a maximum temporal identification information value of a video unit stream including video data, extracts a packetized elementary stream (PES) from the received TS, and a second extractor that extracts a video unit stream from the extracted PES
- An extractor may include a decoder to decode the extracted video unit stream.
- the tier number indicated by the tier number information may have a value of 0 in the RAP picture.
- said maximum temporal identification information may be used to provide information about the speed of trick play.
- the PVR assist information may be included in an adaptation field of the TS.
- the PVR assist information may include segmentation info flag information indicating whether information on a segment to which a picture belongs is present.
- the PVR assist information may include segment identifier information indicating an id of a segment to which a picture belongs.
- the PVR assist information may include program identifier information indicating an id of a program to which a picture belongs.
- the PVR assist information includes at least one of segment start flag information for identifying a picture having a first play time order in each segment and segment end flag information for identifying a picture with a last play time order in each segment. It may include.
- the PVR assist information includes at least one of program start flag information for identifying a picture having a first play time order in each program and program end flag information for identifying a picture with a last play time order in each program. It may include.
- an effective trick play service can be provided while maintaining the existing system related to the trick play service to the maximum.
- the present invention by providing a trick play service using a structure based on a temporal ID of an HEVC video stream, it is not necessary to signal a specific picture to be used for trick play in the encoding step, thereby providing a faster encoding speed.
- FIG. 1 is a diagram illustrating a trick play method according to an embodiment of the present invention according to a scenario.
- CFF common file format
- FIG. 3 is a diagram illustrating syntax of an "hvcn" box according to an embodiment of the present invention.
- HDR high dynamic range
- FIG. 5 is a diagram illustrating a picture type for random access in the case of an HEVC stream according to an embodiment of the present invention.
- FIG. 7 is a diagram illustrating a trick play method when an open GOP and a GOP includes a decodable leading picture according to an embodiment of the present invention. (Scenario 1-2)
- FIG 8 is a diagram illustrating a trick play method when an open GOP and a GOP includes a decodable leading picture and a skipped leading picture according to an embodiment of the present invention.
- Scenario 1-2 is a diagram illustrating a trick play method when an open GOP and a GOP includes a decodable leading picture and a skipped leading picture according to an embodiment of the present invention.
- FIG. 9 is a diagram illustrating a trick play method when an open GOP and a GOP includes a skipped leading picture according to an embodiment of the present invention. (Scenario 1-2)
- FIG. 10 illustrates a configuration of a trick play box for supporting trick play of an HEVC stream having max_temporal_id of 0 according to an embodiment of the present invention. (Scenario 1-2)
- FIG. 11 is a diagram illustrating a configuration of a trick play box for supporting trick play of an HEVC stream having max_temporal_id of 0 according to another embodiment of the present invention. (Scenario 1-2)
- FIG. 12 illustrates a description of pic_type included in a trick play box for supporting trick play of an HEVC stream having max_temporal_id of 0 according to an embodiment of the present invention.
- FIG. 13 is a diagram illustrating a configuration of a trick play box for supporting trick play of an HEVC stream having max_temporal_id equal to 0 when pic_type does not include content related to a leading picture according to an embodiment of the present invention.
- Scenario 1-1
- FIG. 14 is a diagram illustrating a trick play box for supporting trick play of an HEVC stream having max_temporal_id of 0 when pic_type does not include content related to a leading picture according to another embodiment of the present invention. (Scenario 1-1)
- FIG. 15 is a diagram illustrating the configuration of an HEVC stream supporting temporal scalability according to an embodiment of the present invention.
- FIG. 16 illustrates a configuration of a trick play box for supporting trick play by limiting a maximum speed in an HEVC stream supporting temporal scalability according to an embodiment of the present invention.
- scenario 2 the scenario 2
- FIG. 17 is a diagram showing the configuration of a trick play box for supporting trick play by limiting a maximum speed in an HEVC stream supporting temporal scalability according to another embodiment of the present invention. (Scenario 2)
- FIG. 18 is a diagram illustrating a method of changing a frame rate when the temporal sub-layer picture type is TSA according to an embodiment of the present invention.
- FIG. 19 is a diagram illustrating a method of changing a frame rate when the temporal sub-layer picture type is STSA according to an embodiment of the present invention. (Scenario 3)
- FIG. 20 is a diagram illustrating the configuration of a trick play box for supporting trick play at high speed in an HEVC stream supporting temporal scalability according to an embodiment of the present invention. (Scenario 3)
- FIG. 21 is a diagram showing the configuration of a trick play box for supporting trick play at high speed in an HEVC stream supporting temporal scalability according to another embodiment of the present invention.
- 22 is a diagram showing the structure of a broadcast signal receiving system according to an embodiment of the present invention.
- FIG. 23 is a diagram showing the structure of a receiving end according to an embodiment of the present invention.
- FIG. 24 is a diagram illustrating a trick play method using a combination of a temporal id and a tier according to an embodiment of the present invention.
- 25 is a diagram illustrating a trick play method according to a conventional tier concept according to an embodiment of the present invention.
- FIG. 26 is a diagram illustrating a trick play method according to a method of mapping one temporal id to one tier 1: 1 according to an embodiment of the present invention. (Scenario A-a)
- FIG. 27 is a diagram illustrating a trick play method according to a method of mapping one temporal id to one tier 1: 1 according to another embodiment of the present invention. (Scenario A-a)
- FIG. 28 is a diagram illustrating a result of mapping one temporal id to one tier 1: 1 according to an embodiment of the present invention.
- 29 is a diagram illustrating a result of mapping one temporal id to one tier 1: 1 according to another embodiment of the present invention.
- FIG. 30 is a diagram illustrating a trick play method according to a method of mapping one temporal id to several tiers according to an embodiment of the present invention.
- FIG. 31 is a diagram illustrating a configuration of an adaptation field of a TS packet including information for mapping a temporal id and a tier according to an embodiment of the present invention.
- 32 is a diagram illustrating the configuration of HEVC_temporal_id_tier_mapping_info according to an embodiment of the present invention.
- FIG. 33 is a diagram showing the configuration of a trick_play_speed field included in HEVC_temporal_id_tier_mapping_info according to an embodiment of the present invention.
- FIG. 34 is a diagram illustrating a configuration of PVR_assist_information according to an embodiment of the present invention.
- FIG. 36 is a diagram illustrating a configuration of PVR_assist_information to which temporal id frame work is added according to another embodiment of the present invention.
- FIG. 37 illustrates a configuration of PVR_assist_information for supporting trick play using a temporal id according to an embodiment of the present invention.
- 38 is a diagram illustrating a receiving device according to an embodiment of the present invention.
- FIG. 39 illustrates a comparison of a tier framework and an HEVC temporal sub-layer according to an embodiment of the present invention.
- FIG. 40 is a diagram showing the configuration of PVR_assist_information according to another embodiment of the present invention.
- 41 is a view showing a trick play method using an HEVC temporal sub-layer according to an embodiment of the present invention.
- FIG. 43 is a diagram showing the structure of a broadcast signal receiving apparatus according to an embodiment of the present invention.
- High Efficiency Video Coding is a high-efficiency video coding standard that offers the same video quality with approximately twice the compression rate compared to traditional H.265 / AVC technology.
- Temporal scalability refers to temporal scalability and means a method of encoding different frame frequencies at the same spatial resolution.
- Trick play refers to a function that provides ramdom access to play a video after a certain time and provides a double speed function.
- Open GOP refers to a structure that can be encoded by using a picture located before a corresponding GOP as a reference picture when encoding a picture in one GOP. That is, it means a GOP including a leading picture.
- the closed GOP refers to a structure in which only a picture in a corresponding GOP is used as a reference picture when encoding a picture in one GOP. That is, unlike in Open GOP, it does not include a leading picture.
- a leading picture is a picture in which decoding order is slower than that of IRAP in HEVC, but a reproduction order is fast.
- Temporal id is a term introduced to support temporal scalability in HEVC, and may be signaled by nuh_temporal_id_plus1 of NAL_unit_header.
- Tier is a term introduced to support trick play in an AVC stream or an mpeg-2 stream, and may be included in an adaptation field in a TS packet.
- FIG. 1 is a diagram illustrating a trick play method according to an embodiment of the present invention according to a scenario.
- a signaling method for trick play may be defined depending on whether temporal scalability of HEVC is used.
- the video stream according to an embodiment of the present invention is a stream that does not provide temporal scalability, that is, when the video stream includes only pictures with a temporal_id of 0, the video stream does not include the leading picture, so that the CFF media Trick play can be provided by signaling dependency_level and pic_type defined in the file format.
- the CFF media Trick play can be provided by signaling dependency_level and pic_type defined in the file format.
- trick play may be provided by including a leading picture type in pic_type and signaling dependency_level.
- trick play may be provided by signaling dependency_level, pic_type, and temporal_sub_layer_pic_type. More specifically, trick play may be provided by signaling a supportable speed level through temporal_id included in the stream.
- an additional service for double speed may be provided by signaling a support speed level supported through temporal_id and additionally using pictures having a temporal_id of 0.
- the box shown in this figure represents a picture constituting the video stream, and the T_ID described in the box may mean a temporal_id for supporting temporal scalability.
- CFF common file format
- the CFF box structure according to an embodiment of the present invention may include a storage box abbreviated as "hvcn" to support the HEVC-based nal unit.
- FIG. 3 is a diagram illustrating syntax of an "hvcn" box according to an embodiment of the present invention.
- the CFF box structure may include a nal unit storage box abbreviated as "hvcn”.
- a nal unit storage box may be defined for each codec for trick play.
- the HEVC trick box may be defined separately from the existing AVC trick box inside the existing "trick" box without defining a nal unit storage box for each codec for trick play. Also, AVC or HEVC trick play may be selected according to the flag of the "trick" box.
- the HEVCConfig may include a Sequence Parameter Set (SPS) and a Picture Parameter Set (PPS) of the HEVC, and may include video information such as a VUI parameter of the SPS.
- SPS Sequence Parameter Set
- PPS Picture Parameter Set
- HDR high dynamic range
- Common_Metadata referenced in the xml box according to an embodiment of the present invention may include a HighDynamicRange element.
- the HighDynamicRange element according to an embodiment of the present invention may have a string value according to the xml schema and one HighDynamicRange element may or may not exist.
- the HighDynamicRange element may indicate a minimum and / or maximum brightness value (min.luminance and / or max.luminance).
- the HighDynamicRange element may indicate a profile value divided into minimum and / or maximum brightness values.
- the HighDynamicRange element may have a conventional capacity (min: 0.1cd / m2, max: 100cd / m2), Mid capacity (min: 0.001cd / m2, max: 1000cd / m2) or High capacity (min: 0.0001cd / m2 , max: 10000cd / m2).
- FIG. 5 is a diagram illustrating a picture type for random access in the case of an HEVC stream according to an embodiment of the present invention.
- This figure shows the types of pictures that can be the basis for performing random access and trick play among HEVC NAL unit types.
- shaded picture types represent HEVC temporal sub-layer picture types that can be used for trick play when a stream with temporal IDs greater than zero is included, that is, when temporal scalability is provided. Can be.
- the picture types indicated by the above-described shades may be used in scenarios 2 and 3 according to one embodiment of the present invention.
- the random access point pictures may include Instantaneous Decoding Refresh (IDR), Broken Link Access (BLA) and / or Clean Random Access (CRA), and the leading picture is random access decadable leading.
- temporal sub-layer access pictures include Temporal Sub-layer Access (TSA) and / or Step-wise Temporal Sub-layer Access (STSA) can do.
- An Instantaneous Decoding Refresh (IDR) picture may include a case with an associated leading picture and / or a case without an associated leading picture.
- a Broken Link Access (BLA) picture may include a case in which there is a related RADL picture but no related RASL picture, and / or a case in which there is no related leading picture.
- a clean random access (CRA) picture may include a case having an associated leading picture.
- Temporal Sub-layer Access (TSA) pictures may include cases that are not referenced in the same sub-layer and / or cases that are referenced in the same sub-layer.
- Step-wise Temporal Sub-layer Access (STSA) pictures may include cases that are not referenced in the same sub-layer and / or cases that are referenced in the same sub-layer.
- GOP is an abbreviation of Group Of Picture and represents a set of encoded pictures for enabling random access.
- the closed GOP may mean a GOP that does not include a leading picture
- the open GOP may mean a GOP that includes a leading picture.
- the leading picture may represent a picture that is slower in decoding order but faster in display order than IRAP (Intra Random Access Point, which is the same concept as a random access point in AVC codec) in HEVC.
- IRAP Intra Random Access Point
- one square box may represent one picture.
- a picture may be used in the same sense as a frame and / or an image.
- a set of pictures from I1 to P9 may represent one GOP.
- the I picture is one of three picture formats used in the MPEG coded signal and may include all data for configuring one complete picture. That is, an I picture may not refer to another picture.
- the P picture may include only a difference value between the prediction information and the actual information generated by observing the difference between the current picture and the previous picture in the reproduction order. That is, the P picture may refer to a picture existing before the current picture in the reproduction order.
- the B picture may include only prediction information generated by observing a difference between a current picture, a picture existing before the current picture in the reproduction order, and a picture existing behind the current picture. That is, the B picture may refer to pictures existing before and after the current picture. Arrows shown in this figure may indicate whether reference is made between pictures. For example, a B3 picture can refer to an I1 picture and a B5 picture to make a complete picture, and a P9 picture can refer to an I1 picture.
- numbers in boxes may indicate dependency_level of each picture.
- the first picture may indicate a picture having dependency_level of 1
- the second picture may indicate a picture having dependency_level of 5
- the third picture may indicate a picture having dependency_level of 4.
- the double speed trick play may be performed by decoding pictures having 1 to 4 as dependency_level values.
- 4x trick play can be performed by decoding pictures having 1 to 3 as dependency_level values.
- the 8x trick play can be performed by decoding pictures having 1 to 2 as dependency_level values.
- the 16x trick play can be performed by decoding pictures having 1 as the dependency_level value.
- FIG. 7 is a diagram illustrating a trick play method when an open GOP and a GOP includes a decodable leading picture according to an embodiment of the present invention. (Scenario 1-2)
- the first drawing shows the decoding order of the pictures forming the video stream.
- the second drawing shows the display order of the pictures making up the video stream at normal speed.
- the third drawing shows the video stream during double speed trick play.
- displayed pictures B0 through B6 may represent a decodable leading picture.
- the leading picture included in the GOP is a decodable leading picture, it may be displayed from the arrow portion as shown in the second figure, and trick play may be performed as shown in the third figure.
- B1, B3, B5, RAP, and B1 pictures may be displayed according to dependency_level of each picture as shown in the figure.
- the decodable leading picture may include a random access decadable leading picture (RADL).
- FIG 8 is a diagram illustrating a trick play method when an open GOP and a GOP includes a decodable leading picture and a skipped leading picture according to an embodiment of the present invention.
- Scenario 1-2 is a diagram illustrating a trick play method when an open GOP and a GOP includes a decodable leading picture and a skipped leading picture according to an embodiment of the present invention.
- the first drawing shows the decoding order of the pictures forming the video stream.
- the second drawing shows the display order of the pictures making up the video stream at normal speed.
- the third drawing shows the video stream during double speed trick play.
- pictures B0 through B2 may represent a skipped leading picture
- pictures B3 through B6 may represent a decodable leading picture
- the skipped leading picture cannot be displayed, and trick play cannot be performed. Therefore, as shown in the second drawing, the picture can be displayed from the B3 picture, and as shown in the third drawing, the trick play can be performed from the B3 picture.
- pictures B3, B5, RAP, and B1 may be displayed according to dependency_level of each picture as shown in the figure.
- the decodable leading picture may include a random access decadable leading picture (RADL)
- the skipped leading picture may include a random access skipped leading picture (RASL).
- FIG. 9 is a diagram illustrating a trick play method when an open GOP and a GOP includes a skipped leading picture according to an embodiment of the present invention. (Scenario 1-2)
- the first drawing shows the decoding order of the pictures forming the video stream.
- the second drawing shows the display order of the pictures making up the video stream at normal speed.
- the third drawing shows the video stream during double speed trick play.
- pictures B0 to B6 displayed may represent a skipped leading picture.
- the skipped leading picture cannot be displayed, and trick play cannot be performed. Accordingly, the second drawing may be displayed from the RAP, and when the double speed trick play is performed, the RAP and B1 pictures may be displayed as shown in the third drawing.
- the skipped leading picture may include a random access skipped leading picture (RASL).
- FIG. 10 illustrates a configuration of a trick play box for supporting trick play of an HEVC stream having max_temporal_id of 0 according to an embodiment of the present invention. (Scenario 1-2)
- This figure shows a signaling method of a trick play box to trick play a stream having a maximum temporal_id of 0 in the stream.
- a new box called "trikhvc" can be defined.
- the trikhvc box according to an embodiment of the present invention may include a pic_type field and / or a dependency_level field in a for loop that is repeated by a sample_count value.
- sample_count may mean the total number of pictures included in the stream.
- one sample may mean one picture.
- the pic_type field may indicate the type of picture defined in NAL_unit_type of HEVC. Some NAL_unit_type that can be used for trick play may be selected and used as pic_type according to an embodiment of the present invention.
- the pic_type field may indicate a 4-bit value.
- the dependency_level field may indicate the dependency level of the picture.
- Dependency_level according to an embodiment of the present invention may be used when performing trick play.
- the dependency_level field may be the same as the dependency_level field used in the existing trick play box of AVC. For example, performing a trick play including a sample having dependency_level of 3 may mean that only the samples corresponding to 1, 2, or 3 of dependency_level are decoded and displayed. Therefore, dependency_level may mean the level of the layer that can be discarded in performing the trick play. Even when the leading picture exists, the decodable leading picture has dependency_level, and the receiving side can decode and display only the corresponding picture while skipping the picture that is not the same as the conventional trick play method.
- the dependency_level field may indicate a value of 6 bits.
- FIG. 11 is a diagram illustrating a configuration of a trick play box for supporting trick play of an HEVC stream having max_temporal_id of 0 according to another embodiment of the present invention. (Scenario 1-2)
- This figure shows a signaling method of a trick play box to trick play a stream having a maximum temporal_id of 0 in the stream.
- trick play of the HEVC stream may be performed by using a box "trik" previously defined.
- AVC or HEVC trick play may be selected using a flag.
- a trik box can signal by separating a stream according to a video codec using a flag.
- a pic_type field and / or dependency_level field may be included in a for loop repeated by a sample_count value.
- the trik box may signal H.264 / AVC trick play, and when the flag value is 1, the trik box may signal HEVC trick play.
- sample_count may mean the total number of pictures included in the stream.
- one sample may mean one picture.
- the pic_type field may indicate the type of picture defined in NAL_unit_type of HEVC. Some NAL_unit_type that can be used for trick play may be selected and used as pic_type according to an embodiment of the present invention.
- the pic_type field may represent a 2-bit value, and when the flag value is 1, it may represent a 4-bit value.
- the dependency_level field may indicate the dependency level of the picture.
- Dependency_level according to an embodiment of the present invention may be used when performing trick play.
- the dependency_level field may be the same as the dependency_level field used in the existing trick play box of AVC. For example, performing a trick play including a sample having dependency_level of 3 may mean that only the samples corresponding to 1, 2, or 3 of dependency_level are decoded and displayed. Therefore, dependency_level may mean the level of the layer that can be discarded in performing the trick play. Even when the leading picture exists, the decodable leading picture has dependency_level, and the receiving side can decode and display only the corresponding picture while skipping the picture that is not the same as the conventional trick play method.
- the dependency_level field may indicate a value of 6 bits.
- FIG. 12 illustrates a description of pic_type included in a trick play box for supporting trick play of an HEVC stream having max_temporal_id of 0 according to an embodiment of the present invention.
- Pic_type may have a value of 0 to 15.
- pic_type When pic_type is 0, pic_type may indicate an IDR picture that does not have an associated leading picture.
- nal_unit_type may indicate IDR_N_LP.
- pic_type When pic_type is 1, pic_type may indicate an IDR picture having an associated decodable leading picture.
- nal_unit_type may indicate IDR_W_RADL.
- pic_type When pic_type is 2, pic_type may indicate a BLA picture having an associated leading picture.
- nal_unit_type may indicate BLA_N_LP.
- pic_type When pic_type is 3, pic_type may indicate a BLA picture having an associated RADL picture but not having an associated RASL picture. In this case, nal_unit_type may indicate BLA_W_RADL.
- pic_type may indicate a BLA picture having an associated RADL picture and a RASL picture.
- nal_unit_type may indicate BLA_W_LP.
- pic_type When pic_type is 5, pic_type may indicate a CRA picture having an associated leading picture. In this case, nal_unit_type may indicate CRA_NUT.
- pic_type When pic_type is 7, pic_type may indicate a RADL (Random Access Decodable Leading Picture) picture.
- nal_unit_type may indicate RADL_N or RADL_R.
- pic_type When pic_type is 8, pic_type may indicate a RASL (Random Access Skipped Leading Picture) picture.
- nal_unit_type may indicate RASL_N or RASL_R.
- pic_type may indicate an unspecified I picture.
- pic_type 10 may correspond to an unknown value.
- pic_type 11 to 15 may correspond to a reserved value.
- FIG. 13 is a diagram illustrating a configuration of a trick play box for supporting trick play of an HEVC stream having max_temporal_id equal to 0 when pic_type does not include content related to a leading picture according to an embodiment of the present invention.
- Scenario 1-1
- This figure shows a signaling method of a trick play box to trick play a stream having a maximum temporal_id of 0 in the stream.
- a new box called "trikhvc" can be defined.
- the trikhvc box according to an embodiment of the present invention may include a pic_type field and / or a dependency_level field in a for loop that is repeated by a sample_count value.
- sample_count may mean the total number of pictures included in the stream.
- one sample may mean one picture.
- the pic_type field may indicate the type of picture defined in NAL_unit_type of HEVC. Some NAL_unit_type that can be used for trick play may be selected and used as pic_type according to an embodiment of the present invention.
- the pic_type field may indicate a value of 3 bits.
- the dependency_level field may indicate the dependency level of the picture.
- Dependency_level according to an embodiment of the present invention may be used when performing trick play.
- the dependency_level field may be the same as the dependency_level field used in the existing trick play box of AVC. For example, performing a trick play including a sample having dependency_level of 3 may mean that only the samples corresponding to 1, 2, or 3 of dependency_level are decoded and displayed. Therefore, dependency_level may mean the level of the layer that can be discarded in performing the trick play. Even when the leading picture exists, the decodable leading picture has dependency_level, and the receiving side can decode and display only the corresponding picture while skipping the picture that is not the same as the conventional trick play method.
- the dependency_level field may indicate a 5-bit value.
- FIG. 14 is a diagram illustrating a trick play box for supporting trick play of an HEVC stream having max_temporal_id of 0 when pic_type does not include content related to a leading picture according to another embodiment of the present invention. (Scenario 1-1)
- This figure shows a signaling method of a trick play box to trick play a stream having a maximum temporal_id of 0 in the stream.
- trick play of the HEVC stream may be performed by using a box "trik" previously defined.
- AVC or HEVC trick play may be selected using a flag.
- a trik box may include a pic_type field and / or a dependency_level field in a for loop that is repeated by falg and sample_count.
- the trik box may signal H.264 / AVC trick play, and when the flag value is 1, the trik box may signal HEVC trick play.
- sample_count may mean the total number of pictures included in the stream.
- one sample may mean one picture.
- the pic_type field may indicate the type of picture defined in NAL_unit_type of HEVC. Some NAL_unit_type that can be used for trick play may be selected and used as pic_type according to an embodiment of the present invention.
- the pic_type field may represent a 2-bit value, and when the flag value is 1, it may represent a 3-bit value.
- the dependency_level field may indicate the dependency level of the picture.
- Dependency_level may be used when performing trick play.
- the dependency_level field may be the same as the dependency_level field used in the existing trick play box of AVC. For example, performing a trick play including a sample having dependency_level of 3 may mean that only the samples corresponding to 1, 2, or 3 of dependency_level are decoded and displayed. Therefore, dependency_level may mean the level of the layer that can be discarded in performing the trick play. Even when the leading picture exists, the decodable leading picture has dependency_level, and the receiving side can decode and display only the corresponding picture while skipping the picture that is not the same as the conventional trick play method.
- the dependency_level field may represent a 6-bit value, and when the flag value is 1, it may represent a 5-bit value.
- the pic_type when performing the trick play, if the constraint that the leading picture is not displayed, the pic_type may be configured as follows. If pic_type is 0, pic_type can indicate that the sample is an unknown sample. If pic_type is 1, pic_type can indicate that the sample is an IDR sample. If pic_type is 2, pic_type indicates that the sample is a CRA sample. If pic_type is 3, pic_type may indicate that the sample is a BLA sample, and if pic_type is 4, pic_type may indicate that the sample is an unconstrained I sample. In this case, the number of bits allocated to the pic_type and / or dependency_level fields for the HEVC stream may be reduced. Here, the sample may represent the same meaning as the picture.
- FIG. 15 is a diagram illustrating the configuration of an HEVC stream supporting temporal scalability according to an embodiment of the present invention.
- a square box may indicate a picture included in a stream, and T_ID may indicate a temporal id.
- FIG. 16 illustrates a configuration of a trick play box for supporting trick play by limiting a maximum speed in an HEVC stream supporting temporal scalability according to an embodiment of the present invention.
- scenario 2 the scenario 2
- This figure illustrates a method of signaling so that max_temporal_id is greater than 0, that is, the maximum support speed can be limited to support trick play in an HEVC stream supporting temporal scalability. For example, when the maximum temporal_id is 2, only up to 4 times the speed may be provided.
- a new box called "trikhvc" can be defined.
- the trikhvc box may include a pic_type field, a temporal_sub_layer_pic_type field, a max_temporal_id field, a temporal_id field, a constraint_trick_play_mode field and / or a next_temporal_id field in a for loop repeated by a sample_count value.
- sample_count may mean the total number of pictures included in the stream.
- one sample may mean one picture.
- the pic_type field may indicate the type of picture defined in NAL_unit_type of HEVC. Some NAL_unit_type that can be used for trick play may be selected and used as pic_type according to an embodiment of the present invention.
- the pic_type field may indicate a 4-bit value.
- the temporal_sub_layer_pic_type field may indicate whether a corresponding picture is a Temporal Sub-layer Access (TSA) picture or a Step-wise Temporal Sub-layer Access (STSA) picture.
- TSA Temporal Sub-layer Access
- STSA Step-wise Temporal Sub-layer Access
- temporal_sub_layer_pic_type may indicate that the picture is a TSA picture
- temporal_sub_layer_pic_type may indicate that the picture is an STSA picture
- temporal_sub_layer_pic_type_pic_type is 3 .
- HEVC can distinguish temporal_sub_layer_access_picture to provide temporal scalability and to adaptively change frame rate.
- HEVC can dynamically change the frame rate based on the TSA picture and the STSA picture in a layer where temporal_id is not zero.
- TSA and STSA can differ in how and how they can change the fram rate. That is, there may be a difference in how many temporal_ids the TSA and the STSA can skip at a time. For example, if temporal_id is 0, the frame rate is 15p, if temporal_id is 1, the frame rate is 30p, and if temporal_id is 2 (max_temporal_id) If the frame rate is 60p, the TSA is temporal_id in the layer where temporal_id is 0.
- the broadcasting system can change from providing a service with a frame rate of 15p to providing a service with a 60p.
- the layer with temporal_id of 0 can be accessed from the layer with temporal_id of 1 and then with the layer with temporal_id of 2. Accordingly, in this case, the broadcast system may gradually provide a service having a frame rate of 15p, and then provide a service having a 30p and finally a service having a 60p.
- the max_temporal_id field may indicate a maximum temporal_id value included in a stream.
- the temporal_id field may indicate a temporal id value calculated using the nuh_temporal_id_plus1 value of HEVC.
- the value of the temporal_id field may represent a value obtained by subtracting 1 from the nuh_temporal_id_plus1 value.
- the constraint_trick_play_mode field may indicate double speed information that can be provided at maximum.
- the present invention may set a limit to assign max_trick_play_mode to a value smaller than the value calculated through the above-described equation. For example, if the value of max_trick_play_mode is 1, 2x, 4x, 3x, 8x, and 16x.
- constraint_trick_play_mode may have the same meaning as max_trick_play_mode.
- a faster speed than max_trick_play_mode may not be supported, and a same speed number as max_trick_play_mode may be supported.
- an embodiment of the present invention may signal a constraint to assign a value smaller than the calculated max_trick_play_mode value.
- next_temporal_id field may indicate a movable temporal_id to inform the maximum changeable frame rate according to the temporal_sub_layer_pic_type. For example, in a stream with max_temporal_id of 2, if you want to provide trick play at 4x that displays only pictures with temporal_id 0, and then return to normal speed (1x), next_temporal_id can have a max_temporal_id value if temporal_sub_layer_pic_type is TSA. have. On the other hand, in STSA, next_temporal_id may have a value obtained by adding 1 to the temporal_id value.
- FIG. 17 is a diagram showing the configuration of a trick play box for supporting trick play by limiting a maximum speed in an HEVC stream supporting temporal scalability according to another embodiment of the present invention. (Scenario 2)
- This figure illustrates a method of signaling so that max_temporal_id is greater than 0, that is, the maximum support speed can be limited to support trick play in an HEVC stream supporting temporal scalability. For example, when the maximum temporal_id is 2, only up to 4 times the speed may be provided.
- trick play of the HEVC stream may be performed by using a box "trik" previously defined.
- AVC or HEVC trick play may be selected using a flag.
- a trik box may include a pic_type field, a dependency_level field, a temporal_sub_layer_pic_type field, a max_temporal_id field, a temporal_id field, a constraint_trick_play_mode field, and / or a next_temporal_id field in a for loop repeated by a flag and a sample_count value.
- sample_count may mean the total number of pictures included in the stream.
- one sample may mean one picture.
- the pic_type field may indicate the type of picture defined in NAL_unit_type of HEVC. Some NAL_unit_type that can be used for trick play may be selected and used as pic_type according to an embodiment of the present invention.
- the pic_type field may represent a 2-bit value, and when the flag value is 1, it may represent a 4-bit value.
- the dependency_level field may indicate the dependency level of the picture.
- Dependency_level according to an embodiment of the present invention may be used when performing trick play.
- the dependency_level field may be the same as the dependency_level field used in the existing trick play box of AVC. For example, performing a trick play including a sample having dependency_level of 3 may mean that only the samples corresponding to 1, 2, or 3 of dependency_level are decoded and displayed. Therefore, dependency_level may mean the level of the layer that can be discarded in performing the trick play. Even when the leading picture exists, the decodable leading picture has dependency_level, and the receiving side can decode and display only the corresponding picture while skipping the picture that is not the same as the conventional trick play method.
- the dependency_level field may indicate a value of 6 bits.
- the temporal_sub_layer_pic_type field may indicate whether a corresponding picture is a Temporal Sub-layer Access (TSA) picture or a Step-wise Temporal Sub-layer Access (STSA) picture.
- TSA Temporal Sub-layer Access
- STSA Step-wise Temporal Sub-layer Access
- temporal_sub_layer_pic_type may indicate that the picture is a TSA picture
- temporal_sub_layer_pic_type may indicate that the picture is an STSA picture
- temporal_sub_layer_pic_type_pic_type is 3 .
- HEVC can distinguish temporal_sub_layer_access_picture to provide temporal scalability and to adaptively change frame rate.
- HEVC can dynamically change the frame rate based on the TSA picture and the STSA picture in a layer where temporal_id is not zero.
- TSA and STSA can differ in how and how they can change the fram rate. That is, there may be a difference in how many tempora_ids can be skipped at a time between the TSA and the STSA. For example, if temporal_id is 0, the frame rate is 15p, if temporal_id is 1, the frame rate is 30p, and if temporal_id is 2 (max_temporal_id) If the frame rate is 60p, the TSA is temporal_id in the layer where temporal_id is 0.
- the broadcasting system can change from providing a service with a frame rate of 15p to providing a service with a 60p.
- the layer with temporal_id of 0 can be accessed from the layer with temporal_id of 1 and then with the layer with temporal_id of 2. Accordingly, in this case, the broadcast system may gradually provide a service having a frame rate of 15p, and then provide a service having a 30p and finally a service having a 60p.
- the max_temporal_id field may indicate a maximum temporal_id value included in a stream.
- the temporal_id field may indicate a temporal id value calculated using the nuh_temporal_id_plus1 value of HEVC.
- the value of the temporal_id field may represent a value obtained by subtracting 1 from the nuh_temporal_id_plus1 value.
- the constraint_trick_play_mode field may indicate double speed information that can be provided at maximum.
- the present invention may set a limit to assign max_trick_play_mode to a value smaller than the value calculated through the above-described equation. For example, if the value of max_trick_play_mode is 1, 2x, 4x, 3x, 8x, and 16x.
- constraint_trick_play_mode may have the same meaning as max_trick_play_mode.
- next_temporal_id field may indicate a movable temporal_id to inform the maximum changeable frame rate according to the temporal_sub_layer_pic_type. For example, in a stream with max_temporal_id of 2, if you want to provide trick play at 4x that displays only pictures with temporal_id 0, and then return to normal speed (1x), next_temporal_id can have a max_temporal_id value if temporal_sub_layer_pic_type is TSA. have. On the other hand, in STSA, next_temporal_id may have a value obtained by adding 1 to the temporal_id value.
- FIG. 18 is a diagram illustrating a method of changing a frame rate when the temporal sub-layer picture type is TSA according to an embodiment of the present invention.
- the receiver displays only the stream of the layer having temporal_id of 0, that is, performs trick play at 4x speed and then displays it at 1x speed (normal speed). Can be.
- the receiver may decode and display a picture having a temporal_id of 0 and then decode and display a picture having a temporal_id of 2.
- FIG. 19 is a diagram illustrating a method of changing a frame rate when the temporal sub-layer picture type is STSA according to an embodiment of the present invention. (Scenario 3)
- the receiving side displays only the stream of the layer having temporal_id of 0, that is, performs trick play at 4x speed and then directly displays at 1x speed (normal speed). It is not possible to display at 1x speed after going through 2x display process in the middle. Therefore, when the temporal sub-layer picture type according to an embodiment of the present invention is STSA, a method for informing a limitation on a double speed that can be converted may be needed. In other words, it may be necessary to signal the next_temporal_id.
- the receiving side decodes and displays the picture with temporal_id of 0 and then decodes and After decoding and displaying a picture having a temporal_id of 1, a picture having a temporal_id of 2 may be decoded and displayed.
- FIG. 20 is a diagram illustrating the configuration of a trick play box for supporting trick play at high speed in an HEVC stream supporting temporal scalability according to an embodiment of the present invention. (Scenario 3)
- a new box called "trikhvc" can be defined.
- a trikhvc box according to an embodiment of the present invention may include a pic_type field, a temporal_sub_layer_pic_type field, a max_temporal_id field, a temporal_id field, a next_temporal_id field, and / or a dependency_level field in a for loop repeated by a sample_count value.
- sample_count may mean the total number of pictures included in the stream.
- one sample may mean one picture.
- the pic_type field may indicate the type of picture defined in NAL_unit_type of HEVC. Some NAL_unit_type that can be used for trick play may be selected and used as pic_type according to an embodiment of the present invention.
- the pic_type field may indicate a 4-bit value.
- the temporal_sub_layer_pic_type field may indicate whether a corresponding picture is a Temporal Sub-layer Access (TSA) picture or a Step-wise Temporal Sub-layer Access (STSA) picture.
- TSA Temporal Sub-layer Access
- STSA Step-wise Temporal Sub-layer Access
- temporal_sub_layer_pic_type may indicate that the picture is a TSA picture
- temporal_sub_layer_pic_type may indicate that the picture is an STSA picture
- temporal_sub_layer_pic_type_pic_type is 3 .
- HEVC can distinguish temporal_sub_layer_access_picture to provide temporal scalability and to adaptively change frame rate.
- HEVC can dynamically change the frame rate based on the TSA picture and the STSA picture in a layer where temporal_id is not zero.
- TSA and STSA can differ in how and how they can change the fram rate. That is, there may be a difference in how many tempora_ids can be skipped at a time between the TSA and the STSA. For example, if temporal_id is 0, the frame rate is 15p, if temporal_id is 1, the frame rate is 30p, and if temporal_id is 2 (max_temporal_id) If the frame rate is 60p, the TSA is temporal_id in the layer where temporal_id is 0.
- the broadcasting system can change from providing a service with a frame rate of 15p to providing a service with a 60p.
- the layer with temporal_id of 0 can be accessed from the layer with temporal_id of 1 and then with the layer with temporal_id of 2. Accordingly, in this case, the broadcast system may gradually provide a service having a frame rate of 15p, and then provide a service having a 30p and finally a service having a 60p.
- the max_temporal_id field may indicate a maximum temporal_id value included in a stream.
- the temporal_id field may indicate a temporal id value calculated using the nuh_temporal_id_plus1 value of HEVC.
- the value of the temporal_id field may represent a value obtained by subtracting 1 from the nuh_temporal_id_plus1 value.
- next_temporal_id field may indicate a movable temporal_id to inform the maximum changeable frame rate according to the temporal_sub_layer_pic_type. For example, in a stream with max_temporal_id of 2, if you want to provide trick play at 4x that displays only pictures with temporal_id 0, and then return to normal speed (1x), next_temporal_id can have a max_temporal_id value if temporal_sub_layer_pic_type is TSA. have. On the other hand, in STSA, next_temporal_id may have a value obtained by adding 1 to the temporal_id value.
- the dependency_level field may indicate the dependency level of the picture.
- Dependency_level according to an embodiment of the present invention may be used when performing trick play.
- the dependency_level field may be the same as the dependency_level field used in the existing trick play box of AVC. For example, performing a trick play including a sample having dependency_level of 3 may mean that only the samples corresponding to 1, 2, or 3 of dependency_level are decoded and displayed. Therefore, dependency_level may mean the level of the layer that can be discarded in performing the trick play. Even when the leading picture exists, the decodable leading picture has dependency_level, and the receiving side can decode and display only the corresponding picture while skipping the picture that is not the same as the conventional trick play method.
- the dependency_level field may indicate a value of 6 bits.
- the trick play box shown in this figure may include a max_trick_play_mode field indicating the maximum double speed that can be supported in trick play.
- the receiving side may decode only pictures having temporal_id of 0 and 1 in order to provide a double speed trick play service.
- the receiver may decode and display only pictures having a temporal_id of 0 in order to provide a 4x trick play service.
- the receiver classifies pictures according to dependency_level among pictures having a temporal_id of 0 and decodes and displays only pictures having a corresponding dependency_level to play a trick play service with a higher speed than 4 times. Can provide.
- FIG. 21 is a diagram showing the configuration of a trick play box for supporting trick play at high speed in an HEVC stream supporting temporal scalability according to another embodiment of the present invention.
- trick play of the HEVC stream may be performed by using a box "trik" previously defined.
- AVC or HEVC trick play may be selected using a flag.
- a trik box may include a pic_type field, a dependency_level field, a temporal_sub_layer_pic_type field, a max_temporal_id field, a temporal_id field, and / or a next_temporal_id field in a for loop repeated by a flag and a sample_count value.
- sample_count may mean the total number of pictures included in the stream.
- one sample may mean one picture.
- the pic_type field may indicate the type of picture defined in NAL_unit_type of HEVC. Some NAL_unit_type that can be used for trick play may be selected and used as pic_type according to an embodiment of the present invention.
- the pic_type field may represent a 2-bit value, and when the flag value is 1, it may represent a 4-bit value.
- the dependency_level field may indicate the dependency level of the picture.
- Dependency_level according to an embodiment of the present invention may be used when performing trick play.
- the dependency_level field may be the same as the dependency_level field used in the existing trick play box of AVC. For example, performing a trick play including a sample having dependency_level of 3 may mean that only the samples corresponding to 1, 2, or 3 of dependency_level are decoded and displayed. Therefore, dependency_level may mean the level of the layer that can be discarded in performing the trick play. Even when the leading picture exists, the decodable leading picture has dependency_level, and the receiving side can decode and display only the corresponding picture while skipping the picture that is not the same as the conventional trick play method.
- the dependency_level field may indicate a value of 6 bits.
- the temporal_sub_layer_pic_type field may indicate whether a corresponding picture is a Temporal Sub-layer Access (TSA) picture or a Step-wise Temporal Sub-layer Access (STSA) picture.
- TSA Temporal Sub-layer Access
- STSA Step-wise Temporal Sub-layer Access
- temporal_sub_layer_pic_type may indicate that the picture is a TSA picture
- temporal_sub_layer_pic_type may indicate that the picture is an STSA picture
- temporal_sub_layer_pic_type_pic_type is 3 .
- HEVC can distinguish temporal_sub_layer_access_picture to provide temporal scalability and to adaptively change frame rate.
- HEVC can dynamically change the frame rate based on the TSA picture and the STSA picture in a layer where temporal_id is not zero.
- TSA and STSA can differ in how and how they can change the fram rate. That is, there may be a difference in how many tempora_ids can be skipped at a time between the TSA and the STSA. For example, if temporal_id is 0, the frame rate is 15p, if temporal_id is 1, the frame rate is 30p, and if temporal_id is 2 (max_temporal_id) If the frame rate is 60p, the TSA is temporal_id in the layer where temporal_id is 0.
- the broadcasting system can change from providing a service with a frame rate of 15p to providing a service with a 60p.
- the layer with temporal_id of 0 can be accessed from the layer with temporal_id of 1 and then with the layer with temporal_id of 2. Accordingly, in this case, the broadcast system may gradually provide a service having a frame rate of 15p, and then provide a service having a 30p and finally a service having a 60p.
- the max_temporal_id field may indicate a maximum temporal_id value included in a stream.
- the temporal_id field may indicate a temporal id value calculated using the nuh_temporal_id_plus1 value of HEVC.
- the value of the temporal_id field may represent a value obtained by subtracting 1 from the nuh_temporal_id_plus1 value.
- next_temporal_id field may indicate a movable temporal_id to inform the maximum changeable frame rate according to the temporal_sub_layer_pic_type. For example, in a stream with max_temporal_id of 2, if you want to provide trick play at 4x that displays only pictures with temporal_id 0, and then return to normal speed (1x), next_temporal_id can have a max_temporal_id value if temporal_sub_layer_pic_type is TSA. have. On the other hand, in STSA, next_temporal_id may have a value obtained by adding 1 to the temporal_id value.
- the trick play box shown in this figure may include a max_trick_play_mode field indicating the maximum double speed that can be supported in trick play.
- the receiving side may decode only pictures having temporal_id of 0 and 1 in order to provide a double speed trick play service.
- the receiver may decode and display only pictures having a temporal_id of 0 in order to provide a 4x trick play service.
- the receiver classifies pictures according to dependency_level among pictures having a temporal_id of 0 and decodes and displays only pictures having a corresponding dependency_level to play a trick play service with a higher speed than 4 times. Can provide.
- 22 is a diagram showing the structure of a broadcast signal receiving system according to an embodiment of the present invention.
- the broadcast signal receiving system may include a player device 22010, a storage device 22020, a KIC server 22030, a license server 22040, and / or a download server 22050.
- the player device 22010 may include a UHD TV.
- the player device may include an SCSA application and a traditional file system.
- the storage device 22020 may include an SD card, USB and / or SSD memory.
- the storage device may include a traditional file system.
- KIC server 22030 may include information identifying personal information.
- the license server 22040 may include license related information of the content.
- the download server 22050 may include content and information related to the content.
- the player device may go through the following process to obtain a license for the content.
- the storage device includes a license file
- the license file includes a content key can be obtained from the license file.
- the bulk content may be checked to obtain a content key from the license file if the bulk content includes a license file.
- a license may be obtained from a license server using a content key.
- the receiver may need to acquire a license for the corresponding content in order to display the downloaded content.
- FIG. 23 is a diagram showing the structure of a receiving end according to an embodiment of the present invention.
- the receiving end may include a UHD display unit 23010, a second device 23020, a UHD decoding unit 23030, a USB 23040, and / or a remote controller 23050.
- the UHD display unit 23010 may include a UHD decoding unit 23030 and may represent a UHD TV.
- the second device 23020 may represent a mobile phone, a tablet, a notebook, or the like.
- the UHD decoding unit 23030 may include a UHD display unit 23010 and may represent a UHD TV.
- USB 23040 may represent another memory device.
- the USB according to an embodiment of the present invention may store metadata for a second screen, a URL, and / or a playlist.
- the remote controller 23050 may represent a controller suitable for a UHD TV.
- the UHD TV may transmit content metadata included in the USB to the second device and display it on the display of the second device.
- the user can store metadata, URLs, and / or playlists on the USB for display on the second screen.
- the UHD TV and the second device can be automatically paired, and both devices can be connected via a UPnP based SSDP.
- the UHD TV may transmit content information to be displayed on the second screen, that is, information included in the USB, to the second device.
- the second device may display information received from the UHD TV.
- a user may store information on trick play on a USB and information on trick play may be displayed through a second device connected to a UHD TV.
- FIG. 24 is a diagram illustrating a trick play method using a combination of a temporal id and a tier according to an embodiment of the present invention.
- the tier value may be assigned only to a picture having a temporal id of zero.
- the receiver according to an embodiment of the present invention can perform trick play at normal speed by decoding and displaying pictures having a temporal id of 0, 1, 2 or 3, and having a temporal id of 0.
- the present invention can perform trick play faster than 8x speed by assigning different tier values to pictures having a temporal id of zero.
- the following method may be used to provide a trick play.
- Trick play can be provided by mapping tier and temporal id to use the PVR_assist_info descriptor.
- Trick play can be provided by including a trick play using a temporal id in the PVR_assist_info descriptor.
- Trick play can be provided by parsing nuh_temporal_id_plus1 information of NAL_unit_header and selecting only packets necessary for actual trick play.
- scenario C is
- the temporal sub-layer or temporal id is a term introduced to support temporal scalability in HEVC and may be signaled by nuh_temporal_id_plus1 of NAL_unit_header.
- a tier according to an embodiment of the present invention is a term introduced to support trick play in an AVC stream or an mpeg-2 stream, and may be included in an adaptation field in a TS packet.
- 25 is a diagram illustrating a trick play method according to a conventional tier concept according to an embodiment of the present invention.
- An existing tier according to an embodiment of the present invention may indicate dependency between layers.
- pictures with a temporal id of 3 may have a value of tier 6
- pictures with a temporal id of 2 may have a value of tier of 4, and pictures having a temporal id of 1 have a value of tier.
- pictures with a temporal id of 0 an I picture may have 1 as a tier value, and a P picture may have 2 as a tier value. That is, pictures having a temporal id of 0 may have the same temporal id but different tier values.
- pictures having max_temporal_id that is, highest dependency_level, may have 6 or 7 as a tier value.
- pictures having other temporal_ids may have 1 to 5 as tier values.
- pictures corresponding to a layer having a temporal_id of 0 may have a value of 1 or 2 as a tier value according to the type of the picture.
- FIG. 26 is a diagram illustrating a trick play method according to a method of mapping one temporal id to one tier 1: 1 according to an embodiment of the present invention. (Scenario A-a)
- a method of mapping a temporal id and a tier may be used to provide a trick play based on a HEVC stream including a temporal id but not tier information.
- a method of mapping one temporal id to one tier 1: 1 may be used, and (scenario Aa) to map one temporal id to several tiers.
- the method can be used.
- the value of the temporal id may be mapped 1: 1 to the tier regardless of the meaning of the existing tier.
- the value of temporal id is mapped to tier 1: 1, but tier 6,7 is the maximum temporal by maintaining the meaning of the existing tier of discardable picture. id can be mapped to tier 6 or 7.
- the PVR_assist_tier_m_cumulative_frames field included in the PVR_assist_info descriptor may transmit a value of the minimum number of extractable frames per 1.28 seconds from tier 1 through the PVR_assist_tier_m field. (This field conveys the value of the intended minimum number of extractable frames per 1.28 sec from tier 1 through "PVR_assist_teir_m").
- This figure illustrates a trick play method according to scenario A-a which is an embodiment of the present invention.
- pictures with a temporal id of 3 may be mapped to tier 4
- pictures with a temporal id of 2 may be mapped to tier 3
- pictures with a temporal id of 1 may be mapped to tier 2.
- the pictures with a temporal id of 0 may be mapped to tier 1.
- Scenario A which is an embodiment of the present invention, cannot provide trick play at 8x speed or more.
- FIG. 27 is a diagram illustrating a trick play method according to a method of mapping one temporal id to one tier 1: 1 according to another embodiment of the present invention. (Scenario A-a)
- the temporal id value is mapped to tier 1: 1 differently from the previous drawing, but tier 6 and 7 retain the meaning of the existing tier of discardable picture, so that the maximum temporal id is mapped to tier 6 or 7. You can.
- pictures with a temporal id of 3 may be mapped to tier 6
- pictures with a temporal id of 2 may be mapped to tier 3
- pictures with a temporal id of 1 may be mapped to tier 2.
- the pictures with a temporal id of 0 may be mapped to tier 1.
- This drawing differs from the previous drawing in that the pictures corresponding to temporal id 3, the maximum temporal id, may be mapped to tier 6.
- FIG. 28 is a diagram illustrating a result of mapping one temporal id to one tier 1: 1 according to an embodiment of the present invention.
- the figure shows the result of mapping the value of the temporal id to the tier 1: 1 as it is regardless of the meaning of the existing tier.
- the nuh_temporal_id plus1 field shown in this figure is a field included in NAL_unit_header and may indicate a value obtained by adding 1 to a temporal id. For example, if the nuh_temporal_id plus1 field value is 1, the temporal id may be 0. Thus, as shown in this figure, pictures having a temporal id of 0 may be mapped to tier 1, pictures having a temporal id of 1 may be mapped to tier 2, and pictures having a temporal id of 2 may be mapped to tier 3 Pictures having a temporal id of 3 may be mapped to tier 4.
- the new tier mapped with the temporal id may be different from the meaning of the tier previously used.
- tier 6 and 7 meant discardable pictures
- tier 7 meant pictures not used as a reference.
- the newly defined tier through mapping may not have the meaning of the existing tier 6 and 7.
- the temporal sub-layer and the tier may be mapped to have the same number, and the mapping information described above may be used to perform the trick play.
- 29 is a diagram illustrating a result of mapping one temporal id to one tier 1: 1 according to another embodiment of the present invention.
- a value of a temporal id is mapped to a tier 1: 1, but tier 6 and 7 maintain the meaning of the existing tier of discardable picture.
- the nuh_temporal_id plus1 field shown in this figure is a field included in NAL_unit_header and may indicate a value obtained by adding 1 to a temporal id. For example, if the nuh_temporal_id plus1 field value is 1, the temporal id may be 0. Thus, as shown in this figure, pictures having a temporal id of 0 may be mapped to tier 1, pictures having a temporal id of 1 may be mapped to tier 2, and pictures having a temporal id of 2 may be mapped to tier 3 Pictures with a temporal id of 3 may be mapped to tier 6.
- the new tier mapped with the temporal id may have the same meaning as the tier previously used.
- Tiers 6 and 7 may refer to discardable pictures
- tier 6 may refer to pictures used as a reference
- tier 7 may refer to pictures not used as a reference. Accordingly, as described above, pictures having nuh_temporal_id plus1 of 4 may be mapped to tier 6.
- pictures not used as a reference it may be mapped to tier 7.
- FIG. 30 is a diagram illustrating a trick play method according to a method of mapping one temporal id to several tiers according to an embodiment of the present invention.
- pictures having a temporal id of 1, which is a maximum temporal id may be mapped to tier 4 or 6, and the remaining five pictures having a temporal id of 0 may be mapped to tier 0 to 3.
- the new tier mapped with the temporal id may be different from the meaning of the tier previously used.
- tier 6 and 7 meant discardable pictures
- tier 7 meant pictures not used as a reference.
- the newly defined tier through mapping may not have the meaning of the existing tier 6 and 7.
- pictures with a temporal id of 1 may be mapped to tier 4. (30010)
- a new tier mapped with a temporal id may have the same meaning as a tier previously used.
- Tiers 6 and 7 may refer to discardable pictures
- tier 6 may refer to pictures used as a reference
- tier 7 may refer to pictures not used as a reference.
- pictures with a temporal id of 1 may be mapped to tier 6. (30020)
- FIG. 31 is a diagram illustrating a configuration of an adaptation field of a TS packet including information for mapping a temporal id and a tier according to an embodiment of the present invention.
- An embodiment of the present invention may provide a descriptor that maps values of tier and nuh_temporal_id_plus1 to speed information for trick play.
- HEVC_temporal_id_tier_mapping_info which is information for trick play of the HEVC stream including the temporal id, may also be included in the above-described adaptation field.
- This figure illustrates the configuration of a data field included in an adaptation field of a TS packet.
- data_field_tag is 0x00
- this data field may be reserved. If 0x01, this data field may indicate Announcement switching data field. If 0x02, this may indicate data field for AU_information, and 0x03. In this case, it may represent a data field for PVR_assist_information, 0x04 may indicate a data field indicating a TSAP time line, and 0x05 may indicate a data field for HEVC_temporal_id_tier_mapping_info.
- An embodiment of the present invention may map a tier value to a temporal id using the HEVC_temporal_id_tier_mapping_info descriptor and provide trick play based on the tier using the PVR_assist_information descriptor.
- 32 is a diagram illustrating the configuration of HEVC_temporal_id_tier_mapping_info according to an embodiment of the present invention.
- the HEVC_temporal_id_tier_mapping_info may include an included_temporal_id_flag field, a temporal_sub_layer_dependency_flag field, a max_temporal_id_plus1 field, a temporal_id_plus1 field, a curr_tier_num field, and / or a trick_play_speed field.
- the included_temporal_id_flag field may signal whether or not encoding is performed using a temporal id.
- the temporal_sub_layer_dependency_flag field may indicate whether there is a dependency between temporal sub-layers. That is, the temporal_sub_layer_dependency_flag field has 1 when the lower temporal sub-layer picture does not refer to the upper temporal sub-layer picture.
- the max_temporal_id_plus1 field may indicate a maximum temporal id value, and a value indicated by the max_temporal_id_plus1 field is a value obtained by adding 1 to the temporal id.
- the temporal_id_plus1 field may have the same value as that indicated by nuh_temporal_id_plus1 included in the NAL unit header.
- the curr_tier_num field may indicate a tier value mapped to temporal_id_plus1.
- the trick_play_speed field may indicate the maximum trick play speed that can be provided according to the temporal id value.
- the if (max_temporal_id_plus1> 1) conditional statement may indicate a case where max_temporal_id_plus1 is greater than one. That is, the temporal scalability may indicate that the stream is used, and in this case, since the trick play may be provided using the temporal id, an embodiment of the present invention may map a temporal id to a tier to use the existing PVR_assist_information. have.
- the temporal_id_plus1 field may be located in the for loop at the same level as the curr_tier_num field and the trick_play_speed field, and the tier number and the speed for trick play according to each temporal_id may be signaled.
- a look up table may be generated using the descriptor including the above-described HEVC_temporal_id_tier_mapping_info, and tier related information of PVR_assist_information may be interpreted and used as temporal_id using the above look up table.
- the for loop may be deleted from the above-described HEVC_temporal_id_tier_mapping_info, and the curr_tier_num field and the trick_play_speed field may be located at the level where the temporal_id_plus1 field is located.
- the descriptor including the aforementioned HEVC_temporal_id_tier_mapping_info may be signaled for each picture.
- FIG. 33 is a diagram showing the configuration of a trick_play_speed field included in HEVC_temporal_id_tier_mapping_info according to an embodiment of the present invention.
- the speed of trick play that can be provided may be 1x. If the trick_play_speed field is 0, the speed of trick play that can be provided may be 2x. 4 times, 3 times the speed of the available trick play can be 8 times, 4 times the speed of the available trick play can be 16 times, 5 times the speed of the provided trick play can be 32 times If 6, the speed of the trick play that can be provided may be 64 times.
- FIG. 34 is a diagram illustrating a configuration of PVR_assist_information according to an embodiment of the present invention.
- PVR_assist_information in accordance with one embodiment of the present invention data_field_tag field, data_field_length field, PVR_assist_tier_pic_num field, PVR_assist_block_trick_mode_present_flag field, PVR_assist_pic_struct_present_flag field, PVR_assist_tier_next_pic_in_tier_present_flag field, PVR_assist_substream_info_present_flag field, PVR_assist_extension_present_flag field, PVR_assist_segmentation_info_present_flag field, PVR_assist_tier_m_cumulative_frames_present_flag field, PVR_assist_tier_n_mmco_present_flag field, PVR_assist_reserved_0 field, PVR_assist_seg_id field, PVR_assist_prg_id Field, PVR_
- the data_field_tag field may represent that the corresponding data field is PVR_assist_information.
- the data_field_tag field may have 0x03.
- the data_field_length field may indicate the length of the PVR_assist_information excluding the data_field_tag field and the data_field_length field.
- the PVR_assist_tier_pic_num field may indicate a tier number of a picture related to PVR_assist_information.
- the minimum tier number may be zero and the maximum tier number may be seven. Tier number 0 can be reserved for future use.
- the tier number of the HEVC RAP picture may be 0, and the tier number of all pictures other than the HEVC RAP picture may be a value obtained by adding 1 to the temporal id.
- this field may indicate the tier number of pictures included in the video stream, and may be called tier number information.
- the tier number may be used for signaling the temporal sublayer.
- PVR_assist_tier_pic_num may be named tier number information.
- the PVR_assist_block_trick_mode_present_flag field may have a value of 1 in a non-RAP picture when the value of this field is 1 in a previous RAP picture.
- the PVR_assist_pic_struct_present_flag field may have a value of 1 when the video stream is an AVC or HEVC stream and the PVR_assist_pict_struct field is present.
- the PVR_assist_tier_next_pic_in_tier_present_flag field may have a value of 1 when the PVR_assist_tier_next_pic_in_tier field exists.
- the PVR_assist_substream_info_present_flag field may have a value of 1 when the PVR_assist_substream_info field exists.
- the PVR_assist_extension_present_flag field may have a value of 1 in any one of a PVR_assist_segmentation_info_present_flag field, PVR_assist_tier_m_cumulative_frames_present_flag field, PVR_assist_tier_n_mmco_present_flag field, and PVR_assist_temporal_id_info_present_present_flag field.
- the PVR_assist_segmentation_info_present_flag field may have a value of 1 when the PVR_assist_segmentation_info field exists.
- This field may be named segmentation info flag information and may indicate whether information on a segment to which a picture belongs is present.
- the PVR_assist_tier_m_cumulative_frames_present_flag field may have a value of 1 when the PVR_assist_tier_m field and the PVR_assist_tier_m_cumulative_frames field exist. In the case of HEVC, this field may be recommended to have a value of zero.
- the PVR_assist_tier_n_mmco_present_flag field may have a value of 1 when the PVR_assist_tier_n_mmco field exists. In the case of HEVC, this field may have a value of zero.
- the PVR_assist_seg_id field may transmit an id of a segment to which a picture belongs. This field may be named segment identifier information and may indicate an id of a segment to which a picture belongs.
- the PVR_assist_prg_id field may transmit the ID of a program to which a picture belongs. This field may be named program identifier information and may indicate an ID of a program to which a picture belongs.
- the PVR_assist_seg_start_flag field may have a value of 1 when the picture has a first reproduction order in one segment.
- This field may be named segment start flag information, and may identify a picture in which a play time order is first in each segment.
- the PVR_assist_seg_end_flag field may have a value of 1 when the picture has the last reproduction order in one segment. This field may be named segment and flag information, and may identify the picture in which the playback time order is last in each segment.
- the PVR_assist_prg_start_flag field may have a value of 1 when the picture has the first playback order in one program.
- This field may be named program start flag information, and may identify a picture in which a playback time order is first in each program.
- the PVR_assist_prg_stop_flag field may have a value of 1 when the picture has the last playback order in one program. This field may be named program end flag information, and may identify a picture in which each play time sequence is last in each program.
- the PVR_assist_scene_change_flag field may have a value of 1 when the first picture is in the playback order of a new scene.
- the PVR_assist_tier_m field may indicate a tier number associated with the PVR_assist_tier_m_cumulative_frames field. In the case of HEVC, this field may not exist.
- the PVR_assist_tier_m_cumulative_frames field may deliver a value of the minimum number of frames extractable from tier 1 per second through the PVR_assist_tier_m field.
- the PVR_assist_tier_n_mmco field may indicate the minimum tier number below MMCOs that can be ignored by the decoder while performing trick play. In the case of HEVC, this field may not exist.
- PVR_assist_information may further include a PVR_assist_tier_next_pic_tier field.
- the PVR_assist_tier_next_pic_tier field may indicate a relative position of a next order picture in decoding order among pictures having a tier number equal to the value indicated by the PVR_assist_tier_pic_num field, and may be referred to as tier next picture information.
- PVR_assist_information includes metadata for performing a trick play of video data, and may be referred to as PVR assist information.
- An embodiment of the present invention may provide a method including trick play using a temporal id in the existing PVR_assist_information included in the adaptation field of the TS packet. That is, one embodiment of the present invention may provide temporal id frame work.
- One embodiment of the present invention may signal PVR_assist_temporal_id_plus1 together with the existing PVR_assist_tier_pic_num to support temporal scalability based on temporal id.
- the PVR_assist_information may include all the aforementioned fields, the PVR_assist_temporal_id_plus1 field, the PVR_assist_temporal_id_info_present_flag field, and / or the PVR_assist_max_temporal_id_plus1 field in the previous drawing showing the configuration of the PVR_assist_information.
- a field having the same name as the aforementioned field in the previous drawing showing the configuration of PVR_assist_information may have the same meaning as the aforementioned meaning in the previous drawing.
- the PVR_assist_temporal_id_plus1 field may indicate a temporal id value of the current frame and may actually indicate the same value as the nuh_temporal_id_plus1 value included in the NAL unit header.
- the PVR_assist_temporal_id_info_present_flag field may indicate whether or not the temporal id related information is included. This field may indicate a value of 1 if the PVR_assist_max_temporal_id_plus1 field exists. This field may be provided per RAP picture.
- the PVR_assist_max_temporal_id_plus1 field may indicate a maximum temporal id value and, in fact, may indicate a value obtained by adding 1 to the maximum temporal id value. This field may have a value of any one of 0 to 6. This field can be used to provide information about the speed of trick play.
- the double speed of the trick play may be calculated using the values of the PVR_assist_max_temporal_id_plus1 field and the PVR_assist_temporal_id_plus1 field.
- FIG. 36 is a diagram illustrating a configuration of PVR_assist_information to which temporal id frame work is added according to another embodiment of the present invention.
- An embodiment of the present invention may provide a method including trick play using a temporal id in the existing PVR_assist_information included in the adaptation field of the TS packet. That is, one embodiment of the present invention may provide temporal id frame work.
- the PVR_assist_tier_pic_num field may be used as it is and the meaning of PVR_assist_temporal_id_plus1 may be contained in the PVR_assist_tier_pic_num field.
- the PVR_assist_tier_pic_num_to_temporal_id_flag field may be used to inform that the meaning of the tier has been changed.
- the PVR_assist_information may include all the above-described fields, the PVR_assist_tier_pic_num_to_temporal_id_flag field, the PVR_assist_temporal_id_info_present_flag field, and / or the PVR_assist_id_temporal field in the previous drawing showing the configuration of the PVR_assist_information.
- a field having the same name as the aforementioned field in the previous drawing showing the configuration of PVR_assist_information may have the same meaning as the aforementioned meaning in the previous drawing.
- the PVR_assist_tier_pic_num field may be used as it is for temporal id frame work according to an embodiment of the present invention. That is, the PVR_assist_tier_pic_num field indicates the temporal id value of the current frame and may actually have the same value as the nuh_temporal_id_plus1 value included in the NAL unit header.
- the PVR_assist_tier_pic_num_to_temporal_id_flag field may have a value of 1 if the PVR_assist_tier_pic_num field is used as a field indicating a temporal id.
- the PVR_assist_temporal_id_info_present_flag field may indicate whether or not the temporal id related information is included.
- the PVR_assist_max_temporal_id_plus1 field may indicate a maximum temporal id value and, in fact, may indicate a value obtained by adding 1 to the maximum temporal id value.
- the double speed of the trick play may be calculated using PVR_assist_tier_pic_num field values having the meaning of the PVR_assist_max_temporal_id_plus1 field and the PVR_assist_temporal_id_plus1 field.
- a type of a framework for providing a PVR may be distinguished by adding two or more fields of PVR_assist_framework to PVR_assist_information.
- One embodiment of the present invention can distinguish the existing tier, substream framework and temporal id framework. In this case, flag values previously included in PVR_assist_information may not be used, and an embodiment of the present invention may configure a conditional statement in place of a tag value corresponding to each framework.
- FIG. 37 illustrates a configuration of PVR_assist_information for supporting trick play using a temporal id according to an embodiment of the present invention.
- PVR_assist_information in accordance with one embodiment of the present invention may include data_field_tag field, data_field_length field, PVR_assist_temporal_id_plus1 field, PVR_assist_substream_info_present_flag field, PVR_assist_extension_present_flag field, PVR_assist_temporal_id_present_flag field, PVR_assist_temporal_sub_layer_dependency_flag field, PVR_assist_max_temporal_id_plus1 field, PVR_assist_curr_tier_num fields and / or field PVR_assist_trick_play_speed.
- the data_field_tag field may represent that the corresponding data field is PVR_assist_information.
- the data_field_tag field may have 0x03.
- the data_field_length field may indicate the length of the PVR_assist_information excluding the data_field_tag field and the data_field_length field.
- the PVR_assist_temporal_id_plus1 field may indicate a temporal id value of the current frame and may actually indicate the same value as the nuh_temporal_id_plus1 value included in the NAL unit header.
- the PVR_assist_substream_info_present_flag field may have a value of 1 when the PVR_assist_substream_info field exists.
- the PVR_assist_extension_present_flag field may have a value of 1 when any one of the PVR_assist_segmentation_info_present_flag field, the PVR_assist_tier_m_cumulative_frames_present_flag field, and the PVR_assist_tier_n_mmco_present_flag field is 1.
- the PVR_assist_temporal_id_info_present_flag field may indicate whether or not the temporal id related information is included.
- the PVR_assist_temporal_sub_layer_dependency_flag field may indicate whether there is a dependency between temporal sub-layers. That is, the temporal_sub_layer_dependency_flag field has 1 when the lower temporal sub-layer picture does not refer to the upper temporal sub-layer picture.
- the PVR_assist_max_temporal_id_plus1 field may indicate a maximum temporal id value and, in fact, may indicate a value obtained by adding 1 to the maximum temporal id value.
- the PVR_assist_curr_tier_num field may indicate a tier value corresponding to temporal_id_plus1.
- the trick_play_speed field may indicate the maximum trick play speed that can be provided according to the temporal id value.
- conditional statement may indicate a case where PVR_assist_max_temporal_id_plus1 is greater than one. That is, it may indicate that temporal scalability is a used stream, and in this case, trick play may be provided using a temporal id.
- the PVR_assist_temporal_id_plus1 field may be located in the for loop at the same level as the PVR_assist_curr_tier_num field and the PVR_assist_trick_play_speed field, and the tier number and speed for trick play according to each temporal_id may be signaled.
- the existing PVR_assist_tier_pic_num field may be left as it is, and the above-described field value may be changed to represent the PVR_assist_temporal_id_plus1 field value.
- the meaning of the PVR_assist_tier_pic_num field has been changed using the PVR_assist_tier_pic_num_to_temporal_id_flag field.
- Another embodiment of the present invention may provide a trick play using only a temporal id without a tier. That is, trick play may be provided by parsing nuh_temporal_id_plus1 information of NAL_unit_header and selecting only packets necessary for actual trick play.
- scenario C For example, when trying to play a stream composed of temporal ids from 0 to 3 at double speed, an embodiment of the present invention transmits only TS packets having a nuh_temporal_id_plus1 value of 1, 2, or 3 to the system decoder. Can provide 2x trick play.
- 38 is a diagram illustrating a receiving device according to an embodiment of the present invention.
- a receiving apparatus includes a receiver 38010, a demodulator 38020, a trick play performer 38030, a system decoder / demux 38040 And / or a video decoder 38050.
- the receiver 38010 may receive a broadcast signal transmitted through a broadcast network, a cable network, and / or an internet network.
- the receiver may receive a transport stream (TS).
- the TS may include PVR assist information for performing trick play
- the PVR assist information may include tier number information and / or maximum temporal identification information.
- the above-described tier number information may indicate a tier number having a value obtained by adding 1 to the temporal identification information value of the picture other than the RAP, and the above-mentioned maximum temporal identification information indicates the maximum tempo of the video stream including the encoded video data. May indicate a central identification information value.
- the demodulator 38020 may demodulate a broadcast signal modulated according to a modulation technique.
- the trick play performer 38030 may select a TS packet for trick play by a method according to each scenario. Details of scenarios 1, 2, and 3, according to one embodiment of the present invention, are described above.
- the system decoder and demux 38040 may decode system information and may separate the multiplexed broadcast signal for each unit stream.
- the demultiplexer may demultiplex the received broadcast signal to extract a video stream.
- the demultiplexer according to an embodiment of the present invention may include a first extractor and / or a second extractor according to an embodiment of the present invention.
- the first extractor may extract a packetized elementary stream (PES) from the received TS.
- the second extractor may extract a video unit stream from the extracted PES.
- PES packetized elementary stream
- the video decoder 38050 may decode the video stream.
- the video decoder may include a system decoder and a trick play performer, and the video decoder may perform trick play of video data by decoding a video stream selected for trick play based on the PVR assist information.
- the video decoder may perform trick play based on temporal identification information and maximum temporal identification information included in the PVR assist information.
- the video stream according to an embodiment of the present invention may represent a video unit stream.
- FIG. 39 illustrates a comparison of a tier framework and an HEVC temporal sub-layer according to an embodiment of the present invention.
- the hierarchical structure of the HEVC temporal sub-layer according to an embodiment of the present invention is similar to the tier system framework.
- the temporal id according to an embodiment of the present invention may match the tier number.
- the HEVC temporal sub-layer according to an embodiment of the present invention may support the PVR by a method similar to the tier system framework.
- An HEVC compliant encoder / decoder may support an HEVC temporal sub-layer.
- a temporal id according to an embodiment of the present invention may exist in a HEVC encoded stream. If the stream is encoded by a temporal sub-layer structure, no special encoding structure for trick play may be needed. Therefore, the HEVC temporal sub-layer according to an embodiment of the present invention can reduce the encoding burden when supporting trick play.
- tier 7 means discarded pictures that are not referenced
- tier 6 means discarded pictures that are referenced.
- the HEVC temporal sub-layer according to an embodiment of the present invention may not distinguish whether the picture is a reference picture or not.
- tier 1 represents a RAP picture
- tier 2 represents a P picture.
- a temporal id 0 may represent all of an I picture, a P picture, and a B picture including an IRAP picture.
- other sublayers may be assigned to temporal ids 6 and 7. Therefore, when compared with the tier numbers 6 and 7, it is possible to support additional double speed when using the temporal id according to an embodiment of the present invention.
- frame division may be necessary in the base sublayer having a temporal id of 0 to support faster speed.
- FIG. 40 is a diagram showing the configuration of PVR_assist_information according to another embodiment of the present invention.
- a temporal id given at the video level to filter the access units (AUs) before the decoding process may be signaled at the system level.
- an intra frame may be signaled to support higher double speed in the base sublayer having a temporal id of zero.
- PVR_assist_information in accordance with one embodiment of the present invention may include data_field_tag field, data_field_length field, PVR_assist_temporal_id_plus1 field, PVR_assist_temporal_id_info_present_flag field, PVR_assist_intra_picture_flag field, PVR_assist_max_temporal_id_plus1 field, PVR_assist_PB_numbers_in_temporalid_zero field, PVR_assist_reserved_0 fields and / or field PVR_assist_reserved_byte.
- the data_field_tag field may represent that the corresponding data field is PVR_assist_information.
- the data_field_tag field may have 0x03.
- the data_field_length field may indicate the length of the PVR_assist_information excluding the data_field_tag field and the data_field_length field.
- the PVR_assist_temporal_id_plus1 field may indicate a temporal id of a picture.
- the temporal id can have the value minus one in this field.
- the minimum value of this field may be 1 and the maximum value may be 7.
- the value of this field may have the same value as the nuh_temporal_id_plus1 value.
- the PVR_assist_temporal_id_info_present_flag field may have a value of 1 if the PVR_assist_max_temporal_id_plus1 field exists. This field may be provided only in the picture corresponding to the RAP picture.
- the PVR_assist_intra_picture_flag field may have a value of 1 if the current access unit is an intra picture.
- the PVR_assist_max_temporal_id_plus1 field may indicate a maximum temporal id.
- the maximum temporal id may represent a value obtained by subtracting 1 from this field. This field may have a value from 1 to 7.
- the PVR_assist_PB_numbers_in_temporalId_zero field may indicate the number of non-intra frames present between intra frames in the base sublayer having a temporal id of zero. This field can be used to guess the speed of trick play.
- the PVR_assist_reserved_0 field is reserved for future use.
- the PVR_assist_reserved_byte field may indicate a field left for later use.
- 41 is a view showing a trick play method using an HEVC temporal sub-layer according to an embodiment of the present invention.
- the first picture 41010 is a picture in which pictures corresponding to one GOP are arranged in a playback order.
- I may represent an I picture
- B may represent a B picture
- P may represent a P picture.
- numbers below the alphabet indicating the type of picture may indicate the reproduction order.
- the arrow can indicate the reference relationship between the pictures.
- a second figure 40220 is a diagram illustrating a method of providing trick play using an HEVC temporal sub-layer according to an embodiment of the present invention.
- One square box may represent one picture. The number in the box can represent the temporal id.
- An x in the square may represent the picture to be decoded and played back during trick play. As shown in the second figure, 2x to 8x trick play can be provided by the HEVC temporal sub-layer.
- a third figure 40030 is a diagram showing a method of providing trick play using base sub-layer signaling according to an embodiment of the present invention.
- One embodiment of the present invention can provide trick play at 12x, 24x, and 48x speeds by decoding and playing only intra pictures.
- An embodiment of the present invention may transmit a broadcast signal by the following procedure.
- an embodiment of the present invention may generate a video unit stream by encoding video data.
- the video unit stream may be encoded by the AVC or the HEVC codec.
- an embodiment of the present invention may generate a packetized elementary stream (PES) including a video unit stream.
- PES packetized elementary stream
- an embodiment of the present invention may generate a transport stream (TS) including a PES.
- the TS may represent an MPEG-2 TS.
- the TS according to an embodiment of the present invention may include PVR assist information for performing trick play.
- the PVR assist information may mean information necessary for performing trick play of video data in a receiver or a PVR device. Detailed description of the PVR assist information has been described above with reference to FIG.
- PVR assist information may include tier number information and / or maximum temporal identification information.
- the tier number information may indicate a tier number having a value obtained by adding 1 to the temporal identification information value of a non-RAP, and the maximum temporal identification information is a maximum temporal identification value of a video stream including encoded video data. Can be represented.
- the maximum temporal identification information according to an embodiment of the present invention may be named max_temporal_id or PVR_assist_max_temporal_id. Detailed description of the maximum temporal identification information has been described above with reference to FIGS. 32, 35, 36, 37, and 40.
- the tier number may be used to signal the temporal sublayer.
- the tier number may indicate a dependency between pictures.
- the tier number may be used to signal the temporal sublayer similarly to the temporal identification information.
- Tier number information may be named PVR_assist_tier_pic_num. Detailed description of the tier number information described above has been described with reference to FIGS. 34, 35, 36, 39, and 40.
- an embodiment of the present invention may transmit the generated TS.
- an embodiment of the present invention may transmit the generated broadcast signal through at least one of a terrestrial broadcasting network, a cable network, and an internet network.
- the video unit stream may include one or more temporal sublayers, where the temporal sublayer may represent a set of pictures.
- the header of the NAL unit including the encoded video data may include temporal identification plus information.
- the temporal identification plus information may indicate a value obtained by adding 1 to the temporal identification information, and the temporal identification information may include information for identifying a temporal sublayer.
- Temporal identification plus information according to an embodiment of the present invention may be used to identify a temporal sublayer.
- the temporal sublayer may be named a temporal sub-layer
- the temporal identification information may be named a temporal id
- the temporal identification plus information may be named nuh_temporal_id_plus1.
- the PVR assist information may include maximum temporal identification information indicating a maximum temporal identification information value of the video unit stream.
- An HEVC encoded video stream may have several temporal sublayers and each temporal sublayer may be identified by temporal identification information.
- the maximum temporal identification information may mean temporal identification information of a temporal sublayer having the maximum temporal identification information among a plurality of temporal sublayers.
- the PVR assist information may include temporal identification flag information indicating whether the maximum temporal identification information is included, wherein the temporal identification flag information may be provided for each RAP.
- the temporal identification flag information may be named PVR_assist_temporal_id_info_present_flag.
- the temporal identification flag information may have a value of 1 when the maximum temporal identification information is included in the PVR assist information.
- the temporal identification flag information may have a value of zero.
- the maximum temporal identification information may be signaled for each RAP by providing temporal identification flag information for each RAP. Accordingly, an embodiment of the present invention may signal the maximum speed information of trick play for each RAP. Detailed description thereof has been given above with reference to FIGS. 35 and 40.
- the tier number indicated by the tier number information described above may have a value of 0 in the RAP picture.
- the tier number according to an embodiment of the present invention may be determined for each picture constituting the video stream.
- the tier number in the RAP picture may have zero.
- a RAP picture according to an embodiment of the present invention may mean an HEVC DVB_RAP picture.
- the maximum temporal identification information may be used to provide information about the speed of trick play.
- the maximum temporal identification information may signal information about the maximum speed of the trick play.
- An embodiment of the present invention may signal a trick play speed corresponding to each temporal identification information.
- an embodiment of the present invention can inform the user of the maximum speed information that can be serviced, and can determine the speed of trick play in response to a user's request and provide the determined speed.
- the maximum temporal identification information may have a value of any one of 0 to 6.
- the temporal identification information value when the temporal identification information value is one-to-one matched with the tier number according to another embodiment of the present invention, the temporal identification information value may be matched within a range of a previously defined tier number.
- tier numbers were defined from 0 to 7. Detailed description thereof has been given above with reference to FIGS. 35 and 40.
- the above-described PVR assist information may be included in the adaptation field of the TS.
- the adaptation field may include data as a field existing between the header of the TS packet and the payload.
- the adaptation field may include a private data byte field, and the private data byte field may include PVR assist information.
- the private data byte field may be included in the adaptation field and may include several data fields. One data field of the aforementioned several data fields may include PVR assist information.
- the above-described PVR assist information may include segmentation info flag information indicating whether information about a segment to which a picture belongs is present. Detailed description thereof has been provided above with reference to FIG. 34.
- the above-described PVR assist information may include segment identifier information indicating an id of a segment to which a picture belongs. Detailed description thereof has been provided above with reference to FIG. 34.
- the above-described PVR assist information may include program identifier information indicating an id of a program to which a picture belongs. Detailed description thereof has been provided above with reference to FIG. 34.
- the above-described PVR assist information includes segment start flag information for identifying a picture having a first play time order in each segment and a segment for identifying a picture with a last play time order in each segment. It may include at least one of the end flag information. Detailed description thereof has been provided above with reference to FIG. 34.
- the above-described PVR assist information includes program start flag information for identifying a picture having a first play time order in each program and a program for identifying a picture having a last play time order in each program. It may include at least one of the end flag information. Detailed description thereof has been provided above with reference to FIG. 34.
- FIG. 43 is a diagram showing the structure of a broadcast signal receiving apparatus according to an embodiment of the present invention.
- the broadcast signal receiving apparatus 4210 may include a receiver 43020, a first extractor 43030, a second extractor 43040, and / or a decoder 43050.
- the receiver may receive a transport stream (TS).
- the TS may include PVR assist information for performing trick play
- the PVR assist information may include tier number information and / or maximum temporal identification information.
- the above-described tier number information may indicate a tier number having a value obtained by adding 1 to the temporal identification information value of the picture other than the RAP, and the above-mentioned maximum temporal identification information indicates the maximum tempo of the video stream including the encoded video data. May indicate a central identification information value. Detailed description thereof has been provided above with reference to FIG. 42.
- the first extractor may extract a packetized elementary stream (PES) from the received TS.
- PES packetized elementary stream
- the second extractor may extract a video unit stream from the extracted PES.
- the decoder may decode the extracted video unit stream.
- FIG. 38 The configuration having the same name as that of the broadcast signal receiving apparatus shown in FIG. 38 among the configurations of the broadcast signal receiving apparatus shown in this figure is shown in FIG. 38. Can perform the same function as
- the configuration corresponding to the process of the broadcast signal transmitting method shown in FIG. 42 among the components of the broadcast signal receiving apparatus shown in this figure corresponds to the process of the broadcast signal transmitting method shown in FIG. 42. Function can be performed.
- Apparatus and method according to the present invention is not limited to the configuration and method of the embodiments described as described above, the above-described embodiments may be selectively all or part of each embodiment so that various modifications can be made It may be configured in combination.
- the image processing method of the present invention can be implemented as a processor-readable code on a processor-readable recording medium provided in the network device.
- the processor-readable recording medium includes all kinds of recording devices that store data that can be read by the processor. Examples of the processor-readable recording medium include ROM, RAM, CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like, and may also be implemented in the form of a carrier wave such as transmission over the Internet. .
- the processor-readable recording medium can also be distributed over network coupled computer systems so that the processor-readable code is stored and executed in a distributed fashion.
- the present invention can be used throughout the broadcasting industry.
Abstract
Description
Claims (18)
- 비디오 데이터를 인코딩하여 비디오 단위 스트림을 생성하는 단계;상기 비디오 단위 스트림을 포함하는 PES (Packetized Elementary Stream)를 생성하는 단계;상기 생성된 PES를 포함하는 TS (Transport Stream)를 생성하는 단계,여기서, 상기 TS는 트릭 플레이 수행을 위한 PVR 어시스트 정보를 포함하고,여기서, 상기 PVR 어시스트 정보는 티어 넘버 정보 및 최대 템포럴 식별 정보를 포함하고,여기서, 상기 티어 넘버 정보는 RAP가 아닌 픽처의 템포럴 식별 정보 값에 1을 더한 값을 가지는 티어 넘버를 나타내고, 상기 최대 템포럴 식별 정보는 상기 인코딩된 비디오 데이터를 포함하는 비디오 단위 스트림의 최대 템포럴 식별 정보 값을 나타내고;상기 생성된 TS를 전송하는 단계;를 포함하는 방송 신호 송신 방법.
- 제 1 항에 있어서,상기 티어 넘버 정보가 나타내는 티어 넘버는 RAP 픽처에서는 0 값을 갖는 방송 신호 송신 방법.
- 제 1 항에 있어서,상기 최대 템포럴 식별 정보는 트릭 플레이의 속도에 대한 정보를 제공하는데 사용되는 방송 신호 송신 방법.
- 제 1 항에 있어서,상기 PVR 어시스트 정보는 상기 TS의 어댑테이션 필드에 포함되는 방송 신호 송신 방법.
- 제 1 항에 있어서,상기 PVR 어시스트 정보는 픽처가 속한 세그먼트에 대한 정보의 존재 여부를 나타내는 세그먼테이션 인포 플래그 정보를 포함하는 방송 신호 송신 방법.
- 제 5 항에 있어서,상기 PVR 어시스트 정보는 픽처가 속한 세그먼트의 id를 나타내는 세그먼트 식별자 정보를 포함하는 방송 신호 송신 방법.
- 제 5 항에 있어서,상기 PVR 어시스트 정보는 픽처가 속한 프로그램의 id를 나타내는 프로그램 식별자 정보를 포함하는 방송 신호 송신 방법.
- 제 5 항에 있어서,상기 PVR 어시스트 정보는 각 세그먼트에서 재생 시간 순서가 첫 번째인 픽처를 식별하는 세그먼트 스타트 플래그 정보 및 각 세그먼트에서 재생 시간 순서가 마지막인 픽처를 식별하는 세그먼트 엔드 플래그 정보 중 적어도 어느 하나를 포함하는 방송 신호 송신 방법.
- 제 5 항에 있어서,상기 PVR 어시스트 정보는 각 프로그램에서 재생 시간 순서가 첫 번째인 픽처를 식별하는 프로그램 스타트 플래그 정보 및 각 프로그램에서 재생 시간 순서가 마지막인 픽처를 식별하는 프로그램 엔드 플래그 정보 중 적어도 어느 하나를 포함하는 방송 신호 송신 방법.
- TS (Transport Stream)을 수신하는 수신부,여기서, 상기 TS는 트릭 플레이 수행을 위한 PVR 어시스트 정보를 포함하고,여기서, 상기 PVR 어시스트 정보는 티어 넘버 정보 및 최대 템포럴 식별 정보를 포함하고,여기서, 상기 티어 넘버 정보는 RAP가 아닌 픽처의 템포럴 식별 정보 값에 1을 더한 값을 가지는 티어 넘버를 나타내고, 상기 최대 템포럴 식별 정보는 상기 인코딩된 비디오 데이터를 포함하는 비디오 단위 스트림의 최대 템포럴 식별 정보 값을 나타내고;상기 수신한 TS에서 PES (Packetized Elementary Stream)을 추출하는 제 1 추출부;상기 추출된 PES에서 비디오 단위 스트림을 추출하는 제 2 추출부;상기 추출된 비디오 단위 스트림을 디코딩하는 디코더;를 포함하는 방송 신호 수신 장치.
- 제 10 항에 있어서,상기 티어 넘버 정보가 나타내는 티어 넘버는 RAP 픽처에서는 0 값을 갖는 방송 신호 수신 장치.
- 제 10 항에 있어서,상기 최대 템포럴 식별 정보는 트릭 플레이의 속도에 대한 정보를 제공하는데 사용되는 방송 신호 수신 장치.
- 제 10 항에 있어서,상기 PVR 어시스트 정보는 상기 TS의 어댑테이션 필드에 포함되는 방송 신호 수신 장치.
- 제 10 항에 있어서,상기 PVR 어시스트 정보는 픽처가 속한 세그먼트에 대한 정보의 존재 여부를 나타내는 세그먼테이션 인포 플래그 정보를 포함하는 방송 신호 수신 장치.
- 제 14 항에 있어서,상기 PVR 어시스트 정보는 픽처가 속한 세그먼트의 id를 나타내는 세그먼트 식별자 정보를 포함하는 방송 신호 수신 장치.
- 제 14 항에 있어서,상기 PVR 어시스트 정보는 픽처가 속한 프로그램의 id를 나타내는 프로그램 식별자 정보를 포함하는 방송 신호 수신 장치.
- 제 14 항에 있어서,상기 PVR 어시스트 정보는 각 세그먼트에서 재생 시간 순서가 첫 번째인 픽처를 식별하는 세그먼트 스타트 플래그 정보 및 각 세그먼트에서 재생 시간 순서가 마지막인 픽처를 식별하는 세그먼트 엔드 플래그 정보 중 적어도 어느 하나를 포함하는 방송 신호 수신 장치.
- 제 14 항에 있어서,상기 PVR 어시스트 정보는 각 프로그램에서 재생 시간 순서가 첫 번째인 픽처를 식별하는 프로그램 스타트 플래그 정보 및 각 프로그램에서 재생 시간 순서가 마지막인 픽처를 식별하는 프로그램 엔드 플래그 정보 중 적어도 어느 하나를 포함하는 방송 신호 수신 장치.
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CA2891664A CA2891664C (en) | 2013-12-01 | 2014-11-18 | Method and apparatus for transmitting and receiving broadcast signal for providing trick play service |
EP14843154.7A EP3076676B1 (en) | 2013-12-01 | 2014-11-18 | Method and device for transmitting and receiving broadcast signal for providing trick play service |
MX2015003113A MX343061B (es) | 2013-12-01 | 2014-11-18 | Metodo y aparato para transmitir y recibir señal de difusion para proveer un servicio de reproduccion trucada. |
US14/423,980 US9860607B2 (en) | 2013-12-01 | 2014-11-18 | Method and apparatus for transmitting and receiving broadcast signal for providing trick play service |
CN201480003247.7A CN104823450B (zh) | 2013-12-01 | 2014-11-18 | 发送和接收广播信号以便提供特技播放服务的方法和装置 |
DE112014000261.5T DE112014000261T5 (de) | 2013-12-01 | 2014-11-18 | Verfahren und Gerät zum Senden und Empfangen eines Übertragungssignals zum Bereitstellen eines Trickmodusdienstes |
JP2015550347A JP6126240B2 (ja) | 2013-12-01 | 2014-11-18 | トリックプレイサービスを提供する放送信号送受信方法および装置 |
KR1020157004495A KR102182166B1 (ko) | 2013-12-01 | 2014-11-18 | 트릭 플레이 서비스 제공을 위한 방송 신호 송수신 방법 및 장치 |
Applications Claiming Priority (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201361910416P | 2013-12-01 | 2013-12-01 | |
US61/910,416 | 2013-12-01 | ||
US201461952140P | 2014-03-13 | 2014-03-13 | |
US61/952,140 | 2014-03-13 | ||
US201461954615P | 2014-03-18 | 2014-03-18 | |
US61/954,615 | 2014-03-18 | ||
US201461970910P | 2014-03-27 | 2014-03-27 | |
US61/970,910 | 2014-03-27 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015080414A1 true WO2015080414A1 (ko) | 2015-06-04 |
Family
ID=53199318
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2014/011042 WO2015080414A1 (ko) | 2013-12-01 | 2014-11-18 | 트릭 플레이 서비스 제공을 위한 방송 신호 송수신 방법 및 장치 |
Country Status (9)
Country | Link |
---|---|
US (1) | US9860607B2 (ko) |
EP (1) | EP3076676B1 (ko) |
JP (1) | JP6126240B2 (ko) |
KR (1) | KR102182166B1 (ko) |
CN (1) | CN104823450B (ko) |
CA (1) | CA2891664C (ko) |
DE (1) | DE112014000261T5 (ko) |
MX (1) | MX343061B (ko) |
WO (1) | WO2015080414A1 (ko) |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20150009424A (ko) | 2013-07-15 | 2015-01-26 | 한국전자통신연구원 | 시간적 서브 레이어 정보에 기반한 계층간 예측을 이용한 영상 부, 복호화 방법 및 그 장치 |
EP3076677A4 (en) * | 2013-12-01 | 2017-06-14 | LG Electronics Inc. | Method and device for transmitting and receiving broadcast signal for providing trick play service in digital broadcasting system |
KR101792518B1 (ko) | 2013-12-16 | 2017-11-02 | 엘지전자 주식회사 | 트릭 플레이 서비스 제공을 위한 신호 송수신 장치 및 신호 송수신 방법 |
KR101809969B1 (ko) * | 2014-03-18 | 2017-12-18 | 엘지전자 주식회사 | Hevc 스트림의 트릭 플레이 서비스 제공을 위한 방송 신호 송수신 방법 및 장치 |
EP3254471A1 (en) * | 2015-02-05 | 2017-12-13 | Cisco Technology, Inc. | Pvr assist information for hevc bitstreams |
US9510062B1 (en) * | 2015-08-13 | 2016-11-29 | This Technology, Inc. | In-band trick mode control |
MX2018003687A (es) * | 2015-09-23 | 2018-04-30 | Arris Entpr Llc | Alto rango dinamico de señalizacion y contenido de amplia gama de colores en corrientes de transporte. |
EP4258673A3 (en) | 2015-10-07 | 2023-12-13 | Panasonic Intellectual Property Management Co., Ltd. | Video transmission method, video reception method, video transmission device, and video reception device |
US10129574B2 (en) | 2016-05-24 | 2018-11-13 | Divx, Llc | Systems and methods for providing variable speeds in a trick-play mode |
CN114466238B (zh) * | 2020-11-09 | 2023-09-29 | 华为技术有限公司 | 帧解复用方法、电子设备及存储介质 |
CN114845152B (zh) * | 2021-02-01 | 2023-06-30 | 腾讯科技(深圳)有限公司 | 播放控件的显示方法、装置、电子设备及存储介质 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20010036269A1 (en) * | 2000-04-28 | 2001-11-01 | Takeo Morinaga | Information transmitting method, information processing method and apparatus, and information recording and reproducing method and apparatus |
US20020118680A1 (en) * | 2001-02-28 | 2002-08-29 | Sang Yong Lee | Media router and method for recording/reproducing broadcasting signal by using the same |
KR20040011819A (ko) * | 2002-07-30 | 2004-02-11 | 엘지전자 주식회사 | Pvr 지원 비디오 디코딩 시스템 |
KR20070119351A (ko) * | 2006-06-15 | 2007-12-20 | 엘지전자 주식회사 | 방송 시스템 및 방법 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008294512A (ja) * | 2007-05-22 | 2008-12-04 | Panasonic Corp | デジタルテレビ受信装置及びデジタルテレビ送信装置 |
US20090323822A1 (en) | 2008-06-25 | 2009-12-31 | Rodriguez Arturo A | Support for blocking trick mode operations |
EP2665259A1 (en) * | 2012-05-17 | 2013-11-20 | Samsung Electronics Co., Ltd | Recording medium, reproducing device for performing trick play for data of the recording medium, and method thereof |
EP3039861A1 (en) * | 2013-08-28 | 2016-07-06 | Cisco Technology, Inc. | Support for trick modes in hevc streams |
-
2014
- 2014-11-18 CA CA2891664A patent/CA2891664C/en active Active
- 2014-11-18 MX MX2015003113A patent/MX343061B/es active IP Right Grant
- 2014-11-18 CN CN201480003247.7A patent/CN104823450B/zh active Active
- 2014-11-18 JP JP2015550347A patent/JP6126240B2/ja active Active
- 2014-11-18 US US14/423,980 patent/US9860607B2/en active Active
- 2014-11-18 KR KR1020157004495A patent/KR102182166B1/ko active IP Right Grant
- 2014-11-18 WO PCT/KR2014/011042 patent/WO2015080414A1/ko active Application Filing
- 2014-11-18 DE DE112014000261.5T patent/DE112014000261T5/de not_active Withdrawn
- 2014-11-18 EP EP14843154.7A patent/EP3076676B1/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20010036269A1 (en) * | 2000-04-28 | 2001-11-01 | Takeo Morinaga | Information transmitting method, information processing method and apparatus, and information recording and reproducing method and apparatus |
KR100816603B1 (ko) * | 2000-04-28 | 2008-03-24 | 소니 가부시끼 가이샤 | 정보 송신 방법, 정보 처리 방법 및 장치, 정보 기록 재생방법 및 장치 |
US20020118680A1 (en) * | 2001-02-28 | 2002-08-29 | Sang Yong Lee | Media router and method for recording/reproducing broadcasting signal by using the same |
KR20040011819A (ko) * | 2002-07-30 | 2004-02-11 | 엘지전자 주식회사 | Pvr 지원 비디오 디코딩 시스템 |
KR20070119351A (ko) * | 2006-06-15 | 2007-12-20 | 엘지전자 주식회사 | 방송 시스템 및 방법 |
Non-Patent Citations (1)
Title |
---|
See also references of EP3076676A4 * |
Also Published As
Publication number | Publication date |
---|---|
EP3076676A4 (en) | 2016-10-19 |
EP3076676A1 (en) | 2016-10-05 |
JP2016506182A (ja) | 2016-02-25 |
CN104823450A (zh) | 2015-08-05 |
JP6126240B2 (ja) | 2017-05-10 |
CA2891664A1 (en) | 2015-06-01 |
CN104823450B (zh) | 2019-07-12 |
MX2015003113A (es) | 2016-01-12 |
KR20160091814A (ko) | 2016-08-03 |
CA2891664C (en) | 2016-12-13 |
KR102182166B1 (ko) | 2020-11-24 |
MX343061B (es) | 2016-10-21 |
US9860607B2 (en) | 2018-01-02 |
DE112014000261T5 (de) | 2015-09-24 |
EP3076676B1 (en) | 2019-05-01 |
US20160261924A1 (en) | 2016-09-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2015080414A1 (ko) | 트릭 플레이 서비스 제공을 위한 방송 신호 송수신 방법 및 장치 | |
WO2015126144A1 (ko) | 파노라마 서비스를 위한 방송 신호 송수신 방법 및 장치 | |
WO2015102449A1 (ko) | 컬러 개멋 리샘플링을 기반으로 하는 방송 신호 송수신 방법 및 장치 | |
WO2015152635A1 (ko) | 신호 송수신 장치 및 신호 송수신 방법 | |
WO2015034306A1 (ko) | 디지털 방송 시스템에서 고화질 uhd 방송 컨텐츠 송수신 방법 및 장치 | |
WO2014003379A1 (ko) | 영상 디코딩 방법 및 이를 이용하는 장치 | |
WO2009125961A1 (en) | Method of transmitting and receiving broadcasting signal and apparatus for receiving broadcasting signal | |
WO2015065037A1 (ko) | Hevc 기반의 ip 방송 서비스 제공을 위한 방송 신호 송수신 방법 및 장치 | |
WO2010058958A2 (ko) | 비실시간 서비스 처리 방법 및 방송 수신기 | |
WO2014003515A1 (ko) | 멀티미디어 시스템에서 적응적 미디어 구조 송신 방법 및 장치 | |
WO2015076616A1 (ko) | 신호 송수신 장치 및 신호 송수신 방법 | |
WO2010068033A2 (ko) | 비실시간 서비스 처리 방법 및 방송 수신기 | |
WO2015093811A1 (ko) | 트릭 플레이 서비스 제공을 위한 신호 송수신 장치 및 신호 송수신 방법 | |
WO2014073927A1 (ko) | 신호 송수신 장치 및 신호 송수신 방법 | |
WO2015115869A1 (ko) | 트릭 플레이 서비스 제공을 위한 신호 송수신 장치 및 신호 송수신 방법 | |
WO2011112053A2 (ko) | 비실시간 방송 서비스 처리 시스템 및 그 처리방법 | |
WO2017135673A1 (ko) | 방송 신호 송신 장치, 방송 신호 수신 장치, 방송 신호 송신 방법, 및 방송 신호 수신 방법 | |
WO2016171518A2 (ko) | 방송 신호 송신 장치, 방송 신호 수신 장치, 방송 신호 송신 방법, 및 방송 신호 수신 방법 | |
WO2011132883A2 (ko) | 인터넷 기반 컨텐츠 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2012050405A2 (ko) | 디지털 수신기 및 디지털 수신기에서의 3d 컨텐트 처리방법 | |
WO2015126117A1 (ko) | 방송 신호 송수신 방법 및 장치 | |
WO2017061796A1 (ko) | 방송 신호 송신 장치, 방송 신호 수신 장치, 방송 신호 송신 방법, 및 방송 신호 수신 방법 | |
WO2011132879A2 (ko) | 인터넷 기반 컨텐츠 송수신 방법 및 그를 이용한 송수신 장치 | |
WO2018062641A1 (ko) | 관심 영역을 고려한 가상 현실 서비스 제공 | |
WO2011132882A2 (ko) | 인터넷 기반 컨텐츠 송수신 방법 및 그를 이용한 송수신 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 20157004495 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 14423980 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2891664 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2015/003113 Country of ref document: MX |
|
REEP | Request for entry into the european phase |
Ref document number: 2014843154 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2014843154 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2015550347 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 112014000261 Country of ref document: DE Ref document number: 1120140002615 Country of ref document: DE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14843154 Country of ref document: EP Kind code of ref document: A1 |