EP3013065B1 - Informationsverarbeitungsvorrichtung und -verfahren - Google Patents

Informationsverarbeitungsvorrichtung und -verfahren Download PDF

Info

Publication number
EP3013065B1
EP3013065B1 EP14826667.9A EP14826667A EP3013065B1 EP 3013065 B1 EP3013065 B1 EP 3013065B1 EP 14826667 A EP14826667 A EP 14826667A EP 3013065 B1 EP3013065 B1 EP 3013065B1
Authority
EP
European Patent Office
Prior art keywords
image
tile
data
file
track
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
EP14826667.9A
Other languages
English (en)
French (fr)
Other versions
EP3013065A4 (de
EP3013065A1 (de
Inventor
Shinobu Hattori
Mitsuhiro Hirabayashi
Tatsuya Igarashi
Mikita Yasuda
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of EP3013065A1 publication Critical patent/EP3013065A1/de
Publication of EP3013065A4 publication Critical patent/EP3013065A4/de
Application granted granted Critical
Publication of EP3013065B1 publication Critical patent/EP3013065B1/de
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/231Content storage operation, e.g. caching movies for short term storage, replicating data over plural servers, prioritizing data for deletion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234345Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements the reformatting operation being performed only on part of the stream, e.g. a region of the image or a time segment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/23439Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements for generating different versions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • H04N21/2353Processing of additional data, e.g. scrambling of additional data or processing content descriptors specifically adapted to content descriptors, e.g. coding, compressing or processing of metadata
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/2385Channel allocation; Bandwidth allocation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/262Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists
    • H04N21/26258Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists for generating a list of items to be played back in a given order, e.g. playlist, or scheduling item distribution according to such list
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8455Structuring of content, e.g. decomposing content into time segments involving pointers to the content, e.g. pointers to the I-frames of the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/85406Content authoring involving a specific file format, e.g. MP4 format

Definitions

  • the present disclosure relates to an information processing device and method, and more particularly, to an information processing device and method of adaptively supplying data of a partial image.
  • MPEG-DASH Moving Picture Experts Group-Dynamic Adaptive Streaming over HTTP
  • ABS adaptive bitrate streaming
  • selecting a partial image which is a part of an image instead of the entire image and delivering it adaptively has been considered.
  • delivering a partial image which is a part selected in the entire image on a terminal side receiving image data, or controlling the size of the partial image to be delivered according to the performance of the terminal for example, a processing ability of a CPU or the like or the size of a display
  • a transmission path, a load situation of a server, or the like has been considered.
  • Non-Patent Literature 1 MPEG-DASH (Dynamic Adaptive Streaming over HTTP) (URL:http://mpeg.chiariglione.org/standards/mpeg-dash/media-presentation-description-and-segment-formats/text-isoiec-23009-12012-dam-1)
  • the MPEG-DASH standard of the related art relates only to the concept of switching bit rates (Bitrates), and no selection of any partial image or supply of the data performed using tile structures described above, that is, adaptive supply of data of partial images, may be performed.
  • information can be processed.
  • it is possible to adaptively supply data of partial images.
  • MPEG-DASH adopts an adaptive bitrate streaming (ABS) technology in which a plurality of pieces of encoded data in which the same content is expressed at different bit rates are stored in a content server and a client selects and reproduces one piece of encoded data among the plurality of pieces of encoded data according to a network bandwidth.
  • ABS adaptive bitrate streaming
  • a procedure of transmission of content by DASH will be described with reference to FIG. 1 .
  • software for controlling streaming data selects a media presentation description (MPD) file of desired content and acquires the MPD file from a web server.
  • the MPD is metadata for managing content such as a moving image or audio to be delivered.
  • the software for controlling streaming data of the moving image reproduction terminal analyzes the MPD and performs control such that data (a DASH segment) of the desired content appropriate for the quality of a communication line, the performance of the moving image reproduction terminal, or the like is acquired from the web server.
  • Client software for HTTP access acquires the DASH segment using HTTP from the web server under the control. The content acquired in this way is reproduced by moving image reproduction software.
  • the MPD has, for example, the configuration illustrated in FIG. 2 .
  • a client selects an optimum representation from attributes of representations (Representation) included in periods (Period) of the MPD (Media Presentation in FIG. 2 ).
  • the client reads the beginning segment (Segment) of the selected representation (Representation) to acquire and process an initialization segment (Initialization Segment). Subsequently, the client acquires and reproduces subsequent segments (Segment).
  • FIG. 3 A relation among the period (Period), the representation (Representation), and the segment (Segment) in the MPD is illustrated in FIG. 3 . That is, one piece of media content can be managed for each period (Period) which is a unit of data in a time direction and each period (Period) can be managed for each segment (Segment) which is a unit of data in the time direction. For each period (Period), a plurality of representations (Representation) with different attributes such as bit rates can be configured.
  • a file of the MPD (also referred to as an MPD file) has the layered structure illustrated in FIG. 4 below the period (Period).
  • the MPD has the structure illustrated in the example of FIG. 5 .
  • a plurality of representations (Representation) are present as is apparent from the example of FIG. 5 .
  • the client can acquire proper stream data according to a communication environment, a decoding ability of the client, or the like by adaptively selecting any of the representations to reproduce the stream data.
  • DASH of the related art delivery of data of an entire image is adaptively controlled, but selecting a partial image which is a part of an image instead of the entire image and delivering it adaptively has been considered. For example, delivering a partial image which is a part selected in the entire image on a terminal side receiving image data, or controlling the size of the partial image to be delivered according to the performance of the terminal (for example, a processing ability of a CPU or the like or the size of a display), a transmission path, a load situation of a server, or the like has been considered.
  • a tile is a partial region obtained by dividing an entire image in a pre-decided layout (a size, a shape, a number, or the like).
  • a tile image an image of one tile is referred to as a tile image.
  • a partial image is configured by a single tile image or a plurality of tile images.
  • image data is encoded and a bit stream of the image data is filed to be delivered (published as a file).
  • image data is encoded independently for each tile image. At this time, as in the example illustrated in FIG. 6A , each piece of encoded data of each tile may be configured in one bit stream.
  • an entire image with a 640 ⁇ 480 size, an entire image with a 1980 ⁇ 1080 size, and each of tile images (four partial images) with a 960 ⁇ 540 size obtained by dividing the entire image into two in the vertical and horizontal directions are prepared as images for delivery.
  • Data of the entire image with the 640 ⁇ 480 size is encoded and considered to be one bit stream (bitstream1) and the data of the entire image with the 1980 ⁇ 1080 size is also encoded and considered to be one bit stream (bitstream2).
  • bitstream3 to bitstream6 data of each tile image with the 960 ⁇ 540 size is independently encoded and considered to be one bit stream.
  • header information such as a video parameter set (VPS), a sequence parameter set (SPS), supplemental enhancement information (SEI), and a picture parameter set (PPS) is added, and the bit stream of the image data is arranged for each slice (Slice).
  • VPS video parameter set
  • SPS sequence parameter set
  • SEI supplemental enhancement information
  • PPS picture parameter set
  • a tile image to be delivered can be selected by selecting the bit stream to be delivered selected from bitstream3 to bitstream6.
  • each tile image can be delivered as the entire image.
  • a structure called tiles (Tile) into which an entire image is divided is supported, and thus encoding can be independently performed for each tile.
  • decoding can be performed so that only the image of some of the tiles is obtained. That is, the decoding can be performed so that only a partial image which is a part of the entire image is obtained.
  • encoded data of a plurality of tile images can also be configured as one bit stream (bitstream7) using the function of such a coding scheme. That is, in this case, the tiles (Tile) for delivery described above are handled as tiles (Tile) supported by the coding scheme to be encoded. In this case, in the bit stream, the data of the tiles is arranged as slices (Slice).
  • the bit stream for delivery is filed in accordance with, for example, an MP4 file format.
  • the bit stream of each tile can be set to be a separate file, as in the example illustrated in FIG. 7 .
  • the bit stream of each tile is managed in units called tracks (Track).
  • header (Header) information regarding each tile and a base track (Base Track) in which reference to each track is described are provided and filed as a different file from the bit stream of each tile.
  • the base track is reproduced.
  • the base track is referred to in the header information.
  • the bit streams of the tiles can also be collected and configured in one file.
  • data of the tiles can also be collected and managed in one track as in FIG. 8A and the tiles can also be managed as mutually different tracks as in FIG. 8B .
  • the header (Header) information regarding each tile and the base track (Base Track) in which the reference to each track is described are provided.
  • the tiles may be tiles obtained by equally dividing an entire image as in the example of FIG. 9A or may be tiles obtained by unequally dividing an entire image as in the example of FIG. 9B . That is, the image sizes of the tile images forming the entire image may be the same as or different from one another.
  • an application controlling the size of a partial image to be displayed can be considered.
  • An entire image 10 illustrated in FIG. 9A is assumed to be tiled and divided into a plurality of tile images 11 with the same size.
  • an application displays partial images 12 which are 4 tile images of 2 ⁇ 2.
  • an application displays partial images 13 which are 30 tile images of 6 ⁇ 5. In this way, an application controlling the image sizes of partial images displayed according to the performance or the like of a terminal displaying an image is considered.
  • the image sizes of the tile images are unequal.
  • the application can display an image with an HD resolution by displaying an image of a tile 3 (Tile 3), can display an image with a cinema resolution by displaying images of tile 2 (Tile 2) to tile 4 (Tile 4), and can further display an image with a further extended size (EXT) by displaying images of tile 1 (Tile1) to tile 5 (Tile5).
  • the application controlling a resolution or an aspect ratio of a display image by controlling the image sizes of partial images to be displayed is considered.
  • a load of a server, a terminal, a transmission path, or the like can be adaptively controlled, and thus it is possible to suppress an increase in an unnecessary load.
  • the MPEG-DASH standard of the related art relates only to the concept of switching bit rates (Bitrates), and no selection of any partial image or supply of the data performed using tile structures described above, that is, adaptive supply of data of partial images, may be performed.
  • partial image information which is information regarding a partial image which is a part of an entire image is generated as extended data of the MPD, and an extended MPD which is extended to include metadata used for supply of a bit stream of the entire image and supply of a bit stream of the partial image, that is, the partial image information, is generated using the generated partial image information.
  • the partial image to be supplied may be any partial image as long as the partial image is a part of the entire image, and the shape, size, etc. are arbitrary.
  • the partial image may be a part which can be encoded independently from other portions.
  • the partial image is assumed to be an image in units of tiles described above. That is, the partial image is assumed to be formed by a single tile image or a plurality of tile images.
  • the MPD has a layered structure, for example, layers of an adaptation set (AdaptationSet), a representation (Representation), a sub-representation (Sub-Representation), and a sub-segment (Sub-Segment). Any of these layers may be extended.
  • AdaptationSet adaptation set
  • Representation representation
  • Sub-Representation sub-representation
  • Sub-Segment sub-segment
  • a description for a tile is defined utilizing a descriptor type element (DescriptorType element) of the MPD.
  • DescriptorType element a descriptor type element of the MPD.
  • a description for a tile called a viewpoint (Viewpoint) is defined as in FIG. 10A .
  • the viewpoint is an element which is present in the adaptation set (AdaptationSet).
  • the viewpoint is a description that defines what the view is. For example, the viewpoint defines whether the view is a right (R) image or a left (L) image of a stereo image.
  • an element of the related art is used (extended).
  • the element of the related art it is possible to suppress a reduction in affinity to an MPD of the related art (it is possible to suppress an increase in a description which may not be analyzed by a decoder of the related art).
  • the representation (Representation) or the sub-representation (Sub-Representation) is extended, a new element is defined.
  • a schema for storing the partial image information is defined.
  • (urn:mpeg:DASH:tile:2013) is defined as a schema for a tile.
  • the extension of the schema is performed when any of the adaptation set, the representation, and the sub-representation is extended.
  • values of schema for the new tile are defined.
  • the above-described partial image information is defined.
  • a view type (1) viewtype
  • information (2) the width and the height of an entire image
  • information (3) the x coordinate and the y coordinate of the image indicated by the element
  • group identification information (4) TilegroupID
  • the view type is information indicating, for example, whether the image is a tile image, as illustrated in FIG. 10B .
  • a value when the image is an entire image is assumed to be "0”
  • a value when the image is a tile image and a bit stream is divided for each tile as in the example of FIG. 6A is assumed to be "1,”
  • a value when the image is a tile image and data of all the tiles is collected in one bit stream as in the example of FIG. 6B is assumed to be "2.”
  • These values and states (definitions of the values) indicated by the values are decided in advance.
  • the method of defining these values is arbitrary and an example other than this example may be used.
  • the information (the width and the height of the entire image) regarding the size of the entire image is information indicating the size (the horizontal width and the height) of an image in which all of the tile images belonging to the same group as the image (the tile image) are unified, as illustrated in FIG. 10B .
  • the sizes of images of bit streams are the same as the size of a display image.
  • the sizes of the images of the bit streams are different from the size of the display image in some cases. For example, when a plurality of tile images of mutually different bit streams are unified to be displayed, the size of the display image can be larger than the sizes of the images of the bits streams in some cases.
  • the size of an image in which all of the tile images belonging to the same group as the image (the tile image) are unified is indicated. That is, by referring to this value, it is possible to easily comprehend a maximum processing load when all of the tile images belonging to the same group of the image (the tile image) are decoded.
  • the size (1920 ⁇ 1080) of an image in which 4 (2 ⁇ 2) tile images with a 960 ⁇ 540 size are unified is indicated as information regarding the size of the entire image.
  • the information (the x coordinate and the y coordinate of the image indicated by the element) indicating the position of the partial image in the entire image is information indicating where the image in which all of the tile images belonging to the same group as the image (tile image) are unified is located, as illustrated in FIG. 10B .
  • Expression of the position (indicating with which value) is arbitrary.
  • the position may be expressed with the coordinates of the upper left of the image.
  • the position may be expressed with another piece of information such as identification information regarding the tile or the coordinates of another location other than the upper left.
  • the group identification information is identification information indicating a group of the tile images to which the image belongs, as illustrated in FIG. 10B .
  • the same value can be assigned to the tile images of the same group. In contrast, different values can be assigned to respective groups.
  • the same value can be assigned as group identification information to the tile images.
  • the group identification information may be defined not as the value of the viewpoint but as an attribute of another element, for example, as follows.
  • an attribute called a group is already present.
  • a meaning can be assigned as a set (Tilegroup) of tiles (Tile) to the group.
  • an attribute called group is not present in the representation or the sub-representation. That is, when the representation or the sub-representation is extended, a new attribute called (group) is set.
  • the above-described extension method can also be applied when a bit stream is filed (in particular, MP4 filing) as in the example of FIG. 7 or 8 .
  • a bit stream is filed (in particular, MP4 filing) as in the example of FIG. 7 or 8 .
  • the header information or the like of the bit stream assigned to other tracks is assigned to the base track (Base Track)
  • positional information regarding the segment is not necessary.
  • a value which is not the actual coordinates may be defined as information regarding the position of the image.
  • NULL, empty, space, or the like may be set.
  • a considerably large value or a negative value may be set as the coordinates.
  • identification (a flag or the like) indicating the base track may be separately provided.
  • segments are necessarily present under the representation (Representation). That is, a URL of an MP4 file is described in segments immediately under the representation.
  • the sub-representation is, for example, information that is used to reproduce only trickplay or music and designates data of a part in the MP4 file of the segment immediately under the representation.
  • the MPD When the MPD is extended so that the partial image information can be included, the MPD may be extended so that segments are present under the sub-representation (Sub-Representation). That is, a tile image may be assigned to the sub-representation so that the URL of the MP4 file can be referred to.
  • Sub-Representation a sub-representation
  • tags of a base URL ( ⁇ BaseURL>), a segment base ( ⁇ SegmentBase>), a segment list ( ⁇ SegmentList>), a segment template ( ⁇ SegmentTemplate>), and the like are additionally defined in the sub-representation.
  • segment information indicating that the information regarding the bit stream is present under the sub-representation (Sub-Representation) as the partial image information and store the segment information in the MPD.
  • a flag (@SegmentInSubRepresentation: true or false) indicating whether the information regarding the bit stream is present under the sub-representation is defined as the segment information.
  • the representation can be configured by the sub-representations of the plurality of tile images.
  • a segment expresses a concept of time, and thus the segments of the same time are not permitted to be present in one representation (Representation).
  • the MPD When the MPD is extended so that the partial image information is included, the MPD may be extended so that a plurality of segments of the same time can be present in one representation by assigning the tile images to the segments.
  • multi-segment information indicating that the plurality of segments to which the tile images of the same time are assigned are present as partial image information under the representation and store the multi-segment information in the MPD.
  • a flag (@multiSegmentInRepresentation: true or false) indicating whether the plurality of pieces of information regarding the bit streams of the same time are present under the representation is defined as the multi-segment information.
  • the segment can be designated only in access units (AU) in the related art, but the sub-segment (Sub-Segment) assigning an ssix box extended so that data in units of tiles can be designated may be defined under the segment to which an MP4 file storing a bit stream of a single tile image or a plurality of tile images is assigned. That is, under segment to which an MP4 file is assigned, one sub-segment or a plurality of sub-segments including an ssix designating the tile corresponding to the segment from the MP4 file may be present.
  • AU access units
  • Dedicated flag information may be separately defined to clarify that the tile image is expressed in accordance with the sub-segment (that the MP4 file is extended).
  • the partial image information is not limited to the above-described examples, but any partial image information can be used.
  • information other than the information a view type ((1) viewtype), the information ((2) the width and the height of an entire image) regarding the size of the entire image, the information ((3) the x coordinate and the y coordinate of the image indicated by the element) indicating the position of a partial image in the entire image, and the group identification information ((4) TilegroupID) identifying a group to which the partial image belongs and which is a group of the partial images displayable as one image) indicated in the above-described example may be defined.
  • flag information other than the above-described flag information may be defined as partial information.
  • FIG. 11 is a diagram illustrating a delivery system which is a kind of the system to which the present technology is applied.
  • a delivery system 100 illustrated in FIG. 11 is a system that can adaptively deliver data of a partial image which is a part of an entire image.
  • the delivery system 100 includes a delivery data generation device 101, a delivery server 102, and a terminal device 103.
  • the delivery data generation device 101 generates, for example, files of content such as an image and audio delivered by the delivery server 102 and MPD files of the files and supplies the content files and the MPD files to the delivery server 102.
  • the delivery server 102 publishes the content files and the MPD files supplied from the delivery data generation device 101 on a network 104 and performs adaptive delivery of partial images.
  • the terminal device 103 accesses the delivery server 102 via the network 104 and acquires the MPD file of desired content published by the delivery server 102.
  • the terminal device 103 accesses the delivery server 102 via the network 104 according to the MPD file, adaptively selects a proper content file corresponding to the MPD file, and acquires the content file by an HTTP protocol.
  • the terminal device 103 reproduces the acquired content file.
  • FIG. 12 is a block diagram illustrating an example of a main configuration of the delivery data generation device 101.
  • the delivery data generation device 101 includes a screen division processing unit 121, an image encoding unit 122, a file generation unit 123, a tile type image information generation unit 124, an MPD generation unit 125, and a server upload processing unit 126.
  • the screen division processing unit 121 edits (processes) image data supplied from the outside to divide the entire image of the image data for each tile and generates the image data of the tile images.
  • the screen division processing unit 121 supplies the image data of each tile generated in this way to the image encoding unit 122.
  • the screen division processing unit 121 supplies, for example, information regarding the tile structure such as the size, the position, or the like of each tile to the tile type image information generation unit 124.
  • the image encoding unit 122 encodes the image data of each tile supplied from the screen division processing unit 121 to generate a bit stream.
  • the image encoding unit 122 includes a plurality of encoding processing units such as an encoding processing unit 131, an encoding processing unit 132, an encoding processing unit 133, etc. and can encode the image data of each tile of the supplied tiles in parallel.
  • the image encoding unit 122 can generate any number of bit streams from one piece of image data.
  • the image encoding unit 122 can also collect the plurality of pieces of image data into one bit stream.
  • the image encoding unit 122 can also generate the bit stream for each tile image and can also collect the plurality of tile images into one bit stream.
  • the image encoding unit 122 supplies the generated bit stream to the file generation unit 123.
  • the encoding method of the image encoding unit 122 is arbitrary.
  • the encoding processing units perform the same encoding method or may perform mutually different encoding methods.
  • the file generation unit 123 files the supplied bit stream in accordance with a predetermined format such as an MP4 file format to generate the content file. As described with reference to FIGS. 7 and 8 and the like, the file generation unit 123 can file one bit stream into any number of files. The file generation unit 123 can also collect the plurality of bit streams into one file. The file generation unit 123 supplies the generated content file to the MPD generation unit 125. The file generation unit 123 supplies information regarding the filing such as how to file each bit stream to the tile type image information generation unit 124.
  • a predetermined format such as an MP4 file format
  • the file generation unit 123 can perform the filing in accordance with any format.
  • the tile type image information generation unit 124 generates tile type image information (that is, partial image information) to match the MPD to the tile structure based on the information regarding the tile structure supplied from the screen division processing unit 121, the information regarding the filing supplied from the file generation unit 123, or the like.
  • the tile type image information (the partial image information) is information including the content described in the first embodiment and is generated as, for example, the values of the viewpoint or the flag information.
  • the tile type image information generation unit 124 supplies the generated tile type image information to the MPD generation unit 125.
  • the MPD generation unit 125 generates the MPD regarding the content file supplied from the file generation unit 123, extends the MPD using the tile type image information (the partial image information) supplied from the tile type image information generation unit 124, and generates the tile type MPD corresponding to the tile structure.
  • the MPD generation unit 125 supplies the file (MPD file) of the generated tile type MPD and the content file to the server upload processing unit 126.
  • the server upload processing unit 126 uploads the supplied MPD file or content file to the delivery server 102 ( FIG. 11 ) to publish the MPD file or the content file.
  • the delivery data generation device 101 generates the tile type MPD corresponding to the tile structure in this way, and thus the delivery server 102 can adaptively deliver (supply) the data of the partial images which are based on the DASH standard. That is, the delivery system 100 can realize the adaptive supply of the data of the partial images.
  • the above-described processing units may be configured as independent devices.
  • the tile type image information generation unit 124 or the MPD generation unit 125 may be configured as independent devices. That is, the configuration related to the generation of the content file is not requisite and only the generation of the tile type image information (the partial image information) may be performed.
  • the tile type image information (the partial image information) may also be generated based on information supplied from another device.
  • the generated tile type image information (the partial image information) may be supplied to another device.
  • the tile type MPD corresponding to the content file generated in another device may be generated using the tile type image information (the partial image information) supplied from the other device.
  • the generated MPD file may also be supplied to another device.
  • the tile type image information generation unit 124 and the MPD generation unit 125 may be integrated.
  • the tile type MPD generation unit 141 may be configured as one independent device.
  • FIG. 13 is a block diagram illustrating an example of a main configuration of the terminal device 103.
  • the terminal device 103 includes an MPD acquisition unit 151, a parsing processing unit 152, a tile image selection unit 153, a file acquisition unit 154, an image decoding unit 155, a tile image combination unit 156, and a display unit 157.
  • the MPD acquisition unit 151 acquires the MPD file of desired content from the delivery server 102 via the network 104 based on, for example, an instruction of a control program or a user of the terminal device 103.
  • the MPD acquisition unit 151 supplies the acquired MPD file to the parsing processing unit 152.
  • the parsing processing unit 152 analyzes (parses) the supplied MPD file.
  • the parsing processing unit 152 also analyzes (parses) the tile type image information (the partial image information) included in the MPD file.
  • the parsing processing unit 152 supplies an analysis result to the tile image selection unit 153.
  • the tile image selection unit 153 acquires tile image designation information which is supplied from the outside and used to designate a partial image (an image formed from a single tile image or a plurality of tile images) to be reproduced, the tile image selection unit 153 selects the tile image designated by the tile image designation information among the tile images included in the tile type image information based on the analysis result of the MPD file (the tile type image information) in the parsing processing unit 152.
  • the tile image selection unit 153 supplies the URL (delivery address) of the file of the selected tile image to the file acquisition unit 154.
  • the file acquisition unit 154 accesses the delivery address of the delivery server 102 supplied from the tile image selection unit 153 via the network 104 to acquire the desired content file.
  • the file acquisition unit 154 acquires the bit stream from the acquired content file and supplies the bit stream to the image decoding unit 155.
  • the image decoding unit 155 decodes the bit stream supplied from the file acquisition unit 154 to obtain the image data of the tile image. As illustrated in FIG. 13 , the image decoding unit 155 includes a plurality of decoding processing units such as a decoding processing unit 161, a decoding processing unit 162, a decoding processing unit 163, etc. and can decode the plurality of supplied bit streams in parallel. The image decoding unit 155 supplies the image data of the tile image obtained by decoding the bit stream to the tile image combination unit 156.
  • the image decoding unit 155 can perform the decoding in accordance with any decoding method that corresponds to the encoding method of the image encoding unit 122. Accordingly, each decoding processing unit may also perform the decoding in accordance with the same method or may also perform the decoding in accordance with mutually different methods.
  • the tile image combination unit 156 When the image data of the plurality of tile images belonging to the same group is supplied from the image decoding unit 155, the tile image combination unit 156 combines (unifies) the tile images and combines the image data so that one image is formed. That is, the tile image combination unit 156 generates the image data of an image for display. When the images are not combined (for example, when a single tile image is displayed or when a plurality of tile images are already formed as one bit stream at the time of delivery), the supplied images are considered to be images for display. The tile image combination unit 156 supplies the image data for display to the display unit 157.
  • the display unit 157 reproduces the supplied image data for display and displays the image for display on a display.
  • the terminal device 103 can correctly analyze the tile type MPD corresponding to the tile structure and can gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, the data of the partial image can be correctly acquired from the delivery server 102 and can be reproduced. That is, the delivery system 100 can realize the adaptive supply of the data of the partial image.
  • the terminal device 103 can display the image with a different image size from the image size at the time of the delivery. That is, the terminal device 103 can control the data delivery more adaptively according to a load situation or the like of the delivery server 102 or the network 104. For example, since whether to acquire the entire image or acquire the tile image can be controlled, the number of acquired content files can be appropriately increased or decreased without changing the size of the display image. Therefore, it is possible to appropriately perform control such as distribution or concentration of a delivery source or a path.
  • the above-described processing units may be configured as independent devices.
  • the parsing processing unit 152 or the tile image selection unit 153 may be configured as independent devices. That is, the configuration related to the acquisition or reproduction (decoding) of the content file is not requisite and only the analysis of the tile type MPD or the tile type image information (the partial image information) may be performed. For example, the MPD file acquired from the delivery server 102 by another device may be analyzed. For example, the analysis result may be supplied to another device.
  • the parsing processing unit 152 and the tile image selection unit 153 may be integrated.
  • the tile type image information processing unit 171 may be configured as one independent device.
  • the image data for display output from the tile image combination unit 156 may be supplied to another device or may be recorded on a recording medium. At this time, the image data may be encoded.
  • the screen division processing unit 121 of the delivery data generation device 101 edits (processes) the image data so that a screen (that is, an entire image) is divided into tiles in step S101.
  • step S102 the image encoding unit 122 encodes the image data of each tile image generated in step S101.
  • step S103 the file generation unit 123 files the encoded data (bit stream) generated in step S102 (that is, generates the content file).
  • step S104 the tile type MPD generation unit 141 generates the file of the tile type MPD according to the processing result such as the division of step S101 or the filing of step S103.
  • step S105 the server upload processing unit 126 uploads the MPD file and the content file generated in this way to the delivery server 102.
  • step S105 ends, the delivery data generation process ends.
  • the tile type image information generation unit 124 sets the schema (for example, urn:mpeg:DASH:tile:2013) of the tile type image information, for example, in the element of the viewpoint in step S121.
  • step S122 the tile type image information generation unit 124 sets a view type (viewtype) in the value of the schema as the tile type image information.
  • step S123 the tile type image information generation unit 124 sets the size (width and height) of the entire image in the value of the schema as the tile type image information.
  • step S124 the tile type image information generation unit 124 sets the position (x and y) of the tile image in the value of the schema as the tile type image information.
  • step S125 the tile type image information generation unit 124 sets the group identification information (TilegroupID) in the value of the schema as the tile type image information.
  • the tile type image information generation unit 124 sets the segment information (@SegmentInSubRepresentation), as necessary, as the tile type image information. For example, when the MPD is extended so that the segment is present under the sub-representation (Sub-Representation), the tile type image information generation unit 124 generates the segment information indicating that the information regarding the bit stream is present under the sub-representation (Sub-Representation).
  • the tile type image information generation unit 124 sets the multi-segment information (@multiSegmentInRepresentation), as necessary, as the tile type image information. For example, when the tile images are assigned to the segments and the MPD is extended so that the plurality of segments of the same time are present in one representation, the tile type image information generation unit 124 generates the multi-segment information indicating that the plurality of segments to which the tile images of the same time are assigned are present under the representation.
  • step S127 When the process of step S127 ends, the tile type MPD file generation process ends and the process returns to FIG. 14 .
  • the delivery data generation device 101 can allow the delivery server 102 to adaptively deliver (supply) the data of the partial images which are based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial images.
  • the MPD acquisition unit 151 acquires the MPD file corresponding to the desired content from the delivery server 102 in step S141.
  • step S142 the parsing processing unit 152 analyzes (parses) the MPD file acquired in step S141.
  • step S143 the parsing processing unit 152 analyzes (parses) the tile type image information (the partial image information) included in the MPD file.
  • step S144 the tile image selection unit 153 selects the tile images designated by the tile image designation information supplied from the outside among the tile images indicated in the tile type image information.
  • step S145 the file acquisition unit 154 acquires the file of the tile images selected in step S144.
  • step S146 the image decoding unit 155 decodes the bit stream of the tile images included in the file acquired in step S145.
  • step S147 the tile image combination unit 156 edits (processes) the image data of the tile images obtained by decoding the bit stream in step S146 so that the tile images are combined, as necessary.
  • step S148 the display unit 157 displays the image for display such as the combined image of the tile images obtained in step S147 on a display.
  • step S148 ends, the delivery data reproduction process ends.
  • the terminal device 103 can correctly analyze the tile type MPD corresponding to the tile structure and can gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, the data of the partial image can be correctly acquired from the delivery server 102 and can be reproduced. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the above-described adaptive delivery (supply) of the partial images can be used together with the delivery (supply) of the entire image. That is, for example, the server may adaptively deliver the entire image or any partial image according to a request or the like from the terminal.
  • FIG. 17 A main configuration example of the extended MPD is illustrated in FIG. 17 .
  • the encoded data of each tile of the image data to be delivered is configured in one bit stream (MP4 file) (bitstream3.mp4 to bitstream6.mp4).
  • MP4 file bit stream3.mp4 to bitstream6.mp4
  • the adaptation set AdaptationSet
  • MP4 file bit stream of each tile image
  • a viewpoint Viewpoint which is a description for a tile is defined in the adaptation set and the URL of the bit stream (MP4 file) of the tile corresponding to the viewpoint is set in the segment (Segment) under the representation (Representation) under the adaptation set.
  • the partial image information regarding the plurality of partial images belonging to the same group is stored in the mutually different adaptation sets, and the bit streams of the plurality of partial images are assigned to the mutually different adaptation sets.
  • the adaptation sets of the tile images arranged with the adaptation set of the entire images can be provided, and thus the delivery of the entire images and the adaptive delivery of the partial images can be managed in a unified manner.
  • images with different displayed content such as R and L images of a stereo image are defined in mutually different adaptation sets in many cases.
  • the tile images are defined in the mutually different adaptation sets in imitation of such a way. Therefore, even in the delivery control of the partial images, it is possible to realize a natural way close to the related art. Therefore, development can be facilitated.
  • the entire images with different resolutions are defined in the same adaptation set, but these entire images may be defined in mutually different adaptation sets.
  • FIG. 18 A specific description example of the MPD of this example is illustrated in FIG. 18 .
  • FIG. 19 Another configuration example of the extended MPD is illustrated in FIG. 19 .
  • all of the encoded data of the tiles of the image data to be delivered is configured in one bit stream (MP4) (bitstream3.mp4 to bitstream6.mp4).
  • MP4 bit stream3.mp4 to bitstream6.mp4
  • the adaptation set (AdaptationSet) is extended and the bit stream (MP4 file) of each tile image is defined in a different adaptation set from the adaptation set in which the entire image is defined.
  • the bit streams (MP4 files) of the tile images are defined in the same adaptation set.
  • a viewpoint (Viewpoint) which is a description for a tile is defined in a representation (Representation) under the adaptation set and the URL of the bit stream (MP4 file) of the tile corresponding to the viewpoint is set in the segment (Segment) under the representation.
  • the partial image information regarding the plurality of partial images belonging to the same group is stored in the mutually different representations belonging to one adaptation set of metadata, and the bit streams of the plurality of partial images are assigned to the mutually different representations.
  • the adaptation sets of the tile images arranged with the adaptation set of the entire images can be provided, and thus the delivery of the entire images and the adaptive delivery of the partial images can be managed in a unified manner.
  • bitstream1.mp4 and bitstream2.mp4 are defined in the same adaptation set, but these entire images may be defined in mutually different adaptation sets.
  • FIG. 20 Another configuration example of the extended MPD is illustrated in FIG. 20 .
  • the encoded data of the tiles of the image data to be delivered is collected in one bit stream.
  • the bit stream is filed as an MP4 file for each tile (bitstream7_Tile1.mp4 to bitstream7_Tile4.mp4).
  • bitstream7_base.mp4 As described with reference to FIG. 7 , a base track in which the header information or the like of the tiles is collected is filed separately from the bit streams of the tiles (bitstream7_base.mp4).
  • the adaptation set (AdaptationSet) is extended and the bit streams (MP4 files) (bitstream7_Tile1.mp4 to bitstream7_Tile4.mp4) of the tile images are defined in mutually different adaptation sets.
  • a viewpoint (Viewpoint) which is a description for a tile is defined in the adaptation set and the URL of the bit stream (MP4 file) of the tile corresponding to the viewpoint is set in the segment (Segment) under the representation (Representation) under the adaptation set.
  • the partial image information regarding the plurality of partial images belonging to the same group is stored in the mutually different adaptation sets of the metadata, and the plurality of files for which one bit stream including the plurality of partial images is divided for each partial image are assigned to the mutually different adaptation sets.
  • FIG. 21 A specific description example of the MPD of this example is illustrated in FIG. 21 .
  • FIG. 22 Another configuration example of the extended MPD is illustrated in FIG. 22 .
  • the extension method is the same as that of ⁇ Example 3>.
  • the tiles are set such that the sizes are unequal, as illustrated in FIG. 22 (corresponding to FIG. 9B ).
  • an image with a desired size can be obtained by adding tiles, as shown with quadrangles.
  • each piece of encoded data of each tile of the image data to be delivered is configured in one bit stream (MP4 file) (tile1.mp4 to tile5.mp4). Therefore, no base track is present as in ⁇ Example 3>.
  • partial image information regarding control information included in the bit stream is further generated, the partial image information regarding the control information is stored in a different adaptation set from the partial image information regarding each partial image, and a file of the control information is assigned to the adaptation set.
  • each piece of encoded data of each tile of the image data to be delivered is configured in one bit stream (MP4 file) (bitstream3.mp4 to bitstream6.mp4).
  • MP4 file bit stream3.mp4 to bitstream6.mp4
  • the representation (Representation) is extended and the bit streams (MP4 files) of the tile images are defined in mutually different representations under the same adaptation set as the bit streams (MP4 files) (bitstream1.mp4 and bitstream2.mp4) of the entire images.
  • a viewpoint (Viewpoint) which is a description for a tile is defined in the representation and the URL of the bit stream (MP4 file) of the tile corresponding to the viewpoint is set in the segment (Segment) under the representation.
  • the partial image information regarding the plurality of partial images belonging to the same group is stored in the mutually different representations belonging to the same adaptation set of the entire images of the metadata and the bit streams of the plurality of partial images are assigned to the mutually different representations.
  • the representations of the tile images arranged with the representations of the entire images can be provided, and thus the delivery of the entire images and the adaptive delivery of the partial images can be managed in a unified manner.
  • FIG. 24 A specific description example of the MPD of this example is illustrated in FIG. 24 .
  • FIG. 25 Another configuration example of the extended MPD is illustrated in FIG. 25 .
  • the encoded data of the tiles of the image data to be delivered is collected in one bit stream.
  • the bit stream is filed as an MP4 file for each tile (bitstream7_Tile1.mp4 to bitstream7_Tile4.mp4).
  • bitstream7_base.mp4 As described with reference to FIG. 7 , a base track in which the header information or the like of the tiles is collected is filed separately from the bit streams of the tiles (bitstream7_base.mp4).
  • bit streams MP4 files
  • bitstream7_Tile1.mp4 to bitstreams7_Tile4.mp4 bitstream7_Tile1.mp4 to bitstreams7_Tile4.mp4
  • a viewpoint (Viewpoint) which is a description for a tile is defined in the representation and the URL of the bit stream (MP4 file) of the tile corresponding to the viewpoint is set in the segment (Segment) under the representation.
  • partial image information regarding control information included in one bit stream including the plurality of partial images belonging to the same group is further generated, the partial image information regarding the plurality of partial images is stored in the mutually different representations belonging to one adaptation set of the metadata, the plurality of files for which the bit stream is divided for each partial image are assigned to the mutually different representations, the partial image information regarding the control information is stored in the different representation from the partial image information regarding each partial image, and the file of the control information is assigned to the representation.
  • FIG. 26 A specific description example of the MPD of this example is illustrated in FIG. 26 .
  • each piece of encoded data of each tile of the image data to be delivered is configured in one bit stream (MP4 file) (bitstream3.mp4 to bitstream6.mp4).
  • MP4 file bit stream3.mp4 to bitstream6.mp4
  • the sub-representation (Sub-Representation) is extended and the bit streams (MP4 files) of the tile images are defined in mutually different sub-representations under the same adaptation set as the bit streams (MP4 files) (bitstream1.mp4 and bitstream2.mp4) of the entire images and under different representations from the bit streams (MP4 files) of the entire images.
  • a viewpoint (Viewpoint) which is a description for a tile is defined in the sub-representation and the URL of the bit stream (MP4 file) of the tile corresponding to the viewpoint is set in the segment (Segment) under the sub-representation.
  • the partial image information regarding the plurality of partial images belonging to the same group is stored in mutually different sub-representations belonging to one representation belonging to one adaptation set of the metadata, and the bit streams of the plurality of partial images are assigned to the mutually different sub-representations.
  • the representations of the tile images arranged with the representations of the entire images can be provided, and thus the delivery of the entire images and the adaptive delivery of the partial images can be managed in a unified manner.
  • FIG. 28 A specific description example of the MPD of this example is illustrated in FIG. 28 .
  • FIG. 29 Another configuration example of the extended MPD is illustrated in FIG. 29 .
  • the encoded data of the tiles of the image data to be delivered is collected in one bit stream.
  • the bit stream is filed as an MP4 file for each tile (bitstream7_Tile1.mp4 to bitstream7_Tile4.mp4).
  • bitstream7_base.mp4 As described with reference to FIG. 7 , a base track in which the header information or the like of the tiles is collected is filed separately from the bit streams of the tiles (bitstream7_base.mp4).
  • the sub-representation (Sub-Representation) is extended, and the bit streams (MP4 files) (bitstream7_Tile1.mp4 to bitstream7_Tile4.mp4) of the tile images are defined in the mutually different sub-representations under the same representation (Representation) under the same adaptation set (AdaptationSet).
  • a viewpoint (Viewpoint) which is a description for a tile is defined in the sub-representation and the URL of the bit stream (MP4 file) of the tile corresponding to the viewpoint is set in the segment (Segment) under the sub-representation.
  • the viewpoint of a base track is defined in the representation above the sub-representation and the URL of the bit stream (MP4 file) (bitstream7_base.mp4) of the base track is set in the segment under the representation.
  • the partial image information regarding the control information included in one bit stream including the plurality of partial images belonging to the same group and the segment information indicating that the information regarding the bit stream is present under the sub-representation (Sub-Representation) are further generated, the segment information and the partial image information of the control information are stored in one representation belonging to one adaptation set of the metadata, a file of the control information is assigned to the representation, the partial image information regarding the plurality of partial images is stored in the mutually different sub-representations belonging to the representation, and the plurality of files in which the bit stream is divided for each partial image are assigned to the mutually different sub-representations.
  • FIG. 30 A specific description example of the MPD of this example is illustrated in FIG. 30 .
  • FIG. 31 Another configuration example of the extended MPD is illustrated in FIG. 31 .
  • the encoded data of the tiles of the image data to be delivered is collected in one bit stream.
  • the bit stream is filed as one MP4 file as in the example of FIG. 8 (bitstream7.mp4).
  • the sub-representation is extended and the bit stream (MP4 file) (bitstream7.mp4) of the tile image is defined under the representation (Representation) under the adaptation set (AdaptationSet).
  • the viewpoint of each tile is set and the location of the data of each tile in (bitstream7.mp4) is designated with a byte in the segment under the representation.
  • the segment information indicating that the information regarding the bit stream is present under the sub-representation and the partial image information of the control information included in one bit stream including the plurality of partial images belonging to the same group are further generated, the partial image information of the control information and the segment information are stored in one representation belonging to one adaptation set of the metadata, the bit stream is assigned to the representation, the partial image information regarding the plurality of partial images is stored in mutually different sub-representations belonging to the representation, and the information indicating the location of the data of the partial images in the bit stream is assigned to the mutually different sub-representations.
  • FIG. 32 Another configuration example of the extended MPD is illustrated in FIG. 32 .
  • the encoded data of the tiles of the image data to be delivered is configured in one bit stream (MP4 file) (bitstream3.mp4 to bitstream6.mp4).
  • MP4 file bitstream3.mp4 to bitstream6.mp4
  • the segments (Segment) are extended and the plurality of segments (Segment) are defined under the representations under the adaptation set.
  • bit streams (MP4 files) of the tile images are defined in mutually different segments under the different representation from the bit streams (MP4 files) of the entire images and under the same adaptation set as the bit streams (MP4 files) (bitstream1.mp4 and bitstream2.mp4) of the entire images.
  • the viewpoint (Viewpoint) which is a description for a tile is defined in the segment (Segment) and the URL of the bit stream (MP4 file) of the tile corresponding to the viewpoint is set in each segment (Segment).
  • the multi-segment information indicating that the plurality of pieces of information regarding the bit streams of the same time are present under the representation is further generated, the multi-segment information is stored in one representation belonging to one adaptation set of the metadata, the partial image information regarding the plurality of partial images belonging to the same group is stored in the mutually different segments belonging to the representation, and the bit streams of the plurality of partial images are assigned to the mutually different segments.
  • the representation of the tile image arranged with the representations of the entire images (bitstream1.mp4 and bitstream2.mp4) can be provided, and thus the delivery of the entire images and the adaptive delivery of the partial images can be managed in a unified manner.
  • FIG. 33 A specific description example of the MPD of this example is illustrated in FIG. 33 .
  • FIG. 34 Another configuration example of the extended MPD is illustrated in FIG. 34 .
  • the encoded data of the tiles of the image data to be delivered is collectively configured in one bit stream (MP4 file) (bitstream7.mp4).
  • bitstream7.mp4 bit stream 7.mp4
  • the sub-segments (Sub-Segment) are extended and the plurality of sub-segments (Sub-Segment) are defined under the segment under the representation under the adaptation set.
  • the viewpoint of the combined image of all the tile images is defined and the data of each tile image is shown in accordance with the ssix in the sub-segment under the segment.
  • the segment information indicating that the information regarding the bit stream is not present under the sub-representation and the partial image information regarding one bit stream including the plurality of partial images belonging to the same group are further generated, the segment information is stored in one representation belonging to one adaptation set of the metadata, the partial image information is stored in one segment belonging to the representation, the bit stream is assigned to the segment, and the information indicating the location of the data of each partial image in the bit stream is assigned to the mutually different sub-segments belonging to the segment.
  • the MPD extension method is arbitrary and methods other than the above-described methods may be used.
  • a mobile device 221 is assumed to acquire a partial image 212 with a 1920 ⁇ 1080 size formed by four tile images 211 of an entire image 210 from a server 220 using a 3G line and reproduce the partial image 212.
  • TV television signal receiver
  • information regarding a reproduction environment (network bandwidth), a reproduction ability (resolution and a decoder ability), or the like of the TV 222 of a switching destination is acquired from the TV 222.
  • the method of acquiring the information is arbitrary.
  • the mobile device 221 may acquire the information by performing direct communication with the TV 222.
  • the mobile device 221 may acquire the information via the server 220.
  • the mobile device 221 selects optimum tile images for the TV 222 of the switching destination from the information regarding the MPD. In the case of the example of FIG. 35 , a partial image 213 formed by the 5 ⁇ 5 tile images 211 is selected.
  • the TV222 of the switching destination acquires a bit stream of the tile images selected in this way and reproduces the bit stream.
  • the above-described selection or acquisition of the optimum stream may be performed by the mobile device 221 to be pushed to the TV 222 of the switching destination, or such selection or acquisition may be performed by the TV 222.
  • a mobile device 221 is assumed to reproduce a part of an entire image (state 221A of the mobile device)
  • a user of the mobile device 221 shifts the region with his or her finger on a touch panel to move an image (as indicated by an arrow 233) so that a direction desired to be reproduced is displayed on a screen. For example, when the user desires to display an upper right region (partial image 232) of the currently displayed region (partial image 231) as indicated by an arrow 234, the user traces his or her finger in the lower left direction from the upper right of the screen.
  • the mobile device 221 calculates a movement destination of the image based on the input finger motion or the like and selects a stream of tile images to be displayed from the information regarding the MPD.
  • the mobile device 221 acquires the selected bit stream from the server 220 and performs the reproduction and display (state 221B of the mobile device).
  • the selection of the tile images may be performed by an application executed in the mobile device 221, or the direction of the movement destination of the image acquired from the finger motion may be sent to the server 220 and the images may be selected by the server 220.
  • a display region may be switched abruptly or the display region may be gradually shifted and switched to perform smooth switching.
  • FIG. 37 is a diagram illustrating another example of an application using the tile image delivery.
  • a menu is generated by encoding images of the plurality of channels as one image (HD).
  • a combined image combined so that such different images are arranged is defined as a mosaic video.
  • displays of the mobile device are small, and thus can display only images with small image sizes (low resolutions), such as images in HD or lower. That is, only images with 1920 ⁇ 1080 can be delivered to such mobile devices.
  • the image is configured to be switched to another HD image in which images of fewer programs are displayed.
  • the user can easily display only a desired program by repeating such zooming (image switching).
  • the delivered file for example, by tapping an upper left portion of the mosaic video
  • the delivered file bit stream
  • an upper left tile image with an image size of 1920 ⁇ 1080 in the mosaic video is displayed, as illustrated on the right in FIG. 37 .
  • 1 program (A) is displayed. That is, the number of displayed programs is further reduced and the display region per program is spread.
  • the switching of the delivered data described above is realized by extending the DASH standard, as described above. That is, for example, the structure of the mosaic video forming one screen is defined in the MPD so that the mosaic video can be used as a user interface (UI/UX).
  • UI/UX user interface
  • a relation between a screen structure and positional information selected by the user is obtained and a stream to be subsequently switched is selected.
  • Coordinates touched on the screen by the user and coordinates on the mosaic video are obtained and a mosaic video of a subsequent Layer (extension) in which the coordinate position is included is obtained to be switched.
  • New schemeIdUri (urn:mpeg:DASH:mosaic:2013) is defined using an element (Viewpoint element) of the viewpoint.
  • Viewpoint element the following information is defined in content (partial image information) of the value of the new schemeIdUri.
  • a viewpoint is defined as follows. Then, the MPD is extended using such partial image information.
  • the element of the viewpoint is an element corresponding to the mosaic video (urn:mpeg:DASH:mosaic:2013).
  • the element of the viewpoint for a tile As illustrated in FIG. 10A . That is, the element of the viewpoint for mosaic video described above is positioned as an extension element of the elements of the viewpoint for a tile.
  • the positional information regarding the image is handled optionally. Writing may not be performed. When the writing is performed, it is necessary to write all of the images. Further, information other than the above-described information may be defined as a value.
  • FIG. 39 is a diagram illustrating an example of the configuration of an MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bit streams of tiles are collected and considered as one file and the data of the tiles is further managed as one track.
  • Parameter sets such as a video parameter set (VPS), a sequence parameter set (SPS), and a picture parameter set (PPS) are managed for a sample by a sample entry (Sample Entry).
  • Each tile is defined by a tile region group entry (TileRegionGroupEntry) in a sample group description (Sample Group Description). As illustrated in FIG.
  • the values of 5 parameters, GroupID which is identification information identifying the tile, H_offset indicating the position (offset) of the tile in the horizontal direction, V_offset indicating the position (offset) of the tile in the vertical direction, H_width indicating the size (width) of the tile in the horizontal direction, and V_height indicating the size (height) of the tile in the vertical direction, are defined as the tile region group entry (TileRegionGroupEntry).
  • tile region group entry (TileRegionGroupEntry) of tile 1 Tile 1
  • tile region group entry (TileRegionGroupEntry) of tile 4 (Tile 4)
  • GroupID 4
  • H_offset 960
  • V_offset 540
  • H_width 960
  • an entire image (1920 ⁇ 1080) is formed by 4 tiles (960 ⁇ 540), 2 vertical tiles ⁇ 2 horizontal tiles.
  • the file name of this MP4 file is assumed to be bitstream.mp4.
  • an MPD of an MPEG-DASH standard of the related art is extended, as in FIG. 40 .
  • AdaptationSet an entire image and each tile are defined in mutually different adaptation sets.
  • AdaptationSet In the topmost adaptation set in the drawing defined in the entire image, as illustrated in FIG. 40 , a supplemental property (SupplementalProperty) is defined as a description for a tile instead of the viewpoint (Viewpoint) described in the first embodiment.
  • the supplemental property is an element of the related art.
  • the supplemental property is defined in the adaptation set in which the bit stream decodable even in a decoder of the related art is defined.
  • the supplemental property is defined in the adaptation set defined in regard to an entire image which can be decoded even in the decoder of the related art.
  • the supplemental property is extended and defined as follows.
  • schema for storing image information is defined.
  • “urn:mpeg:dash:srd:2013” is defined as the schema.
  • source id is identification information indicating whether a content source of the adaptation set is the same as a content source of another adaptation set. In the case of FIG. 40 , since the content source of each adaptation set is common (bitstream.mp4), "1" is defined as “source id.”
  • x, y is information indicating the position (x and y coordinates of the upper left) of the tile defined by the adaptation set.
  • "0, 0" is defined as "x, y.”
  • width, height is information indicating the size (the width and the height) of the tile defined by the adaptation set. In the case of FIG. 40 , since the adaptation set defines the entire image, "1920, 1080" is defined as "width, height.”
  • width_all, height_all is information indicating the size (the width and the height) of the entire image. In the case of FIG. 40 , “1920, 1080" is defined as “width_all, height_all.”
  • stream type is identification information indicating whether the adaptation set defines an entire bit stream or a part of the bit stream. In the case of FIG. 40 , "0" indicating that the adaptation set defines the entire bit stream is defined as "stream type.”
  • the supplemental property is defined as follows, for example.
  • an essential property is defined instead of the viewpoint (Viewpoint) described as the description for a tile in the first embodiment.
  • the essential property is an element of the related art. By using the element of the related art, it is possible to suppress a reduction in affinity to an MPD of the related art (it is possible to suppress an increase in a description in which a decoder of the related art is not analyzable).
  • the essential property is defined in the adaptation set in which the bit stream undecodable in a decoder of the related art is defined. For example, in the case of FIG. 40 , the essential property is defined in the adaptation set defined in regard to each tile image which cannot be decoded in the decoder of the related art.
  • the essential property is extended as follows and is defined. That is, the essential property is defined as in the supplemental property (SupplementalProperty).
  • the essential property is further extended as information indicating the part of the bit stream.
  • the adaptation set corresponding to the tile corresponds to the part of the bit stream.
  • the essential property in regard to the part of the bit stream is further extended and defined as follows, for example.
  • a schema for storing information indicating a part of the file is defined.
  • "urn:mpeg:dash:hevc:2013” is defined as the schema.
  • Sub-Sample-Type is information indicating by which information a part of the bit stream to which the adaptation set corresponds is configured. For example, when the value of the information is "0,” it is indicated that the part of the bit stream is configured by Nal based. For example, when the value of the information is "1,” it is indicated that the part of the bit stream is configured by Decoding-unit-based. Further, for example, when the value of the information is "2,” it is indicated that the part of the bit stream is configured by Tile-based. For example, when the value of the information is "3,” it is indicated that the part of the bit stream is configured by CTU-row-based.
  • Sub-Sample-is-extracted is information indicating whether a part of the bit stream to which the adaptation set corresponds is divided (extracted) into tracks. For example, when the value of the information is "0,” it is indicated that the part of the bit stream is not divided (false). When the value of the information is "1,” it is indicated that the part of the bit stream is divided into the tracks (true). In the case of the second adaptation set from the top of the drawing in the example of FIG. 40 , the number of tracks is 1 (not divided), as described with reference to FIG. 39 , and "0" is defined as “Sub-Sample-is-extracted.”
  • ID is identification information.
  • "2" is defined as "Sub-Sample-Type,” that is, in the case of Tile, GroupID of The tile region group entry (TileRegionGroupEntry) of the MP4 file is defined.
  • the part of the bit stream is data of tile 1 (Tile 1), and thus "1" is defined as "ID.”
  • the essential property is defined as follows, for example.
  • the essential property is defined as follows, for example.
  • the essential property is defined as follows, for example.
  • the essential property is defined as follows, for example.
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • FIG. 41 is a diagram illustrating an example of the configuration of an MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bit stream7 having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • the bit streams of the tiles are collected and considered as one file and the data of the tiles is further managed as one track.
  • track 1 manages data of an entire image (1920 ⁇ 1080), and thus the entire image can be reproduced by reproducing track 1 (Track 1).
  • track 2 manages data of tile 1 (Tile 1), and thus the image of tile 1 (Tile 1) can be reproduced by reproducing track 2 (Track 2).
  • track 3 manages data of tile 2 (Tile 2), and thus the image of tile 2 (Tile 2) can be reproduced by reproducing track 3 (Track 3).
  • track 4 manages data of tile 3 (Tile 3), and thus the image of tile 3 (Tile 3) can be reproduced by reproducing track 4 (Track 4).
  • track 5 (Track 5) manages data of tile 4 (Tile 4), and thus the image of tile 4 (Tile 4) can be reproduced by reproducing track 5 (Track 5).
  • the parameter sets such as the video parameter set (VPS), the sequence parameter set (SPS), and the picture parameter set (PPS), an entity (also referred to as actual data) such as supplemental enhancement information (SEI), and reference information (also referred to as extractors) of the bit streams of the tiles are stored.
  • VPS video parameter set
  • SPS sequence parameter set
  • PPS picture parameter set
  • SEI supplemental enhancement information
  • SEI reference information
  • the extractor (Track 2) is information (reference information) used to refer to the actual data (Slice 1) of tile 1 (Tile 1) stored in track 2 (Track 2). For example, the extractor indicates the storage location of the actual data (Slice 1).
  • an extractor (Track 3) is reference information regarding the actual data (Slice 2) of tile 2 (Tile 2) stored in track 3 (Track 3)
  • an extractor (track 4) is reference information regarding the actual data (Slice 3) of tile 3 (Tile 3) stored in track 4 (Track 4)
  • an extractor (Track 5) is reference information regarding the actual data (Slice 4) of tile 4 (Tile 4) stored in track 5 (Track 5).
  • the parameter sets, the extractor, and the like are managed for each sample by the sample entry (Sample Entry).
  • the extractor (Track 1) such as the parameter set, the actual data (Slice 1) of tile 1 (Tile 1), and the like are stored.
  • the extractor (Track 1) of the parameter set is reference information of the actual data (the VPS, the SPS, the SEI, the PPS, and the like) such as the parameter sets stored in track 1 (Track 1).
  • the extractor indicates the storage location of the actual data.
  • track 3 the extractor (Track 1) such as the parameter sets, the actual data (Slice 2) of tile 2 (Tile 2), and the like are stored.
  • track 4 the extractor (Track 1) such as the parameter sets, the actual data (Slice 3) of tile 3 (Tile 3), and the like are stored.
  • track 5 the extractor (Track 1) such as the parameter sets, the actual data (Slice 4) of tile 4 (Tile 4), and the like are stored.
  • the tile region group entry (TileRegionGroupEntry) is defined in each of track 2 (Track 2) to track 5 (Track 5). That is, one tile is defined in each track.
  • the extractor indicating a reference relation is defined for each sample. That is, the reference relation can be set for each sample. Accordingly, by using the extractor, it is possible to construct a freer reference relation, for example, a change in the reference relation in the bit stream. More specifically, for example, it is possible to easily realize a change or the like in the size or the shape of the tile in the bit stream.
  • the file name of this MP4 file is assumed to be bitstream.mp4.
  • the supplemental property (SupplementalProperty) or the essential property (EssentialProperty) of the adaptation set (AdaptationSet) is extended.
  • the example is illustrated in FIG. 42 .
  • AdaptationSet an entire image and each tile are defined in mutually different adaptation sets.
  • the supplemental property (SupplementalProperty) is defined as a description for a tile, instead of the viewpoint (Viewpoint) described in the first embodiment.
  • the supplemental property of the topmost adaptation set in the drawing is defined as follows, for example.
  • the essential property (EssentialProperty) is defined as a description for a tile, instead of the viewpoint (Viewpoint) described in the first embodiment.
  • the essential property in regard to a part of the bit stream is further extended and defined.
  • the essential property of the second adaptation set from the top of the drawing is defined as follows, for example.
  • the essential property of the third adaptation set from the top of the drawing in the example of FIG. 42 is defined as follows, for example.
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • FIG. 43 is a diagram illustrating an example of the configuration of an MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bit stream7 having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • the bit streams of tiles are managed as mutually different files. Since the tracks of the files are mutually different, the bit streams of the tiles can also be said to be managed as mutually different tracks.
  • the topmost MP4 file (MP4 File) in FIG. 43 (that is, track 1 (Track 1)) stores (manages) data of an entire image (1920 ⁇ 1080). By reproducing the MP4 file (that is, track 1), it is possible to reproduce the entire image.
  • the second MP4 file (MP4 File) (that is, track 2 (Track 2)) from the top of FIG. 43 stores (manages) data of tile 1 (Tile 1). By reproducing the MP4 file (that is, track 2), it is possible to reproduce an image of tile 1 (Tile 1).
  • the third MP4 file (MP4 File) (that is, track 3 (Track 3)) from the top of FIG. 43 stores (manages) data of tile 2 (Tile 2). By reproducing the MP4 file (that is, track 3), it is possible to reproduce an image of tile 2 (Tile 2).
  • the fourth MP4 file (MP4 File) (that is, track 4 (Track 4)) from the top of FIG. 43 stores (manages) data of tile 3 (Tile 3).
  • the bottommost MP4 file (MP4 File) (that is, track 5 (Track 5)) in FIG. 43 stores (manages) data of tile 4 (Tile 4).
  • the MP4 file that is, track 5
  • the parameter sets such as the video parameter set (VPS), the sequence parameter set (SPS), and the picture parameter set (PPS), the actual data such as the SEI, extractors (Track 2, Track 3, Track 4, and Track 5) of the bit streams of the tiles, and the like are stored.
  • the parameter sets, the extractors, and the like are managed for each sample by the sample entry (Sample Entry).
  • the extractor (Track 1) such as the parameter sets, the actual data (Slice 1) of tile 1 (Tile 1), and the like are stored. Further, in the third MP4 file (track 3) from the upper side of FIG. 43 , the extractor (Track 1) such as the parameter sets, the actual data (Slice 2) of tile 2 (Tile 2), and the like are stored. In the fourth MP4 file (track 4) from the top of FIG. 43 , the extractor (Track 1) such as the parameter sets, the actual data (Slice 3) of tile 3 (Tile 3), and the like are stored. Further, in the bottommost MP4 file (track 5) in FIG. 43 , the extractor (Track 1) such as the parameter sets, the actual data (Slice 4) of tile 4 (Tile 4), and the like are stored.
  • a tile region group entry (TileRegionGroupEntry) is defined in each of the MP4 files (tracks 2 to 5). That is, one tile is defined in each track.
  • the extractor is used as information indicating the reference relation. Accordingly, for example, it is possible to construct a freer reference relation, such as a change in the reference relation in the bit stream.
  • the file name of the topmost MP4 file in FIG. 43 is assumed to be bitstream_base.mp4
  • the file name of the second MP4 file from the top of FIG. 43 is assumed to be bitstream_tile1.mp4
  • the file name of the third MP4 file from the top of FIG. 43 is assumed to be bitstream_tile2.mp4
  • the file name of the fourth MP4 file from the top of FIG. 43 is assumed to be bitstream_tile3.mp4
  • the file name of the bottommost MP4 file in FIG. 43 is assumed to be bitstream_tile4.mp4
  • the supplemental property (SupplementalProperty) or the essential property (EssentialProperty) of the adaptation set (AdaptationSet) is extended.
  • the example is illustrated in FIG. 44 .
  • AdaptationSet an entire image and each tile are defined in mutually different adaptation sets.
  • the supplemental property (SupplementalProperty) is defined as a description for a tile, instead of the viewpoint (Viewpoint) described in the first embodiment.
  • the supplemental property of the topmost adaptation set in the drawing is defined as follows, for example.
  • the representation (Representation) belonging to the adaptation set is extended and information indicating dependency between files (tiles) is additionally defined.
  • bitstream_base.mp4 is defined.
  • the essential property (EssentialProperty) is defined as a description for a tile, instead of the viewpoint (Viewpoint) described in the first embodiment.
  • the essential property in regard to a part of the bit stream is further extended and defined.
  • the essential property of the second adaptation set from the top of the drawing is defined as follows, for example.
  • bit stream to which the adaptation set corresponds is an HEVC Tile divided (extracted) into tracks (that is, a plurality of tracks (plurality of files) are formed), "1 (true)" is defined as "Sub-Sample-is-extracted.”
  • bitstream_tile1.mp4 is defined.
  • bitstream_tile2.mp4 is defined.
  • bitstream_tile3.mp4 is defined.
  • bitstream_tile4.mp4 is defined.
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • FIG. 45 is a diagram illustrating an example of the configuration of an MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bit stream7 having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • the bit streams of the tiles are collected and considered as one file and the data of the tiles is further managed as one track.
  • the reference relation of the data between the tracks is defined using the extractor.
  • the reference relation is defined using track reference (Track Reference).
  • the track reference is information indicating a reference relation (which track refers to which track (or from which track reference is made)) between tracks. That is, the track reference is information in units of tracks and is defined once for 1 track. "dpnd” is information that defines a track (that is, a reference source) referring to the track and "prnt” is information that defines a track (that is, a reference destination) referred to by the track.
  • the degree of freedom for setting the reference relation is improved since the extractor is defined for each sample.
  • redundancy of the extractor increases, and thus there is a possibility of the amount of information being unnecessarily increasing. For example, when the sizes or shapes of the tiles are uniform in the bit stream, one time suffices for the reference relation.
  • the track reference (Track Reference) is defined only once for 1 track, as described above. Accordingly, by using the track reference, it is possible to reduce the definition redundancy of the reference relation and suppress an increase in the amount of unnecessary information.
  • track 1 (Track 1) is present for storing the parameter sets and the reproduction of track 1 (reproduction of an entire image (1920 ⁇ 1080)) may not be performed.
  • track 1 reproduction of an entire image (1920 ⁇ 1080)
  • the tile region group entry (TileRegionGroupEntry) is defined in each of track 2 (Track 2) to track 5 (Track 5). That is, one tile is defined in each track.
  • the file name of this MP4 file is assumed to be bitstream.mp4.
  • the supplemental property (SupplementalProperty) or the essential property (EssentialProperty) of the adaptation set (AdaptationSet) is also extended, as in the above-described case of the reference by the extractor. An example of this is illustrated in FIG. 46 .
  • the MP4 file can be managed by the MPD as in the example of FIG. 42 .
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • FIG. 47 is a diagram illustrating an example of the configuration of an MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bitstream7 having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • the bit streams of tiles are managed as mutually different files. Since the tracks of the files are mutually different, the bit streams of the tiles can also be said to be managed as mutually different tracks.
  • the topmost MP4 file (MP4 File) (that is, track 1 (Track 1)) in FIG. 47 stores (manages) the parameter sets and the like (the VPS, the SPS, the PPS, the SEI, and the like).
  • the second to fifth MP4 files (MP4 File) (that is, track 2 (Track 2) to track 5 (Track)) from the top of FIG. 47 store (manage) the data of tile 1 (Tile 1) to tile 4 (Tile 4). By reproducing any MP4 file (that is, any track) among the files, it is possible to reproduce the image of any tile.
  • the reference relation of the data between the tracks is defined using the extractor.
  • the reference relation is defined using track reference (Track Reference) in a way similar to the case of FIG. 45 .
  • the tile region group entry (TileRegionGroupEntry) is defined in each of track 2 (Track 2) to track 5 (Track 5). That is, one tile is defined in each track.
  • the track reference is used as the information indicating the reference relation. Accordingly, it is possible to reduce the definition redundancy of the reference relation and suppress the increase in the amount of unnecessary information.
  • the file names of the MP4 files in FIG. 47 are assumed to be bitstream_base.mp4, bitstream_tile1.mp4, bitstream_tile2.mp4, bitstream_tile3.mp4, and bitstream_tile4.mp4 in order from the top.
  • the supplemental property (SupplementalProperty) or the essential property (EssentialProperty) of the adaptation set (AdaptationSet) is also extended, as in the above-described case of the reference by the extractor. An example of this is illustrated in FIG. 48 .
  • the MP4 file can be managed by the MPD as in the example of FIG. 44 .
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • FIG. 49 is a diagram illustrating an example of the configuration of an MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bit stream7 having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • the bit streams of the tiles are collected and considered as one file and the data of the tiles is further managed as one track.
  • the reference relation of the data between the tracks is defined using the extractor.
  • the reference relation of the data between the tracks is defined using the track reference.
  • the reference relation is defined using both of the extractor and the track reference.
  • track 1 refers to the information regarding track 2 (Track 2) to track 5 (Track 5) using the extractor as in the case of FIG. 41 .
  • track 2 (Track 2) to track 5 (Track 5) refer to the information regarding track 1 (Track 1) using the track reference as in the case of FIG. 45 .
  • Track 1 Track 1
  • the parameter sets such as the video parameter set (VPS), the sequence parameter set (SPS), and the picture parameter set (PPS), the actual data such as the SEI, the extractor for referring to the data of the tiles of tracks 2 to 5, and the like are stored.
  • the tile region group entry (TileRegionGroupEntry) is defined in each of track 2 (Track 2) to track 5 (Track 5). That is, one tile is defined in each track.
  • the file name of this MP4 file is assumed to be bitstream.mp4.
  • the supplemental property (SupplementalProperty) or the essential property (EssentialProperty) of the adaptation set (AdaptationSet) is extended.
  • the example is illustrated in FIG. 50 .
  • the MP4 file can be managed by the MPD as in the examples of FIG. 42 and FIG. 46 .
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • FIG. 51 is a diagram illustrating an example of the configuration of an MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bitstream7 having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • the bit streams of tiles are managed as mutually different files. Since the tracks of the files are mutually different, the bit streams of the tiles can also be said to be managed as mutually different tracks.
  • the reference relation of the data between the tracks is defined using the extractor.
  • the reference relation of the data between the tracks is defined using the track reference.
  • the reference relation is defined using both of the extractor and the track reference.
  • the topmost MP4 file (track 1 (Track 1)) in FIG. 51 refers to the information regarding the second to fifth MP4 files (track 2 (Track 2) to track 5 (Track 5)) from the top of FIG. 51 using the extractor as in the case of FIG. 43 .
  • the second to fifth MP4 files (track 2 (Track 2) to track 5 (Track 5)) from the top of FIG. 51 refer to the information regarding the topmost MP4 file (track 1 (Track 1)) in FIG. 51 using the track reference as in the case of FIG. 47 .
  • the parameter sets such as the video parameter set (VPS), the sequence parameter set (SPS), and the picture parameter set (PPS), the actual data such as the SEI, extractors (Track 2, Track 3, Track 4, and Track 5) of the bit streams of the tiles, and the like are stored.
  • the parameter sets, the extractors, and the like are managed for each sample by the sample entry (Sample Entry).
  • the tile region group entry (TileRegionGroupEntry) is defined. That is, one tile is defined in each track.
  • the file names of the MP4 files in FIG. 51 are assumed to be bitstream_base.mp4, bitstream_tile1.mp4, bitstream_tile2.mp4, bitstream_tile3.mp4, and bitstream_tile4.mp4 in order from the top.
  • the supplemental property (SupplementalProperty) or the essential property (EssentialProperty) of the adaptation set (AdaptationSet) is extended.
  • the example is illustrated in FIG. 52 .
  • the MP4 file can be managed by the MPD as in the examples of FIG. 44 and FIG. 48 .
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • one MP4 file includes the plurality of tracks
  • the slice which is the actual data is stored for each tile in the different track.
  • the slices of the tiles can be collected and disposed in one track. The example of this case will be described below with reference to FIG. 53 .
  • FIG. 53 is a diagram illustrating an example of the configuration of the MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bit stream7 having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • the bit streams of the tiles are collected and considered as one MP4 file. Further, the tiles are managed in mutually different tracks.
  • the slices which are the actual data of the tiles are collected and stored in one track.
  • the reference relation of the data between the tracks is defined using the extractor.
  • the reference relation of the data between the tracks is defined using the track reference.
  • both of the extractor and the track reference are used. However, a method of using the extractor and the track reference differs from the case of FIG. 49 .
  • track 1 which is a base track
  • the parameter sets such as the video parameter set (VPS), the sequence parameter set (SPS), and the picture parameter set (PPS) and the actual data such as the SEI are stored.
  • the parameter sets such as the video parameter set (VPS), the sequence parameter set (SPS), and the picture parameter set (PPS) are managed for each sample by a sample entry (Sample Entry).
  • samples 1 to 4 which are actual data of the tiles of HEVC and the like are stored.
  • track 2 (Track 2) to track 5 (Track 5) have both of the extractor and the track reference for referring to the information regarding track 1 (Track 1).
  • slice 1 of track 1 is referred to in accordance with the extractor, for example, the tile of track 2 is reproduced.
  • slice 2 of track 1 is referred to when the tile of track 3 is reproduced.
  • slice 3 of track 1 is referred to when the tile of track 4 is reproduced, and slice 4 of track 1 is referred to when the tile of track 5 is reproduced.
  • the tile region group entry (TileRegionGroupEntry) is defined in each of track 2 (Track 2) to track 5 (Track 5). That is, one tile is defined for each track.
  • the definition is the same as the case of each track in FIGS. 41 , 43 , 45 , 47 , 49 , and 51 (the case of each tile in FIG. 39 ).
  • the file name of this MP4 file is assumed to be bitstream.mp4.
  • the MPD of the MP4 in FIG. 53 is illustrated in FIG. 54 . Even in the MPD, the same extension as the MPDs in FIGS. 42 , 46 , and 50 corresponding to the MP4 files in FIGS. 41 , 45 , and 49 is performed. That is, the supplemental property (SupplementalProperty) or the essential property (EssentialProperty) of the adaptation set (AdaptationSet) is extended.
  • the MPD in FIG. 54 has basically the same configuration as the MPDs in FIGS. 42 , 46 , and 50 . However, the MPD in FIG. 54 differs from the MPDs in that an ID is stored in each representation (Representation).
  • an ID In the representation (Representation) located at the top in FIG. 54 , an ID (bs) indicating a base track is stored.
  • an ID (tl1) In the second representation (Representation) from the top, an ID (tl1) indicating the ID of tile 1 is stored.
  • IDs (tl2 to tl4) indicating the IDs of tiles 2 to 4 are stored.
  • the MP4 file in FIG. 53 can be managed by the MPD in FIG. 54 .
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • FIG. 55 is a diagram illustrating an example of the configuration of the MP4 file obtained by filing the bit stream (bitstream7) having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • bit stream 7 having, for example, the tile (Tile) structure illustrated in FIG. 6B .
  • the tracks of the tiles are considered to be different MP4 files.
  • the slices which are the actual data of the tiles are collected and stored in track 1 (Track 1) which is a base track.
  • the reference relation of the data between the tracks is defined using the extractor.
  • the reference relation of the data between the tracks is defined using the track reference.
  • both of the extractor and the track reference are used.
  • a method of using the extractor and the track reference differs from the case of FIG. 49 in a way similar to the case of FIG. 53 .
  • track 1 (Track 1) as illustrated in FIG. 55 , the parameter sets such as the video parameter set (VPS), the sequence parameter set (SPS), and the picture parameter set (PPS) and the actual data such as the SEI are stored. Further, in track 1 (Track 1), slices 1 to 4 which are actual data of the tiles of HEVC are stored.
  • track 2 (Track 2) to track 5 (Track 5) have both of the extractor and the track reference for referring to the information regarding track 1 (Track 1).
  • slice 1 of track 1 is referred to in accordance with the extractor, for example, the tile of track 2 is reproduced.
  • slice 2 of track 1 is referred to when the tile of track 3 is reproduced.
  • slice 3 of track 1 is referred to when the tile of track 4 is reproduced, and slice 4 of track 1 is referred to when the tile of track 5 is reproduced.
  • the tile region group entry (TileRegionGroupEntry) is defined in each of track 2 (Track 2) to track 5 (Track 5). That is, one tile is defined for each track. Its content is the same as FIG. 39 . etc.
  • the MP4 file in FIG. 55 has the same basic configuration as the MP4 file in FIG. 53 except that the MP4 files separated in the example of FIG. 53 are collected as one MP4 file.
  • the file names of the MP4 files in FIG. 55 are assumed to be bitstream_base.mp4, bitstream_tile1.mp4, bitstream_tile2.mp4, bitstream_tile3.mp4, and bitstream_tile4.mp4 in order from the top.
  • the supplemental property (SupplementalProperty) or the essential property (EssentialProperty) of the adaptation set (AdaptationSet) is extended.
  • the example is illustrated in FIG. 56 .
  • the MPD in FIG. 56 has the same configuration as the MPD in FIG. 54 .
  • the MP4 file in FIG. 55 can be managed by the MPD in FIG. 56 .
  • the generation of the extended MPD can be performed as in the case of the first embodiment.
  • the delivery data generation device 101 FIG. 12
  • the tile type MPD generation unit 141 the tile type image information generation unit 124)
  • FIG. 12 performs the tile type MPD file generation process ( FIG. 15 )
  • the extended MPD can be generated (the MPD is extended). Accordingly, even in this case, the delivery data generation device 101 can adaptively deliver (supply) the data of the partial image to the delivery server 102 based on the DASH standard. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the reproduction of the delivery data using the extended MPD can also be performed as in the case of the first embodiment.
  • the terminal device 103 FIG. 13
  • the terminal device 103 can correctly analyze the extended MPD by performing the delivery data generation process ( FIG. 16 ) and gain the adaptive delivery (supply) of the data of the partial image by the delivery server 102 which is based on the DASH standard. That is, it is possible to correctly acquire the data of the partial image from the delivery server 102 and reproduce the data of the partial image. That is, it is possible to realize the adaptive supply of the data of the partial image.
  • the partial image information includes the track reference and the extractor, the track reference and the extractor are stored in the tracks corresponding to the plurality of partial images, and the tracks storing the slices of the partial images are referred to.
  • the application scope of the present technology can be applied to any information processing devices that supply or receive partial images.
  • the above-described series of processes can also be performed by hardware and can also be performed by software.
  • a program of the software is installed in a computer.
  • the computer includes a computer embedded in dedicated hardware and, for example, a general personal computer capable of various functions through installation of various programs.
  • FIG. 57 is a block diagram illustrating an example of a hardware configuration of the computer performing the above-described series of processes according to a program.
  • a central processing unit (CPU) 501 a read-only memory (ROM) 502, and a random access memory (RAM) 503 are connected mutually via a bus 504.
  • CPU central processing unit
  • ROM read-only memory
  • RAM random access memory
  • An input and output interface 510 is also connected to the bus 504.
  • An input unit 511, an output unit 512, a storage unit 513, a communication unit 514, and a drive 515 are connected to the input and output interface 510.
  • the input unit 511 is formed by, for example, a keyboard, a mouse, a microphone, a touch panel, or an input terminal.
  • the output unit 512 is formed by, for example, a display, a speaker, or an output terminal.
  • the storage unit 513 is formed by, for example, a hard disk, a RAM disk, or a non-volatile memory.
  • the communication unit 514 is formed by, for example, a network interface.
  • the drive 515 drives a removable medium 521 such as a magnetic disk, an optical disc, a magneto-optical disc, or a semiconductor memory.
  • the CPU 501 performs the above-described processes by loading a program stored in the storage unit 513 to the RAM 503 via the input and output interface 510 and the bus 504 and executing the program.
  • the RAM 503 also appropriately stores data necessary for the CPU 501 to perform various processes.
  • a program executed by the computer can be recorded in the removable medium 521 such as a package medium to be applied.
  • the program can be installed in the storage unit 513 via the input and output interface 510.
  • the program can also be supplied via a wired or wireless transmission medium such as a local area network, the Internet, or digital satellite broadcast.
  • the program can be received by the communication unit 514 to be installed in the storage unit 513.
  • program can also be installed in advance in the ROM 502 or the storage unit 513.
  • Programs executed by the computer may be programs which are processed chronologically in the order described in the present specification or may be programs which are processed at necessary timings, for example, in parallel or when called.
  • steps describing a program recorded in a recording medium include not only processes which are performed chronologically in the described order but also processes which are performed in parallel or individually but not chronologically.
  • Multi-view image encoding and multi-view image decoding can be applied as schemes for image encoding and image decoding related to the above-described series of processes.
  • FIG. 58 illustrates an example of a multi-view image coding scheme.
  • a multi-view image includes images having a plurality of views.
  • the plurality of views of the multi-view image include a base view for which encoding/decoding is performed using only the image of its own view without using images of other views and non-base views for which encoding/decoding is performed using images of other views.
  • a non-base view the image of the base view may be used, and the image of the other non-base view may be used.
  • information necessary to encode and decode the flags or the parameters may be shared between encoding and decoding of each view. In this way, it is possible to suppress transmission of redundant information and suppress a reduction in coding efficiency.
  • FIG. 59 is a diagram illustrating a multi-view image encoding device which performs the above-described multi-view image encoding.
  • the multi-view image encoding device 600 has an encoding unit 601, an encoding unit 602, and a multiplexing unit 603.
  • the encoding unit 601 encodes a base view image to generate a base view image encoded stream.
  • the encoding unit 602 encodes a non-base view image to generate a non-base view image encoded stream.
  • the multiplexing unit 603 multiplexes the base view image encoded stream generated by the encoding unit 601 and the non-base view image encoded stream generated by the encoding unit 602 to generate a multi-view image encoded stream.
  • the multi-view image encoding device 600 may be applied as the image encoding unit 122 (which is one encoding processing unit of the image encoding unit) ( FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the image encoding unit 122 which is one encoding processing unit of the image encoding unit
  • FIG. 12 the delivery data generation device 101
  • FIG. 60 is a diagram illustrating a multi-view image decoding device which performs the above-described multi-view image decoding.
  • the multi-view image decoding device 610 has a demultiplexing unit 611, a decoding unit 612, and another decoding unit 613.
  • the demultiplexing unit 611 demultiplexes the multi-view image encoded stream obtained by multiplexing the base view image encoded stream and the non-base view image encoded stream to extract the base view image encoded stream and the non-base view image encoded stream.
  • the decoding unit 612 decodes the base view image encoded stream extracted by the demultiplexing unit 611 to obtain the base view image.
  • the decoding unit 613 decodes the non-base view image encoded stream extracted by the demultiplexing unit 611 to obtain the non-base view image.
  • the multi-view image decoding device 610 may be applied as the image decoding unit 155 (one decoding processing unit of the image decoding unit) of the terminal device 103 ( FIG. 11 ). In this way, it is also possible to apply the method of each embodiment described above to delivery of the multi-view image, and thus it is possible to realize adaptive supply of the data of the partial images.
  • FIG. 61 illustrates an example of a layered image coding scheme.
  • Layered image encoding involves dividing an image into a plurality of layers (multi-layered) and performing encoding for each layer so that image data can have scalability with respect to a predetermined parameter.
  • Layered image decoding is decoding that corresponds to the layered image encoding.
  • the layering of the image is a parameter related to the image and is performed by changing predetermined parameters having scalability. That is, as illustrated in FIG. 61 , an image subjected to the layering (layered image) is configured to include a plurality of images of which the values of the predetermined parameters with the scalability are mutually different. Each image of the plurality of images is considered to be a layer.
  • the plurality of layers of the layered image include a base layer in which only information regarding the own layer is used without using information regarding other layers at the time of encoding and decoding and non-base layers (also referred to as enhancement layers) in which the information regarding the other layers can be used at the time of encoding and decoding.
  • non-base layers also be used and the information regarding the other non-base layers can also be used.
  • the layered image encoding is a process of encoding such a layered image.
  • the image of the base layer is encoded using only the information regarding the base layer to generate encoded data of the base layer.
  • the images of the non-base layers are encoded using the information regarding the base layer and the information regarding the non-base layers, and encoded data of the non-base layers is generated.
  • the layered image decoding is a process of decoding the encoded data subjected to the layered image encoding and generating a decoded image of any layer.
  • the encoded data of the base layer is decoded to generate a decoded image of the base layer.
  • the encoded data of the base layer is decoded, and the encoded data of the non-base layers is decoded using the information regarding the base layer to generate decoded images of the non-base layers.
  • the encoded data is divided and generated for each layer through the layered encoding. Therefore, at the time of decoding, the encoded data of all the layers may not necessarily be necessary, and only the encoded data of a layer necessary to obtain a desired decoded image may be obtained. Accordingly, it is possible to suppress an increase in a transmission amount of the data from an encoding side to a decoding side.
  • any information of another layer used for the encoding and the decoding can be used.
  • an image for example, a decoded image
  • prediction between layers may be performed using the image of another layer. In this way, it is possible to reduce redundancy between the layers. In particular, it is possible to suppress an increase in the encoding amount of the non-base layer.
  • the use of the information between the layers may be performed in all of the pictures of a moving image. As illustrated in FIG. 61 , the use of the information may be performed in some of the pictures.
  • the qualities of the images of the layers of the layered image are mutually different for the predetermined parameters having the scalability. That is, by performing the layered image encoding and the layered image decoding on the layered image, it is possible to easily obtain images with various qualities according to situations. Any setting can be performed on the quality of each layer. However, in general, the quality of the image of the base layer is set to be lower than the quality of the image of the enhancement layer using the information regarding the base layer.
  • image compression information (encoded data) regarding only the base layer may be transmitted to a terminal such as a mobile telephone with a low processing ability
  • image compression information (encoded data) regarding the enhancement layer in addition to the base layer may be transmitted to a terminal such as a television or a personal computer with a high processing ability.
  • the load of a process of reproducing an image with low quality is less than that of a process of reproducing an image with high quality. Accordingly, by performing the transmission in this way, it is possible to allow each terminal to perform a reproduction process according to the ability, for example, to allow a terminal with a low processing ability to reproduce a moving image with low quality and allow a terminal with a high processing ability to reproduce a moving image with high quality. That is, it is possible to allow terminals with more varied processing abilities to reproduce a moving image normally (without failure).
  • only the encoded data of a necessary layer may be transmitted to each terminal. Therefore, it is possible to suppress an increase in a data amount (transmission amount) of the encoded data to be transmitted.
  • the delivery of the data according to a terminal can be realized without a transcoding process.
  • the method of each embodiment described above may be applied. In this way, it is possible to realize the adaptive supply of the data of the partial image even in the layered image.
  • the information necessary to encode and decode the flags or the parameters (for example, the VPS, the SPS, and the like as coding information) used in the method of each embodiment described above may be shared between encoding and decoding of each layer. In this way, it is possible to suppress transmission of redundant information and suppress a reduction in coding efficiency.
  • any parameter having the scalability can be used.
  • a spatial resolution illustrated in FIG. 62 may be assumed to be the parameter (spatial scalability).
  • a spatial resolution that is, the number of pixels of a picture
  • each picture is layered into two layers, a base layer with a low resolution and an enhancement layer with a high resolution.
  • this number of layers is an example and each picture can be layered into any number of layers.
  • a temporal resolution may be applied, as illustrated in FIG. 63 (temporal scalability).
  • a temporal resolution that is, a frame rate
  • a picture is layered into three layers, a layer with a low frame rate (7.5 fps), a layer with an intermediate frame rate (15 fps), and a layer with a high frame rate (30 fps).
  • this number of layers is an example and each picture can be layered into any number of layers.
  • a signal-to-noise ratio may be applied, as illustrated in FIG. 64 (SNR scalability).
  • SNR scalability the SN ratio differs for each layer.
  • each picture is layered into two layers, a base layer with a low SNR and an enhancement layer with a high SNR.
  • this number of layers is an example and each picture can be layered into any number of layers.
  • the parameter having such a scalable property may, of course, be a parameter other than the above-described examples.
  • a bit depth can also be used as the parameter having such a scalable property (bit-depth scalability).
  • bit-depth scalability a bit depth differs for each layer.
  • the base layer may be formed by an 8-bit image and the enhancement layer may be formed by a 10-bit image.
  • this number of layers is an example and each picture can be layered into any number of layers. Any bit depth of each layer can also be used and is not limited to the above-described example.
  • the base layer may be assumed to be a standard dynamic range (SDR) image with a standard dynamic range and the enhancement layer may be assumed to be a high dynamic range (HDR) image with a broader dynamic range.
  • SDR image may be assumed to be, for example, image data with integer precision of 8 bits or 16 bits and the HDR image may be assumed to be, for example, image data with floating-point precision of 32 bits.
  • a chroma format can also be used (chroma scalability).
  • the chroma format differs for each layer.
  • the base layer may be formed by a component image with a 4:2:0 format and the enhancement layer may be formed by a component image with a 4:2:2 format.
  • this number of layers is an example and each picture can be layered into any number of layers.
  • Any chroma format of each layer can also be used and is not limited to the above-described example.
  • a color gamut may be used.
  • the color gamut of the enhancement layer may be configured to include the color gamut of the base layer (that is, broader than the color gamut of the base layer).
  • FIG. 65 is a diagram illustrating a layered image encoding device which performs the above-described layered image encoding.
  • the layered image encoding device 620 has an encoding unit 621, another encoding unit 622, and a multiplexing unit 623 as illustrated in FIG. 65 .
  • the encoding unit 621 encodes a base layer image to generate a base layer image encoded stream.
  • the encoding unit 622 encodes a non-base layer image to generate a non-base layer image encoded stream.
  • the multiplexing unit 623 multiplexes the base layer image encoded stream generated by the encoding unit 621 and the non-base layer image encoded stream generated by the encoding unit 622 to generate a layered image encoded stream.
  • the layered image encoding device 620 may be applied as the image encoding unit 122 (which is one encoding processing unit of the image encoding unit) ( FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the image encoding unit 122 which is one encoding processing unit of the image encoding unit
  • FIG. 12 the delivery data generation device 101
  • FIG. 66 is a diagram illustrating a layered image decoding device which performs the above-described layered image decoding.
  • the layered image decoding device 630 has a demultiplexing unit 631, a decoding unit 632, and another decoding unit 633 as illustrated in FIG. 66 .
  • the demultiplexing unit 631 demultiplexes the layered image encoded stream obtained by multiplexing the base layer image encoded stream and the non-base layer image encoded stream to extract the base layer image encoded stream and the non-base layer image encoded stream.
  • the decoding unit 632 decodes the base layer image encoded stream extracted by the demultiplexing unit 631 to obtain the base layer image.
  • the decoding unit 633 decodes the non-base layer image encoded stream extracted by the demultiplexing unit 631 to obtain the non-base layer image.
  • the layered image decoding device 630 may be applied as the image decoding unit 155 (one decoding processing unit of the image decoding unit) of the terminal device 103 ( FIG. 11 ). In this way, it is also possible to apply the method of each embodiment described above to delivery of the layered image, and thus it is possible to realize adaptive supply of the data of the partial images.
  • the image encoding device and the image decoding device according to the above-described embodiments can be applied to various electronic devices such as a transmitter or a receiver in delivery of satellite broadcast, a wired broadcast such as a cable TV, or the Internet and delivery to a terminal by cellular communication, a recording device recording an image in a medium such as an optical disc, a magnetic disk, or a flash memory, or a reproduction device reproducing an image from the storage medium.
  • a transmitter or a receiver in delivery of satellite broadcast such as a wired broadcast such as a cable TV, or the Internet and delivery to a terminal by cellular communication
  • a recording device recording an image in a medium such as an optical disc, a magnetic disk, or a flash memory
  • a reproduction device reproducing an image from the storage medium such as an optical disc, a magnetic disk, or a flash memory
  • FIG. 67 is a block diagram illustrating an example of a schematic configuration of a television device to which the above-described embodiments are applied.
  • a television device 900 includes an antenna 901, a tuner 902, a demultiplexer 903, a decoder 904, a video signal processing unit 905, a display unit 906, an audio signal processing unit 907, and a speaker 908.
  • the television device 900 further includes an external interface (I/F) unit 909, a control unit 910, a user interface (I/F) unit 911, and a bus 912.
  • the television device 900 further includes an MP4 processing unit 914 and an MPEG-DASH processing unit 915.
  • the tuner 902 extracts a signal of a desired channel (tuned channel) from a broadcast wave signal received via the antenna 901 and demodulates the extracted signal.
  • the tuner 902 outputs an encoded bit stream obtained through the demodulation to the demultiplexer 903.
  • the demultiplexer 903 demultiplexes a video stream and an audio stream of a viewing target program from the encoded bit stream and outputs the demultiplexed streams to the decoder 904.
  • the demultiplexer 903 extracts auxiliary data such as an electronic program guide (EPG) from the encoded bit stream and supplies the extracted data to the control unit 910.
  • EPG electronic program guide
  • the demultiplexer 903 may perform descrambling on the encoded bit stream.
  • the decoder 904 decodes the video stream and the audio stream input from the demultiplexer 903.
  • the decoder 904 performs the decoding using the MP4 processing unit 914 or the MPEG-DASH processing unit 915, as necessary.
  • the decoder 904 outputs video data generated through the decoding process to the video signal processing unit 905.
  • the decoder 904 outputs audio data generated through the decoding process to the audio signal processing unit 907.
  • the video signal processing unit 905 reproduces the video data input from the decoder 904 and causes the display unit 906 to display an image.
  • the video signal processing unit 905 can also reproduce video data supplied from the outside via a reception unit 913 and cause the display unit 906 to display the image.
  • the video signal processing unit 905 can also generate an image by executing an application supplied from the outside via the reception unit 913 and cause the display unit 906 to display the image.
  • the video signal processing unit 905 can also perform, for example, an additional process such as noise removal on the image displayed by the display unit 906.
  • the video signal processing unit 905 can also generate an image of a graphical user interface (GUI) such as a menu, a button, or a cursor and superimpose the image on an image displayed by the display unit 906.
  • GUI graphical user interface
  • the audio signal processing unit 907 performs a reproduction process such as D-to-A conversion and amplification on the audio data input from the decoder 904 and outputs audio from the speaker 908.
  • the audio signal processing unit 907 can also reproduce audio data supplied from the outside via the reception unit 913 and output the audio from the speaker 908.
  • the audio signal processing unit 907 can also generate audio by executing an application supplied from the outside via the reception unit 913 and output the audio from the speaker 908.
  • the audio signal processing unit 907 can also perform, for example, an additional process such as noise removal on the audio to be output from the speaker 908.
  • the external interface unit 909 is an interface for connecting the television device 900 to an external device or a network.
  • the external device may be any electronic device, such as a computer, an externally attached hard disk drive (HDD) connected via a communication cable of a predetermined standard such as Universal Serial Bus (USB) or IEEE1394, an externally attached optical disc drive, or a network attached storage (NAS), as long as the device can transmit and receive information to and from the television device 900.
  • a computer an externally attached hard disk drive (HDD) connected via a communication cable of a predetermined standard such as Universal Serial Bus (USB) or IEEE1394, an externally attached optical disc drive, or a network attached storage (NAS), as long as the device can transmit and receive information to and from the television device 900.
  • HDMI hard disk drive
  • NAS network attached storage
  • a network is a communication network serving as a communication medium.
  • the network may be any communication network, a wired communication network, a wireless communication network, or both.
  • the network may be a wired local area network (LAN), a wireless LAN, a public telephone line network, a wide area communication network for a wireless moving object such as a so-called 3G network or 4G network, or the Internet for wireless moving objects, or a combination thereof.
  • the network may be a single communication network or a plurality of communication networks.
  • the network may be configured by a plurality of communication networks mutually connected via servers, relay devices, or the like.
  • a part or all of the network may be configured by a communication cable of a predetermined standard, such as a Universal Serial Bus (USB) cable or a High-Definition Multimedia Interface (HDMI: registered trademark) cable.
  • a part or all of the network may be configured by a method which is based on a predetermined standard such as an ad hoc mode of Institute of Electrical and Electronic Engineers (IEEE) 802.11 wireless LAN, optical communication of infrared rays such as InfraRed Data Association (IrDA) or the like, or Bluetooth (registered trademark), or may be configured by wireless communication of a unique communication scheme.
  • IEEE Institute of Electrical and Electronic Engineers
  • IrDA InfraRed Data Association
  • Bluetooth registered trademark
  • the television device 900 can perform communication (transmit and receive information) with the external device via the network.
  • the external interface unit 909 can receive an encoded bit stream supplied from an external device via a communication cable or a network. When the external interface unit 909 receives the encoded bit stream, the external interface unit 909 supplies the encoded bit stream to the demultiplexer 903 via the bus 912.
  • the demultiplexer 903 processes the encoded bit stream as in the encoded bit stream supplied from the tuner 902 to demultiplex a video stream and an audio stream, extract auxiliary data such as EPG, or perform descrambling.
  • the television device 900 can receive a broadcast wave signal including the encoded bit stream and can also receive the encoded bit stream transmitted via a network, decode the encoded bit stream, and output the video or the audio.
  • the antenna 901 or the external interface unit 909 functions as a reception unit in the television device 900.
  • the television device 900 can also transmit information to an external device via the external interface unit 909.
  • This information is arbitrary.
  • the information may be a request for content such as a video or audio, information regarding a communication function of the television device 900 necessary to establish communication, or information regarding a decoding function, an image display function, an audio output function of the television device 900.
  • the television device 900 may also transmit an encoded bit stream received via the antenna 901 to an external device via the external interface unit 909. That is, the external interface unit 909 may function as a transmission unit in the television device 900.
  • the control unit 910 is connected with the user interface unit 911.
  • the user interface unit 911 is configured as a manipulating switch or a remotely controlled signal reception unit, and supplies an operation signal to the control unit 910 according to a user operation.
  • the control unit 910 is configured using a CPU, a memory, and the like.
  • the memory stores programs executed by the CPU, various kinds of data necessary for the CPU to perform processes, EPG data, data acquired through the external interface unit 909.
  • the programs stored in the memory are read and executed by the CPU at predetermined timings such as when the television device 900 is turned on. By executing the programs, the CPU controls the respective units so that the television device 900 is operated according to user operations.
  • a bus 912 is provided to connect the tuner 902, the demultiplexer 903, the video signal processing unit 905, the audio signal processing unit 907, the external interface unit 909, and the like with the control unit 910.
  • the decoder 904 supplies the MP4 file to the MP4 processing unit 914.
  • the MP4 processing unit 914 analyzes the supplied MP4 file and decodes encoded data included in the MP4 file.
  • the MP4 processing unit 914 supplies the image data obtained through the decoding to the decoder 904.
  • the decoder 904 supplies the image data to the video signal processing unit 905.
  • the MP4 processing unit 914 may include the file acquisition unit 154, the image decoding unit 155, and the tile image combination unit 156 ( FIG. 13 ) of the terminal device 103 ( FIG. 11 ).
  • the MP4 processing unit 914 acquires an MP4 file including the data of the tiles included in a desired range via the decoder 904 or the like, extracts and decodes the encoded data of the tiles, appropriately combines the acquired image data (tile images) of the tiles to generate image data in the desired range, and supplies the image data to the decoder 904.
  • the MP4 processing unit 914 can process the various MP4 files described above in the embodiments to obtain desired image data. That is, the television device 900 can realize the adaptive supply of the data of the partial images.
  • the decoder 904 supplies the MPD file to the MPEG-DASH processing unit 915.
  • the MPEG-DASH processing unit 915 analyzes the supplied MPD and acquires desired image data based on the MPD. For example, when the MP4 file including the encoded data obtained by encoding the image data is managed by the MPD, the MPEG-DASH processing unit 915 acquires the MP4 file corresponding to a desired image based on the MPD, decodes the encoded data included in the MP4 file, and supplies the image data obtained through the decoding to the decoder 904.
  • the decoder 904 supplies the image data to the video signal processing unit 905.
  • the MPEG-DASH processing unit 915 may include the MPD acquisition unit 151 to the tile image combination unit 156 (each processing unit other than the display unit 157 in FIG. 13 ) of the terminal device 103 ( FIG. 11 ).
  • the MPEG-DASH processing unit 915 analyzes the MPD, acquires the MP4 file including the data of the tiles included in a desired range via the decoder 904 or the like, extracts and decodes the encoded data of the tiles, appropriately combines the obtained image data (tile images) of the tiles to generate image data in the desired range, and supplies the image data to the decoder 904.
  • the MPEG-DASH processing unit 915 can process the various MP4 files described in the embodiments to obtain desired image data. That is, the television device 900 can realize the adaptive supply of the data of the partial images.
  • FIG. 68 illustrates a schematic configuration of a mobile telephone to which the present disclosure is applied.
  • the mobile telephone 920 has a communication unit 922, an audio codec 923, a camera unit 926, an image processing unit 927, a demultiplexing unit 928, a recording and reproduction unit 929, a display unit 930, and a control unit 931.
  • the constituent elements are connected to one another by a bus 933.
  • an antenna 921 is connected to the communication unit 922, and a speaker 924 and a microphone 925 are connected to the audio codec 923. Further, an operation unit 932 is connected to the control unit 931.
  • the mobile telephone 920 includes an MP4 processing unit 934 and an MPEG-DASH processing unit 935.
  • the MP4 processing unit 934 and the MPEG-DASH processing unit 935 are connected to the bus 933.
  • the communication unit 922 performs processes related to transmission and reception of radio signals via the antenna 921.
  • the audio codec 923 performs processes related to encoding of audio data and decoding of audio encoded data obtained by encoding the audio data.
  • the camera unit 926 images a subject and performs processes related to the imaging, such as generation of image data.
  • the image processing unit 927 performs a process on the image data.
  • the image processing unit 927 can perform any image processing on the image data.
  • the image processing unit 927 can also encode the image data or decode the encoded data obtained by encoding the image data.
  • the demultiplexing unit 928 performs, for example, processes related to multiplexing of a plurality of pieces of data such as image data or audio data or demultiplexing of the multiplexed data.
  • the recording and reproduction unit 929 includes any storage medium capable of performing reading and writing and performs processes related to writing (recording) of data to the storage medium or reading (reproducing) of data stored in the storage medium.
  • the storage medium may be an internal type storage medium such as a RAM or a flash memory or may be an externally mounted type storage medium such as a hard disk, a magnetic disk, a magneto-optical disc, an optical disc, a USB memory, or a memory card.
  • the display unit 930 includes a display device (for example, a liquid crystal display, a plasma display, or an organic electroluminescence display (OELD) (organic EL display)) and performs processes related to image display.
  • a display device for example, a liquid crystal display, a plasma display, or an organic electroluminescence display (OELD) (organic EL display)
  • OELD organic electroluminescence display
  • the control unit 931 includes a processor such as a CPU and memories such as a RAM and a ROM.
  • the memories store programs executed by the CPU, program data, EPG data, data acquired via a network, and the like.
  • the programs stored in the memories are read and executed by the CPU, for example, when the mobile telephone 920 is activated.
  • the CPU controls an operation of each processing unit of the mobile telephone 920, for example, according to an operation signal input from the operation unit 932 by executing a program.
  • the MP4 processing unit 934 performs processes related to the MP4 file.
  • the MPEG-DASH processing unit 935 performs a process related to generation of delivery data delivered in a method which is based on the MPEG-DASH standard or the control information, such as generation of the MPD or the MP4 file.
  • the MPEG-DASH processing unit 935 also performs a process related to reproduction of the delivery data delivered in a method which is based on the MPEG-DASH standard, such as the analysis of the MPD or processing of the MP4 file.
  • the mobile telephone 920 performs various operations such as transmission and reception of audio signals, transmission and reception of electronic mail or image data, capturing of images, and recording of data in various operation modes such as an audio calling mode, a data communication mode, a photographing mode, and a video phone mode.
  • an analog audio signal generated by the microphone 925 is supplied to the audio codec 923.
  • the audio codec 923 performs A-to-D conversion to convert the analog audio signal into digital audio data and encodes (compresses) the digital audio data.
  • the audio codec 923 outputs the audio data (audio encoded data) after the compression to the communication unit 922.
  • the communication unit 922 further encodes or modulates the audio encoded data to generate a transmission signal. Then, the communication unit 922 transmits the generated transmission signal to a base station (not illustrated) via the antenna 921.
  • the communication unit 922 performs amplification or frequency conversion on a radio signal received via the antenna 921 to acquire a received signal, demodulates or decodes the received signal to generate audio encoded data, and outputs the audio encoded data to the audio codec 923.
  • the audio codec 923 decodes (decompresses) the supplied audio encoded data or performs D-to-A conversion to generate the analog audio signal.
  • the audio codec 923 supplies the analog audio signal to the speaker 924 to output the audio.
  • the control unit 931 receives a text input via the operation unit 932 by a user and causes the display unit 930 to display the input text.
  • the control unit 931 receives a mail transmission instruction from the user via the operation unit 932, generates electronic mail data according to the instruction, and supplies the electronic mail data to the communication unit 922.
  • the communication unit 922 encodes or modulates the supplied electronic mail data to generate a transmission signal, performs frequency conversion or amplification on the transmission signal, and transmits the signal to a base station (not illustrated) via the antenna 921.
  • the communication unit 922 when mail reception is performed in the data communication mode, the communication unit 922 performs amplification or frequency conversion on a radio signal received via the antenna 921 to acquire a received signal, demodulates or decodes the received signal to restore the electronic mail data, and supplies the restored electronic mail data to the control unit 931.
  • the control unit 931 causes the display unit 930 to display content of the electronic mail and stores the electronic mail data in a storage medium of the recording and reproduction unit 929.
  • the camera unit 926 images a subject to generate image data.
  • the camera unit 926 supplies the generated image data to the image processing unit 927 via the bus 933.
  • the image processing unit 927 performs image processing on the image data.
  • the camera unit 926 supplies the image data subjected to the image processing to the display unit 930 via the bus 933 to display the image.
  • the image processing unit 927 encodes the image data subjected to the image processing to generate encoded data, supplies the encoded data (image encoded data) to the recording and reproduction unit 929 via the bus 933, and stores the encoded data in the storage medium based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • the camera unit 926 When sound collection is also performed along with photographing in the photographing mode, the camera unit 926 images a subject and generates image data, the microphone 925 collects sound, and an analog audio signal is generated.
  • the image processing unit 927 performs image processing on the image data generated by the camera unit 926 and causes the display unit 930 to display an image of the image data subjected to the image processing.
  • the audio codec 923 outputs the audio of the analog audio signal generated by the microphone 925 from the speaker 924.
  • the image processing unit 927 encodes the image data to generate image encoded data and supplies the encoded data to the demultiplexing unit 928 via the bus 933 based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • the audio codec 923 performs A-to-D conversion on the analog audio signal to generate audio data, further encodes the audio data to generate audio encoded data, and supplies the audio encoded data to the demultiplexing unit 928 via the bus 933 based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • the demultiplexing unit 928 multiplexes the supplied image encoded data and audio encoded data to generate multiplexed data.
  • the demultiplexing unit 928 supplies the multiplexed data to the recording and reproduction unit 929 via the bus 933 and stores the multiplexed data in the storage medium.
  • the communication unit 922 acquires the image encoded data from the image processing unit 927 or the recording and reproduction unit 929 via the bus 933, encodes or modulates the image encoded data to generate the transmission signal, performs frequency conversion or amplification on the transmission signal, and transmits the signal to a base station (not illustrated) via the antenna 921 based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • the communication unit 922 acquires the multiplexed data in which the data of the image and the audio (for example, the image encoded data and the audio encoded data) are multiplexed from the demultiplexing unit 928 via the bus 933, encodes or modulates the multiplexed data to generate the transmission signal, performs frequency conversion or amplification on the transmission signal, and transmits the signal to a base station (not illustrated) via the antenna 921 based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • the data of the image and the audio for example, the image encoded data and the audio encoded data
  • the communication unit 922 acquires the multiplexed data in which the data of the image and the audio (for example, the image encoded data and the audio encoded data) are multiplexed from the demultiplexing unit 928 via the bus 933, encodes or modulates the multiplexed data to generate the transmission signal, performs frequency conversion or amplification on the transmission signal, and transmits the signal
  • the MP4 processing unit 934 acquires image data from the camera unit 926, the image processing unit 927, the recording and reproduction unit 929, or the like via the bus 933 (may acquire the multiplexed data from the demultiplexing unit 928), encodes the image data to generate encoded data, further generates an MP4 file in which the encoded data is stored, and supplies the MP4 file to the communication unit 922 via the bus 933 based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • control a user instruction or the like input via the operation unit 932
  • the communication unit 922 encodes or modulates the supplied MP4 file to generate the transmission signal, performs frequency conversion or amplification on the transmission signal, and transmits the signal to a base station (not illustrated) via the antenna 921 based on control of the control unit 931.
  • the MP4 processing unit 934 may include the screen division processing unit 121, the image encoding unit 122, the file generation unit 123, and the server upload processing unit 126 ( FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the MP4 processing unit 934 divides and encodes an image for each tile, generates an MP4 file in which data of each tile is stored, and uploads the MP4 file to the delivery server 102.
  • the MP4 processing unit 934 can generate the various MP4 files described above in the embodiments. That is, the mobile telephone 920 can realize the adaptive supply of the data of the partial images.
  • the MPEG-DASH processing unit 935 acquires the image data from the camera unit 926, the image processing unit 927, the recording and reproduction unit 929, or the like via the bus 933 (may acquire the multiplexed data from the demultiplexing unit 928), generates the MPD managing the image data, and supplies the MPD file to the communication unit 922 via the bus 933 based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • the communication unit 922 encodes or modulates the supplied MPD file to generate the transmission signal, performs frequency conversion or amplification on the transmission signal, and supplies the signal to a base station (not illustrated) via the antenna 921 based on control of the control unit 931.
  • the MPEG-DASH processing unit 935 may transmit the image data along with the MPD file via the communication unit 922.
  • the MPEG-DASH processing unit 935 may encode the image data to generate the MPD managing the encoded data and transmit the MPD file via the communication unit 922. Further, the MPEG-DASH processing unit 935 may transmit the encoded data along with the MPD file via the communication unit 922.
  • the MPEG-DASH processing unit 935 may encode the image data to generate the MP4 file in which the encoded data is stored, generate the MPD managing the MP4 file, and transmit the MPD file via the communication unit 922. Further, the MPEG-DASH processing unit 935 may transmit the MP4 file along with the MPD file via the communication unit 922.
  • the MPEG-DASH processing unit 935 may include the screen division processing unit 121 to the server upload processing unit 126 (including the tile type MPD generation unit 141 in FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the MPEG-DASH processing unit 935 divides and encodes an image for each tile, generates the MP4 files in which the data of each tile is stored, generates the MPDs managing the MP4 files, and uploads them to the delivery server 102.
  • the MPEG-DASH processing unit 935 can generate the various MPDs (or the MP4 files) described above in the embodiments. That is, the mobile telephone 920 can realize the adaptive supply of the data of the partial images.
  • the communication unit 922 receives a radio signal via the antenna 921, performs amplification or frequency conversion on the received signal to generate the received signal, demodulates or decodes the signal to generate the image encoded data, and supplies the image encoded data to the image processing unit 927 or the recording and reproduction unit 929 via the bus 933 based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • the image processing unit 927 decodes the supplied image encoded data and supplies the obtained image data to the display unit 930 to display the image.
  • the recording and reproduction unit 929 stores the supplied image encoded data in the storage medium.
  • the communication unit 922 receives a radio signal via the antenna 921, performs amplification or frequency conversion on the radio signal to generate a received signal, and demodulates or decodes the signal to generate multiplexed data in which data of the image and the audio (for example, the image encoded data and the audio encoded data) are multiplexed based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • the communication unit 922 supplies the multiplexed data to the demultiplexing unit 928 via the bus 933.
  • the demultiplexing unit 928 demultiplexes the image encoded data and the audio encoded data included in the supplied multiplexed data, supplies the image encoded data to the image processing unit 927 or the recording and reproduction unit 929 via the bus 933, and supplies the audio encoded data to the audio codec 923 via the bus 933.
  • the image processing unit 927 decodes the supplied image encoded data and supplies the obtained image data to the display unit 930 to display the image.
  • the recording and reproduction unit 929 stores the supplied image encoded data in the storage medium.
  • the audio codec 923 decodes the supplied audio encoded data and performs D-to-A conversion on the obtained audio data to generate an analog audio signal and outputs the audio of the analog audio signal from the speaker 924.
  • the MP4 processing unit 934 acquires the MP4 file from the communication unit 922 via the bus 933, analyzes the MP4 file to extract the encoded data, further decodes the encoded data, and supplies the obtained image data to the image processing unit 927, the recording and reproduction unit 929, the display unit 930, and the like via the bus 933 based on control (a user instruction or the like input via the operation unit 932) of the control unit 931.
  • control a user instruction or the like input via the operation unit 932
  • the MP4 processing unit 934 supplies the obtained multiplexed data to the demultiplexing unit 928.
  • the MP4 processing unit 934 may include the file acquisition unit 154, the image decoding unit 155, and the tile image combination unit 156 ( FIG. 13 ) of the terminal device 103 ( FIG. 11 ).
  • the MP4 processing unit 934 acquires the MP4 file including the data of the tiles included in a desired range via the communication unit 922 and the like, extracts and decodes the encoded data of the tiles, appropriately combines the obtained image data (tile images) of the tiles to generate image data in the desired range, and supplies the image data to the image processing unit 927, the recording and reproduction unit 929, the display unit 930, and the like via the bus 933.
  • the MP4 processing unit 934 can generate the various MP4 files described above in the embodiments. That is, the mobile telephone 920 can realize the adaptive supply of the data of the partial images.
  • the MPEG-DASH processing unit 935 acquires the MPD file from the communication unit 922 via the bus 933 and analyzes the MPD file based on control (a user instruction or the like input via the operation unit 932) of the control unit 931 to acquire desired image data based on the MPD.
  • the MPEG-DASH processing unit 935 acquires the MP4 file corresponding to a desired image via the communication unit 922 based on the MPD, decodes the encoded data included in the MP4 file, and supplies the image data obtained through the decoding to the image processing unit 927, the recording and reproduction unit 929, the display unit 930, or the like via the bus 933.
  • the MPEG-DASH processing unit 935 supplies the obtained multiplexed data to the demultiplexing unit 928.
  • the MPEG-DASH processing unit 935 may include the MPD acquisition unit 151 to the tile image combination unit 156 (each processing unit other than the display unit 157 in FIG. 13 ) of the terminal device 103 ( FIG. 11 ).
  • the MPEG-DASH processing unit 935 analyzes the MPD, acquires the MP4 file including the data of the tiles included in a desired range via the communication unit 922 or the like, extracts and decodes the encoded data of the tiles, appropriately combines the obtained image data (tile images) of the tiles to generate image data in the desired range, and supplies the image data to the image processing unit 927, the recording and reproduction unit 929, the display unit 930, and the like.
  • the MPEG-DASH processing unit 935 can process the various MP4 files described in the embodiments to obtain desired image data. That is, the mobile telephone 920 can realize the adaptive supply of the data of the partial images.
  • the present technology is not limited thereto, and can be implemented as any configuration mounted in the devices or devices constituting the systems, for example, processors in the form of system large scale integration (LSI), modules that use a plurality of processors, units that use a plurality of modules, sets obtained by further adding other functions to the units (i.e., a partial configuration of the devices), and the like.
  • LSI system large scale integration
  • FIG. 69 illustrates an example of a schematic configuration of a video set to which the present disclosure is applied.
  • the video set 1300 illustrated in FIG. 69 is configured to be multifunctional as described above by combining devices having functions of encoding and decoding (which may have either or both of the functions) of images with devices having other functions relating to the foregoing functions.
  • the video set 1300 has a module group including a video module 1311, an external memory 1312, a power management module 1313, a frontend module 1314 and the like, and devices having relevant functions such as connectivity 1321, a camera 1322, a sensor 1323, and the like.
  • a module is a form of a component in which several related componential functions are gathered to provide a cohesive function.
  • a specific physical configuration is arbitrary; however, it is considered to be an integration in which, for example, a plurality of processors each having functions, electronic circuit elements such as a resistor and a capacitor, and other devices are disposed on a circuit board.
  • making a new module by combining a module with another module, a processor, or the like is also considered.
  • the video module 1311 is a combination of configurations with functions relating to image processing, and has an application processor, a video processor, a broadband modem 1333, and an RF module 1334.
  • a processor is a semiconductor chip integrated with a configuration having predetermined functions using System-On-Chip (SoC), and is also referred to as, for example, system large scale integration (LSI), or the like.
  • the configuration having a predetermined function may be a logic circuit (hardware configuration), may be, along with CPU, a ROM, and a RAM, a program that is executed by using the elements (software configuration), or may be a combination of both configurations.
  • a processor may have a logic circuit, a CPU, a ROM, a RAM, and the like and may realize some functions with the logic circuit (hardware configuration), or may realize the other functions with a program executed by the CPU (software configuration).
  • the application processor 1331 of FIG. 69 is a processor that executes an application relating to image processing.
  • the application executed by the application processor 1331 can not only perform an arithmetic process but can also control a configuration internal and external to the video module 1311, for example, the video processor 1332 when necessary in order to realize predetermined functions.
  • the video processor 1332 is a processor having a function relating to (one or both of) encoding and decoding of images.
  • the broadband modem 1333 is a processor (or a module) which performs a process relating to wired or wireless (or both) broadband communication performed through a broadband line such as the Internet or a public telephone line network.
  • the broadband modem 1333 converts data (a digital signal) to be transmitted into an analog signal by performing digital modulation or the like, or converts a received analog signal into data (a digital signal) by performing demodulation.
  • the broadband modem 1333 can digitally modulate/demodulate arbitrary information such as image data to be processed by the video processor 1332, a stream obtained by encoding image data, an application program, or setting data.
  • the RF module 1334 is a module which performs frequency conversion, modulation and demodulation, amplification, a filtering process, and the like on a radio frequency (RF) signal transmitted and received via an antenna.
  • RF radio frequency
  • the RF module 1334 generates an RF signal by performing frequency conversion and the like on a baseband signal generated by the broadband modem 1333.
  • the RF module 1334 for example, generates a baseband signal by performing frequency conversion and the like on an RF signal received via the frontend module 1314.
  • the application processor 1331 and the video processor 1332 may be integrated to constitute one processor.
  • the external memory 1312 is a module that is provided outside the video module 1311, having a storage device used by the video module 1311.
  • the storage device of the external memory 1312 may be realized with any physical configuration, but is generally used when large amounts of data such as image data in units of frames are stored, and thus it is desirable to realize the storage device with a relatively inexpensive and high-capacity semiconductor memory, for example, a dynamic random access memory (DRAM).
  • DRAM dynamic random access memory
  • the power management module 1313 manages and controls power supply to the video module 1311 (each constituent element inside the video module 1311).
  • the frontend module 1314 is a module which provides the RF module 1334 with a frontend function (serving as a circuit of a transmitting and receiving end on an antenna side).
  • the frontend module 1314 has, for example, an antenna unit 1351, a filter 1352, and an amplifying unit 1353 as illustrated in FIG. 38 .
  • the antenna unit 1351 is configured with an antenna which transmits and receives wireless signals and peripherals thereof.
  • the antenna unit 1351 transmits a signal supplied from the amplifying unit 1353 as a radio signal and supplies a received radio signal to the filter 1352 as an electric signal (RF signal).
  • the filter 1352 performs a filtering process or the like on the RF signal received via the antenna unit 1351 and supplies the processed RF signal to the RF module 1334.
  • the amplifying unit 1353 amplifies an RF signal supplied from the RF module 1334, and supplies the signal to the antenna unit 1351.
  • the connectivity 1321 is a module having a function relating to connection to the outside.
  • a physical configuration of the connectivity 1321 is arbitrary.
  • the connectivity 1321 has, for example, a configuration with a communication function other than that of a communication standard to which the broadband modem 1333 corresponds, an external input and output terminal, or the like.
  • the connectivity 1321 may have a communicating function that is based on a wireless communication standard such as Bluetooth (a registered trademark), IEEE 802.11 (for example, Wireless Fidelity (Wi-Fi; a registered trademark), near field communication (NFC), or Infrared Data Association (IrDA), an antenna which transmits and receives signals based on the standard, or the like.
  • the connectivity 1321 may have, for example, a module having a communicating function based on a wired communication standard such as Universal Serial Bus (USB), or High-Definition Multimedia Interface (HDMI; a registered trademark), or a terminal based on the standard.
  • the connectivity 1321 may have, for example, another data (signal) transmitting function of an analog input and output terminal or the like.
  • the connectivity 1321 may be set to include a device serving as a data (signal) transmission destination.
  • the connectivity 1321 may be set to have a drive (including a drive not only of a removable medium but also of a hard disk, a solid-state drive (SSD), a network-attached storage (NAS), or the like) which reads and writes data with respect to a recording medium such as a magnetic disk, an optical disc, a magneto-optical disc, or a semiconductor memory.
  • the connectivity 1321 may be set to have an image or audio output device (a monitor, a speaker, or the like).
  • the camera 1322 is a module having a function of capturing a subject and obtaining image data of the subject.
  • Image data obtained from capturing by the camera 1322 is, for example, supplied to and encoded by the video processor 1332.
  • the sensor 1323 is a module having arbitrary sensing functions of, for example, a sound sensor, an ultrasound sensor, a light sensor, an illuminance sensor, an infrared sensor, an image sensor, a rotation sensor, an angle sensor, an angular velocity sensor, a speed sensor, an acceleration sensor, an inclination sensor, a magnetic identification sensor, a shock sensor, a temperature sensor, and the like.
  • Data detected by the sensor 1323 is, for example, supplied to the application processor 1331 and used by an application or the like.
  • the present technology can be applied to the video processor 1332 as will be described below.
  • the video set 1300 can be implemented as a set to which the present technology is applied.
  • the video processor 1332 may perform a process related to the MP4 file or a process related to generation or reproduction of the delivery data or the control information delivered in the method which is based on the MPEG-DASH standard. The details of the video processor 1332 will be described below.
  • the application processor 1331 may execute an application to perform a process related to the MP4 file or the process related to generation or reproduction of the delivery data or the control information delivered in the method which is based on the MPEG-DASH standard. As a process of the application processor 1331, the method of each embodiment described above may be applied.
  • the application processor 1331 may execute an application to have the functions of the screen division processing unit 121 to the server upload processing unit 126 (including the tile type MPD generation unit 141 in FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the application processor 1331 divides and encodes an image for each tile, generates the MP4 files in which the data of each tile is stored, and uploads the MP4 files to the delivery server 102.
  • the application processor 1331 can also generate the MPD managing the generated MP4 file and upload them to the delivery server 102. In this way, the application processor 1331 can generate the various MPDs or MP4 files described above in the embodiments. That is, the video set 1300 can realize the adaptive supply of the data of the partial images.
  • the application processor 1331 may execute an application to have the functions of the MPD acquisition unit 151 to the tile image combination unit 156 (each processing unit other than the display unit 157 in FIG. 13 ) of the terminal device 103 ( FIG. 11 ).
  • the application processor 1331 can acquire the MP4 file including the data of the tiles included in a desired range, extract and decode the encoded data of the tiles, and appropriately combine the obtained image data (tile images) of the tiles to generate image data in the desired range.
  • the application processor 1331 can also acquire the MPD, analyze the acquired MPD, acquire the MP4 file including the data of the tiles included in a desired range based on the analysis result, extract and decode the encoded data of the tiles, and appropriately combine the obtained image data (tile images) of the tiles to generate image data in the desired range.
  • the application processor 1331 can process the various MPDs or the MP4 files described above in the embodiments to obtain the desired image data. That is, the video set 1300 can realize the adaptive supply of the data of the partial images.
  • FIG. 70 illustrates an example of a schematic configuration of the video processor 1332 (of FIG. 69 ) to which the present disclosure is applied.
  • the video processor 1332 has a video input processing unit 1401, a first image enlarging and reducing unit 1402, a second image enlarging and reducing unit 1403, a video output processing unit 1404, a frame memory 1405, and a memory control unit 1406.
  • the video processor 1332 has an encoding/decoding engine 1407, video elementary stream (ES) buffers 1408A and 1408B, and audio ES buffers 1409A and 1409B.
  • the video processor 1332 has an audio encoder 1410, an audio decoder 1411, a multiplexer (MUX) 1412, a demultiplexer (DMUX) 1413, and a stream buffer 1414.
  • the video processor 1332 includes an MP4 processing unit 1415 and an MPEG-DASH processing unit 1416.
  • the video input processing unit 1401 acquires a video signal input from, for example, the connectivity 1321, and converts the signal into digital image data.
  • the first image enlarging and reducing unit 1402 performs format conversion, an image enlarging or reducing process or the like on image data.
  • the second image enlarging and reducing unit 1403 performs an image enlarging or reducing process on the image data according to the format of a destination to which the data is output via the video output processing unit 1404, or performs format conversion, an image enlarging or reducing process or the like in the same manner as the first image enlarging and reducing unit 1402.
  • the video output processing unit 1404 performs format conversion, conversion into an analog signal, or the like on image data, and outputs the data to, for example, the connectivity 1321 as a reproduced video signal.
  • the frame memory 1405 is a memory for image data shared by the video input processing unit 1401, the first image enlarging and reducing unit 1402, the second image enlarging and reducing unit 1403, the video output processing unit 1404, and the encoding/decoding engine 1407.
  • the frame memory 1405 is realized as a semiconductor memory, for example, a DRAM, or the like.
  • the memory control unit 1406 receives a synchronization signal from the encoding/decoding engine 1407 and controls access to the frame memory 1405 for writing and reading according to an access schedule to the frame memory 1405 which is written in an access management table 1406A.
  • the access management table 1406A is updated by the memory control unit 1406 according to processes executed in the encoding/decoding engine 1407, the first image enlarging and reducing unit 1402, the second image enlarging and reducing unit 1403, and the like.
  • the encoding/decoding engine 1407 performs an encoding process of image data and a decoding process of a video stream that is data obtained by encoding image data. For example, the encoding/decoding engine 1407 encodes image data read from the frame memory 1405, and sequentially writes the data in the video ES buffer 1408A as video streams. In addition, for example, the encoding/decoding engine 1407 sequentially reads video streams from the video ES buffer 1408B, and sequentially writes the data in the frame memory 1405 as image data. The encoding/decoding engine 1407 uses the frame memory 1405 as a work area for such encoding and decoding.
  • the encoding/decoding engine 1407 outputs a synchronization signal to the memory control unit 1406 at a timing at which, for example, a process on each micro block is started. Further, the encoding/decoding engine 1407 performs encoding of the image data or decoding of the encoded data obtained by encoding the image data using the MP4 processing unit 1415 or the MPEG-DASH processing unit 1416, as necessary.
  • the video ES buffer 1408A buffers a video stream generated by the encoding/decoding engine 1407 and supplies the stream to the multiplexer (MUX) 1412.
  • the video ES buffer 1408B buffers a video stream supplied from the demultiplexer (DMUX) 1413 and supplies the stream to the encoding/decoding engine 1407.
  • the audio ES buffer 1409A buffers an audio stream generated by an audio encoder 1410 and supplies the stream to the multiplexer (MUX) 1412.
  • the audio ES buffer 1409B buffers an audio stream supplied from the demultiplexer (DMUX) 1413 and supplies the stream to an audio decoder 1411.
  • the audio encoder 1410 digitally converts an audio signal input from, for example, the connectivity 1321 or the like, and encodes the signal in a predetermined scheme, for example, an MPEG audio scheme, an AudioCode number 3 (AC3) scheme, or the like.
  • the audio encoder 1410 sequentially writes audio streams that are data obtained by encoding audio signals in the audio ES buffer 1409A.
  • the audio decoder 1411 decodes an audio stream supplied from the audio ES buffer 1409B, performs conversion into an analog signal, for example, and supplies the signal to, for example, the connectivity 1321 or the like as a reproduced audio signal.
  • the multiplexer (MUX) 1412 multiplexes a video stream and an audio stream.
  • a method for this multiplexing i.e., a format of a bit stream generated from multiplexing
  • the multiplexer (MUX) 1412 can also add predetermined header information or the like to a bit stream. That is to say, the multiplexer (MUX) 1412 can convert the format of a stream through multiplexing.
  • the multiplexer (MUX) 1412 converts the streams into a transport stream that is a bit stream of a format for transport.
  • the multiplexer (MUX) 1412 converts the streams into data of a file format for recording (file data).
  • the demultiplexer (DMUX) 1413 demultiplexes a bit stream obtained by multiplexing a video stream and an audio stream using a method which corresponds to the multiplexing performed by the multiplexer (MUX) 1412. That is to say, the demultiplexer (DMUX) 1413 extracts a video stream and an audio stream from a bit stream read from the stream buffer 1414 (separates the bit stream into the video stream and the audio stream).
  • the demultiplexer (DMUX) 1413 can convert the format of a stream through demultiplexing (inverse conversion to conversion by the multiplexer (MUX) 1412).
  • the demultiplexer (DMUX) 1413 can acquire a transport stream supplied from, for example, the connectivity 1321, the broadband modem 1333, or the like via the stream buffer 1414, and convert the stream into a video stream and an audio stream through demultiplexing.
  • the demultiplexer (DMUX) 1413 can acquire file data read from various recording media by, for example, the connectivity 1321 via the stream buffer 1414, and convert the data into a video stream and an audio stream through demultiplexing.
  • the stream buffer 1414 buffers bit streams.
  • the stream buffer 1414 buffers a transport stream supplied from the multiplexer (MUX) 1412, and supplies the stream to, for example, the connectivity 1321, the broadband modem 1333, or the like at a predetermined timing or based on a request from outside or the like.
  • MUX multiplexer
  • the stream buffer 1414 buffers file data supplied from the multiplexer (MUX) 1412, and supplies the data to, for example, the connectivity 1321 or the like at a predetermined timing or based on a request from outside or the like to cause the data to be recorded on any of various kinds of recording media.
  • MUX multiplexer
  • the stream buffer 1414 buffers a transport stream acquired via, for example, the connectivity 1321, the broadband modem 1333, or the like, and supplies the stream to the demultiplexer (DMUX) 1413 at a predetermined timing or based on a request from outside or the like.
  • DMUX demultiplexer
  • the stream buffer 1414 buffers file data read from any of various kinds of recording media via, for example, the connectivity 1321 or the like, and supplies the data to the demultiplexer (DMUX) 1413 at a predetermined timing or based on a request from outside or the like.
  • DMUX demultiplexer
  • the MP4 processing unit 1415 performs a process related to the MP4 file, such as generation or reproduction of the MP4 file.
  • the MPEG-DASH processing unit 1416 performs a process related to generation or reproduction of the delivery data delivered in a method which is based on the MPEG-DASH standard or the control information, such as generation or reproduction of the MPD or the MP4 file.
  • a video signal input to the video processor 1332 from the connectivity 1321 or the like is converted into digital image data in a predetermined format such as a YCbCr format of 4:2:2 of in the video input processing unit 1401, and sequentially written in the frame memory 1405.
  • This digital image data is read by the first image enlarging and reducing unit 1402 or the second image enlarging and reducing unit 1403, undergoes format conversion and an enlarging or reducing process in a predetermined format such as a YCbCr format of 4:2:0, and then is written in the frame memory 1405 again.
  • This image data is encoded by the encoding/decoding engine 1407, and written in the video ES buffer 1408A as a video stream.
  • an audio signal input to the video processor 1332 from the connectivity 1321 is encoded by the audio encoder 1410, and then written in the audio ES buffer 1409A as an audio stream.
  • the video stream of the video ES buffer 1408A and the audio stream of the audio ES buffer 1409A are read and multiplexed by the multiplexer (MUX) 1412 to be converted into a transport stream, file data, or the like.
  • the transport stream generated by the multiplexer (MUX) 1412 is buffered in the stream buffer 1414, and then output to an external network via, for example, the connectivity 1321, the broadband modem 1333, or the like.
  • the file data generated by the multiplexer (MUX) 1412 is buffered in the stream buffer 1414, and output to, for example, the connectivity 1321 (of FIG. 29 ) to be recorded in any of various kinds of recording media.
  • a transport stream input to the video processor 1332 from an external network via, for example, the connectivity 1321, the broadband modem 1333, or the like is buffered in the stream buffer 1414, and then demultiplexed by the demultiplexer (DMUX) 1413.
  • DMUX demultiplexer
  • file data read from any of various kinds of recording media via the connectivity 1321 and input to the video processor 1332 is buffered in the stream buffer 1414, and then demultiplexed by the demultiplexer (DMUX) 1413. That is to say, the transport stream or the file data input to the video processor 1332 is separated into a video stream and an audio stream by the demultiplexer (DMUX) 1413.
  • the audio stream is supplied to the audio decoder 1411 via the audio ES buffer 1409B to be decoded, and an audio signal is reproduced.
  • the video stream is written in the video ES buffer 1408B, then sequentially read by the encoding/decoding engine 1407 to be decoded, and written in the frame memory 1405.
  • the decoded image data undergoes an enlarging and reducing process by the second image enlarging and reducing unit 1403, and is written in the frame memory 1405.
  • the decoded image data is read by the video output processing unit 1404, undergoes format conversion in a predetermined format such as the YCbCr format of 4:2:2, and is further converted into an analog signal, and a video signal is reproduced to be output.
  • a predetermined format such as the YCbCr format of 4:2:2
  • the MP4 processing unit 1415 acquires the image data stored in, for example, the frame memory 1405 via the encoding/decoding engine 1407, encodes the image data to generate the encoded data, and further generates the MP4 file in which the encoded data is stored.
  • the MP4 processing unit 1415 supplies the generated MP4 file to the encoding/decoding engine 1407.
  • the encoding/decoding engine 1407 outputs the supplied MP4 file to the outside of the video processor 1332 via, for example, the video ES buffer 1408A, the multiplexing unit (MUX) 1412, the stream buffer 1414, and the like and outputs the MP4 file to an external network via the connectivity 1321, the broadband modem 1333, or the like.
  • MUX multiplexing unit
  • the MP4 processing unit 1415 acquires, via the encoding/decoding engine 1407, the MP4 file acquired from an external network via the connectivity 1321, the broadband modem 1333, or the like and stored in the video ES buffer 1408B, analyzes the MP4 file to extract the encoded data, and further decodes the encoded data.
  • the MP4 processing unit 1415 supplies the obtained image data to the encoding/decoding engine 1407.
  • the encoding/decoding engine 1407 supplies the supplied image data to the video output processing unit 1404 via the frame memory 1405 and outputs the image data as a video signal to the outside of the video processor 1332.
  • the MP4 processing unit 1415 may include the screen division processing unit 121, the image encoding unit 122, the file generation unit 123, and the server upload processing unit 126 ( FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the MP4 processing unit 1415 divides and encodes an image for each tile, generates the MP4 files in which the data of each tile is stored, and uploads the MP4 files to the delivery server 102 via the connectivity 1321 or the like. In this way, the MP4 processing unit 1415 can generate the various MP4 files described above in the embodiments.
  • the MP4 processing unit 1415 may include the file acquisition unit 154, the image decoding unit 155, the tile image combination unit 156 ( FIG. 13 ) of the terminal device 103 ( FIG. 11 ). In this case, the MP4 processing unit 1415 downloads the MP4 file including the data of the tiles included in a desired range from the delivery server 102 via the connectivity 1321 or the like, extracts and decodes the encoded data of the tiles from the MP4 file, appropriately combines the obtained image data (tile images) of the tiles to generate image data in the desired range, and outputs the image data as a video signal to the outside of the video processor 1332. In this way, the MP4 processing unit 1415 can process the various MP4 files described above in the embodiments to obtain desired image data.
  • the video processor 1332 (that is, the video set 1300) can realize the adaptive supply of the data of the partial images.
  • the MPEG-DASH processing unit 1416 acquires the image data stored in the frame memory 1405 via the encoding/decoding engine 1407, generates the MPD managing the image data, and supplies the MPD file to the encoding/decoding engine 1407.
  • the encoding/decoding engine 1407 outputs the supplied MPD file to the outside of the video processor 1332 via the video ES buffer 1408A, the multiplexing unit (MUX) 1412, the stream buffer 1414, and the like and outputs the MPD file to an external network via the connectivity 1321, the broadband modem 1333, or the like.
  • MUX multiplexing unit
  • the MPEG-DASH processing unit 1416 may encode the image data to generate the MP4 file in which the encoded data is stored and to generate the MPD managing the MP4 file and output the MPD file to an external network.
  • the MPEG-DASH processing unit 1416 may output the MP4 file along with the MPD file to an external network.
  • the MPEG-DASH processing unit 1416 acquires, via the encoding/decoding engine 1407, the MPD file acquired from an external network via the connectivity 1321, the broadband modem 1333, or the like and stored in the video ES buffer 1408B, analyzes the MPD file, and acquires desired image data based on the MPD. For example, when the MP4 file including the encoded data obtained by encoding the image data is managed by the MPD, the MPEG-DASH processing unit 1416 acquires the MP4 file corresponding to a desired image based on the MPD from an external network, decodes the encoded data included in the MP4 file, and supplies the image data obtained through the decoding to the encoding/decoding engine 1407. The encoding/decoding engine 1407 supplies the supplied image data to the video output processing unit 1404 via the frame memory 1405 and outputs the image data as a video signal to the outside of the video processor 1332.
  • the MPEG-DASH processing unit 1416 may include the screen division processing unit 121 to the server upload processing unit 126 (including the tile type MPD generation unit 141 in FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the MPEG-DASH processing unit 1416 divides and encodes an image for each tile, generates the MP4 files in which the data of each tile is stored, generates the MPDs managing the MP4 file, and uploads them to the delivery server 102 via the connectivity 1321 or the like. In this way, the MPEG-DASH processing unit 1416 can generate the various MPDs described in the embodiments.
  • the MPEG-DASH processing unit 1416 may include the MPD acquisition unit 151 to the tile image combination unit 156 (each processing unit other than the display unit 157 in FIG. 13 ) of the terminal device 103 ( FIG. 11 ).
  • the MPEG-DASH processing unit 1416 analyzes the MPD, downloads the MP4 file including the data of the tiles included in a desired range from the delivery server 102 via the connectivity 1321 or the like, extracts and decodes the encoded data of the tiles from the MP4 file, appropriately combines the obtained image data (tile images) of the tiles to generate image data in the desired range, and outputs the image data as a video signal to the outside of the video processor 1332.
  • the MPEG-DASH processing unit 1416 can process the various MPDs described above in the embodiments to obtain desired image data.
  • the video processor 1332 (that is, the video set 1300) can realize the adaptive supply of the data of the partial images.
  • the present technology (the function of the delivery data generation device 101 or the terminal device 103 described above) may be realized by hardware such as a logic circuit, may be realized by software such as an embedded program, or may be realized by both.
  • FIG. 71 illustrates another example of a schematic configuration of the video processor 1332 to which the present disclosure is applied.
  • the video processor 1332 has functions of encoding and decoding video data in a predetermined scheme.
  • the video processor 1332 includes a control unit 1511, a display interface 1512, a display engine 1513, an image processing engine 1514, and an internal memory 1515.
  • the video processor 1332 includes a codec engine 1516, a memory interface 1517, a multiplexing and demultiplexing unit (MUX DMUX) 1518, a network interface 1519, and a video interface 1520.
  • MUX DMUX multiplexing and demultiplexing unit
  • the control unit 1511 controls an operation of each processing unit in the video processor 1332, such as the display interface 1512, the display engine 1513, the image processing engine 1514, and the codec engine 1516.
  • the control unit 1511 includes a main CPU 1531, a sub-CPU 1532, and a system controller 1533.
  • the main CPU 1531 executes a program or the like to control an operation of each processing unit in the video processor 1332.
  • the main CPU 1531 generates a control signal according to the program or the like and supplies the control signal to each processing unit (that is, controls the operation of each processing unit).
  • the sub-CPU 1532 serves as an auxiliary role of the main CPU 1531.
  • the sub-CPU 1532 executes an offspring process or a sub-routine of a program or the like executed by the main CPU 1531.
  • the system controller 1533 controls operations of the main CPU 1531 and the sub-CPU 1532, for example, designates programs executed by the main CPU 1531 and the sub-CPU 1532.
  • the display interface 1512 outputs the image data to, for example, the connectivity 1321 under the control of the control unit 1511.
  • the display interface 1512 converts the image data of digital data into an analog signal and outputs the image data as the reproduced video signal or the image data of the digital data to a monitor device or the like of the connectivity 1321.
  • the display engine 1513 performs various conversion processes such as format conversion, size conversion, and color gamut conversion on the image data to match a hardware specification of the monitor device or the like displaying the image under the control of the control unit 1511.
  • the image processing engine 1514 performs predetermined image processing such as filter processing on the image data, for example, to improve image quality under the control of the control unit 1511.
  • the internal memory 1515 is a memory shared by the display engine 1513, the image processing engine 1514, and the codec engine 1516 and provided inside the video processor 1332.
  • the internal memory 1515 is used to transmit and receive data among the display engine 1513, the image processing engine 1514, and the codec engine 1516.
  • the internal memory 1515 stores data supplied from the display engine 1513, the image processing engine 1514, or the codec engine 1516 and supplies the data to the display engine 1513, the image processing engine 1514, or the codec engine 1516, as necessary (for example, according to a request).
  • the internal memory 1515 may be realized by any storage device, but the internal memory 1515 is generally used to store data with a small capacity such as parameters or image data in units of blocks in many cases. Therefore, the internal memory 1515 is preferably realized by, for example, a semiconductor memory with a relatively small capacity (compared to, for example, the external memory 1312) and a fast response speed, such as a static random access memory (SRAM).
  • SRAM static
  • the codec engine 1516 performs a process related to encoding or decoding of the image data. Any encoding and decoding schemes to which the codec engine 1516 corresponds can be used, and the number of schemes may be singular or plural.
  • the codec engine 1516 may include codec functions of a plurality of encoding and decoding schemes, and may encode the image data using the codec function selected therefrom and decode the encoded data.
  • the codec engine 1516 includes, for example, an MPEG-2 video 1541, an AVC/H.264 1542, an HEVC/H.265 1543, an HEVC/H.265 (scalable) 1544, and an HEVC/H.265 (multi-view) 1545 and includes an MPEG-DASH 1551 and an MP4 processing unit 1552.
  • the MPEG-2 video 1541 is a functional block that encodes or decodes the image data in an MPEG-2 scheme.
  • the AVC/H.264 1542 is a functional block that encodes or decodes the image data in an AVC scheme.
  • the HEVC/H.265 1543 is a functional block that encodes or decodes the image data in an HEVC scheme.
  • the HEVC/H.265 (scalable) 1544 is a functional block that performs scalable encoding or scalable decoding on the image data in an HEVC scheme.
  • the HEVC/H.265 (multi-view) 1545 is a functional block that performs multi-view encoding or multi-view decoding on the image data in an HEVC scheme.
  • the MPEG-DASH 1551 performs processes related to generation or reproduction of the delivery data or the control information delivered in a method which is based on the MPEG-DASH standard, such as generation or reproduction of the MPD or the MP4 file.
  • the MP4 processing unit 1552 performs a process related to the MP4 file, such as generation or reproduction of the MP4 file.
  • the MPEG-DASH 1551 and the MP4 processing unit 1552 use the MPEG-2 video 1541 to the HEVC/H.265 (multi-view) 1545 described above.
  • the memory interface 1517 is an interface for the external memory 1312.
  • the data supplied from the image processing engine 1514 or the codec engine 1516 is supplied to the external memory 1312 via the memory interface 1517.
  • the data read from the external memory 1312 is supplied to the video processor 1332 (the image processing engine 1514 or the codec engine 1516) via the memory interface 1517.
  • the multiplexing and demultiplexing unit (MUX DMUX) 1518 multiplexes or demultiplexes various kinds of data related to images such as image data, video signals, bit streams of encoded data. Any multiplexing and demultiplexing methods can be used. For example, at the time of multiplexing, the multiplexing and demultiplexing unit (MUX DMUX) 1518 can collect a plurality of pieces of data into one piece of data and can also add predetermined header information or the like to the data. At the time of demultiplexing, the multiplexing and demultiplexing unit (MUX DMUX) 1518 divides one piece of data into a plurality of pieces of data and can also add predetermined header information or the like to each of the pieces of divided data.
  • the multiplexing and demultiplexing unit (MUX DMUX) 1518 can convert the format of the data through the multiplexing and the demultiplexing.
  • the multiplexing and demultiplexing unit (MUX DMUX) 1518 can convert data into a transport stream which is a bit stream with a transmission format or data (file data) with a file format for recording by multiplexing the bit stream.
  • the reverse conversion can also be performed through the demultiplexing.
  • the network interface 1519 is, for example, an interface for the broadband modem 1333, the connectivity 1321, or the like.
  • the video interface 1520 is, for example, an interface for the connectivity 1321, the camera 1322, or the like.
  • the transport stream is supplied to the multiplexing and demultiplexing unit (MUX DMUX) 1518 via the network interface 1519 to be demultiplexed, and then is decoded by the codec engine 1516.
  • MUX DMUX multiplexing and demultiplexing unit
  • the image data obtained through the decoding of the codec engine 1516 is subjected to predetermined image processing by the image processing engine 1514, is subjected to predetermined conversion by the display engine 1513, and is supplied to, for example, the connectivity 1321 via the display interface 1512, and then the image is displayed on a monitor.
  • the image data obtained through the decoding of the codec engine 1516 is re-encoded by the codec engine 1516, is multiplexed by the multiplexing and demultiplexing unit (MUX DMUX) 1518 to be converted into file data, is output to, for example, the connectivity 1321 via the video interface 1520, and is recorded in various recording media.
  • MUX DMUX multiplexing and demultiplexing unit
  • the file data of the encoded data read from a recording medium (not illustrated) by the connectivity 1321 or the like and obtained by encoding the image data is supplied to the multiplexing and demultiplexing unit (MUX DMUX) 1518 via the video interface 1520 to be demultiplexed, and then is decoded by the codec engine 1516.
  • the image data obtained through the decoding of the codec engine 1516 is subjected to predetermined image processing by the image processing engine 1514, is subjected to predetermined conversion by the display engine 1513, and is supplied to, for example, the connectivity 1321 via the display interface 1512, and then the image is displayed on a monitor.
  • the image data obtained through the decoding of the codec engine 1516 is re-encoded by the codec engine 1516, is multiplexed by the multiplexing and demultiplexing unit (MUX DMUX) 1518 to be converted into a transport stream, is supplied to, for example, the connectivity 1321 or the broadband modem 1333 via the network interface 1519, and is transmitted to another device (not illustrated).
  • MUX DMUX multiplexing and demultiplexing unit
  • Transmission and reception of the image data or other data between the processing units in the video processor 1332 are performed using, for example, the internal memory 1515 or the external memory 1312.
  • the power management module 1313 controls power supply to, for example, the control unit 1511.
  • the MP4 processing unit 1552 of the codec engine 1516 acquires the image data read from, for example, the external memory 1312, encodes the image data using any of the MPEG-2 video 1541 to the HEVC/H.265 (multi-view) 1545 to generate the encoded data, and further generates the MP4 file in which the encoded data is stored.
  • the MP4 processing unit 1552 supplies the generated MP4 file to the external memory 1312 via, for example, the memory interface 1517 to store the MP4 file.
  • the MP4 file is read by the memory interface 1517, is output to the outside of the video processor 1332 via the multiplexing and demultiplexing unit (MUX DMUX) 1518 or the network interface 1519, and is output to an external network via the connectivity 1321, the broadband modem 1333, or the like.
  • MUX DMUX multiplexing and demultiplexing unit
  • the MP4 processing unit 1552 acquires, via the memory interface 1517, the MP4 file acquired from an external network via the connectivity 1321, the broadband modem 1333, or the like, supplied to the external memory 1312 via the network interface 1519, the multiplexing and demultiplexing unit (MUX DMUX) 1518, the memory interface 1517, and the like, and stored.
  • the MP4 processing unit 1552 analyzes the acquired MP4 file, extracts the encoded data, and further decodes the encoded data using any of the MPEG-2 video 1541 to the HEVC/H.265 (multi-view) 1545.
  • the MP4 processing unit 1552 supplies the obtained image data to the external memory 1312 via, for example, the memory interface 1517 to store the image data.
  • the image data is read by the memory interface 1517 and is supplied to, for example, the connectivity 1321 via the image processing engine 1514, the display engine 1513, the display interface 1512, and the like, so that the image is displayed on a monitor.
  • the MP4 processing unit 1552 may include the screen division processing unit 121, the image encoding unit 122, the file generation unit 123, and the server upload processing unit 126 ( FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the MP4 processing unit 1552 divides and encodes an image for each tile, generates the MP4 files in which the data of each tile is stored, and uploads the MP4 files to the delivery server 102 via the connectivity 1321 or the like. In this way, the MP4 processing unit 1552can generate the various MP4 files described above in the embodiments.
  • the MP4 processing unit 1552 may include the file acquisition unit 154, the image decoding unit 155, the tile image combination unit 156 ( FIG. 13 ) of the terminal device 103 ( FIG. 11 ). In this case, the MP4 processing unit 1552downloads the MP4 file including the data of the tiles included in a desired range from the delivery server 102 via the connectivity 1321 or the like, extracts and decodes the encoded data of the tiles from the MP4 file, appropriately combines the obtained image data (tile images) of the tiles to generate image data in the desired range, and outputs the image data as a video signal to the outside of the video processor 1332. In this way, the MP4 processing unit 1552can process the various MP4 files described above in the embodiments to obtain desired image data.
  • the video processor 1332 (that is, the video set 1300) can realize the adaptive supply of the data of the partial images.
  • the MPEG-DASH 1551 acquires the image data read from, for example, the external memory 1312 and generates the MPD managing the image data.
  • the MPEG-DASH 1551 supplies the generated MPD file to the external memory 1312 via, for example, the memory interface 1517 to store the MPD file.
  • the MP4 file is read by the memory interface 1517, is output to the outside of the video processor 1332 via the multiplexing and demultiplexing unit (MUX DMUX) 1518 or the network interface 1519, and is output to an external network via the connectivity 1321, the broadband modem 1333, or the like.
  • MUX DMUX multiplexing and demultiplexing unit
  • the MPEG-DASH 1551 may encode the image data to generate the MP4 file in which the encoded data is stored and to generate the MPD managing the MP4 file and output the MPD file to an external network.
  • the MPEG-DASH 1551 may output the MP4 file along with the MPD file to an external network.
  • the MPEG-DASH 1551 acquires, via the memory interface 1517, the MPD file acquired from an external network via the connectivity 1321, the broadband modem 1333, or the like, supplied to the external memory 1312 via the network interface 1519, the multiplexing and demultiplexing unit (MUX DMUX) 1518, the memory interface 1517, and the like, and stored.
  • the MPEG-DASH 1551 analyzes the acquired MPD and acquires desired image data based on the MPD.
  • the MPEG-DASH 1551 acquires the MP4 file corresponding to a desired image from an external network based on the MPD, extracts the encoded data included in the MP4 file, further decodes the encoded data using any of the MPEG-2 video 1541 to the HEVC/H.265 (multi-view) 1545.
  • the MPEG-DASH 1551 supplies the obtained image data to the external memory via, for example, the memory interface 1517 to store the image data.
  • the image data is read by the memory interface 1517 and is supplied to, for example, the connectivity 1321 via the image processing engine 1514, the display engine 1513, the display interface 1512, and the like, so that the image is displayed on a monitor.
  • the MPEG-DASH 1551 may include the screen division processing unit 121 to the server upload processing unit 126 (including the tile type MPD generation unit 141 in FIG. 12 ) of the delivery data generation device 101 ( FIG. 11 ).
  • the MPEG-DASH 1551 divides and encodes an image for each tile, generates the MP4 files in which the data of each tile is stored, generates the MPDs managing the MP4 file, and uploads them to the delivery server 102 via the connectivity 1321 or the like. In this way, the MPEG-DASH 1551 can generate the various MPDs described in the embodiments.
  • the MPEG-DASH 1551 may include the MPD acquisition unit 151 to the tile image combination unit 156 (each processing unit other than the display unit 157 in FIG. 13 ) of the terminal device 103 ( FIG. 11 ).
  • the MPEG-DASH 1551 analyzes the MPD, downloads the MP4 file including the data of the tiles included in a desired range from the delivery server 102 via the connectivity 1321 or the like, extracts and decodes the encoded data of the tiles from the MP4 file, appropriately combines the obtained image data (tile images) of the tiles to generate image data in the desired range, and outputs the image data as a video signal to the outside of the video processor 1332.
  • the MPEG-DASH 1551 can process the various MPDs described above in the embodiments to obtain desired image data.
  • the video processor 1332 (that is, the video set 1300) can realize the adaptive supply of the data of the partial images.
  • the present technology (the function of the delivery data generation device 101 or the terminal device 103 described above) may be realized by hardware such as a logic circuit, may be realized by software such as an embedded program, or may be realized by both.
  • the two configurations of the video processor 1332 have been exemplified, but the configuration of the video processor 1332 is arbitrary and may be a configuration other than the two configurations described above.
  • the video processor 1332 may be configured as a single semiconductor chip or may be configured as a plurality of semiconductor chips. For example, a 3-dimensional laminated LSI in which a plurality of semiconductors are laminated may be used.
  • the video processor 1332 may be realized by a plurality of LSIs.
  • the video set 1300 can be embedded in various devices that process image data.
  • the video set 1300 can be embedded in the television device 900 ( FIG. 67 ) or the mobile telephone 920 ( FIG. 68 ).
  • the device can obtain the same advantages as the advantages described with reference to FIGS. 1 to 66 .
  • a part of each configuration of the above-described video set 1300 can also be implemented as a configuration to which the present technology is applied, as long as the part of the configuration includes the video processor 1332.
  • the video processor 1332 can be implemented as a video processor to which the present technology is applied.
  • the video module 1331 or the processor indicated by the dashed line 1341, as described above can be implemented as a processor, a module, or the like to which the present technology is applied.
  • the video module 1311, the external 1312, the power management module 1313, and the frontend module 1314 can be combined to be implemented as a video unit 1361 to which the present technology is applied. It is possible to obtain the same advantages as the advantages described with reference to FIGS. 1 to 66 regardless of the configuration.
  • any configuration can be embedded in various devices processing image data, as in the case of the video set 1300, as long as the configuration includes the video processor 1332.
  • the video processor 1332 or the processor indicated by the dashed line 1341, the video module 1311, or the video unit 1361 can be embedded in the television device 900 ( FIG. 67 ), the mobile telephone 920 ( FIG. 68 ), and the like.
  • the device can obtain the same advantages as the advantages described with reference to FIGS. 1 to 66 , as in the video set 1300.
  • a system means a set of a plurality of constituent elements (devices, modules (components), and the like) and all of the constituent elements may be included or may not be included in the same casing. Accordingly, a plurality of devices accommodated in separate casings and connected via networks and a single device in which a plurality of modules are accommodated in a single casing are all systems.
  • a configuration described above as a single device (or processing unit) may be divided and configured as a plurality of devices (or processing units).
  • a configuration described above as a plurality of devices (or processing units) may be collected and configured as a single device (or processing unit).
  • Configurations other than the above-described configurations may, of course, be added to the configurations of the devices (or the processing units). Further, as long as configurations or operations are substantially the same in the entire system, parts of the configurations of certain devices (or processing units) may be included in the configurations of the other devices (or other processing units).
  • the plurality of processes included in the single step can be performed by a single device and can also be shared and performed by a plurality of devices.
  • the information processing device can be applied to various electronic devices such as a transmitter or a receiver in delivery of satellite broadcast, a wired broadcast such as a cable TV, or the Internet and delivery to a terminal by cellular communication, a recording device recording an image in a medium such as an optical disc, a magnetic disk, or a flash memory, or a reproduction device reproducing an image from the storage medium.
  • the examples in which the various kinds of metadata are multiplexed in the bit stream and are transmitted from the encoding side to the decoding side have been described.
  • the methods of transmitting the information are not limited to the examples.
  • the information may be transmitted or recorded as separate pieces of data associated with the bit stream without being multiplexed in the bit stream.
  • the term "associated" means that an image (which may be a part of an image, such as a slice or a block) included in a bit stream and information corresponding to the image can be linked at the time of decoding. That is, the information may be transmitted along a different transmission path from the bit stream of the image.
  • the information may be recorded in a different recording medium (or a different recording area of the same recording medium) from the bit stream of the image. Further, the bit stream of the information and the image may be mutually associated, for example, in any unit such as a plurality of frames, a single frame, or a part of a frame.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Library & Information Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
  • Information Transfer Between Computers (AREA)

Claims (8)

  1. Datenverarbeitungsvorrichtung, die Folgendes umfasst:
    eine Analyseeinheit, die konfiguriert ist, eine Mediendarstellungsbeschreibung gemäß MPEG-DASH bereitstellenden Metadaten, die Teilbildinformationen enthalten, die Informationen bezüglich jedes Teilbildes sind, das ein Teil eines gesamten Bildes ist, zu analysieren, wobei die Metadaten für eine Lieferung eines Bitstroms des gesamten Bildes und eine Lieferung eines Bitstroms des Teilbildes verwendet werden, wobei die Teilbildinformationen bezüglich der mehreren Teilbilder in voneinander verschiedenen Unterrepräsentationen gespeichert werden, die zu einer Repräsentation gehören, die zu einer Anpassungsgruppe gehört, und Bitströme der mehreren Teilbilder den voneinander verschiedenen Unterrepräsentationen zugewiesen werden, und konfiguriert ist, die Teilbildinformationen zu erhalten;
    eine Auswahleinheit, die konfiguriert ist, den Bitstrom eines gewünschten Teilbildes unter Verwendung von Teilbildinformationen, die durch die Analyseeinheit erhalten werden, auszuwählen; und
    eine Bitstromerfassungseinheit, die konfiguriert ist, von einem Server den Bitstrom zu erfassen, der durch die Auswahleinheit ausgewählt wird, ohne von dem Server einen Bitstrom eines verbleibenden Teils des gesamten Bildes, der von dem gewünschten Teilbild verschieden ist, zu erfassen.
  2. Datenverarbeitungsvorrichtung nach Anspruch 1,
    wobei die Metadaten Informationen umfassen, die angeben, dass Informationen bezüglich des Bitstroms unter der Unterrepräsentation vorhanden sind.
  3. Datenverarbeitungsvorrichtung nach Anspruch 1,
    wobei die Teilbildinformationen Positionsinformationen enthalten, die eine Position des Teilbildes in dem gesamten Bild angeben.
  4. Datenverarbeitungsvorrichtung (101) nach Anspruch 3,
    wobei die Positionsinformationen eine Position einer oberen Linken des Teilbildes angeben.
  5. Datenverarbeitungsvorrichtung nach Anspruch 1,
    wobei die Teilbildinformationen eines der Folgenden enthalten:
    Informationen bezüglich einer Größe des gesamten Bildes;
    Gruppenidentifikationsinformationen, die eine Gruppe identifizieren, die eine Gruppe ist, zu der die Teilbilder gehören, und die eine Gruppe der Teilbilder ist, die als ein Bild angezeigt werden können;
    einen Sichttyp, der angibt, ob ein Bild das Teilbild ist; und
    Informationen, die die Anzahl von Teilbildern angeben, die das gesamte Bild bilden, wobei die Identifikationsinformationen angeben, dass Größen der Teilbilder gleich sind, und Informationen angeben, die eine Position und eine Größe jedes Teilbildes angeben, wenn die Größen der Teilbilder nicht gleich sind.
  6. Datenverarbeitungsvorrichtung nach Anspruch 1,
    wobei das Teilbild eine Kachel in einer Hocheffizienzvideocodierung (HEVC) ist.
  7. Datenverarbeitungsvorrichtung nach Anspruch 1,
    wobei jeder der Bitströme der mehreren Teilbilder in einem TRACK einer MP4-Datei gespeichert wird.
  8. Datenverarbeitungsverfahren, das Folgendes umfasst:
    Analysieren einer Mediendarstellungsbeschreibung gemäß MPEG-DASH bereitstellenden Metadaten, die Teilbildinformationen enthalten, die Informationen bezüglich jedes Teilbildes sind, das ein Teil eines gesamten Bildes ist, wobei die Metadaten für eine Lieferung eines Bitstroms des gesamten Bildes und eine Lieferung eines Bitstroms des Teilbildes verwendet werden, wobei die Teilbildinformationen bezüglich der mehreren Teilbilder in voneinander verschiedenen Unterrepräsentationen gespeichert werden, die zu einer Repräsentation gehören, die zu einer Anpassungsgruppe gehört, und Bitströme der mehreren Teilbilder den voneinander verschiedenen Unterrepräsentationen zugewiesen werden, und Erhalten der Teilbildinformationen;
    Auswählen des Bitstroms eines gewünschten Teilbildes unter Verwendung der erhaltenen Teilbildinformationen; und
    Erfassen von einem Server des ausgewählten Bitstroms, ohne von dem Server einen Bitstrom eines verbleibenden Teils des gesamten Bildes, der von dem gewünschten Teilbild verschieden ist, zu erfassen.
EP14826667.9A 2013-07-19 2014-07-16 Informationsverarbeitungsvorrichtung und -verfahren Active EP3013065B1 (de)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2013150977 2013-07-19
JP2014002046 2014-01-08
JP2014058762 2014-03-20
PCT/JP2014/068861 WO2015008775A1 (ja) 2013-07-19 2014-07-16 情報処理装置および方法

Publications (3)

Publication Number Publication Date
EP3013065A1 EP3013065A1 (de) 2016-04-27
EP3013065A4 EP3013065A4 (de) 2016-11-16
EP3013065B1 true EP3013065B1 (de) 2019-10-16

Family

ID=52346223

Family Applications (3)

Application Number Title Priority Date Filing Date
EP14825915.3A Active EP3013064B1 (de) 2013-07-19 2014-07-16 Informationsverarbeitungsvorrichtung und -verfahren
EP14826667.9A Active EP3013065B1 (de) 2013-07-19 2014-07-16 Informationsverarbeitungsvorrichtung und -verfahren
EP18201028.0A Active EP3461142B1 (de) 2013-07-19 2014-07-16 Informationsverarbeitungsvorrichtung und -verfahren

Family Applications Before (1)

Application Number Title Priority Date Filing Date
EP14825915.3A Active EP3013064B1 (de) 2013-07-19 2014-07-16 Informationsverarbeitungsvorrichtung und -verfahren

Family Applications After (1)

Application Number Title Priority Date Filing Date
EP18201028.0A Active EP3461142B1 (de) 2013-07-19 2014-07-16 Informationsverarbeitungsvorrichtung und -verfahren

Country Status (12)

Country Link
US (3) US10038922B2 (de)
EP (3) EP3013064B1 (de)
JP (3) JPWO2015008775A1 (de)
KR (1) KR102224332B1 (de)
CN (3) CN110035300A (de)
AU (2) AU2014291253B2 (de)
CA (1) CA2916878A1 (de)
MX (2) MX364810B (de)
MY (1) MY182338A (de)
RU (2) RU2671946C2 (de)
SG (1) SG11201600223UA (de)
WO (2) WO2015008775A1 (de)

Families Citing this family (62)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3025525B1 (de) 2013-07-25 2018-12-12 Convida Wireless, LLC Ende-zu-ende m2m dienstschicht-sitzungen
WO2015014773A1 (en) 2013-07-29 2015-02-05 Koninklijke Kpn N.V. Providing tile video streams to a client
US20160269759A1 (en) * 2013-10-22 2016-09-15 Sharp Kabushiki Kaisha Display processing device, distribution device, and metadata
CN105900401B (zh) * 2014-01-07 2020-03-06 佳能株式会社 用于对层间依赖性进行编码的方法、装置和计算机程序
KR101953679B1 (ko) 2014-06-27 2019-03-04 코닌클리즈케 케이피엔 엔.브이. Hevc-타일드 비디오 스트림을 기초로 한 관심영역 결정
EP3162075B1 (de) 2014-06-27 2020-04-08 Koninklijke KPN N.V. Hevc-mosaikbasiertes video-streaming
SG11201609457UA (en) * 2014-08-07 2016-12-29 Sonic Ip Inc Systems and methods for protecting elementary bitstreams incorporating independently encoded tiles
WO2016092759A1 (ja) 2014-12-09 2016-06-16 パナソニックIpマネジメント株式会社 送信方法、受信方法、送信装置および受信装置
JP6741975B2 (ja) * 2014-12-09 2020-08-19 パナソニックIpマネジメント株式会社 送信方法および送信装置
EP3035326B1 (de) * 2014-12-19 2019-07-17 Alcatel Lucent Codierung, Übertragung, Decodierung und Anzeige von ausgerichteten Bildern
KR101980721B1 (ko) * 2015-02-12 2019-05-21 후아웨이 테크놀러지 컴퍼니 리미티드 멀티미디어 스트리밍 서비스 프레젠테이션 방법, 관련 장치 및 관련 시스템
EP3249873B1 (de) * 2015-02-15 2018-09-12 Huawei Technologies Co., Ltd. Medienpräsentationsführungsverfahren auf der grundlage von http-mediendatenstrom und zugehörige vorrichtung
BR112017017792A2 (pt) 2015-02-27 2018-04-10 Sony Corporation dispositivos e métodos de transmissão e de recepção.
US10412422B2 (en) * 2015-04-23 2019-09-10 Lg Electronics Inc. Apparatus for transmitting broadcasting signal, apparatus for receiving broadcasting signal, method for transmitting broadcasting signal, and method for receiving broadcasting signal
WO2016204712A1 (en) * 2015-06-16 2016-12-22 Intel IP Corporation Adaptive video content for cellular communication
JP6675475B2 (ja) * 2015-08-20 2020-04-01 コニンクリーケ・ケイピーエヌ・ナムローゼ・フェンノートシャップ メディア・ストリームに基づくタイルド・ビデオの形成
US10715843B2 (en) 2015-08-20 2020-07-14 Koninklijke Kpn N.V. Forming one or more tile streams on the basis of one or more video streams
WO2017060423A1 (en) 2015-10-08 2017-04-13 Koninklijke Kpn N.V. Enhancing a region of interest in video frames of a video stream
WO2017122543A1 (ja) * 2016-01-13 2017-07-20 ソニー株式会社 情報処理装置および情報処理方法
EP3405865A1 (de) * 2016-01-21 2018-11-28 Playgiga S.L. Modifizierung des softwareverhaltens in der laufzeit
JP6944131B2 (ja) * 2016-02-22 2021-10-06 ソニーグループ株式会社 ファイル生成装置およびファイル生成方法、並びに、再生装置および再生方法
WO2017145756A1 (ja) * 2016-02-22 2017-08-31 ソニー株式会社 ファイル生成装置およびファイル生成方法、並びに、再生装置および再生方法
US9992517B2 (en) * 2016-02-23 2018-06-05 Comcast Cable Communications, Llc Providing enhanced content based on user interactions
US10390071B2 (en) * 2016-04-16 2019-08-20 Ittiam Systems (P) Ltd. Content delivery edge storage optimized media delivery to adaptive bitrate (ABR) streaming clients
JP2017199994A (ja) * 2016-04-26 2017-11-02 日本放送協会 映像配信装置及び映像配信方法
WO2017196670A1 (en) 2016-05-13 2017-11-16 Vid Scale, Inc. Bit depth remapping based on viewing parameters
CN109076262B (zh) * 2016-05-13 2022-07-12 索尼公司 文件生成装置和文件生成方法以及再现装置和再现方法
EP3466083B1 (de) 2016-05-25 2020-09-16 Koninklijke KPN N.V. Räumlich gekacheltes omnidirektionales video-streaming
EP3466076A1 (de) * 2016-05-26 2019-04-10 VID SCALE, Inc. Verfahren und vorrichtung für blickpunktadaptive 360-grad-videobereitstellung
US10491963B1 (en) * 2016-06-28 2019-11-26 Amazon Technologies, Inc. Use video codecs to deliver images
EP4336850A3 (de) 2016-07-08 2024-04-17 InterDigital Madison Patent Holdings, SAS Systeme und verfahren zur tonumabbildung in einer region von interesse
US10805614B2 (en) 2016-10-12 2020-10-13 Koninklijke Kpn N.V. Processing spherical video data on the basis of a region of interest
EP3520243A2 (de) 2016-11-03 2019-08-07 Convida Wireless, LLC Rahmenstruktur in nr
KR102130429B1 (ko) * 2016-11-07 2020-07-07 한화테크윈 주식회사 멀티미디어 수신 장치에서 디코딩을 수행하는 방법 및 멀티미디어 장치
DE112017006610T5 (de) * 2016-12-27 2019-09-12 Sony Corporation Bildverarbeitungsvorrichtung und Verfahren
CN110301136B (zh) 2017-02-17 2023-03-24 交互数字麦迪逊专利控股公司 在流传输视频中进行选择性感兴趣对象缩放的系统和方法
US11139000B2 (en) * 2017-03-07 2021-10-05 Mediatek Inc. Method and apparatus for signaling spatial region information
CN110383848B (zh) 2017-03-07 2022-05-06 交互数字麦迪逊专利控股公司 用于多设备呈现的定制视频流式传输
WO2018169139A1 (ko) * 2017-03-17 2018-09-20 엘지전자 주식회사 360도 비디오의 영역 정보 전달 방법 및 장치
WO2018180511A1 (ja) * 2017-03-27 2018-10-04 ソニー株式会社 画像生成装置および画像生成方法、並びに画像再生装置および画像再生方法
GB2560921B (en) * 2017-03-27 2020-04-08 Canon Kk Method and apparatus for encoding media data comprising generated content
BR112019024597A2 (pt) * 2017-05-30 2020-06-09 Sony Corp aparelho e método de processamento de imagem, programa para fazer com que um computador execute processamento, e, aparelho e método de geração de arquivo
WO2019002662A1 (en) 2017-06-26 2019-01-03 Nokia Technologies Oy APPARATUS, METHOD AND COMPUTER PROGRAM FOR OMNIDIRECTIONAL VIDEO
US10587883B2 (en) * 2017-07-14 2020-03-10 Qualcomm Incorporated Region-wise packing, content coverage, and signaling frame packing for media content
JP7035401B2 (ja) * 2017-09-15 2022-03-15 ソニーグループ株式会社 画像処理装置およびファイル生成装置
RU2020122086A (ru) 2018-01-12 2022-01-04 Сони Корпорейшн Способ и устройство обработки информации
JP7246855B2 (ja) * 2018-02-16 2023-03-28 キヤノン株式会社 撮像装置、記録装置及び表示制御装置
US11323757B2 (en) * 2018-03-29 2022-05-03 Sony Group Corporation Information processing apparatus, information processing method, and program
WO2019199024A1 (ko) * 2018-04-10 2019-10-17 엘지전자 주식회사 360 영상 데이터의 서브픽처 기반 처리 방법 및 그 장치
GB2575074B (en) * 2018-06-27 2022-09-28 Canon Kk Encapsulating video content with an indication of whether a group of tracks collectively represents a full frame or a part of a frame
US11871451B2 (en) 2018-09-27 2024-01-09 Interdigital Patent Holdings, Inc. Sub-band operations in unlicensed spectrums of new radio
US11481961B2 (en) * 2018-10-02 2022-10-25 Sony Corporation Information processing apparatus and information processing method
US20200296462A1 (en) 2019-03-11 2020-09-17 Wci One, Llc Media content presentation
US20200296316A1 (en) 2019-03-11 2020-09-17 Quibi Holdings, LLC Media content presentation
US11523185B2 (en) 2019-06-19 2022-12-06 Koninklijke Kpn N.V. Rendering video stream in sub-area of visible display area
US20220247991A1 (en) * 2019-06-28 2022-08-04 Sony Group Corporation Information processing apparatus, information processing method, reproduction processing device, and reproduction processing method
CN110381331A (zh) * 2019-07-23 2019-10-25 深圳市道通智能航空技术有限公司 一种图像处理方法、装置、航拍设备及存储介质
CN111340683B (zh) * 2020-02-08 2020-11-13 朱明华 图像数据处理方法、装置、图像处理系统及服务器
CN113824958A (zh) * 2020-06-18 2021-12-21 中兴通讯股份有限公司 视频分块方法、传输方法、服务器、适配器和存储介质
US11276206B2 (en) 2020-06-25 2022-03-15 Facebook Technologies, Llc Augmented reality effect resource sharing
US11888913B2 (en) * 2021-04-28 2024-01-30 Lemon Inc. External stream representation properties
CN114998112B (zh) * 2022-04-22 2024-06-28 广州市天誉创高电子科技有限公司 基于自适应频域滤波的图像去噪方法及系统

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3532332B2 (ja) * 1995-11-10 2004-05-31 パイオニア株式会社 画像情報再生装置
EP1202577B1 (de) * 2000-10-10 2006-05-17 Koninklijke Philips Electronics N.V. Verfahren zur Verarbeitung von Videodateien
JP4099973B2 (ja) * 2001-10-30 2008-06-11 松下電器産業株式会社 映像データ送信方法及び映像データ受信方法、並びに映像監視システム
US7613727B2 (en) * 2002-02-25 2009-11-03 Sont Corporation Method and apparatus for supporting advanced coding formats in media files
JP4443181B2 (ja) * 2003-10-15 2010-03-31 株式会社日立製作所 コミュニケーションシステム及び方法
CN101389021B (zh) * 2007-09-14 2010-12-22 华为技术有限公司 视频编解码方法及装置
JP5239423B2 (ja) * 2008-03-17 2013-07-17 株式会社リコー 情報処理装置,情報処理方法,プログラム,および記録媒体
JP4539754B2 (ja) * 2008-04-11 2010-09-08 ソニー株式会社 情報処理装置及び情報処理方法
RU2504917C2 (ru) * 2008-10-07 2014-01-20 Телефонактиеболагет Лм Эрикссон (Пабл) Файл медиаконтейнера
JP5222227B2 (ja) * 2009-05-22 2013-06-26 キヤノン株式会社 画像処理方法、画像処理装置およびプログラム
JP5037574B2 (ja) * 2009-07-28 2012-09-26 株式会社ソニー・コンピュータエンタテインメント 画像ファイル生成装置、画像処理装置、画像ファイル生成方法、および画像処理方法
WO2011087449A1 (en) * 2010-01-18 2011-07-21 Telefonaktiebolaget L M Ericsson (Publ) Methods and arrangements for http media stream distribution
US9185439B2 (en) * 2010-07-15 2015-11-10 Qualcomm Incorporated Signaling data for multiplexing video components
CN102130936B (zh) * 2010-08-17 2013-10-09 华为技术有限公司 一种在动态http流传输方案中支持时移回看的方法和装置
KR101206698B1 (ko) 2010-10-06 2012-11-30 한국항공대학교산학협력단 스트리밍 콘텐츠 제공 장치 및 방법
CN102136948B (zh) * 2011-03-15 2014-04-02 华为技术有限公司 用于统计用户体验的方法、终端设备和系统
US9860293B2 (en) * 2011-03-16 2018-01-02 Electronics And Telecommunications Research Institute Apparatus and method for providing streaming content using representations
KR101633239B1 (ko) * 2011-06-08 2016-06-23 코닌클리즈케 케이피엔 엔.브이. 공간적으로-세그먼트된 콘텐츠 전달
US9843844B2 (en) 2011-10-05 2017-12-12 Qualcomm Incorporated Network streaming of media data
US9584819B2 (en) * 2011-10-24 2017-02-28 Qualcomm Incorporated Grouping of tiles for video coding
EP2793479A4 (de) 2011-12-12 2015-07-01 Lg Electronics Inc Vorrichtung und verfahren zum empfangen von medieninhalten
JP6214235B2 (ja) * 2012-07-02 2017-10-18 キヤノン株式会社 ファイル生成方法、ファイル生成装置、及びプログラム
CN110139130B (zh) * 2012-10-12 2022-09-20 佳能株式会社 流传输数据的方法、发送和接收视频数据的方法和设备
US10616573B2 (en) * 2013-01-07 2020-04-07 Nokia Technologies Oy Method and apparatus for video coding and decoding
CN109618235B (zh) * 2013-01-18 2021-03-16 佳能株式会社 生成设备和方法、处理设备和方法以及存储介质
GB2513140B (en) * 2013-04-16 2016-05-04 Canon Kk Methods, devices, and computer programs for streaming partitioned timed media data
JP2014230055A (ja) 2013-05-22 2014-12-08 ソニー株式会社 コンテンツ供給装置、コンテンツ供給方法、プログラム、およびコンテンツ供給システム
CN109842613B (zh) 2013-07-12 2021-11-19 佳能株式会社 用于提供和接收媒体数据的方法和装置以及存储介质
MX2017008774A (es) * 2014-12-31 2018-02-13 Nokia Technologies Oy Prediccion inter-capa para codificacion y decodificacion de video escalable.

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
None *

Also Published As

Publication number Publication date
CN105519131A (zh) 2016-04-20
CN105519131B (zh) 2019-05-03
EP3013065A4 (de) 2016-11-16
CA2916878A1 (en) 2015-01-22
JP6493765B2 (ja) 2019-04-03
CN105519130B (zh) 2019-03-08
US20160156949A1 (en) 2016-06-02
EP3013064B1 (de) 2019-03-13
MY182338A (en) 2021-01-20
KR102224332B1 (ko) 2021-03-08
EP3013064A4 (de) 2016-11-16
EP3461142B1 (de) 2020-12-30
JP2019088023A (ja) 2019-06-06
RU2016100862A (ru) 2017-07-17
AU2014291253A1 (en) 2016-02-11
JPWO2015008774A1 (ja) 2017-03-02
US10038922B2 (en) 2018-07-31
US10306273B2 (en) 2019-05-28
CN110035300A (zh) 2019-07-19
MX2016000335A (es) 2016-05-05
WO2015008774A1 (ja) 2015-01-22
WO2015008775A1 (ja) 2015-01-22
KR20160034282A (ko) 2016-03-29
EP3013064A1 (de) 2016-04-27
CN105519130A (zh) 2016-04-20
MX2019004446A (es) 2019-07-15
JP6658931B2 (ja) 2020-03-04
US20160156943A1 (en) 2016-06-02
AU2018241185A1 (en) 2018-11-01
AU2014291253B2 (en) 2018-07-05
EP3013065A1 (de) 2016-04-27
EP3461142A1 (de) 2019-03-27
JPWO2015008775A1 (ja) 2017-03-02
AU2018241185B2 (en) 2019-08-08
RU2018135725A (ru) 2018-11-21
MX364810B (es) 2019-05-07
RU2671946C2 (ru) 2018-11-08
SG11201600223UA (en) 2016-02-26
US20190191194A1 (en) 2019-06-20

Similar Documents

Publication Publication Date Title
AU2018241185B2 (en) Information processing device and method
AU2017228638B2 (en) Information processing device, content requesting method, and computer program
EP2988516B1 (de) Servervorrichtung, clientvorrichtung, inhaltsverteilungsverfahren und computerprogramm

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20160120

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

AX Request for extension of the european patent

Extension state: BA ME

DAX Request for extension of the european patent (deleted)
A4 Supplementary search report drawn up and despatched

Effective date: 20161014

RIC1 Information provided on ipc code assigned before grant

Ipc: H04N 21/845 20110101AFI20161010BHEP

Ipc: H04N 21/434 20110101ALI20161010BHEP

Ipc: H04N 21/236 20110101ALI20161010BHEP

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

17Q First examination report despatched

Effective date: 20171117

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: GRANT OF PATENT IS INTENDED

INTG Intention to grant announced

Effective date: 20190517

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE PATENT HAS BEEN GRANTED

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 602014055359

Country of ref document: DE

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: AT

Ref legal event code: REF

Ref document number: 1192484

Country of ref document: AT

Kind code of ref document: T

Effective date: 20191115

REG Reference to a national code

Ref country code: NL

Ref legal event code: MP

Effective date: 20191016

REG Reference to a national code

Ref country code: LT

Ref legal event code: MG4D

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 1192484

Country of ref document: AT

Kind code of ref document: T

Effective date: 20191016

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200117

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: NO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200116

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200116

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200217

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200224

Ref country code: HR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: RS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602014055359

Country of ref document: DE

PG2D Information on lapse in contracting state deleted

Ref country code: IS

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20200216

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SM

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

26N No opposition filed

Effective date: 20200717

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

REG Reference to a national code

Ref country code: BE

Ref legal event code: MM

Effective date: 20200731

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200716

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200731

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200731

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200731

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20210623

Year of fee payment: 8

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200716

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: MT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20191016

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20220731

P01 Opt-out of the competence of the unified patent court (upc) registered

Effective date: 20230527

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20230620

Year of fee payment: 10

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20230620

Year of fee payment: 10