WO2010125812A1 - 画像復号方法、画像符号化方法、画像復号装置および画像符号化装置 - Google Patents
画像復号方法、画像符号化方法、画像復号装置および画像符号化装置 Download PDFInfo
- Publication number
- WO2010125812A1 WO2010125812A1 PCT/JP2010/003041 JP2010003041W WO2010125812A1 WO 2010125812 A1 WO2010125812 A1 WO 2010125812A1 JP 2010003041 W JP2010003041 W JP 2010003041W WO 2010125812 A1 WO2010125812 A1 WO 2010125812A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- view
- nal unit
- base view
- flag
- mvc extension
- Prior art date
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/188—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a video data packet, e.g. a network abstraction layer [NAL] unit
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/44—Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/1887—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a variable length codeword
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
Definitions
- the present invention relates to an image decoding method for decoding multi-view video, an image decoding device for decoding multi-view video, an image encoding method for encoding multi-view video, and an image encoding device for encoding multi-view video. .
- the multi-view video coding (MVC) standard is specified as an extension of the ISO / IEC 14496-10 Advanced Video Coding (AVC) standard.
- Profiles in the MVC standard are designed to make some of the profiles defined in the AVC standard backward compatible.
- a decoder or player compliant with the new MVC profile can decode some of the bitstreams of the AVC profile.
- an AVC profile (especially high profile) legacy decoder should decode at least one view in a bitstream compliant with the new MVC profile.
- the encoded base view is required to be compatible with the profile defined in the AVC standard, so the legacy type conforming to the high profile of the AVC standard
- the decoder must be able to decode a base view in an MVC bitstream that conforms to the MVC profile.
- the encoded view is included in a network abstraction layer (NAL) unit. Different types of NAL units are distinguished by the value of the NAL unit type. Non-base views are included in NAL units that have NAL unit type values reserved in previous versions of the AVC standard. Thus, these NAL units should be ignored by traditional high profile decoders.
- NAL network abstraction layer
- Prefix NAL unit a special NAL unit called a prefix NAL unit (Prefix NAL unit) is required to be placed in front of each NAL unit including an encoded base view and transmitted together. Yes.
- the value of the NAL unit type of this prefix NAL unit is 14, which is reserved in the previous version of the AVC standard.
- the value of the NAL unit type for each NAL unit containing the encoded base view is 5 or 1.
- Prefix NAL unit includes additional parameters placed in the MVC extension of the NAL unit header. These parameters are associated with the base view and are used in the process of encoding and decoding the non-base view.
- FIG. 23 is a diagram relating to the storage location of the NAL unit header in the MVC extension syntax. Examples of parameters in the MVC extension of the NAL unit header include non-IDR flag (non_idr_flag), priority ID (priority_id), view ID (view_id), time ID (temporal_id), anchor picture flag (anchor_pic_flag), and between views There are a prediction flag (inter_view_flag), a reserved 1 bit (reserved_one_bit), and the like. The reserved 1 bit (reserved_one_bit) is one value, and is not used for the encoding and decoding processing of the non-base view.
- Patent Document 1 describes a technique for encoding a multi-view video using a NAL unit.
- legacy AVC decoders should ignore NAL units that have NAL unit type values defined as reserved values in previous versions of the AVC standard.
- a legacy AVC decoder should only decode NAL units that contain a base view and reconstruct only that base view.
- a compressed base view and a compressed non-base view can be distinguished using different stream identifiers.
- some decoders in the market cannot decode the base view due to the prefix NAL unit. This is a challenge in assisting the compressed MVC stream to be backward compatible with legacy AVC decoders.
- the prefix NAL unit must be placed in front of each NAL unit in the base view. Therefore, it is not easy to add another new view to a bit stream that targets a single view and does not have a prefix NAL unit. That is, it is difficult to change a bit stream generated based on the conventional standard to a bit stream corresponding to a multi-view video. Therefore, it has been difficult to use an existing bitstream generated based on a conventional standard as one view in a multi-view video.
- an object of the present invention is to provide an image decoding method capable of decoding a multi-view video and an image encoding method capable of encoding a multi-view video even when a prefix NAL unit is not used.
- an image decoding method for decoding a multi-view video, which parses a NAL unit including an image included in a view component of a base view, Decode the image contained in the view component of the base view, parse the NAL unit header, MVC extension parameters of the non-base view, and prepend the NAL unit to the view component of the base view If the prefix NAL unit does not exist in the view component of the base view, the NAL unit header of the base view, the MVC extension parameter is calculated, The prefix NAL unit is added to the view component of the base view.
- the NAL unit header and MVC extension parameters of the base view are parsed, and the NAL including an image included in the view component of the non-base view is included. Parsing a unit, and calculating or parsing the NAL unit header and MVC extension parameters of the base view and the NAL unit header and MVC extension parameters of the non-base view parsed. And decoding the image included in the view component of the non-base view.
- the NAL unit / header / MVC extension parameter of the base view and the NAL unit / header / MVC extension parameter of the non-base view respectively include a non-IDR flag (non_idr_flag) and a priority ID (priority_id). ), View ID (view_id), time ID (temporal_id), anchor picture flag (anchor_pic_flag), and inter-view prediction flag (inter_view_flag).
- non_idr_flag a non-IDR flag
- priority ID priority ID
- View ID view_id
- time ID temporary_id
- anchor picture flag anchor_pic_flag
- inter_view_flag inter-view prediction flag
- the value of the non-IDR flag (non_idr_flag) is calculated from the NAL unit header / MVC extension parameter of the non-base view.
- the non-IDR flag (non_idr_flag) of the NAL unit header MVC extension parameter of the base view is set to the non-IDR flag (non_idr_flag) of the non-base view.
- a value of an IDR flag (non_idr_flag) is assigned, a first predetermined value is assigned to the priority ID (priority_id) of the NAL unit header / MVC extension parameter of the base view, and the base view is assigned.
- a predetermined second value is assigned to the view ID (view_id) of the NAL unit header MVC extension parameter of the non-base view, and the time from the NAL unit header MVC extension parameter of the non-base view
- ID (temporal_id) is acquired, and the time ID (temporal_id) of the NAL unit header MVC extension parameter of the base view is added from the NAL unit header header MVC extension parameter of the non-base view.
- the value of the acquired time ID (temporal_id) is assigned, and the value of the anchor picture flag (anchor_pic_flag) is acquired from the NAL unit header MVC extension parameter of the non-base view.
- anchor_pic_flag The anchor picture obtained from the NAL unit header MVC extension parameter of the non-base view in the anchor picture flag (anchor_pic_flag) of the NAL unit header MVC extension parameter of the base view
- a value of a flag (anchor_pic_flag) may be assigned, and a predetermined third value may be assigned to the inter-view prediction flag (inter_view_flag) of the NAL unit header / MVC extension parameter of the base view.
- NAL unit, header, and MVC extension parameters of the base view are calculated more specifically.
- a value of a NAL unit type is acquired from the NAL unit header of the base view, and the NAL unit When the value of type (nal_unit_type) is 5, 0 is assigned to the non-IDR flag (non_idr_flag) of the NAL unit header MVC extension parameter of the base view, and the NAL unit type (nal_unit_type) When the value is 1, 1 is assigned to the non-IDR flag (non_idr_flag) of the NAL unit header MVC extension parameter of the base view, and the NAL unit of the base view A first predetermined value is assigned to the priority ID (priority_id) of the header / MVC extension parameter, and the view ID (view_id) of the NAL unit / header / MVC extension parameter of the base view is predetermined.
- the time ID (temporal_id) is obtained from the NAL unit header MVC extension parameter of the non-base view, and the NAL unit header header of the base view is obtained.
- a value of the time ID (temporal_id) acquired from the NAL unit header MVC extended parameter of the non-base view is assigned to the time ID (temporal_id) of the MVC extension parameter, and the non-base view N
- the value of the anchor picture flag (anchor_pic_flag) is acquired from the L unit header MVC extension parameter, and the anchor picture flag (anchor_pic_flag) of the NAL unit header MVC extension parameter of the base view is acquired.
- the anchor picture flag (anchor_pic_flag) value obtained from the NAL unit header MVC extension parameter of the non-base view is assigned, and the NAL unit header MVC extension parameter of the base view is assigned.
- a predetermined third value may be assigned to the inter-view prediction flag (inter_view_flag).
- the NAL unit, header, and MVC extension parameters of the base view are calculated more specifically. Further, by using the NAL unit header of the base view, a more accurate value is calculated.
- the predetermined first value assigned to the priority ID may be zero.
- the predetermined second value assigned to the view ID (view_id) may be zero.
- the view ID (view_id) of the base view becomes a fixed value that is easy to distinguish.
- the predetermined third value assigned to the inter-view prediction flag (inter_view_flag) may be 1.
- An image encoding method is an image encoding method for encoding a multi-view video, encoding a base view image, and encoding the base view without a prefix NAL unit.
- the generated view component including the image is written to the NAL unit, the NAL unit header / MVC extension parameter of the non-base view is specified, the NAL unit / header / MVC extension parameter of the base view is calculated, Using the calculated NAL unit header MVC extension parameter of the base view and the determined NAL unit header MVC extension parameter of the non-base view, the non-base view Encode an image and include the encoded image of the non-base view.
- the over components may be written in the NAL unit.
- the multi-view video is encoded without the front NAL unit.
- the NAL unit / header / MVC extension parameter of the base view and the NAL unit / header / MVC extension parameter of the non-base view respectively include a non-IDR flag (non_idr_flag) and a priority ID (priority_id). ), View ID (view_id), time ID (temporal_id), anchor picture flag (anchor_pic_flag), and inter-view prediction flag (inter_view_flag).
- non_idr_flag a non-IDR flag
- priority ID priority ID
- View ID view_id
- time ID temporary_id
- anchor picture flag anchor_pic_flag
- inter_view_flag inter-view prediction flag
- the value of the non-IDR flag (non_idr_flag) is calculated from the NAL unit header / MVC extension parameter of the non-base view.
- the non-IDR flag (non_idr_flag) of the NAL unit header MVC extension parameter of the base view is set to the non-IDR flag (non_idr_flag) of the non-base view.
- a value of an IDR flag (non_idr_flag) is assigned, a first predetermined value is assigned to the priority ID (priority_id) of the NAL unit header / MVC extension parameter of the base view, and the base view is assigned.
- a predetermined second value is assigned to the view ID (view_id) of the NAL unit header MVC extension parameter of the non-base view, and the time from the NAL unit header MVC extension parameter of the non-base view
- ID (temporal_id) is acquired, and the time ID (temporal_id) of the NAL unit header MVC extension parameter of the base view is added from the NAL unit header header MVC extension parameter of the non-base view.
- the value of the acquired time ID (temporal_id) is assigned, and the value of the anchor picture flag (anchor_pic_flag) is acquired from the NAL unit header MVC extension parameter of the non-base view.
- anchor_pic_flag The anchor picture obtained from the NAL unit header MVC extension parameter of the non-base view in the anchor picture flag (anchor_pic_flag) of the NAL unit header MVC extension parameter of the base view
- a value of a flag (anchor_pic_flag) may be assigned, and a predetermined third value may be assigned to the inter-view prediction flag (inter_view_flag) of the NAL unit header / MVC extension parameter of the base view.
- NAL unit, header, and MVC extension parameters of the base view are calculated more specifically.
- An image encoding method is an image encoding method for encoding a multi-view video, which specifies a NAL unit / header / MVC extension parameter of a base view and a NAL of a non-base view.
- the unit, header, and MVC extension parameters are specified, and whether to write the NAL unit, header, and MVC extension parameters of the base view to the prefix NAL unit is switched, and the NAL unit, header, and MVC of the base view are switched.
- the NAL unit header / MVC extension parameter of the specified base view is written to the prefix NAL unit, and the image of the base view is Encode the base view encoded image
- the NAL unit, and the NAL unit header and MVC extension parameter of the specified base view and the NAL unit header and MVC extension parameter of the specified non-base view Used to encode the non-base view image and write a view component including the non-base view encoded image to the NAL unit.
- the NAL unit / header / MVC extension parameter of the base view and the NAL unit / header / MVC extension parameter of the non-base view respectively include a non-IDR flag (non_idr_flag) and a priority ID (priority_id). ), View ID (view_id), time ID (temporal_id), anchor picture flag (anchor_pic_flag), and inter-view prediction flag (inter_view_flag), and identifies the NAL unit header MVC extension parameter of the base view In this case, a predetermined first value is set in the priority ID (priority_id) of the NAL unit / header / MVC extension parameter of the base view.
- a predetermined third value may be assigned to the inter-view prediction flag (inter_view_flag).
- a value of a NAL unit type is acquired from the NAL unit header of the base view, and the NAL unit When the value of type (nal_unit_type) is 5, 0 is assigned to the non-IDR flag (non_idr_flag) of the NAL unit header MVC extension parameter of the base view, and the NAL unit type (nal_unit_type) When the value is 1, 1 is assigned to the non-IDR flag (non_idr_flag) of the NAL unit header MVC extension parameter of the base view, and the NAL unit of the base view A first predetermined value is assigned to the priority ID (priority_id) of the header / MVC extension parameter, and the view ID (view_id) of the NAL unit / header / MVC extension parameter of the base view is predetermined.
- the time ID (temporal_id) is obtained from the NAL unit header MVC extension parameter of the non-base view, and the NAL unit header header of the base view is obtained.
- a value of the time ID (temporal_id) acquired from the NAL unit header MVC extended parameter of the non-base view is assigned to the time ID (temporal_id) of the MVC extension parameter, and the non-base view N
- the value of the anchor picture flag (anchor_pic_flag) is acquired from the L unit header MVC extension parameter, and the anchor picture flag (anchor_pic_flag) of the NAL unit header MVC extension parameter of the base view is acquired.
- the anchor picture flag (anchor_pic_flag) value obtained from the NAL unit header MVC extension parameter of the non-base view is assigned, and the NAL unit header MVC extension parameter of the base view is assigned.
- a predetermined third value may be assigned to the inter-view prediction flag (inter_view_flag).
- the NAL unit, header, and MVC extension parameters of the base view are calculated more specifically. Further, by using the NAL unit header of the base view, a more accurate value is calculated.
- the predetermined first value assigned to the priority ID may be zero.
- the predetermined second value assigned to the view ID (view_id) may be zero.
- the view ID (view_id) of the base view becomes a fixed value that is easy to distinguish.
- the predetermined third value assigned to the inter-view prediction flag (inter_view_flag) may be 1.
- An image decoding apparatus is an image decoding apparatus that decodes multi-viewpoint video, and a base view NAL unit syntax that parses a NAL unit including an image included in a view component of a base view.
- the base A view NAL unit, a header, a base for calculating an MVC extension parameter, an MVC extension parameter calculation unit and, if the prefix NAL unit exists in the view component of the base view, the prefix A base view / MVC extended parameter parsing unit for parsing the NAL unit / header / MVC extended parameter of the base view from the NAL unit, and an image included in the view component of the non-base view
- Using a header-MVC extension parameters may include a non-base view decoding unit for decoding the image included in the view components of the non-base view.
- An image encoding apparatus is an image encoding apparatus that encodes a multi-view video, and includes a base-view encoding unit that encodes a base-view image and a prefix NAL unit.
- a base view writer that writes a view component including the encoded image of the base view to a NAL unit, and a non base base that identifies a NAL unit header MVC extension parameter of the non base view
- a non-base view writing unit that writes components to the NAL unit may be provided.
- the multi-view video is encoded without the front NAL unit.
- An image encoding apparatus is an image encoding apparatus that encodes a multi-view video, and specifies a base / view / MVC extension parameter specifying base / view NAL unit / header / MVC extension parameter.
- a non-base-view / MVC extension parameter specifying unit for specifying a NAL unit / header / MVC extension parameter for a non-base view, and a NAL unit / header / MVC extension parameter for the base view. Specified when the front NAL unit write switching unit for switching whether or not to write to the NAL unit and the NAL unit / header / MVC extension parameter of the base view are written to the front NAL unit.
- the NAL unit head of the base view A pre-NAL unit writing unit that writes MVC extension parameters to the pre-NAL unit, a base view encoding unit that encodes the base view image, and the base view encoded image
- a base view writer for writing the containing view component to the NAL unit, the NAL unit header of the specified base view, the MVC extension parameter, and the NAL unit of the specified non-base view A non-base view encoding unit that encodes the non-base view image using a header MVC extension parameter; and a view component including the non-base view encoded image. Equipped with a non-base view writer that writes to the NAL unit It may be.
- the multi-view video is decoded and the multi-view video is encoded.
- the conventional AVC decoder that cannot decode the base view due to the prefix NAL unit can decode the base view image included in the multi-view video.
- no prefix NAL unit since no prefix NAL unit is used, it is also easy to add another view to a stream generated by encoding a single view. Then, it becomes easy to use an existing bit stream generated by a conventional AVC encoder as a base view of a multi-view video.
- FIG. 1 is a configuration diagram showing components of an image encoding device according to Embodiment 1 of the present invention.
- FIG. 2 is a flowchart showing the encoding process according to Embodiment 1 of the present invention.
- FIG. 3 is a block diagram showing components of the image decoding apparatus according to Embodiment 1 of the present invention.
- FIG. 4 is a flowchart showing the decoding process according to Embodiment 1 of the present invention.
- FIG. 5 is a flowchart showing processing for calculating the NAL unit / header / MVC extension parameter of the base view according to the first embodiment of the present invention.
- FIG. 6 is a diagram illustrating an example of an image coding apparatus according to Embodiment 1 of the present invention.
- FIG. 1 is a configuration diagram showing components of an image encoding device according to Embodiment 1 of the present invention.
- FIG. 2 is a flowchart showing the encoding process according to Embodiment 1 of the present invention.
- FIG. 7 is a diagram illustrating an example of the image decoding apparatus according to Embodiment 1 of the present invention.
- FIG. 8 is a flowchart showing processing for calculating the NAL unit / header / MVC extension parameter of the base view according to the second embodiment of the present invention.
- FIG. 9 is a diagram illustrating an example of an image coding apparatus according to Embodiment 2 of the present invention.
- FIG. 10 is a diagram illustrating an example of an image decoding apparatus according to Embodiment 2 of the present invention.
- FIG. 11 is a block diagram showing components of the image coding apparatus according to Embodiment 3 of the present invention.
- FIG. 12 is a flowchart showing a modification of the encoding process according to Embodiment 3 of the present invention.
- FIG. 13 is a diagram illustrating an example of an image coding apparatus according to Embodiment 3 of the present invention.
- FIG. 14 is a diagram illustrating an example of a data structure of an encoded stream in each embodiment of the present invention.
- FIG. 15 is a schematic diagram illustrating an example of the overall configuration of a content supply system that implements a content distribution service.
- FIG. 16 is a diagram illustrating an appearance of a mobile phone.
- FIG. 17 is a block diagram illustrating a configuration example of a mobile phone.
- FIG. 18 is a schematic diagram showing an example of the overall configuration of a digital broadcasting system.
- FIG. 19 is a block diagram illustrating a configuration example of a television.
- FIG. 19 is a block diagram illustrating a configuration example of a television.
- FIG. 20 is a block diagram illustrating a configuration example of an information reproducing / recording unit that reads and writes information from and on a recording medium that is an optical disk.
- FIG. 21 is a diagram illustrating a structure example of a recording medium that is an optical disk.
- FIG. 22 is a block diagram illustrating a configuration example of an integrated circuit that implements the image encoding method and the image decoding method according to each embodiment.
- FIG. 23 is a diagram showing the storage location of the NAL unit header in the MVC extended syntax.
- a new method for transmitting a parameter included in the MVC extension of the header of the prefix NAL unit to the MVC decoder without using the prefix NAL unit in the compressed MVC stream is introduced.
- the novel aspect of the present invention is that the base view of the encoded MVC stream required for non-base view decoding without the use of a prefix NAL unit that can cause problems in the decoding process by a conventional AVC decoder.
- the value associated with is to be specified.
- the present invention allows a traditional AVC decoder on the market to decode the base view of the MVC stream, and allows the MVC decoder of the present invention to decode all coded views in the MVC stream.
- FIG. 1 is a configuration diagram illustrating components of the image encoding device according to the first embodiment.
- a base / view encoding unit 100 includes a base / view encoding unit 100, a base / view writing unit 102, a non-base / view / MVC extended parameter specifying unit 104, and a base / view / MVC extended parameter calculating unit.
- 106 includes a non-base view encoding unit 108 and a non-base view writing unit 110.
- Each processing unit executes each process shown below.
- FIG. 2 is a flowchart showing the encoding process in the first embodiment of the present invention.
- the base view encoding unit 100 encodes a base view image (S100).
- Base-view images are encoded using the multi-view video encoding standard.
- Base-view images that are encoded using the multi-view video encoding standard can be decoded according to the Advanced Video Coding standard.
- the base view writing unit 102 writes the compressed base view component to the NAL unit (S102).
- Base view NAL units are written without using a prefix NAL unit as shown in FIG.
- the non-base view / MVC extension parameter specifying unit 104 specifies the NAL unit / header / MVC extension parameter of the non-base / view (S104).
- these parameters include a non-IDR flag (non_idr_flag), a priority ID (priority_id), a view ID (view_id), a time ID (temporal_id), an anchor picture flag (anchor_pic_flag), an inter-view prediction flag (inter_view_flag), etc.
- the base / view / MVC extended parameter calculation unit 106 calculates the NAL unit / header / MVC extended parameter of the base / view (S106).
- these parameters include a non-IDR flag (non_idr_flag), a priority ID (priority_id), a view ID (view_id), a time ID (temporal_id), an anchor picture flag (anchor_pic_flag), an inter-view prediction flag (inter_view_flag), etc.
- the NAL unit / header / MVC extension parameter of the base view is a parameter stored in the MVC extension part of the NAL unit / header of the prefix NAL unit, assuming that the prefix NAL unit is used.
- the non-base view encoding unit 108 encodes the non-base view image using the NAL unit header MVC extension parameters of the base view and the non-base view (S108). .
- non-base view writing unit 110 writes the non-base view component to the NAL unit (S110).
- FIG. 3 is a block diagram showing components of the image decoding apparatus according to Embodiment 1 of the present invention.
- a base / view / NAL unit syntax analysis unit 200 includes a base / view / NAL unit syntax analysis unit 200, a base / view decoding unit 202, a non-base / view / MVC extended parameter syntax analysis unit 204, and a prefix NAL unit search unit. 205, a base-view / MVC extended parameter calculation unit 206, a base-view / MVC extended parameter syntax analysis unit 207, a non-base-view / NAL unit syntax analysis unit 208, and a non-base-view decoding unit 210.
- Each processing unit executes each process shown below.
- FIG. 4 is a flowchart showing the decoding process in the first embodiment of the present invention.
- the base / view / NAL unit syntax analysis unit 200 parses the NAL unit of the base / view component (S200).
- the base view decoding unit 202 decodes the base view image using the encoding process described in the multi-view video encoding standard (S202).
- the non-base view / MVC extended parameter syntax analysis unit 204 parses the NAL unit / header / MVC extended parameters of the non-base view (S204). That is, the non-base view MVC extended parameter syntax analysis unit 204 parses the NAL unit header of the non-base view to acquire the non-base view MVC extended parameter.
- non_idr_flag a non-IDR flag
- priority_id priority ID
- view_id view_id
- time ID temporary_id
- anchor_pic_flag anchor_pic_flag
- inter_view_flag inter-view prediction flag
- the prefix NAL unit search unit 205 searches for the prefix NAL unit of the base view (S205).
- the base view / MVC extended parameter syntax analysis unit 207 determines that the base view NAL unit is based on the prefix NAL unit.
- the header and MVC extension parameters are parsed (S207). In other words, the base-view / MVC extended parameter syntax analysis unit 207 parses the NAL unit / header of the prefix NAL unit to obtain the NAL unit / header / MVC extended parameter of the base / view.
- the base view / MVC extension parameter calculation unit 206 calculates the NAL unit / header / MVC extension parameter of the base view. (S206).
- non_idr_flag a non-IDR flag
- priority_id priority ID
- view_id view_id
- time ID temporary_id
- anchor_pic_flag anchor_pic_flag
- inter_view_flag inter-view flag
- non-base view NAL unit syntax analysis unit 208 parses the NAL unit of the non-base view component (S208).
- the non-base view decoding unit 210 uses the NAL unit, header, and MVC extension parameters of the base view and the non-base view based on the decoding process specified by the multi-view video coding standard.
- the non-base view image is decoded (S210).
- FIG. 5 is a flowchart showing processing for calculating the NAL unit / header / MVC extension parameter of the base view according to the first embodiment of the present invention.
- these parameters include a non-IDR flag (non_idr_flag), a priority ID (priority_id), a view ID (view_id), a time ID (temporal_id), an anchor picture flag (anchor_pic_flag), and an inter-view flag (inter_view_flag). is there.
- the base / view / MVC extended parameter calculation unit 206 of the image decoding device 250 also executes similar processing.
- the base / view / MVC extended parameter calculation unit 106 reads the value of the non-IDR flag (non_idr_flag) from the non-base / view NAL unit / header / MVC extended parameter (S300).
- the base view / MVC extended parameter calculation unit 106 assigns the value of the non-IDR flag (non_idr_flag) of the non-base view to the non-IDR flag (non_idr_flag) of the base view (S302).
- the base / view / MVC extended parameter calculation unit 106 assigns a predetermined value to the priority flag (priority_id) of the base / view (S304).
- the value predetermined as the value of the priority flag (priority_id) is 0.
- the base / view / MVC extended parameter calculation unit 106 assigns a predetermined value to the view ID (view_id) (S306).
- the value predetermined as the value of the view ID (view_id) is also 0.
- the base view / MVC extended parameter calculation unit 106 acquires the value of the non-base view time ID (temporal_id) from the NAL unit / header / MVC extended parameter of the non-base view (S308). .
- the base view / MVC extended parameter calculation unit 106 assigns the value of the acquired time ID (temporal_id) of the non-base view to the time ID (temporal_id) of the base view (S310).
- the base view / MVC extension parameter calculation unit 106 acquires the value of the anchor picture flag (anchor_pic_flag) from the NAL unit / header / MVC extension parameter of the non-base view (S310).
- the base view / MVC extended parameter calculation unit 106 assigns the value of the anchor picture flag (anchor_pic_flag) acquired for the non-base view to the anchor picture flag (anchor_pic_flag) of the base view. (S314).
- the base-view / MVC extended parameter calculation unit 106 sets a predetermined value in the inter-view prediction flag (inter_view_flag) of the base view (S316).
- a predetermined value is 1 as the inter-view prediction flag (inter_view_flag) of the base view.
- FIG. 6 is a diagram illustrating an example of the image coding apparatus according to Embodiment 1 of the present invention.
- the apparatus includes a base / view encoding unit 500, a base / view writing unit 502, a base / view decoding unit 504, a base / view / NAL unit / header / MVC extended parameter calculation unit 506, a non-base /
- the base view image D001 is input to the base view encoding unit 500, and the compressed image D003 is output to both the base view writing unit 502 and the base view decoding unit 504.
- the base-view decoding unit 504 reads the base-view compressed image D003, and outputs the base-view decoded image D005 to the storage unit 510.
- the non-base view image D011 is input to both the non-base view, the NAL unit, the header, the MVC extension parameter specifying unit 508, and the non-base view encoding unit 512.
- the non-base view NAL unit header MVC extension parameter specifying unit 508 sets the non-base view NAL unit header MVC extension parameter D013 to the base view NAL unit header MVC extension. This is output to both the parameter calculation unit 506 and the non-base view encoding unit 512.
- the base / view / NAL unit / header / MVC extension parameter calculation unit 506 acquires a predetermined value D009 and a non-base view NAL unit / header / MVC extension parameter D013, and obtains a base view NAL unit. A value is assigned to the header / MVC extension parameter, and the base / view NAL unit / header / MVC extension parameter D 017 is output to the non-base / view encoding unit 512.
- the non base view encoding unit 512 includes a non base view image D011, a non base view NAL unit header MVC extension parameter D013, and a base view NAL unit header MVC extension parameter. D017 and a base-view decoded image D019 are acquired, and a non-base-view compressed image D021 is output.
- the base view writing unit 502 and the non-base view writing unit 514 obtain the compressed image D003 of the base view and the compressed image D021 of the non-base view, respectively, and the compressed image D007 and N007 in the NAL unit.
- a compressed image D023 is output.
- the image encoding device shown in FIG. 6 is a specific example of the image encoding device 150 shown in FIG.
- the processing performed by each component shown in FIG. 1 is executed by each component shown in FIG.
- the processing of the base view encoding unit 100 is executed by the base view encoding unit 500.
- the processing of the base view writing unit 102 is executed by the base view writing unit 502.
- the processing of the non-base view / MVC extension parameter specifying unit 104 is executed by the non-base / view / NAL unit / header / MVC extension parameter specifying unit 508.
- the processing of the base / view / MVC extended parameter calculation unit 106 is executed by the base / view / NAL unit / header / MVC extended parameter calculation unit 506.
- the processing of the non-base view encoding unit 108 is executed by the non-base view encoding unit 512.
- the processing of the non-base view writing unit 110 is executed by the non-base view writing unit 514.
- FIG. 7 is a diagram illustrating an example of the image decoding apparatus according to Embodiment 1 of the present invention.
- the apparatus includes a splitter unit 600, a prefix NAL unit search unit 601, a base / view decoding unit 602, a base / view / prefix NAL unit / header / MVC extended parameter syntax analysis unit 603, and a storage unit 604.
- a switch unit 605, a non-base view / NAL unit / header / MVC extended parameter syntax analysis unit 608, a base / view / NAL unit / header / MVC extended parameter calculation unit 606, and a non-base / view decoding unit 610. is there.
- the splitter unit 600 acquires one or more views of compressed video D020, a base-view compressed image D003 for the prefix NAL unit search unit 601, and a non-base-view NAL.
- the data is divided into a non-base view compressed image D021 for the unit header MVC extended parameter syntax analysis unit 608.
- the base-view compressed image D003 is also output from the splitter unit 600 to the base-view / prefix NAL unit / header / MVC extended parameter syntax analysis unit 603.
- the prefix NAL unit search unit 601 searches for the prefix NAL unit in the base view, and outputs a signal D016 indicating whether or not it exists to the switch unit 605.
- the prefix NAL unit search unit 601 also outputs the base-view compressed image D004 to the base-view decoding unit 602.
- the base view decoding unit 602 acquires the compressed image D004 of the base view and outputs the decoded image D005 of the base view.
- the output decoded image D005 of the base view is stored in the storage unit 604.
- the base view / prefix NAL unit / header / MVC extended parameter parsing unit 603 reads the compressed image D003 of the base view and reads the base view NAL unit. Outputs header and MVC extension parameter D008.
- the non-base view NAL unit / header / MVC extended parameter syntax analysis unit 608 acquires the compressed image D021 of the non-base view and sets the NAL unit / header / MVC extended parameter of the non-base view. D013 is output to both the base-view-NAL unit-header-MVC extension parameter calculation unit 606 and the non-base-view decoding unit 610.
- the non-base view NAL unit / header / MVC extended parameter syntax analysis unit 608 outputs the non-base view compressed image D015 to the non-base view decoding unit 610.
- the base / view / NAL unit / header / MVC extension parameter calculation unit 606 obtains a predetermined value D009 and a non-base / view NAL unit / header / MVC extension parameter D013 to obtain a base / view NAL unit.
- a value is assigned to the header / MVC extension parameter, and the NAL unit / header / MVC extension parameter D014 of the base view is output to the switch unit 605.
- the switch unit 605 calculates whether or not the prefix NAL unit exists in the compression base view.
- the base view NAL unit header MVC extension parameter D014 is delivered to the non-base view decoding unit 610. If it exists, the switch unit 605 passes the parsed base view NAL unit header MVC extension parameter D010 to the non-base view decoding unit 610.
- the non-base view decoding unit 610 includes a base-view decoded image D019, a base-view NAL unit header, an MVC extension parameter D010, and a parsed non-base-view NAL unit.
- the header MVC extension parameter D013 and the compressed image D015 of the non-base view are acquired, and the decoded image D025 of the non-base view is output.
- image decoding apparatus shown in FIG. 7 is a specific example of the image decoding apparatus 250 shown in FIG.
- the processing performed by each component shown in FIG. 3 is executed by each component shown in FIG.
- the processing of the base / view / NAL unit syntax analysis unit 200 and the base / view decoding unit 202 is executed by the base / view decoding unit 602.
- the processing of the non-base view / MVC extended parameter syntax analysis unit 204 is executed by the non-base / view / NAL unit / header / MVC extended parameter syntax analysis unit 608.
- the processing of the prefix NAL unit search unit 205 is executed by the prefix NAL unit search unit 601.
- the processing of the base / view / MVC extended parameter syntax analysis unit 207 is executed by the base / view / prefix NAL unit / header / MVC extended parameter syntax analysis unit 603.
- the processing of the base / view / MVC extended parameter calculation unit 206 is executed by the base / view / NAL unit / header / MVC extended parameter calculation unit 606.
- the processing of the non-base view NAL unit parsing unit 208 and the non-base view decoding unit 210 is executed by the non-base view decoding unit 610.
- the image encoding device 150 shown in Embodiment 1 encodes a multi-view video without a front NAL unit.
- the base view can be decoded even by the conventional AVC decoder that cannot decode the base view due to the prefix NAL unit included in the encoded multi-view video.
- the image decoding apparatus 250 shown in the first embodiment performs all the views included in the encoded multi-view video regardless of whether or not the encoded multi-view video has a prefix NAL unit. Can be decrypted.
- the prefix NAL unit can be omitted, the encoding efficiency is improved.
- the image decoding apparatus 250 parses the MVC extended parameter of the base view or calculates the MVC extended parameter of the base view depending on whether or not there is a prefix NAL unit. However, the image decoding apparatus 250 calculates the non-base view MVC extension parameter from the non-base view MVC extension parameter regardless of whether there is a prefix NAL unit or not. The image may be decoded.
- the prefix NAL unit search unit 205 the base / view / MVC extended parameter syntax analysis unit 207, and the processing executed by them are not required.
- the image decoding apparatus 250 does not include such a component, and can decode the encoded multi-view video even when the encoded multi-view video does not have a prefix NAL unit.
- FIGS. 2, 4 and 5 The processing flow shown in FIGS. 2, 4 and 5 is an example of the processing flow, and the processing flow executed by the image encoding device 150 and the image decoding device 250 is shown in each figure. It is not limited to the shown processing flow.
- the image encoding device 150 and the image decoding device 250 may execute the processes shown in the drawings in an order different from the order of the processes shown in the drawings, or perform the processes shown in the drawings. It may be executed in parallel.
- the predetermined values assigned to the priority ID (priority_id), the view ID (view_id), and the inter-view prediction flag (view_inter_flag) may be different from the values shown in the first embodiment.
- the image coding apparatus 150 according to the second embodiment includes the same components as the image coding apparatus 150 according to the first embodiment shown in FIG. Then, the image coding apparatus 150 according to the second embodiment performs the same coding process as the coding process according to the first embodiment shown in FIG. Further, the image decoding device 250 according to the second embodiment includes the same components as the image decoding device 250 according to the first embodiment shown in FIG. Then, the image decoding device 250 according to the second embodiment executes a decoding process similar to the decoding process according to the first embodiment shown in FIG.
- the processing for calculating the NAL unit / header / MVC extension parameter of the base view is changed as compared with the first embodiment.
- FIG. 8 is a flowchart showing processing for calculating the NAL unit / header / MVC extension parameter of the base view in the second embodiment of the present invention.
- these parameters include a non-IDR flag (non_idr_flag), a priority ID (priority_id), a view ID (view_id), a time ID (temporal_id), an anchor picture flag (anchor_pic_flag), an inter-view prediction flag (inter_view_flag), etc.
- the base / view / MVC extended parameter calculation unit 206 of the image decoding device 250 also executes similar processing.
- the base view / MVC extended parameter calculation unit 106 reads the value of the NAL unit type (nal_unit_type) from the NAL unit header of the base view (S400).
- the base view / MVC extended parameter calculation unit 106 assigns 0 to the non-IDR flag (non_idr_flag) (S402).
- the base-view / MVC extended parameter calculation unit 106 assigns 1 to the non-IDR flag (non_idr_flag) (S404).
- the base view / MVC extended parameter calculation unit 106 assigns a predetermined value to the priority ID (priority_id) of the base view (S406).
- the value predetermined as the value of the priority ID (priority_id) is 0.
- the base / view / MVC extended parameter calculation unit 106 assigns a predetermined value to the view ID (view_id) (S408).
- the value predetermined as the value of the view ID (view_id) is also 0.
- the base view / MVC extended parameter calculation unit 106 acquires the value of the non-base view time ID (temporal_id) from the NAL unit / header / MVC extended parameter of the non-base view (S410). .
- the base view / MVC extended parameter calculation unit 106 assigns the value of the non-base view acquired time ID (temporal_id) to the base view time ID (temporal_id) (S412).
- the base view / MVC extension parameter calculation unit 106 acquires the value of the anchor picture flag (anchor_pic_flag) from the NAL unit / header / MVC extension parameter of the non-base view (S414).
- the base view / MVC extended parameter calculation unit 106 assigns the value of the anchor picture flag (anchor_pic_flag) acquired for the non-base view to the anchor picture flag (anchor_pic_flag) of the base view. (S416).
- the base-view / MVC extended parameter calculation unit 106 sets a predetermined value in the inter-view prediction flag (inter_view_flag) of the base view (S418).
- a predetermined value is 1 as the inter-view prediction flag (inter_view_flag) of the base view.
- FIG. 9 is a diagram illustrating an example of an image coding apparatus according to Embodiment 2 of the present invention.
- the apparatus includes a base / view encoding unit 700, a base / view writing unit 702, a base / view decoding unit 704, a base / view / NAL unit / header / MVC extended parameter calculation unit 706, a non-base /
- the base view image D001 is input to the base view encoding unit 700, and the base view compressed image D003 is output to both the base view writing unit 702 and the base view decoding unit 704.
- the base-view decoding unit 704 reads the base-view compressed image D003, and outputs the base-view decoded image D005 to the storage unit 710.
- the base view writing unit 702 acquires the compressed image D003 of the base view, outputs the compressed image D007 of the base view in the NAL unit, and the value D018 of the NAL unit type (nal_unit_type) of the base view NAL unit. Is output to the base / view / NAL unit / header / MVC extended parameter calculation unit 706.
- the non-base view image D011 is input to both the non-base view, the NAL unit, the header, the MVC extension parameter specifying unit 708, and the non-base view encoding unit 712.
- the non-base-view / NAL unit / header / MVC extension parameter specifying unit 708 sets the non-base / view NAL unit / header / MVC extension parameter D013 to the base / view / NAL unit / header / MVC extension. This is output to both the parameter calculation unit 706 and the non-base view encoding unit 712.
- the base / view / NAL unit / header / MVC extension parameter calculation unit 706 includes a predetermined value D009, a non-base / view NAL unit / header / MVC extension parameter D013, and a base / view / NAL unit NAL unit.
- Obtain a value (D018) of type (nal_unit_type) assign a value to the NAL unit header of the base view, and an MVC extension parameter, and set the NAL unit header of the base view and the MVC extension parameter D017 to a non-base view code To the conversion unit 712.
- the non-base view encoding unit 712 includes a non-base view image D011, a non-base view NAL unit header MVC extension parameter D013, and a base view NAL unit header MVC extension parameter. D017 and a base-view decoded image D019 are acquired, and a non-base-view compressed image D021 is output.
- the non-base view writing unit 714 obtains the non-base view compressed image D021 and outputs the non-base view compressed image D023 in the NAL unit.
- image encoding device shown in FIG. 9 is a specific example of the image encoding device 150 shown in FIG.
- the processing performed by each component shown in FIG. 1 is executed by each component shown in FIG.
- the processing of the base view encoding unit 100 is executed by the base view encoding unit 700.
- the processing of the base view writing unit 102 is executed by the base view writing unit 702.
- the processing of the non-base view / MVC extension parameter specifying unit 104 is executed by the non-base / view / NAL unit / header / MVC extension parameter specifying unit 708.
- the processing of the base / view / MVC extended parameter calculation unit 106 is executed by the base / view / NAL unit / header / MVC extended parameter calculation unit 706.
- the processing of the non-base view encoding unit 108 is executed by the non-base view encoding unit 712.
- the processing of the non-base view writing unit 110 is executed by the non-base view writing unit 714.
- FIG. 10 is a diagram illustrating an example of an image decoding device according to Embodiment 2 of the present invention.
- the apparatus includes a splitter unit 800, a base / view / NAL unit / header / parameter syntax analysis unit 802, a prefix NAL unit search unit 803, a base / view decoding unit 804, a base / view / prefix NAL unit.
- the splitter unit 800 acquires one or more views of compressed video D020, and compresses a base view compressed image D003 to the base view NAL unit header parameter parsing unit 802.
- the image is divided into a non-base view compressed image D021 for the non-base view NAL unit header MVC extended parameter syntax analysis unit 810.
- the base view NAL unit header parameter parsing unit 802 acquires the compressed image D003 of the base view, and sets the value D018 of the NAL unit type (nal_unit_type) of the base view NAL unit to the base view The data is output to the NAL unit / header / MVC extended parameter calculation unit 808.
- the base / view / NAL unit / header / parameter syntax analysis unit 802 outputs the base-view compressed image D004 to the prefix NAL unit search unit 803.
- the prefix NAL unit search unit 803 searches for the prefix NAL unit in the base view, and outputs a signal D016 indicating whether or not it exists to the switch unit 807.
- the prefix NAL unit search unit 803 also outputs the base view compressed image D006 to the base view decoding unit 804.
- the base view decoding unit 804 acquires the compressed image D006 of the base view and outputs the decoded image D005 of the base view.
- the output decoded image D005 of the base view is stored in the storage unit 806.
- the base view / prefix NAL unit / header / MVC extended parameter parsing unit 805 reads out the compressed image D003 of the base view and reads the base view NAL unit. Outputs header and MVC extension parameter D008.
- the non-base view NAL unit / header / MVC extended parameter syntax analysis unit 810 obtains the compressed image D021 of the non-base view and obtains the NAL unit / header / MVC extended parameter D013 of the non-base view. The data is output to both the base-view-NAL unit-header-MVC extended parameter calculation unit 808 and the non-base-view decoding unit 812. Next, the non-base-view / NAL unit / header / MVC extended parameter syntax analysis unit 810 outputs the non-base-view compressed image D015 to the non-base view decoding unit 812.
- the base-view-NAL unit-header-MVC extended parameter calculation unit 808 includes a predetermined value D009, a base-view-NAL unit NAL unit type (nal_unit_type) value D018, and a non-base-view value.
- the NAL unit / header / MVC extension parameter D013 is acquired, the value is assigned to the NAL unit / header / MVC extension parameter of the base view, and the NAL unit / header / MVC extension parameter D014 of the base view is assigned to the switch unit 807. Output.
- the switch unit 807 calculates the presence of the prefix NAL unit in the compression base view depending on whether or not the prefix NAL unit exists.
- the base view NAL unit header MVC extension parameter D014 is delivered to the non-base view decoding unit 812. If it is present, the switch unit 807 passes the parsed base view NAL unit header MVC extension parameter D008 to the non-base view decoding unit 812.
- the non-base view decoding unit 812 includes a base-view decoded image D019, a base-view NAL unit header, an MVC extension parameter D010, and a parsed non-base-view NAL unit.
- the header MVC extension parameter D013 and the compressed image D015 of the non-base view are acquired, and the decoded image D025 of the non-base view is output.
- image decoding device shown in FIG. 10 is a specific example of the image decoding device 250 shown in FIG.
- the processing performed by each component shown in FIG. 3 is executed by each component shown in FIG.
- the processing of the base / view / NAL unit syntax analysis unit 200 and the base / view decoding unit 202 is executed by the base / view decoding unit 804.
- the processing of the non-base view / MVC extended parameter syntax analysis unit 204 is executed by the non-base / view / NAL unit / header / MVC extended parameter syntax analysis unit 810.
- the processing of the prefix NAL unit search unit 205 is executed by the prefix NAL unit search unit 803.
- the processing of the base / view / MVC extended parameter syntax analysis unit 207 is executed by the base / view / prefix NAL unit / header / MVC extended parameter syntax analysis unit 805.
- the processing of the base / view / MVC extended parameter calculation unit 206 is executed by the base / view / NAL unit / header / MVC extended parameter calculation unit 808.
- the processing of the non-base view NAL unit parsing unit 208 and the non-base view decoding unit 210 is executed by the non-base view decoding unit 812.
- the image encoding device 150 encodes a multi-view video without a front NAL unit, as in the first embodiment. Also, the image decoding apparatus 250 according to the second embodiment, as in the first embodiment, performs the encoded multi-view video regardless of whether or not the encoded multi-view video has a prefix NAL unit. Can be decrypted. Further, the image encoding device 150 and the image decoding device 250 according to the second embodiment use the base view NAL unit type when calculating the MVC extension parameter of the base view, so that the MVC extension with higher accuracy can be performed. Parameters can be calculated.
- the flow of processing shown in FIG. 8 is an example of the flow of processing, and the flow of processing executed by the image encoding device 150 and the image decoding device 250 is illustrated in FIG.
- the processing flow shown in FIG. the image encoding device 150 and the image decoding device 250 may execute the processes shown in FIG. 8 in an order different from the order of the processes shown in FIG. It may be executed in parallel.
- the predetermined values assigned to the priority ID (priority_id), the view ID (view_id), and the inter-view prediction flag (view_inter_flag) may be different from the values shown in the second embodiment.
- FIG. 11 is a block diagram showing components of the image coding apparatus according to Embodiment 3 of the present invention.
- the image encoding device 950 shown in FIG. 11 includes a base / view / MVC extended parameter specifying unit 900, a non-base / view / MVC extended parameter specifying unit 902, a pre-NAL unit write switching unit 904, and a pre-NAL unit.
- a writing unit 906, a base view coding unit 908, a base view writing unit 910, a non-base view coding unit 912, and a non-base view writing unit 914 are provided.
- Each processing unit executes each process shown below.
- FIG. 12 is a flowchart showing an encoding process according to the third embodiment of the present invention.
- the base view / MVC extension parameter specifying unit 900 specifies the NAL unit / header / MVC extension parameter of the base view (S900).
- these parameters include a non-IDR flag (non_idr_flag), a priority ID (priority_id), a view ID (view_id), a time ID (temporal_id), an anchor picture flag (anchor_pic_flag), an inter-view prediction flag (inter_view_flag), etc.
- the non-base view / MVC extension parameter specifying unit 902 specifies the NAL unit / header / MVC extension parameter of the non-base / view (S902).
- these parameters include a non-IDR flag (non_idr_flag), a priority ID (priority_id), a view ID (view_id), a time ID (temporal_id), an anchor picture flag (anchor_pic_flag), an inter-view prediction flag (inter_view_flag), etc.
- the prefix NAL unit write switching unit 904 switches whether to write the base / view NAL unit / header / MVC extension parameter to the prefix NAL unit (S904). For example, the prefix NAL unit write switching unit 904 switches whether to write the base / view NAL unit / header / MVC extension parameters to the prefix NAL unit based on preset information.
- the pre-NAL unit write switching unit 904 switches to write the base / view NAL unit / header / MVC extension parameters to the pre-NAL unit (Yes in step S904)
- the pre-NAL unit write unit 906 Writes the NAL unit, header, and MVC extension parameters of the base view into the prefix NAL unit (S906).
- the pre-NAL unit write switching unit 904 switches so as not to write the base / view NAL unit / header / MVC extension parameters to the pre-NAL unit (No in S904)
- the pre-NAL unit write unit 906 Does not write the base / view NAL unit / header / MVC extension parameters to the prefix NAL unit. That is, the prefix NAL unit is not generated.
- the base view encoding unit 908 encodes the base view image (S908).
- Base-view images are encoded using the multi-view video encoding standard.
- Base-view images that are encoded using the multi-view video encoding standard can be decoded according to the Advanced Video Coding standard.
- the base view writing unit 910 writes the encoded base view component into the NAL unit (S910).
- the non-base view encoding unit 912 encodes the non-base view image using the base view and the non-base view NAL unit header MVC extension parameters (S912). .
- non-base view writing unit 914 writes the non-base view component to the NAL unit (S914).
- FIG. 13 is a diagram illustrating an example of an image coding apparatus according to Embodiment 3 of the present invention.
- the apparatus includes a base / view encoding unit 500, a base / view writing unit 502, a base / view decoding unit 504, a base / view / NAL unit / header / MVC extended parameter specifying unit 1015, a non-base / A view / NAL unit / header / MVC extended parameter specifying unit 508, a storage unit 510, a non-base / view encoding unit 512, a non-base / view writing unit 514, a switch unit 1016, a front NAL unit writing unit 1017, and the like. is there.
- a base / view / NAL unit / header / MVC extended parameter specifying unit 1015 is added instead of the base / view / NAL unit / header / MVC extended parameter calculating unit 506. Yes. Further, a switch unit 1016 and a front NAL unit writing unit 1017 are added.
- the base / view / NAL unit / header / MVC extension parameter specifying unit 1015 sets a value in the base / view NAL unit / header / MVC extension parameter D017 and sets the base / view NAL unit / header / MVC extension parameter D017. And output to the non-base view encoding unit 512.
- the non-base view NAL unit / header / MVC extension parameter specifying unit 508 sets a value for the non-base view NAL unit / header / MVC extension parameter D013, and sets the value to the non-base view NAL unit / header.
- the MVC extension parameter D013 is output to the non-base view encoding unit 512.
- the base view image D001 is input to the base view encoding unit 500, and the base view compressed image D003 is output to both the base view writing unit 502 and the base view decoding unit 504.
- the base-view decoding unit 504 reads the base-view compressed image D003, and outputs the base-view decoded image D005 to the storage unit 510.
- the non-base view image D011 is input to the non-base view encoding unit 512.
- the non base view encoding unit 512 includes a non base view image D011, a non base view NAL unit header MVC extension parameter D013, and a base view NAL unit header MVC extension parameter.
- D017 and a base-view decoded image D019 are acquired, and a non-base-view compressed image D021 is output.
- the base view writing unit 502 and the non-base view writing unit 514 obtain the compressed image D003 of the base view and the compressed image D021 of the non-base view, respectively, and the base view in the NAL unit.
- the compressed image D007 and the non-base view compressed image D023 are output.
- the switch unit 1016 receives the NAL unit / header / MVC of the base / view from the base / view / NAL unit / header / MVC extension parameter specifying unit 1015.
- the extension parameter D017 is output to the prefix NAL unit writing unit 1017.
- the prefix NAL unit writing unit 1017 writes the NAL unit / header / MVC extension parameter of the base view into the prefix NAL unit and outputs the prefix NAL unit D022.
- the image encoding device shown in FIG. 13 is a specific example of the image encoding device 950 shown in FIG.
- the processing performed by each component shown in FIG. 11 is executed by each component shown in FIG.
- the processing of the base / view / MVC extension parameter specifying unit 900 is executed by the base / view / NAL unit / header / MVC extension parameter specifying unit 1015.
- the processing of the non-base view / MVC extension parameter specifying unit 902 is executed by the non-base / view / NAL unit / header / MVC extension parameter specifying unit 508.
- the processing of the pre-NAL unit write switching unit 904 is executed by the switch unit 1016.
- the processing of the prefix NAL unit writing unit 906 is executed by the prefix NAL unit writing unit 1017.
- the processing of the base view encoding unit 908 is executed by the base view encoding unit 500.
- the processing of the base view writing unit 910 is executed by the base view writing unit 502.
- the processing of the non-base view encoding unit 912 is executed by the non-base view encoding unit 512.
- the processing of the non-base view writing unit 110 is executed by the non-base view writing unit 5
- the image encoding device 950 according to Embodiment 3 can switch the presence / absence of the front NAL unit when encoding a multi-view video. Furthermore, the image coding apparatus 950 according to Embodiment 3 specifies the MVC extension parameters for the base view and the non-base view, respectively. In other words, the image encoding device 950 does not have to calculate the MVC extension parameter of the base view.
- the image coding apparatus 950 may not specify the MVC extension parameter of the base view from the MVC extension parameter of the non-base view.
- the image encoder 950 may use the independently specified base view MVC extension parameters for non-base view encoding.
- the conventional AVC decoder can decode the base view included in the encoded multi-view video.
- the image decoding device 250 according to the first and second embodiments can decode all the views included in the multi-view video encoded by the image encoding device 950.
- the image encoding device 950 switches whether to write the prefix NAL unit.
- the image encoding device 950 may be premised on not writing the prefix NAL unit. In this case, the front NAL unit write switching unit 904, the front NAL unit writing unit 906, and the processing executed by them are unnecessary.
- the flow of processing shown in FIG. 12 is an example of the flow of processing, and the flow of processing executed by the image encoding device 950 is illustrated in FIG.
- the process flow shown in FIG. the image encoding device 950 may execute the processes shown in FIG. 12 in an order different from the order of the processes shown in FIG. 12, or may execute the processes shown in FIG. 12 in parallel. Also good.
- the order of the process for specifying the MVC extension parameter for the base view (S900) and the process for specifying the MVC extension parameter for the non-base view (S902) may be reversed. Then, the processing for identifying those MVC extension parameters (S900, S902) is up to the processing for encoding the non-base view image (S912) and the processing for writing the MVC extension parameters (S906, S914). It only has to be executed.
- the priority ID (priority_id), the view ID (view_id), and the inter-view prediction flag (inter_view_flag) may be assigned a predetermined value. For example, 0 is assigned to the priority ID (priority_id), 0 is assigned to the inter-view prediction flag (inter_view_flag), and 1 is assigned to the inter-view prediction flag (inter_view_flag).
- the non-IDR flag (non_idr_flag), the time ID (temporal_id), and the anchor picture flag (anchor_pic_flag) are respectively
- the base view side and the non-base view side may be assigned the same value.
- the non-IDR flag (non_idr_flag) of the base view may be specified based on the NAL unit type as in the second embodiment.
- the image encoding device and the image decoding device shown in the first embodiment, the second embodiment, and the third embodiment encode a multi-view video even when a prefix NAL unit is not used.
- Multi-view video can be decoded.
- the base view can be decoded even by the conventional AVC decoder that cannot decode the base view due to the prefix NAL unit.
- a 2D animation image that has already been encoded by a conventional AVC encoder may be written on a BD (Blu-ray Disc). Then, a video obtained by adding parallax to the 2D animation video written on the BD may be encoded and distributed via the Internet.
- a multi-view video is formed with a 2D animation video written on the BD as a base view and a video obtained by adding parallax to the 2D animation video as a non-base view.
- the image decoding device in each embodiment can decode the multi-view video formed in this way. Then, the decoded multi-view video is reproduced as a 3D animation video. In this way, it is not necessary to add a prefix NAL unit to the base view stream, so that it is easy to reuse existing assets. It is also easy to add another view to the stream generated assuming a single view. Thus, for example, another encoded view may be provided on another recording medium.
- FIG. 14 is a diagram illustrating an example of a data structure of an encoded stream in each embodiment.
- the image encoding apparatus in each embodiment can generate a stream without a prefix NAL unit as shown in FIG. 14 by encoding a multi-view video. Then, the image decoding apparatus according to each embodiment decodes a multi-view video from a stream without a prefix NAL unit as shown in FIG. 14 using MVC extension parameters included in the non-base view. can do.
- the image encoding device in each embodiment may encode only the non-base view.
- the image decoding apparatus according to each embodiment calculates a base view MVC extension parameter from a non-base view MVC extension parameter, thereby encoding a base view encoded without a prefix NAL unit, and a new Can be decoded as a multi-view video.
- the storage medium may be any medium that can record a program, such as a magnetic disk, an optical disk, a magneto-optical disk, an IC card, and a semiconductor memory.
- FIG. 15 is a diagram showing an overall configuration of a content supply system ex100 that realizes a content distribution service.
- the communication service providing area is divided into desired sizes, and base stations ex106 to ex110, which are fixed radio stations, are installed in each cell.
- This content supply system ex100 includes a computer ex111, a PDA (Personal Digital Assistant) ex112, a camera ex113, a mobile phone ex114, a game machine via an Internet service provider ex102, a telephone network ex104, and base stations ex106 to ex110. Each device such as ex115 is connected.
- PDA Personal Digital Assistant
- each device may be directly connected to the telephone network ex104 without going through the base stations ex106 to ex110 which are fixed wireless stations.
- the devices may be directly connected to each other via short-range wireless or the like.
- the camera ex113 is a device that can shoot moving images such as a digital video camera
- the camera ex116 is a device that can shoot still images and movies such as a digital camera.
- the mobile phone ex114 is a GSM (Global System for Mobile Communications) method, a CDMA (Code Division Multiple Access) method, a W-CDMA (Wideband-Code Division Multiple Access L (Semiconductor Access) method, a W-CDMA (Wideband-Code Division Multiple Access L method, or a high access).
- GSM Global System for Mobile Communications
- CDMA Code Division Multiple Access
- W-CDMA Wideband-Code Division Multiple Access L (Semiconductor Access) method
- W-CDMA Wideband-Code Division Multiple Access L method
- a high access A High Speed Packet Access
- PHS Personal Handyphone System
- the camera ex113 and the like are connected to the streaming server ex103 through the base station ex109 and the telephone network ex104, thereby enabling live distribution and the like.
- the content for example, music live video
- the streaming server ex103 streams the content data transmitted to the requested client.
- the client include a computer ex111, a PDA ex112, a camera ex113, a mobile phone ex114, a game machine ex115, and the like that can decode the encoded data.
- Each device that has received the distributed data decodes and reproduces the received data.
- the encoded processing of the captured data may be performed by the camera ex113, the streaming server ex103 that performs the data transmission processing, or may be performed in a shared manner.
- the decryption processing of the distributed data may be performed by the client, the streaming server ex103, or may be performed in a shared manner.
- still images and / or moving image data captured by the camera ex116 may be transmitted to the streaming server ex103 via the computer ex111.
- the encoding process in this case may be performed by any of the camera ex116, the computer ex111, and the streaming server ex103, or may be performed in a shared manner.
- these encoding processing and decoding processing are generally performed in a computer ex111 and an LSI (Large Scale Integration) ex500 included in each device.
- the LSI ex500 may be configured as a single chip or a plurality of chips.
- image encoding and image decoding software is incorporated into some recording medium (CD-ROM, flexible disk, hard disk, etc.) that can be read by the computer ex111 and the like, and the encoding processing and decoding processing are performed using the software. May be.
- moving image data acquired by the camera may be transmitted. The moving image data at this time is data encoded by the LSI ex500 included in the mobile phone ex114.
- the streaming server ex103 may be a plurality of servers or a plurality of computers, and may process, record, and distribute data in a distributed manner.
- the encoded data can be received and reproduced by the client.
- the information transmitted by the user can be received, decrypted and reproduced in real time by the client, and even a user who does not have special rights or facilities can realize personal broadcasting.
- the image encoding method or the image decoding method described in each of the above embodiments may be used for encoding and decoding of each device constituting the content supply system.
- FIG. 16 is a diagram showing the mobile phone ex114 using the image encoding method and the image decoding method described in the above embodiment.
- the cellular phone ex114 includes an antenna ex601 for transmitting and receiving radio waves to and from the base station ex110, a video from a CCD camera, a camera unit ex603 capable of taking a still image, a video shot by the camera unit ex603, and an antenna ex601.
- a display unit ex602 such as a liquid crystal display that displays data obtained by decoding received video and the like, a main body unit composed of a group of operation keys ex604, an audio output unit ex608 such as a speaker for outputting audio, and a voice input Audio input unit ex605 such as a microphone, recorded moving image or still image data, received mail data, moving image data or still image data, etc., for storing encoded data or decoded data
- Recording media ex607 can be attached to media ex607 and mobile phone ex114 And a slot unit ex606 for.
- the recording medium ex607 stores a flash memory element, which is a kind of EEPROM, which is a nonvolatile memory that can be electrically rewritten and erased, in a plastic case such as an SD card.
- the mobile phone ex114 has a power supply circuit ex710, an operation input control unit ex704, an image encoding unit, and a main control unit ex711 configured to control the respective units of the main body unit including the display unit ex602 and the operation key ex604.
- Unit ex712, camera interface unit ex703, LCD (Liquid Crystal Display) control unit ex702, image decoding unit ex709, demultiplexing unit ex708, recording / reproducing unit ex707, modulation / demodulation circuit unit ex706, and audio processing unit ex705 are connected to each other via a synchronization bus ex713. It is connected.
- the power supply circuit ex710 activates the camera-equipped digital mobile phone ex114 by supplying power to each unit from the battery pack. .
- the cellular phone ex114 converts the audio signal collected by the audio input unit ex605 in the audio call mode into digital audio data by the audio processing unit ex705 based on the control of the main control unit ex711 including a CPU, a ROM, a RAM, and the like.
- the modulation / demodulation circuit unit ex706 performs spread spectrum processing, the transmission / reception circuit unit ex701 performs digital analog conversion processing and frequency conversion processing, and then transmits the result via the antenna ex601.
- the mobile phone ex114 amplifies the reception data received by the antenna ex601 in the voice call mode, performs frequency conversion processing and analog-digital conversion processing, performs spectrum despreading processing by the modulation / demodulation circuit unit ex706, and performs analog speech processing by the voice processing unit ex705. After the data is converted, it is output via the audio output unit ex608.
- text data of the e-mail input by operating the operation key ex604 on the main body is sent to the main control unit ex711 via the operation input control unit ex704.
- the main control unit ex711 performs spread spectrum processing on the text data in the modulation / demodulation circuit unit ex706, performs digital analog conversion processing and frequency conversion processing in the transmission / reception circuit unit ex701, and then transmits the text data to the base station ex110 via the antenna ex601.
- the image data captured by the camera unit ex603 is supplied to the image encoding unit ex712 via the camera interface unit ex703.
- the image data captured by the camera unit ex603 can be directly displayed on the display unit ex602 via the camera interface unit ex703 and the LCD control unit ex702.
- the image encoding unit ex712 includes the image encoding device described in the present invention, and the image encoding unit uses the image data supplied from the camera unit ex603 in the image encoding device described in the above embodiment.
- the image data is converted into encoded image data by compression encoding according to the method, and is sent to the demultiplexing unit ex708.
- the mobile phone ex114 sends the sound collected by the sound input unit ex605 during imaging by the camera unit ex603 to the demultiplexing unit ex708 via the sound processing unit ex705 as digital sound data.
- the demultiplexing unit ex708 multiplexes the encoded image data supplied from the image encoding unit ex712 and the audio data supplied from the audio processing unit ex705 by a predetermined method, and the resulting multiplexed data is a modulation / demodulation circuit unit Spread spectrum processing is performed in ex706, digital analog conversion processing and frequency conversion processing are performed in the transmission / reception circuit unit ex701, and then transmission is performed via the antenna ex601.
- the received data received from the base station ex110 via the antenna ex601 is subjected to spectrum despreading processing by the modulation / demodulation circuit unit ex706, and the resulting multiplexing is obtained.
- Data is sent to the demultiplexing unit ex708.
- the demultiplexing unit ex708 separates the multiplexed data into a bit stream of image data and a bit stream of audio data, and a synchronization bus
- the encoded image data is supplied to the image decoding unit ex709 via ex713 and the audio data is supplied to the audio processing unit ex705.
- the image decoding unit ex709 is configured to include the image decoding device described in the present application, and decodes a bit stream of image data using an image decoding method corresponding to the image encoding method described in the above embodiment.
- the reproduced moving image data is generated and supplied to the display unit ex602 via the LCD control unit ex702, whereby the moving image data included in the moving image file linked to the home page is displayed, for example.
- the audio processing unit ex705 converts the audio data into analog audio data, and then supplies the analog audio data to the audio output unit ex608.
- the audio data included in the moving image file linked to the home page is reproduced.
- a decoding device can be incorporated. Specifically, in the broadcasting station ex201, audio data, video data, or a bit stream in which those data are multiplexed is transmitted to a communication or broadcasting satellite ex202 via radio waves. In response, the broadcasting satellite ex202 transmits a radio wave for broadcasting, and a home antenna ex204 having a satellite broadcasting receiving facility receives the radio wave, and the television (receiver) ex300 or the set top box (STB) ex217 or the like. The device decodes the bitstream and reproduces it.
- the reader / recorder ex218 that reads and decodes a bitstream in which image data and audio data recorded on recording media ex215 and ex216 such as CD and DVD as recording media are multiplexed is also shown in the above embodiment. It is possible to implement an image decoding device. In this case, the reproduced video signal is displayed on the monitor ex219. Further, a configuration in which an image decoding device is mounted in a set-top box ex217 connected to a cable ex203 for cable television or an antenna ex204 for satellite / terrestrial broadcasting, and this is reproduced on the monitor ex219 of the television is also conceivable. At this time, the image decoding apparatus may be incorporated in the television instead of the set top box. In addition, a car ex210 having an antenna ex205 can receive a signal from a satellite ex202 or a base station and reproduce a moving image on a display device such as a car navigation ex211 included in the car ex210.
- audio data, video data recorded on a recording medium ex215 such as DVD or BD, or an encoded bit stream in which those data are multiplexed are read and decoded, or audio data, video data or these are recorded on the recording medium ex215.
- a recording medium ex215 such as DVD or BD
- an encoded bit stream in which those data are multiplexed are read and decoded
- audio data, video data or these are recorded on the recording medium ex215.
- the image decoding apparatus or the image encoding apparatus described in the above embodiments also in the reader / recorder ex218 that encodes the above data and records it as multiplexed data.
- the reproduced video signal is displayed on the monitor ex219.
- the recording medium ex215 on which the encoded bit stream is recorded allows other devices and systems to reproduce the video signal.
- the other reproduction device ex212 can reproduce the video signal on the monitor ex213 using the recording medium ex214 on which the encoded bitstream is copied.
- an image decoding device may be mounted in the set-top box ex217 connected to the cable ex203 for cable television or the antenna ex204 for satellite / terrestrial broadcasting and displayed on the monitor ex219 of the television.
- the image decoding apparatus may be incorporated in the television instead of the set top box.
- FIG. 19 is a diagram illustrating a television (receiver) ex300 that uses the image decoding method and the image encoding method described in the above embodiments.
- the television ex300 obtains or outputs a bit stream of video information via the antenna ex204 or the cable ex203 that receives the broadcast, and a tuner ex301 that outputs or outputs the encoded data that is received or demodulated.
- Modulation / demodulation unit ex302 that modulates data for transmission to the outside, and multiplexing / separation unit ex303 that separates demodulated video data and audio data, or multiplexes encoded video data and audio data Is provided.
- the television ex300 decodes each of the audio data and the video data, or encodes the respective information, the audio signal processing unit ex304, the signal processing unit ex306 including the video signal processing unit ex305, and the decoded audio signal. And an output unit ex309 including a display unit ex308 such as a display for displaying the decoded video signal.
- the television ex300 includes an interface unit ex317 including an operation input unit ex312 that receives an input of a user operation.
- the television ex300 includes a control unit ex310 that controls each unit in an integrated manner, and a power supply circuit unit ex311 that supplies power to each unit.
- the interface unit ex317 includes a bridge ex313 connected to an external device such as a reader / recorder ex218, a slot unit ex314 for enabling recording media ex216 such as an SD card, and an external recording such as a hard disk
- a driver ex315 for connecting to a medium, a modem ex316 for connecting to a telephone network, and the like may be included.
- the recording medium ex216 is capable of electrically recording information by using a nonvolatile / volatile semiconductor memory element to be stored.
- Each part of the television ex300 is connected to each other via a synchronous bus.
- the television ex300 receives a user operation from the remote controller ex220 or the like, and demultiplexes the video data and audio data demodulated by the modulation / demodulation unit ex302 by the multiplexing / separation unit ex303 based on the control of the control unit ex310 having a CPU or the like. . Further, the television ex300 decodes the separated audio data by the audio signal processing unit ex304, and the separated video data is decoded by the video signal processing unit ex305 using the image decoding method described in each of the above embodiments. The decoded audio signal and video signal are output to the outside from the output unit ex309.
- the television ex300 may read the encoded bitstream encoded from the recording media ex215 and ex216 such as a magnetic / optical disk and an SD card, not from broadcasting.
- the television ex300 encodes an audio signal and a video signal and transmits them to the outside or writes them to a recording medium or the like.
- the television ex300 receives a user operation from the remote controller ex220 or the like, encodes an audio signal with the audio signal processing unit ex304, and converts the video signal with the video signal processing unit ex305 based on the control of the control unit ex310.
- the image is encoded using the image encoding method described in the above.
- the encoded audio signal and video signal are multiplexed by the multiplexing / demultiplexing unit ex303 and output to the outside.
- these signals may be temporarily stored in the buffers ex320, ex321, etc. so that the audio signal and the video signal are synchronized.
- a plurality of buffers ex318 to ex321 may be provided as shown in the figure, or one or more buffers may be shared.
- data may be stored in the buffer as a buffer material that prevents system overflow and underflow even between the modulation / demodulation unit ex302 and the multiplexing / demultiplexing unit ex303, for example.
- the television ex300 In addition to acquiring audio data and video data from broadcast and recording media, the television ex300 has a configuration for receiving AV input of a microphone and a camera, and even if encoding processing is performed on the data acquired therefrom Good.
- the television ex300 has been described as a configuration capable of the above-described encoding processing, multiplexing, and external output. However, all of these processing cannot be performed, and the above reception, decoding processing, and external
- the configuration may be such that only one of the outputs is possible.
- the decoding process or the encoding process may be performed by either the television ex300 or the reader / recorder ex218.
- the television ex300 and the reader / recorder ex218 may be shared with each other.
- FIG. 20 shows a configuration of the information reproducing / recording unit ex400 when data is read from or written to an optical disk.
- the information reproducing / recording unit ex400 includes elements ex401 to ex407 described below.
- the optical head ex401 irradiates a laser spot on the recording surface of the recording medium ex215 that is an optical disc to write information, and detects information reflected from the recording surface of the recording medium ex215 to read the information.
- the modulation recording unit ex402 electrically drives a semiconductor laser built in the optical head ex401 and modulates the laser beam according to the recording data.
- the reproduction demodulator ex403 amplifies the reproduction signal obtained by electrically detecting the reflected light from the recording surface by the photodetector built in the optical head ex401, separates and demodulates the signal component recorded on the recording medium ex215, and is necessary. To play back information.
- the buffer ex404 temporarily holds information to be recorded on the recording medium ex215 and information reproduced from the recording medium ex215.
- the disk motor ex405 rotates the recording medium ex215.
- the servo control unit ex406 moves the optical head ex401 to a predetermined information track while controlling the rotational drive of the disk motor ex405, and performs a laser spot tracking process.
- the system control unit ex407 controls the entire information reproduction / recording unit ex400.
- the system control unit ex407 uses various types of information held in the buffer ex404, and generates and adds new information as necessary. This is realized by recording / reproducing information through the optical head ex401 while the unit ex403 and the servo control unit ex406 are operated cooperatively.
- the system control unit ex407 includes, for example, a microprocessor, and executes these processes by executing a read / write program.
- the optical head ex401 has been described as irradiating a laser spot, but it may be configured to perform higher-density recording using near-field light.
- FIG. 21 shows a schematic diagram of a recording medium ex215 that is an optical disk.
- Guide grooves grooves
- address information indicating the absolute position on the disc is recorded in advance on the information track ex230 by changing the shape of the groove.
- This address information includes information for specifying the position of the recording block ex231 which is a unit for recording data, and the recording and reproducing apparatus specifies the recording block by reproducing the information track ex230 and reading the address information. be able to.
- the recording medium ex215 includes a data recording area ex233, an inner peripheral area ex232, and an outer peripheral area ex234.
- the area used for recording user data is the data recording area ex233, and the inner circumference area ex232 and the outer circumference area ex234 arranged on the inner circumference or outer circumference of the data recording area ex233 are used for specific purposes other than recording user data. Used.
- the information reproducing / recording unit ex400 reads / writes encoded audio data, video data, or encoded data obtained by multiplexing these data, with respect to the data recording area ex233 of the recording medium ex215.
- an optical disk such as a single-layer DVD or BD has been described as an example.
- the present invention is not limited to these, and an optical disk having a multilayer structure and capable of recording other than the surface may be used. It also has a structure that performs multidimensional recording / reproduction, such as recording information using light of various different wavelengths at the same location on the disc, and recording different layers of information from various angles. It may be an optical disk.
- the car ex210 having the antenna ex205 can receive data from the satellite ex202 and the like, and the moving image can be reproduced on a display device such as the car navigation ex211 that the car ex210 has.
- the configuration of the car navigation ex211 may be, for example, a configuration in which a GPS receiving unit is added in the configuration shown in FIG.
- the mobile phone ex114 and the like can be used in three ways: a transmitting terminal having only an encoder and a receiving terminal having only a decoder. The implementation form of can be considered.
- the image encoding method or the image decoding method shown in each of the above embodiments can be used in any of the above-described devices and systems, and by doing so, the effects described in the above embodiments can be obtained. Can be obtained.
- FIG. 22 shows a configuration of an LSI ex500 that is made into one chip.
- the LSI ex500 includes elements ex501 to ex509 described below, and each element is connected via a bus ex510.
- the power supply circuit unit ex505 starts up to an operable state by supplying power to each unit when the power supply is in an on state.
- the LSI ex500 when performing the encoding process, inputs an AV signal from the microphone ex117, the camera ex113, and the like by the AV I / Oex 509 based on the control of the control unit ex501 having the CPU ex502, the memory controller ex503, the stream controller ex504, and the like. Accept.
- the input AV signal is temporarily stored in an external memory ex511 such as SDRAM.
- the accumulated data is appropriately divided into a plurality of times according to the processing amount and the processing speed, and sent to the signal processing unit ex507.
- the signal processing unit ex507 performs encoding of an audio signal and / or encoding of a video signal.
- the encoding process of the video signal is the encoding process described in each of the above embodiments.
- the signal processing unit ex507 further performs processing such as multiplexing the encoded audio data and the encoded video data according to circumstances, and outputs the result from the stream I / Oex 506 to the outside.
- the output bit stream is transmitted to the base station ex107 or written to the recording medium ex215. It should be noted that data should be temporarily stored in the buffer ex508 so as to be synchronized when multiplexing.
- the LSI ex500 is obtained by reading from the encoded data obtained via the base station ex107 by the stream I / Oex 506 or the recording medium ex215 based on the control of the control unit ex501.
- the encoded data is temporarily stored in the memory ex511 or the like.
- the accumulated data is appropriately divided into a plurality of times according to the processing amount and the processing speed and sent to the signal processing unit ex507.
- the signal processing unit ex507 performs decoding of audio data and / or decoding of video data.
- the decoding process of the video signal is the decoding process described in the above embodiments.
- each signal may be temporarily stored in the buffer ex508 or the like so that the decoded audio signal and the decoded video signal can be reproduced in synchronization.
- the decoded output signal is output from each output unit such as the mobile phone ex114, the game machine ex115, and the television ex300 through the memory ex511 or the like as appropriate.
- the memory ex511 has been described as an external configuration of the LSI ex500.
- a configuration included in the LSI ex500 may be used.
- the buffer ex508 is not limited to one, and a plurality of buffers may be provided.
- the LSI ex500 may be made into one chip or a plurality of chips.
- LSI LSI
- IC system LSI
- super LSI ultra LSI depending on the degree of integration
- the method of circuit integration is not limited to LSI, and implementation with a dedicated circuit or a general-purpose processor is also possible.
- An FPGA that can be programmed after manufacturing the LSI or a reconfigurable processor that can reconfigure the connection and setting of circuit cells inside the LSI may be used.
- the image encoding method, the image encoding device, the image decoding method, and the image decoding device according to the present invention have been described above based on the embodiments. However, the present invention is not limited to these embodiments. Absent. Unless it deviates from the meaning of the present invention, various forms conceived by those skilled in the art are applied to the embodiment, and other forms constructed by combining components and steps in different embodiments are also included in the present invention. It is included in the range.
- the present invention can be used as an image encoding method, an image decoding method, an image encoding device, and an image decoding device, for example, in an image recording device and a reproduction device.
- Base view coding unit 102 Base view writing unit 104, 902 Non-base view MVC extended parameter specifying unit 106, 206 Base view MVC extended parameter calculation Units 108, 512, 712, 912 Non-base view coding units 110, 514, 714, 914 Non-base view writing units 150, 950 Image coding apparatus 200
- Base view decoding unit 204 Non-base view MVC extended parameter syntax analysis unit 205, 601, 803 Prefix NAL unit search unit 207
- Base view MVC extended parameter syntax analysis unit 208 Non- Base view / NAL unit syntax analysis unit 210, 610, 812 Non-base view decoding unit 250 Image decoding device 506, 606, 706, 808 Base view / NAL unit / header / MVC extended parameter calculation unit 508, 708 Non Base view NAL unit header Header MVC extended parameter specifying unit 510, 604, 710, 8
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
まず、実施の形態1について説明する。
次に、実施の形態2について説明する。
次に、実施の形態3について説明する。
上記各実施の形態で示した画像符号化方法または画像復号方法の構成を実現するためのプログラムを記憶メディアに記録することにより、上記各実施の形態で示した処理を独立したコンピュータシステムにおいて簡単に実施することが可能となる。記憶メディアは、磁気ディスク、光ディスク、光磁気ディスク、ICカード、半導体メモリ等、プログラムを記録できるものであればよい。
上記各実施の形態で示した画像符号化方法および装置、画像復号方法および装置は、典型的には集積回路であるLSIで実現される。一例として、図22に1チップ化されたLSIex500の構成を示す。LSIex500は、以下に説明する要素ex501~ex509を備え、各要素はバスex510を介して接続している。電源回路部ex505は電源がオン状態の場合に各部に対して電力を供給することで動作可能な状態に起動する。
102、502、702、910 ベース・ビュー書き込み部
104、902 ノン・ベース・ビュー・MVC拡張パラメータ特定部
106、206 ベース・ビュー・MVC拡張パラメータ算出部
108、512、712、912 ノン・ベース・ビュー符号化部
110、514、714、914 ノン・ベース・ビュー書き込み部
150、950 画像符号化装置
200 ベース・ビュー・NALユニット構文解析部
202、504、602、704、804 ベース・ビュー復号部
204 ノン・ベース・ビュー・MVC拡張パラメータ構文解析部
205、601、803 前置NALユニット検索部
207 ベース・ビュー・MVC拡張パラメータ構文解析部
208 ノン・ベース・ビュー・NALユニット構文解析部
210、610、812 ノン・ベース・ビュー復号部
250 画像復号装置
506、606、706、808 ベース・ビュー・NALユニット・ヘッダ・MVC拡張パラメータ算出部
508、708 ノン・ベース・ビュー・NALユニット・ヘッダ・MVC拡張パラメータ特定部
510、604、710、806 記憶部
600、800 スプリッタ部
603、805 ベース・ビュー・前置NALユニット・ヘッダ・MVC拡張パラメータ構文解析部
605、807、1016 スイッチ部
608、810 ノン・ベース・ビュー・NALユニット・ヘッダ・MVC拡張パラメータ構文解析部
802 ベース・ビュー・NALユニット・ヘッダ・パラメータ構文解析部
900 ベース・ビュー・MVC拡張パラメータ特定部
904 前置NALユニット書き込み切り替え部
906、1017 前置NALユニット書き込み部
1015 ベース・ビュー・NALユニット・ヘッダ・MVC拡張パラメータ特定部
D001、D011 画像
D003、D004、D006、D007、D015、D021、D023 圧縮画像
D005、D019、D025 復号画像
D008、D010、D013、D014、D017 NALユニット・ヘッダ・MVC拡張パラメータ
D009、D018 値
D016 信号
D020 圧縮映像
D022 前置NALユニット
ex100 コンテンツ供給システム
ex101 インターネット
ex102 インターネットサービスプロバイダ
ex103 ストリーミングサーバ
ex104 電話網
ex106、ex107、ex108、ex109、ex110 基地局
ex111 コンピュータ
ex112 PDA
ex113、ex116 カメラ
ex114 カメラ付デジタル携帯電話(携帯電話)
ex115 ゲーム機
ex117 マイク
ex200 デジタル放送用システム
ex201 放送局
ex202 放送衛星(衛星)
ex203 ケーブル
ex204、ex205、ex601 アンテナ
ex210 車
ex211 カーナビゲーション(カーナビ)
ex212 再生装置
ex213、ex219 モニタ
ex214、ex215、ex216、ex607 記録メディア
ex217 セットトップボックス(STB)
ex218 リーダ/レコーダ
ex220 リモートコントローラ
ex230 情報トラック
ex231 記録ブロック
ex232 内周領域
ex233 データ記録領域
ex234 外周領域
ex300 テレビ
ex301 チューナ
ex302 変調/復調部
ex303 多重/分離部
ex304 音声信号処理部
ex305 映像信号処理部
ex306、ex507 信号処理部
ex307 スピーカ
ex308、ex602 表示部
ex309 出力部
ex310、ex501 制御部
ex311、ex505、ex710 電源回路部
ex312 操作入力部
ex313 ブリッジ
ex314、ex606 スロット部
ex315 ドライバ
ex316 モデム
ex317 インターフェース部
ex318、ex319、ex320、ex321、ex404、ex508 バッファ
ex400 情報再生/記録部
ex401 光ヘッド
ex402 変調記録部
ex403 再生復調部
ex405 ディスクモータ
ex406 サーボ制御部
ex407 システム制御部
ex500 LSI
ex502 CPU
ex503 メモリコントローラ
ex504 ストリームコントローラ
ex506 ストリームI/O
ex509 AV I/O
ex510 バス
ex511 メモリ
ex603 カメラ部
ex604 操作キー
ex605 音声入力部
ex608 音声出力部
ex701 送受信回路部
ex702 LCD制御部
ex703 カメラインターフェース部(カメラI/F部)
ex704 操作入力制御部
ex705 音声処理部
ex706 変復調回路部
ex707 記録再生部
ex708 多重分離部
ex709 画像復号部
ex711 主制御部
ex712 画像符号化部
ex713 同期バス
Claims (19)
- 多視点映像を復号する画像復号方法であって、
ベース・ビューのビュー・コンポーネントに含まれる画像を含むNALユニットを構文解析し、
前記ベース・ビューの前記ビュー・コンポーネントに含まれる前記画像を復号し、
ノン・ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを構文解析し、
前記ベース・ビューの前記ビュー・コンポーネントに前置NALユニットが存在しているか否かを検索し、
前記ベース・ビューの前記ビュー・コンポーネントに前記前置NALユニットが存在していない場合、前記ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを算出し、
前記ベース・ビューの前記ビュー・コンポーネントに前記前置NALユニットが存在している場合、前記前置NALユニットから、前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを構文解析し、
前記ノン・ベース・ビューのビュー・コンポーネントに含まれる画像を含むNALユニットを構文解析し、
算出または構文解析された前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、構文解析された前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを用いて、前記ノン・ベース・ビューの前記ビュー・コンポーネントに含まれる前記画像を復号する
画像復号方法。 - 前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータは、それぞれ、非IDRフラグ(non_idr_flag)、優先ID(priority_id)、ビューID(view_id)、時間ID(temporal_id)、アンカー・ピクチャ・フラグ(anchor_pic_flag)およびビュー間予測フラグ(inter_view_flag)を含む
請求項1に記載の画像復号方法。 - 前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを算出する際には、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記非IDRフラグ(non_idr_flag)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記非IDRフラグ(non_idr_flag)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記非IDRフラグ(non_idr_flag)の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記優先ID(priority_id)に、予め定められた第1の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビューID(view_id)に、予め定められた第2の値を割り当て、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記時間ID(temporal_id)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記時間ID(temporal_id)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記時間ID(temporal_id)の値を割り当て、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記アンカー・ピクチャ・フラグ(anchor_pic_flag)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記アンカー・ピクチャ・フラグ(anchor_pic_flag)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記アンカー・ピクチャ・フラグ(anchor_pic_flag)の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビュー間予測フラグ(inter_view_flag)に、予め定められた第3の値を割り当てる
請求項2に記載の画像復号方法。 - 前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを算出する際には、
前記ベース・ビューのNALユニット・ヘッダから、NALユニット・タイプ(nal_unit_type)の値を取得し、
前記NALユニット・タイプ(nal_unit_type)の値が5である場合、前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記非IDRフラグ(non_idr_flag)に、0を割り当て、
前記NALユニット・タイプ(nal_unit_type)の値が1である場合、前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記非IDRフラグ(non_idr_flag)に、1を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記優先ID(priority_id)に、予め定められた第1の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビューID(view_id)に、予め定められた第2の値を割り当て、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記時間ID(temporal_id)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記時間ID(temporal_id)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記時間ID(temporal_id)の値を割り当て、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記アンカー・ピクチャ・フラグ(anchor_pic_flag)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記アンカー・ピクチャ・フラグ(anchor_pic_flag)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記アンカー・ピクチャ・フラグ(anchor_pic_flag)の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビュー間予測フラグ(inter_view_flag)に、予め定められた第3の値を割り当てる
請求項2に記載の画像復号方法。 - 前記優先ID(priority_id)に割り当てられる前記予め定められた第1の値は、0である
請求項3または請求項4に記載の画像復号方法。 - 前記ビューID(view_id)に割り当てられる前記予め定められた第2の値は、0である
請求項3~5のいずれか1項に記載の画像復号方法。 - 前記ビュー間予測フラグ(inter_view_flag)に割り当てられる前記予め定められた第3の値は、1である
請求項3~6のいずれか1項に記載の画像復号方法。 - 多視点映像を符号化する画像符号化方法であって、
ベース・ビューの画像を符号化し、
前置NALユニットなしで、前記ベース・ビューの符号化された前記画像を含むビュー・コンポーネントをNALユニットに書き込み、
ノン・ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを特定し、
前記ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを算出し、
算出された前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、特定された前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを用いて、前記ノン・ベース・ビューの画像を符号化し、
前記ノン・ベース・ビューの符号化された前記画像を含むビュー・コンポーネントをNALユニットに書き込む
画像符号化方法。 - 前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータは、それぞれ、非IDRフラグ(non_idr_flag)、優先ID(priority_id)、ビューID(view_id)、時間ID(temporal_id)、アンカー・ピクチャ・フラグ(anchor_pic_flag)およびビュー間予測フラグ(inter_view_flag)を含む
請求項8に記載の画像符号化方法。 - 前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを算出する際には、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記非IDRフラグ(non_idr_flag)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記非IDRフラグ(non_idr_flag)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記非IDRフラグ(non_idr_flag)の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記優先ID(priority_id)に、予め定められた第1の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビューID(view_id)に、予め定められた第2の値を割り当て、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記時間ID(temporal_id)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記時間ID(temporal_id)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記時間ID(temporal_id)の値を割り当て、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記アンカー・ピクチャ・フラグ(anchor_pic_flag)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記アンカー・ピクチャ・フラグ(anchor_pic_flag)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記アンカー・ピクチャ・フラグ(anchor_pic_flag)の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビュー間予測フラグ(inter_view_flag)に、予め定められた第3の値を割り当てる
請求項9に記載の画像符号化方法。 - 多視点映像を符号化する画像符号化方法であって、
ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを特定し、
ノン・ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを特定し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを前置NALユニットに書き込むか否かを切り替え、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを前記前置NALユニットに書き込むように切り替えられた場合、特定された前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを前記前置NALユニットに書き込み、
前記ベース・ビューの画像を符号化し、
前記ベース・ビューの符号化された前記画像を含むビュー・コンポーネントをNALユニットに書き込み、
特定された前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、特定された前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを用いて、前記ノン・ベース・ビューの画像を符号化し、
前記ノン・ベース・ビューの符号化された前記画像を含むビュー・コンポーネントをNALユニットに書き込む
画像符号化方法。 - 前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータは、それぞれ、非IDRフラグ(non_idr_flag)、優先ID(priority_id)、ビューID(view_id)、時間ID(temporal_id)、アンカー・ピクチャ・フラグ(anchor_pic_flag)およびビュー間予測フラグ(inter_view_flag)を含み、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを特定する際には、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記優先ID(priority_id)に、予め定められた第1の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビューID(view_id)に、予め定められた第2の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビュー間予測フラグ(inter_view_flag)に、予め定められた第3の値を割り当てる
請求項11に記載の画像符号化方法。 - 前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを算出する際には、
前記ベース・ビューのNALユニット・ヘッダから、NALユニット・タイプ(nal_unit_type)の値を取得し、
前記NALユニット・タイプ(nal_unit_type)の値が5である場合、前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記非IDRフラグ(non_idr_flag)に、0を割り当て、
前記NALユニット・タイプ(nal_unit_type)の値が1である場合、前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記非IDRフラグ(non_idr_flag)に、1を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記優先ID(priority_id)に、予め定められた第1の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビューID(view_id)に、予め定められた第2の値を割り当て、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記時間ID(temporal_id)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記時間ID(temporal_id)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記時間ID(temporal_id)の値を割り当て、
前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから、前記アンカー・ピクチャ・フラグ(anchor_pic_flag)の値を取得し、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記アンカー・ピクチャ・フラグ(anchor_pic_flag)に、前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータから取得された前記アンカー・ピクチャ・フラグ(anchor_pic_flag)の値を割り当て、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータの前記ビュー間予測フラグ(inter_view_flag)に、予め定められた第3の値を割り当てる
請求項9に記載の画像符号化方法。 - 前記優先ID(priority_id)に割り当てられる前記予め定められた第1の値は、0である
請求項10、12、13のいずれか1項に記載の画像符号化方法。 - 前記ビューID(view_id)に割り当てられる前記予め定められた第2の値は、0である
請求項10、12、13、14のいずれか1項に記載の画像符号化方法。 - 前記ビュー間予測フラグ(inter_view_flag)に割り当てられる前記予め定められた第3の値は、1である
請求項10、12、13、14、15のいずれか1項に記載の画像符号化方法。 - 多視点映像を復号する画像復号装置であって、
ベース・ビューのビュー・コンポーネントに含まれる画像を含むNALユニットを構文解析するベース・ビュー・NALユニット構文解析部と、
前記ベース・ビューの前記ビュー・コンポーネントに含まれる前記画像を復号するベース・ビュー復号部と、
ノン・ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを構文解析するノン・ベース・ビュー・MVC拡張パラメータ構文解析部と、
前記ベース・ビューの前記ビュー・コンポーネントに前置NALユニットが存在しているか否かを検索する前置NALユニット検索部と、
前記ベース・ビューの前記ビュー・コンポーネントに前記前置NALユニットが存在していない場合、前記ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを算出するベース・ビュー・MVC拡張パラメータ算出部と、
前記ベース・ビューの前記ビュー・コンポーネントに前記前置NALユニットが存在している場合、前記前置NALユニットから、前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを構文解析するベース・ビュー・MVC拡張パラメータ構文解析部と、
前記ノン・ベース・ビューのビュー・コンポーネントに含まれる画像を含むNALユニットを構文解析するノン・ベース・ビュー・NALユニット構文解析部と、
算出または構文解析された前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、構文解析された前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを用いて、前記ノン・ベース・ビューの前記ビュー・コンポーネントに含まれる前記画像を復号するノン・ベース・ビュー復号部とを備える
画像復号装置。 - 多視点映像を符号化する画像符号化装置であって、
ベース・ビューの画像を符号化するベース・ビュー符号化部と、
前置NALユニットなしで、前記ベース・ビューの符号化された前記画像を含むビュー・コンポーネントをNALユニットに書き込むベース・ビュー書き込み部と、
ノン・ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを特定するノン・ベース・ビュー・MVC拡張パラメータ特定部と、
前記ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを算出するベース・ビュー・MVC拡張パラメータ算出部と、
算出された前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、特定された前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを用いて、前記ノン・ベース・ビューの画像を符号化するノン・ベース・ビュー符号化部と、
前記ノン・ベース・ビューの符号化された前記画像を含むビュー・コンポーネントをNALユニットに書き込むノン・ベース・ビュー書き込み部とを備える
画像符号化装置。 - 多視点映像を符号化する画像符号化装置であって、
ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを特定するベース・ビュー・MVC拡張パラメータ特定部と、
ノン・ベース・ビューのNALユニット・ヘッダ・MVC拡張パラメータを特定するノン・ベース・ビュー・MVC拡張パラメータ特定部と、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを前置NALユニットに書き込むか否かを切り替える前置NALユニット書き込み切り替え部と、
前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを前記前置NALユニットに書き込むように切り替えられた場合、特定された前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを前記前置NALユニットに書き込む前置NALユニット書き込み部と、
前記ベース・ビューの画像を符号化するベース・ビュー符号化部と、
前記ベース・ビューの符号化された前記画像を含むビュー・コンポーネントをNALユニットに書き込むベース・ビュー書き込み部と、
特定された前記ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータ、および、特定された前記ノン・ベース・ビューの前記NALユニット・ヘッダ・MVC拡張パラメータを用いて、前記ノン・ベース・ビューの画像を符号化するノン・ベース・ビュー符号化部と、
前記ノン・ベース・ビューの符号化された前記画像を含むビュー・コンポーネントをNALユニットに書き込むノン・ベース・ビュー書き込み部とを備える
画像符号化装置。
Priority Applications (11)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
MX2010011009A MX2010011009A (es) | 2009-04-28 | 2010-04-28 | Metodo de decodificacion de imagenes y aparato de decodificacion de imagenes. |
JP2010534314A JP4633866B2 (ja) | 2009-04-28 | 2010-04-28 | 画像復号方法および画像復号装置 |
EP10769512A EP2426924A1 (en) | 2009-04-28 | 2010-04-28 | Image decoding method, image coding method, image decoding device, and image coding device |
BRPI1001258A BRPI1001258A2 (pt) | 2009-04-28 | 2010-04-28 | método de decodificação de imagem, método de codifcação de imagem, aparelho de decodificação de imagem, e aparelho de codificação de imagem |
RU2010141215/07A RU2477009C2 (ru) | 2009-04-28 | 2010-04-28 | Способ декодирования изображений и устройство декодирования изображений |
AU2010227032A AU2010227032B2 (en) | 2009-04-28 | 2010-04-28 | Image decoding method, and image decoding apparatus |
CN2010800013124A CN101981936B (zh) | 2009-04-28 | 2010-04-28 | 图像解码方法及图像解码装置 |
CA2718447A CA2718447C (en) | 2009-04-28 | 2010-04-28 | Image decoding method, image coding method, image decoding apparatus, and image coding apparatus |
US12/900,824 US8149923B2 (en) | 2009-04-28 | 2010-10-08 | Image decoding method, image coding method, image decoding apparatus, and image coding apparatus |
US13/204,035 US8369414B2 (en) | 2009-04-28 | 2011-08-05 | Image decoding method, image coding method, image decoding apparatus, and image coding apparatus |
US13/204,018 US8908771B2 (en) | 2009-04-28 | 2011-08-05 | Image decoding method, image coding method, image decoding apparatus, and image coding apparatus |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17338009P | 2009-04-28 | 2009-04-28 | |
US61/173,380 | 2009-04-28 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/900,824 Continuation US8149923B2 (en) | 2009-04-28 | 2010-10-08 | Image decoding method, image coding method, image decoding apparatus, and image coding apparatus |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2010125812A1 true WO2010125812A1 (ja) | 2010-11-04 |
Family
ID=43031969
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2010/003041 WO2010125812A1 (ja) | 2009-04-28 | 2010-04-28 | 画像復号方法、画像符号化方法、画像復号装置および画像符号化装置 |
Country Status (13)
Country | Link |
---|---|
US (3) | US8149923B2 (ja) |
EP (1) | EP2426924A1 (ja) |
JP (3) | JP4633866B2 (ja) |
KR (2) | KR101097690B1 (ja) |
CN (3) | CN103124351A (ja) |
AU (1) | AU2010227032B2 (ja) |
BR (1) | BRPI1001258A2 (ja) |
CA (1) | CA2718447C (ja) |
MX (1) | MX2010011009A (ja) |
MY (1) | MY154325A (ja) |
RU (2) | RU2550552C2 (ja) |
TW (3) | TW201336313A (ja) |
WO (1) | WO2010125812A1 (ja) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4962525B2 (ja) * | 2009-04-08 | 2012-06-27 | ソニー株式会社 | 再生装置、再生方法、およびプログラム |
US10237565B2 (en) | 2011-08-01 | 2019-03-19 | Qualcomm Incorporated | Coding parameter sets for various dimensions in video coding |
KR102057194B1 (ko) | 2012-01-19 | 2019-12-19 | 삼성전자주식회사 | 시점 변환을 위한 다시점 비디오 예측 방법 및 그 장치, 시점 변환을 위한 다시점 비디오 예측 복원 방법 및 그 장치 |
US10447990B2 (en) * | 2012-02-28 | 2019-10-15 | Qualcomm Incorporated | Network abstraction layer (NAL) unit header design for three-dimensional video coding |
US20130272371A1 (en) * | 2012-04-16 | 2013-10-17 | Sony Corporation | Extension of hevc nal unit syntax structure |
CN104412602A (zh) * | 2012-04-23 | 2015-03-11 | 三星电子株式会社 | 多视点视频编码方法和装置以及多视点视频解码方法和装置 |
EP4436173A2 (en) | 2012-06-25 | 2024-09-25 | Huawei Technologies Co., Ltd. | Method for signaling a gradual temporal layer access picture |
JP5885604B2 (ja) * | 2012-07-06 | 2016-03-15 | 株式会社Nttドコモ | 動画像予測符号化装置、動画像予測符号化方法、動画像予測符号化プログラム、動画像予測復号装置、動画像予測復号方法及び動画像予測復号プログラム |
US9686542B2 (en) * | 2012-09-05 | 2017-06-20 | Qualcomm Incorporated | Network abstraction layer header design |
US9319657B2 (en) * | 2012-09-19 | 2016-04-19 | Qualcomm Incorporated | Selection of pictures for disparity vector derivation |
US9781416B2 (en) * | 2013-02-26 | 2017-10-03 | Qualcomm Incorporated | Neighboring block disparity vector derivation in 3D video coding |
US9596448B2 (en) | 2013-03-18 | 2017-03-14 | Qualcomm Incorporated | Simplifications on disparity vector derivation and motion vector prediction in 3D video coding |
RU2018128647A (ru) * | 2013-03-21 | 2018-10-05 | Сони Корпорейшн | Устройство кодирования изображения и способ и устройство декодирования изображения и способ |
US9602822B2 (en) * | 2013-04-17 | 2017-03-21 | Qualcomm Incorporated | Indication of cross-layer picture type alignment in multi-layer video coding |
AU2014288482A1 (en) * | 2013-07-12 | 2015-02-26 | Sony Corporation | Image coding device and method |
BR112021020634A2 (pt) * | 2019-05-20 | 2022-03-03 | Panasonic Ip Corp America | Dispositivo de codificação, dispositivo de decodificação, método de codificação e método de decodificação |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007126509A2 (en) | 2006-03-29 | 2007-11-08 | Thomson Licensing | Multi-view video coding method and device |
WO2008085909A2 (en) * | 2007-01-04 | 2008-07-17 | Thomson Licensing | Methods and apparatus for video error correction in multi-view coded video |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030123738A1 (en) * | 2001-11-30 | 2003-07-03 | Per Frojdh | Global motion compensation for video pictures |
JP2006518127A (ja) * | 2003-02-18 | 2006-08-03 | ノキア コーポレイション | ピクチャ復号化方法 |
CN1751518B (zh) | 2003-02-18 | 2010-12-01 | 诺基亚有限公司 | 图像编码方法 |
US7489342B2 (en) * | 2004-12-17 | 2009-02-10 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for managing reference pictures in multiview videos |
US7415069B2 (en) * | 2003-12-09 | 2008-08-19 | Lsi Corporation | Method for activation and deactivation of infrequently changing sequence and picture parameter sets |
JP4180095B2 (ja) * | 2004-07-01 | 2008-11-12 | 三菱電機株式会社 | 記録装置、及び記録方法、及び再生装置及び再生方法 |
JP4442420B2 (ja) * | 2004-12-28 | 2010-03-31 | ソニー株式会社 | 画像再生装置および方法、並びにプログラム |
JP2006294152A (ja) * | 2005-04-12 | 2006-10-26 | Toshiba Corp | 情報記憶媒体、情報記録装置、および情報再生装置 |
KR100943914B1 (ko) * | 2006-01-12 | 2010-03-03 | 엘지전자 주식회사 | 다시점 비디오의 처리 방법 및 장치 |
US8532178B2 (en) * | 2006-08-25 | 2013-09-10 | Lg Electronics Inc. | Method and apparatus for decoding/encoding a video signal with inter-view reference picture list construction |
KR101385808B1 (ko) * | 2006-10-16 | 2014-04-17 | 톰슨 라이센싱 | 비디오 동작 동안 nal 유닛을 이용하여 동시 디코딩 리프레시를 시그널링하는 방법 |
CN101401430B (zh) * | 2006-11-17 | 2012-02-29 | Lg电子株式会社 | 用于解码/编码视频信号的方法及装置 |
EP1985121A4 (en) * | 2006-11-17 | 2010-01-13 | Lg Electronics Inc | METHOD AND DEVICE FOR DECODING / CODING A VIDEO SIGNAL |
RU2420023C1 (ru) * | 2007-03-13 | 2011-05-27 | Нокиа Корпорейшн | Система и способ кодирования и декодирования видеосигналов |
WO2008130528A2 (en) * | 2007-04-17 | 2008-10-30 | Thomson Licensing | Hypothetical reference decoder for multiview video coding |
BR122012013077A2 (pt) * | 2007-04-18 | 2015-07-14 | Thomson Licensing | Sinal tendo parâmetros de decodificação para codificação de vídeo de múltiplas vistas |
CN101690220B (zh) * | 2007-04-25 | 2013-09-25 | Lg电子株式会社 | 用于解码/编码视频信号的方法和装置 |
US20080317124A1 (en) * | 2007-06-25 | 2008-12-25 | Sukhee Cho | Multi-view video coding system, decoding system, bitstream extraction system for decoding base view and supporting view random access |
US8855199B2 (en) * | 2008-04-21 | 2014-10-07 | Nokia Corporation | Method and device for video coding and decoding |
KR101260613B1 (ko) * | 2008-12-26 | 2013-05-03 | 닛뽕빅터 가부시키가이샤 | 화상 부호화 장치, 화상 부호화 방법 및 그 프로그램 및 화상 복호화 장치, 화상 복호화 방법 및 그 프로그램 |
KR101619450B1 (ko) | 2009-01-12 | 2016-05-10 | 엘지전자 주식회사 | 뎁스 정보를 이용한 비디오 신호 처리 방법 및 장치 |
KR101619451B1 (ko) * | 2009-04-17 | 2016-05-10 | 엘지전자 주식회사 | 다시점 비디오 신호의 처리 방법 및 장치 |
-
2010
- 2010-04-28 CN CN2013100185592A patent/CN103124351A/zh active Pending
- 2010-04-28 TW TW102102031A patent/TW201336313A/zh unknown
- 2010-04-28 TW TW099113476A patent/TW201101843A/zh unknown
- 2010-04-28 CN CN2013100185605A patent/CN103124352A/zh active Pending
- 2010-04-28 AU AU2010227032A patent/AU2010227032B2/en active Active
- 2010-04-28 TW TW100112654A patent/TWI489834B/zh active
- 2010-04-28 KR KR1020107022364A patent/KR101097690B1/ko active IP Right Grant
- 2010-04-28 RU RU2011121698/08A patent/RU2550552C2/ru active
- 2010-04-28 KR KR1020117021723A patent/KR20110132380A/ko not_active Application Discontinuation
- 2010-04-28 JP JP2010534314A patent/JP4633866B2/ja active Active
- 2010-04-28 BR BRPI1001258A patent/BRPI1001258A2/pt not_active IP Right Cessation
- 2010-04-28 MX MX2010011009A patent/MX2010011009A/es active IP Right Grant
- 2010-04-28 WO PCT/JP2010/003041 patent/WO2010125812A1/ja active Application Filing
- 2010-04-28 MY MYPI2010004691A patent/MY154325A/en unknown
- 2010-04-28 RU RU2010141215/07A patent/RU2477009C2/ru active
- 2010-04-28 CA CA2718447A patent/CA2718447C/en active Active
- 2010-04-28 CN CN2010800013124A patent/CN101981936B/zh active Active
- 2010-04-28 EP EP10769512A patent/EP2426924A1/en not_active Withdrawn
- 2010-09-14 JP JP2010206114A patent/JP5470206B2/ja active Active
- 2010-10-08 US US12/900,824 patent/US8149923B2/en active Active
-
2011
- 2011-08-05 US US13/204,035 patent/US8369414B2/en active Active
- 2011-08-05 US US13/204,018 patent/US8908771B2/en active Active
-
2013
- 2013-03-25 JP JP2013062646A patent/JP5470482B2/ja active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007126509A2 (en) | 2006-03-29 | 2007-11-08 | Thomson Licensing | Multi-view video coding method and device |
WO2008085909A2 (en) * | 2007-01-04 | 2008-07-17 | Thomson Licensing | Methods and apparatus for video error correction in multi-view coded video |
Non-Patent Citations (5)
Title |
---|
ANTHONY VETRO ET AL.: "Joint Draft 1.0 on Multiview Video Coding", JOINT VIDEO TEAM (JVT) OF ISO/IEC MPEG & ITU-T VCEG 21ST MEETING [JVT-U209], October 2006 (2006-10-01), HANGZHOU, CHINA, pages 1 - 20, XP008167549 * |
ANTHONY VETRO ET AL.: "Joint Draft 2.0 on Multiview Video Coding", JOINT VIDEO TEAM (JVT) OF ISO/IEC MPEG & ITU-T VCEG 22ND MEETING [JVT-V209], January 2007 (2007-01-01), MARRAKECH, MOROCCO, pages 1, XP002523392 * |
ANTHONY VETRO ET AL.: "Joint Draft 5.0 on Multiview Video Coding", JOINT VIDEO TEAM (JVT) OF ISO/IEC MPEG & ITU-T VCEG 25TH MEETING [JVT-Y209], October 2007 (2007-10-01), SHENZHEN, CHINA, pages 1,4 - 5,9-13, XP030007349 * |
THOMAS WIEGAND ET AL.: "Meeting Report, Draft 7", JOINT VIDEO TEAM (JVT) OF ISO/IEC MPEG & ITU-T VCEG 22ND MEETING, January 2007 (2007-01-01), MARRAKECH, MOROCCO, pages 1, 34, XP030007391 * |
WANG YE-KUI ET AL.: "System and Transport Interface of the Emerging SVC Standard", JOINT VIDEO TEAM (JVT) OF ISO/IEC MPEG & ITU-T VCEG 21ST MEETING [JVT-U151], October 2006 (2006-10-01), HANGZHOU, CHINA, pages 1, XP030113300 * |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5470482B2 (ja) | 画像復号方法および画像復号装置 | |
JP5574345B2 (ja) | 符号化方法、エラー検出方法、復号方法、符号化装置、エラー検出装置及び復号装置 | |
KR101680877B1 (ko) | 화상 복호 방법, 화상 부호화 방법, 화상 복호 장치, 화상 부호화 장치, 프로그램, 및 집적 회로 | |
EA021750B1 (ru) | Способ декодирования, декодирующее устройство, способ кодирования и кодирующее устройство | |
KR101440184B1 (ko) | 부호화 방법, 복호 방법, 부호화 장치, 복호 장치, 프로그램, 및 집적 회로 | |
AU2012227355B2 (en) | Image decoding method, and image decoding apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201080001312.4 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010534314 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2718447 Country of ref document: CA |
|
ENP | Entry into the national phase |
Ref document number: 20107022364 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2010/011009 Country of ref document: MX |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010141215 Country of ref document: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010227032 Country of ref document: AU |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10769512 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010769512 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: PI1001258 Country of ref document: BR Kind code of ref document: A2 Effective date: 20101007 |