WO2010123203A2 - 다시점 영상의 참조 픽쳐 리스트 변경 방법 - Google Patents
다시점 영상의 참조 픽쳐 리스트 변경 방법 Download PDFInfo
- Publication number
- WO2010123203A2 WO2010123203A2 PCT/KR2010/001453 KR2010001453W WO2010123203A2 WO 2010123203 A2 WO2010123203 A2 WO 2010123203A2 KR 2010001453 W KR2010001453 W KR 2010001453W WO 2010123203 A2 WO2010123203 A2 WO 2010123203A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- picture
- information
- reference picture
- view
- inter
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 44
- 230000007774 longterm Effects 0.000 claims description 38
- 238000012937 correction Methods 0.000 claims description 22
- 238000012986 modification Methods 0.000 claims description 10
- 230000004048 modification Effects 0.000 claims description 10
- 230000002123 temporal effect Effects 0.000 description 23
- 238000010586 diagram Methods 0.000 description 8
- 230000006835 compression Effects 0.000 description 7
- 238000007906 compression Methods 0.000 description 7
- 230000003044 adaptive effect Effects 0.000 description 6
- 239000013598 vector Substances 0.000 description 6
- 238000012545 processing Methods 0.000 description 3
- 238000009795 derivation Methods 0.000 description 2
- 238000005192 partition Methods 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000002715 modification method Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/24—Systems for the transmission of television signals using pulse code modulation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/597—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N13/00—Stereoscopic video systems; Multi-view video systems; Details thereof
- H04N13/10—Processing, recording or transmission of stereoscopic or multi-view image signals
- H04N13/106—Processing image signals
- H04N13/172—Processing image signals image signals comprising non-image signal components, e.g. headers or format information
- H04N13/178—Metadata, e.g. disparity information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/434—Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
- H04N21/4347—Demultiplexing of several video streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44004—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving video buffer management, e.g. video decoder buffer or video display buffer
Definitions
- the present invention relates to the processing of a multiview video signal.
- Compression coding refers to a series of signal processing techniques that transmit digitized information through a communication line or store the data in a form suitable for a storage medium.
- the object of compression encoding includes objects such as voice, video, text, and the like.
- a technique of performing compression encoding on an image is called video image compression.
- the general feature of the video image is that it has spatial redundancy and temporal redundancy.
- An object of the present invention is to increase the processing efficiency of a multi-view video signal.
- the present invention provides a more efficient method and apparatus for decoding a multiview video signal by identifying inter-view dependencies based on profile information representing a multiview video bitstream.
- the present invention provides a method and apparatus for more efficiently decoding a stereo image signal by defining profile information representing a stereo image.
- the present invention defines header information (eg, NAL unit header information, sequence parameter information, picture parameter information, slice header information, and the like) based on profile information representing a stereo image, thereby providing more efficient multiview image.
- header information eg, NAL unit header information, sequence parameter information, picture parameter information, slice header information, and the like
- profile information representing a stereo image thereby providing more efficient multiview image.
- the present invention seeks to efficiently code a multiview image signal by defining an interview prediction flag indicating whether a coded picture of a current NAL unit is used for inter-view prediction based on profile information representing a stereo image.
- the present invention seeks to efficiently code a multi-view video signal by acquiring interview reference information indicating a dependency relationship between viewpoints based on profile information representing a stereo image, and generating and managing a reference picture list using the reference reference information.
- the present invention seeks to efficiently code a multiview image signal by providing a method for managing reference pictures used for inter-view prediction based on profile information representing a stereo image.
- the present invention it is possible to code a multiview image signal more efficiently by defining profile information representing a stereo image.
- header information eg, NAL unit header information, sequence parameter information, picture parameter information, slice header information, etc.
- the number of bits to be transmitted can be reduced and the DPB You can also increase the coding speed by reducing the burden of the (Decoded Picture Buffer).
- more efficient coding may be performed by using various attribute information of a multiview image based on profile information representing a stereo image.
- FIG. 1 illustrates an example of a configuration of an NAL unit to which attribute information for a multiview image may be added.
- FIG. 2 is a diagram illustrating a prediction structure of a stereo image according to an embodiment to which the present invention is applied.
- 3 to 9 illustrate embodiments to which the present invention is applied and illustrate syntax for restricting coding of multiview image coding information when stereo image decoding.
- FIG. 10 illustrates, as an embodiment to which the present invention is applied, a reference picture for inter-view prediction according to a coding format of a picture of a reference view and a picture of a non-reference view.
- FIG. 11 is a schematic block diagram of a multi-view video signal decoding apparatus to which the present invention is applied.
- FIG. 12 is an embodiment to which the present invention is applied and shows a flowchart for generating a reference picture list.
- FIG. 13 illustrates a method for assigning a reference number to a picture of a reference view when a picture of a reference view is coded as a field according to an embodiment to which the present invention is applied.
- FIG. 14 is an embodiment to which the present invention is applied and shows an internal block diagram of the reference picture list corrector 620
- FIG. 15 is an embodiment to which the present invention is applied and shows the inside of the reference number changing units 623B and 625B. Represents a block diagram.
- 16 to 18 illustrate syntaxes for modifying a picture list referenced for temporal or inter-view prediction as an embodiment to which the present invention is applied.
- the video signal processing method parses a profile identifier and a slice type from a multiview image bitstream, obtains interview reference information based on the profile identifier, and obtains the interview reference information.
- the reference picture list may be initialized by using the reference picture list, and the reference picture list may be modified in consideration of the slice type.
- the interview reference information may be obtained when the multiview image bitstream is a bitstream coded into a stereo image according to the profile identifier.
- the interview reference information may include flag information indicating whether a random access picture is used for inter-view prediction and flag information indicating whether a non-random access picture is used for inter-view prediction. .
- the random access picture is an encoded picture in which all slices refer only to slices in the same access unit, and the non-random access picture may be a picture that is not a random access picture.
- the reference picture list obtains reference picture correction information based on flag information indicating whether to modify the reference picture list, and based on the reference picture correction information, the difference value of the picture number, the long-term picture number, and the like. Parse the difference value of the viewpoint information, and correct it based on the modified reference number derived using the difference value of the picture number, the long-term picture number and the difference information of the viewpoint information, but the slice type is not an I slice.
- the reference picture list may be modified.
- the reference picture modification information is information specifying a reference picture that is changed among reference pictures of the initialized reference picture list, and a difference value of the picture number is a picture number and prediction of the current picture.
- the predicted picture number is a number of a reference picture allocated immediately before
- the difference value of the view information means a difference between a view number of the current picture and a predicted view number.
- the predicted view number may mean a view number of a reference picture allocated immediately before.
- the difference value of the viewpoint information may be parsed based on the reference picture information and the profile identifier.
- a modified reference number is derived based on a predicted picture number and a difference value of the picture number, and if the reference picture is a long-term reference picture, it is assigned to the long-term picture number. Based on the modified reference number can be derived.
- Techniques for compressing and encoding video signals consider spatial redundancy, temporal redundancy, scalable redundancy, and inter-view redundancy.
- the compression coding technique considering the inter-view redundancy is only an embodiment of the present invention, and the technical idea of the present invention may be applied to temporal redundancy, scalable redundancy, and the like.
- the term coding in this specification may include both concepts of encoding and decoding, and may be flexibly interpreted according to the technical spirit and technical scope of the present invention.
- H.264 / AVC supports a parallel scan as the video format of the video signal.
- a parallel scan is a method of scanning pixel lines divided into even and odd lines. In other words, the scan is divided into two field signals.
- one frame includes two fields, a top field and a bottom field.
- the upper field is a field located spatially above two fields constituting one frame
- the lower field is a field located spatially below one of two fields constituting one frame.
- a frame picture or a field picture may be determined and coded for each picture. In this case, a frame picture or a field picture may be determined for each picture by using a picture coding structure flag (field_pic_flag).
- the picture coding structure flag may be a flag indicating whether the current picture is a frame picture or a field picture.
- the picture coding structure flag may be obtained at a slice level to indicate a coding structure of the current picture in units of slices in the current picture.
- the frame picture refers to a picture in which the two fields are collected to form and process one frame
- the field picture refers to a picture in which the two fields are configured and processed as two independent pictures.
- macro blocks in the field picture may also be coded into fields. This may be called a field macro block.
- macro blocks within the frame picture may be coded into frames. This may be referred to as a frame macro block.
- frame coding and field coding may be switched in units of units in which two macro blocks in the frame picture are vertically attached. This is called MB-AFF (Macroblock-Adaptive Frame-Field Coding).
- a field frame change flag (mb_adaptive_frame_field_flag) may be used for the MB-AFF.
- the field frame change flag (mb_adaptive_frame_field_flag) may mean a flag indicating whether there is a switch between a frame macro block and a field macro block in a picture.
- FIG. 1 illustrates an example of a configuration of an NAL unit to which attribute information for a multiview image may be added.
- the NAL unit may include a header of the NAL unit and a raw byte sequence payload (RBSP).
- the header of the NAL unit may include identification information nal_ref_idc indicating whether the NAL unit includes a slice of a reference picture and information nal_unit_type indicating the type of the NAL unit.
- the extended region of the NAL unit header may be limited. For example, when the information indicating the type of the NAL unit is related to scalable video coding or indicates a prefix NAL unit, the NAL unit may also include an extended area of the NAL unit header.
- attribute information of the multiview image may be added according to flag information for identifying whether the multiview image is a coded bitstream.
- the RBSP may include information about the sequence parameter set.
- the sequence parameter set may include an extended area of the sequence parameter set according to profile information.
- the profile information is a profile related to multi-view image coding or a profile related to stereo image coding
- the sequence parameter set may include an extended region of the sequence parameter set.
- the subset sequence parameter set may include an extended region of the sequence parameter set according to the profile information.
- the extended area of the sequence parameter set may include interview reference information indicating inter-view dependency and information related to the level of the bitstream.
- attribute information on a multiview image for example, attribute information that may be included in an extension region of a NAL unit header or attribute information that may be included in an extension region of a sequence parameter set, will be described in detail. do.
- View identification information refers to information for distinguishing a picture at a current view from a picture at a different view.
- POC Picture Order Count
- frame_num When a video video signal is coded, POC (Picture Order Count) and frame_num are used to identify the picture.
- identification information for distinguishing a picture at a current view from a picture at a different view is required.
- the view identification information may be obtained from a header area of a video signal.
- the header area may be an NAL header area or an extension area of the NAL header, or may be a slice header area.
- the view identification information may be used to obtain information of a picture that is different from the current picture, and the video signal may be decoded using the information of the picture at the other view. That is, the viewpoint identification information may be applied throughout the encoding / decoding process of the video signal.
- the viewpoint identification information may be used to indicate the inter-view dependency.
- information about the number of interview reference pictures, viewpoint identification information on the interview reference picture, and the like may be used.
- Information used to indicate the inter-view dependency is referred to as inter-view reference information.
- the viewpoint identification information of the interview reference picture may mean information representing a specific viewpoint used for inter-view prediction, and the specific viewpoint may be viewed as a viewpoint to which the interview reference picture used for inter-view prediction belongs.
- inter-view prediction means prediction using decoded samples of an interview reference picture that is different from the current picture in coding a current picture, and the current picture and the interview reference picture are It may belong to the same access unit.
- the access unit is a set of pictures existing in the same time zone, and the pictures may be defined as having the same picture order count.
- the picture When the picture is divided into a plurality of slices, the picture may be viewed as a set of slices.
- the interview reference picture may mean a reference picture used when performing inter-view prediction with respect to a current picture.
- the frame_num may be applied to a multi-view video coding as it is, using a frame_num in consideration of a view rather than a specific view identifier.
- the random access flag information refers to information for identifying whether a coded picture of a current NAL unit is a random access picture.
- the random access picture refers to an encoded picture in which all slices refer only to slices in the same access unit. That is, the random access picture performs inter-view prediction by coding with reference to slices at different views, but does not perform inter prediction by referring to slices at current views.
- interview reference information for inter-view prediction it may be acquired according to whether it is a random access picture or a non-random access picture, and may be obtained from a data region of a video signal. For example, it can be obtained from the sequence parameter set region.
- the number of all viewpoints may be obtained, and viewpoint identification information for distinguishing each viewpoint may be determined based on the number of all viewpoints.
- information about the number of interview reference pictures indicating the number of reference pictures with respect to the reference direction may be obtained at each viewpoint.
- View identification information of each interview reference picture may be acquired according to the number information of the interview reference picture.
- the interview reference information may be obtained, and the interview reference information may be divided into a case of a random access picture and a case of a non-random access picture.
- random access flag information indicating whether the coded slice in the current NAL is a random access picture.
- Such random access flag information may be obtained from an extended region of the NAL header, or may be obtained from a slice layer region.
- interview reference information acquired according to the random access flag information may be used.
- the reference picture list for the inter-view prediction may be added to the reference picture list.
- the interview reference information may be used when initializing or modifying a reference picture list for inter-view prediction. It may also be used to manage the added reference pictures for the inter-view prediction.
- the reference pictures may be divided into a random access picture and a non-random access picture, and the reference pictures not used when performing inter-view prediction may be marked not to be used.
- the random access flag information may also be applied to a hypothetical reference decoder.
- Inter-view prediction flag information refers to information indicating whether a coded picture of a current NAL unit is used for inter-view prediction.
- the interview prediction flag information may be used in a part where temporal prediction or inter-view prediction is performed.
- the NAL unit may be used together with identification information indicating whether the NAL unit includes a slice of the reference picture.
- the current NAL unit may be a reference picture used only for inter-view prediction.
- the current NAL unit may be used for temporal prediction and inter-view prediction.
- the NAL unit may be stored in the decoded picture buffer even if the NAL unit does not include a slice of the reference picture according to the identification information. This is because the current NAL coded picture according to the interview prediction flag information needs to be stored when it is used for inter-view prediction. In addition to using the flag information and the identification information together, it may indicate whether a coded picture of a current NAL unit is used for temporal prediction and / or inter-view prediction as one identification information.
- Temporal level information refers to information on a hierarchical structure for providing temporal scalability from a video signal.
- the temporal level information may provide an image of a user's desired time zone.
- Priority identification information means information for identifying priority in units of NAL. For example, when a user requests an image of a specific time point and time zone, it may obtain a bitstream (hereinafter, referred to as a sub bitstream) according to the request from the bitstream received from the encoder.
- the sub bitstream may be obtained using viewpoint identification information, temporal level information related to a viewpoint and time desired by a user, and the interview reference information mentioned above.
- the priority information may be allocated in the NAL unit with respect to the sub bitstream to indicate the priority in decoding the coded picture in the NAL unit.
- FIG. 2 is a diagram illustrating a prediction structure of a stereo image according to an embodiment to which the present invention is applied.
- the two viewpoints may be composed of a base view and a non-base view.
- the reference time point may mean a time point that can be coded independently from other time points.
- the reference time point may mean a time point that becomes a reference for decoding among multiviews. That is, it may correspond to a reference view for predicting an image of another view.
- this may mean at least one viewpoint for compatibility with an existing decoder (eg, H.264 / AVC, MPEG-2, MPEG-4, etc.).
- the image of the reference view may be encoded by an image codec method (MPEG-2, MPEG-4, H.26L series, etc.) to form an independent bitstream.
- the non-reference time point may mean a time point other than the reference time point.
- T0 to T3 on the horizontal axis represent frames over time
- V0 to V1 on the vertical axis represent frames according to viewpoints.
- the arrows shown in FIG. 2 indicate the prediction directions of the pictures, and the numbers in the pictures are only one embodiment indicating the decoding order. As such, assuming that only two viewpoints exist, one viewpoint V0 may be a reference viewpoint and the other viewpoint V1 may be a non-reference viewpoint.
- the reference time point V0 may be used as a reference time point of the non-reference time point V1, but the non-reference time point V1 may not be a reference time point of another time point. This is because the reference time point V0 is a time point that can be independently coded. Therefore, when decoding a stereo image as described above, the coding efficiency can be improved by limiting the coding of information required for multi-view image coding.
- 3 to 9 illustrate embodiments to which the present invention is applied and illustrate syntax for restricting coding of multiview image coding information when stereo image decoding.
- the received bitstream may include two viewpoint images. Therefore, the information indicating the number of all viewpoints can always be set to a value indicating only two viewpoints. In this case, information representing the total number of viewpoints obtained from the extended region of the sequence parameter set may not be transmitted. That is, it may be transmitted only when the profile identifier of the received bitstream does not represent a bitstream coded with a stereo image.
- a profile identifier of the received bitstream represents a bitstream coded with a stereo image (S320). If the profile identifier indicates a bitstream coded with a stereo image, information num_views_minus1 indicating the number of all viewpoints may not be obtained. However, if the profile identifier does not represent a bitstream coded with a stereo image, that is, if it represents a bitstream coded with a multiview image, information indicating the number of all viewpoints may be obtained (S330). Here, the information indicating the number of all viewpoints may be information indicating that at least three viewpoint images are present. Information indicating the number of all viewpoints may be obtained from an extended area of a sequence parameter set (S310).
- the non-reference viewpoint V1 is referred to the reference viewpoint V0. That is, the non-reference time point V1 goes only through the interview reference picture in the L0 direction. Therefore, it is not necessary to always obtain information related to the L1 direction among the interview reference information, and may be obtained only when the profile identifier of the received bitstream does not represent a bitstream coded with a stereo image.
- the information related to the L1 direction may be divided into a case where the current slice is a random access picture and a case where the non-random access picture is obtained to consider acquisition.
- information on the number of views of all viewpoints may be obtained from an extended region of a sequence parameter (S410).
- view identification information of each view may be acquired according to the number information of the entire views.
- information on the number of reference views in the L0 direction of the random access picture of each view may be obtained according to the information on the number of all views.
- view identification information of the reference view in the L0 direction of the random access picture may be obtained (S442).
- Information (S444, S445) for the L1 direction can be obtained in the same manner as the steps S441 and S442.
- the profile identifier of the received bitstream is a bitstream coded with a stereo image (S443). If the profile identifier indicates a bitstream coded with a stereo image, the profile identifier may skip without obtaining information about the L1 direction. However, when the profile identifier does not represent a bitstream coded with a stereo image, information S444 and S445 for the L1 direction is obtained.
- the bitstream received according to the profile identifier is a bitstream coded into a multiview image, it may be decoded by the decoder of the stereo profile according to the compatibility indication flag (constraint_setX_flag).
- the compatibility indication flag may mean information indicating which profile decoder the bitstream can be decoded. Referring to FIG. 5, if the bitstream is a bitstream coded as a multiview image and the bitstream is decoded in the decoder of the stereo profile according to the compatibility indication flag information, information on the L1 direction is not obtained. You can skip it.
- a profile identifier does not represent a bitstream coded into a stereo picture, and at the same time, the bitstream is not a bitstream coded into a multiview image or the bitstream is decoded in a decoder of a stereo profile according to the compatibility indication flag information. If not (S510) only to obtain the information (L520, S530) for the L1 direction.
- the processes S510 to S530 may be similarly applied to non-random access pictures. This is shown in the process S540 to S560, detailed description thereof will be omitted.
- the interview reference information described with reference to FIG. 1 is completely different.
- the interview reference information may be substituted by transmitting other information without transmitting.
- the two flag information is meaningful only when the random access picture or non-random access picture corresponds to the reference time point, and if the random access picture or non-random access picture corresponds to the non-reference time point.
- the two flag information may always have a value of false since it will not be used as a reference picture.
- the reference view point V0 can be independently coded without referring to another view point, no inter-view reference information is required, and the non-reference view point V1 refers only to the reference view point V0. Since it can be used as a viewpoint, it is not necessary to send both the number of reference viewpoints for the L0 direction and the L1 direction and the viewpoint identification information of the reference viewpoint.
- coding efficiency may be improved by acquiring interview reference information of a multiview image.
- bitstream received according to the profile identifier is a bitstream coded into a multiview image, it may be decoded by the decoder of the stereo profile according to the compatibility indication flag (constraint_setX_flag).
- the bitstream is a bitstream coded into a multiview image and the bitstream is decoded by a decoder of a stereo profile according to the compatibility indication flag information, or a bit coded into a stereo image according to a profile identifier.
- flag information anchor_ref_flag
- non_anchor_ref_flag flag information indicating whether a non-random access picture is used for inter-view prediction
- the profile identifier does not represent a bitstream coded into a stereo picture, and at the same time the bitstream is not a bitstream coded into a multiview image or the bitstream is decoded in the decoder of the stereo profile according to the compatibility indication flag information. If not (S710) only the interview reference information of the multi-view image can be obtained.
- the interview reference information of the multiview image may be adaptively utilized by checking whether the bitstream is encoded into the stereoscopic image using the profile identifier or the compatibility indication flag information.
- the stereo flag may mean information indicating whether a coded video sequence conforms to a stereo profile.
- the stereo flag may be obtained from an extended region of a sequence parameter set (S810).
- interview reference information of a multiview image may be obtained (S820).
- flag information anchor_ref_flag
- flag information non_anchor_ref_flag
- S850 non_anchor_ref_flag
- the stereo flag is obtained from an extended region of the sequence parameter set (S910), and as a result, when the coded video sequence according to the stereo flag conforms to a stereo profile, You can skip without obtaining information.
- the profile identifier does not represent a bitstream coded with a stereo image (S920)
- information (S930, S940) for the L1 direction is obtained. This may be equally applied to the case where the interview reference picture is a non-random access picture, and is shown in the processes S950 to S970.
- An access unit is a set of pictures existing in the same time zone, and it has been described above that the pictures are defined as having the same picture order count. Furthermore, pictures belonging to the same access unit may be defined as having the same image format. For example, when a picture of a reference view is coded into a frame, a picture of a non-reference view belonging to the same access unit as the reference view will also be coded into a frame, and when the picture of the reference view is coded into a field, The picture at the non-reference point of view will also be coded as a field. That is, the field picture structure flag (field_pic_flag) for the picture of the reference view and the picture of the non-reference view will have the same value.
- field_pic_flag field picture structure flag
- the non-reference picture will also be coded with a macro block adaptive frame / field. That is, the field frame change flag (mb_adaptive_frame_field_flag) for the picture of the reference view and the picture of the non-reference view will have the same value. If the picture of the reference time point is an upper field, the picture of the non-reference time point is also an upper field. If the picture of the reference time point is a lower field, the picture of the non-reference time point may also be a lower field. That is, the lower field indication flag (bottom_field_flag) for the picture of the reference view and the picture of the non-reference view will have the same value.
- the lower field indication flag (bottom_field_flag) may mean a flag indicating whether the current picture is an upper field or a lower field.
- a picture belonging to the non-reference view may perform inter-view prediction by using decoded samples of the picture belonging to the reference view.
- the picture of the reference time point and the picture of the non-reference time point may belong to the same access unit, and may follow the definition of the access unit described above.
- a field pair that is, an upper field and a lower field, for a picture coded as a field may be defined as belonging to the same access unit. It may be. This is hereinafter referred to as a modified access unit.
- FIG. 10 illustrates, as an embodiment to which the present invention is applied, a reference picture for inter-view prediction according to a coding format of a picture of a reference view and a picture of a non-reference view.
- the picture of the non-reference view uses the picture of the reference view as the interview reference picture.
- the picture at the non-reference time point must also be a higher field as described above.
- the present invention is not limited thereto, and according to the modified definition of the access unit, the upper field and the lower field of the reference time point belong to the same access unit.
- the lower field of the reference time point may be used as the interview reference picture.
- a picture of a reference view is coded into a field and a picture of a non-reference view is coded into a macroblock adaptive frame / field.
- the picture of the non-reference time point is the picture of the reference time, that is, the upper field.
- Inter-view prediction may be performed by using and subfields as an interview reference picture.
- the picture at the reference time point is coded with a macroblock adaptive frame / field
- the picture at the non-reference time point is coded with a field.
- the picture of the non-reference view that is, the upper field or the lower field
- FIG. 11 is a schematic block diagram of a multi-view video signal decoding apparatus to which the present invention is applied.
- the decoding apparatus includes an entropy decoding unit 100, an inverse quantization unit 200, an inverse transform unit 300, an intra prediction unit 400, a deblocking filter unit 500, a decoded picture buffer unit 600, and inter prediction. Section 700 and the like.
- the decoded picture buffer unit 600 includes a reference picture list initializer 610 and a reference picture list corrector 620.
- the parsed bitstream is entropy decoded by the entropy decoding unit 100, and coefficients, motion vectors, and the like of each macroblock are obtained.
- the inverse quantization 200 multiplies the received quantized value by a constant constant to obtain a transformed coefficient value, and the inverse transform unit 300 inversely transforms the coefficient value to restore the pixel value.
- the intra prediction unit 400 uses the reconstructed pixel value to perform intra prediction from the decoded samples in the current picture.
- the deblocking filter unit 500 is applied to each coded macroblock in order to reduce block distortion.
- the filter smoothes the edges of the block to improve the quality of the decoded frame. The choice of filtering process depends on the boundary strength and the gradient of the image samples around the boundary.
- the filtered pictures are output or stored in the decoded picture buffer unit 600 for use as a reference picture.
- FIG. 12 is an embodiment to which the present invention is applied and shows a flowchart for generating a reference picture list.
- the decoded picture buffer unit 600 stores or opens previously coded pictures in order to perform inter prediction.
- the picture may be identified using the frame number and the picture order count (POC) of the picture.
- POC picture order count
- Some of the previously coded pictures in MVC have pictures that are different from the current picture. Therefore, in order to utilize these pictures as reference pictures, not only the frame number and the POC but also point identification information identifying a picture's view point are also included. It is available. However, the reference picture used only for the inter-view prediction may be identified using the viewpoint identification information and the POC.
- the decoded picture buffer unit 600 stores pictures referred to for coding the current picture (S1210), and generates a list of pictures referenced for inter prediction.
- a reference picture list for inter-view prediction may be generated.
- a reference picture list for temporal prediction and inter-view prediction may be generated.
- a diagonal reference picture list may be generated.
- a reference picture list for temporal prediction may be indicated, if 1, a reference picture list for inter-view prediction may be indicated, and if 2, a reference picture list for temporal prediction and inter-view prediction may be indicated.
- the diagonal reference picture list may be generated using the reference picture list for the temporal prediction or the reference picture list for the inter-view prediction.
- reference pictures in a diagonal direction may be arranged in a reference picture list for temporal prediction.
- reference pictures in a diagonal direction may be arranged in a reference picture list for inter-view prediction.
- the reference picture list for the temporal prediction and the reference picture list for the inter-view prediction are mainly described. However, the concept of the present invention may be applied to the reference picture list in the diagonal direction.
- the decoded picture buffer unit 600 includes a variable derivation unit (not shown), a reference picture list initialization unit 610, and a reference picture list correction unit 620.
- the variable derivation unit (not shown) derives variables used for initializing the reference picture list (S1210).
- a first frame number FrameNum, a second frame number FrameNumWrap, a picture number PicNum, a long term frame number LongTermFrameIdx, and a long term picture number LongTermPicNum may be used as the variable.
- the first frame number may be determined using a frame number (frame_num) obtained from a slice header region of the short-term reference picture.
- the first frame number may be set to the same value as the frame number (frame_num) obtained from the slice header area of the short-term reference picture.
- the second frame number may be used by the decoded picture buffer unit 600 to allocate a small number for each reference picture, and may be derived based on the first frame number.
- the frame number (frame_num) obtained from the slice header area of the current picture is compared with the first frame number.
- the second frame number is the most among the first frame numbers.
- a large value MaxFrameNum may be derived by subtracting the first frame number.
- the second frame number may be derived to the same value as the first frame number.
- the current picture may mean a picture coded using the short-term reference picture.
- a picture number PicNum or a long term picture number LongTermPicNum to be allocated to a reference picture can be derived.
- the picture number PicNum or the long term picture number LongTermPicNum may mean an identification number of a picture used in the decoded picture buffer unit 600.
- the long-term picture number LongTermPicNum may be used.
- the picture number PicNum or long term picture number LongTermPicNum may be derived based on the field picture flag (field_pic_flag) and the lower field flag (bottom_field_flag) for the current picture.
- the field picture flag may be information indicating whether the current picture is a field picture or a frame picture
- the lower field flag may mean information indicating whether the current picture is a higher field or a lower field when the current picture is a field picture.
- the field picture flag and the lower field flag may be obtained from a slice header area for the current picture.
- the picture number PicNum or long term picture number LongTermPicNum may be derived as the second frame number or the long frame number, respectively.
- the long term frame number may mean a frame number assigned to the long term reference picture.
- a picture number PicNum or a long term picture number LongTermPicNum is derived based on the second frame number or the long frame number, respectively, and the current picture is It may be derived depending on whether the field is the same location as the reference field.
- variables when generating a reference picture list for inter-view prediction, variables may be derived in the same manner as described above, and the reference picture list for inter-view prediction may be initialized using the derived variables.
- the reference picture list initialization unit 610 initializes the reference picture list by using the variables.
- the method of initializing the reference picture list may vary depending on the slice type (S1220).
- the slice type is a P slice or an SP slice
- a reference picture list 0 is generated (S1230).
- reference numbers may be assigned based on the decoding order. For example, they may be arranged according to a picture number or a long picture number which is a variable derived from a first frame number or a long frame number. Short-term reference pictures may be initialized prior to the long-term reference picture.
- the order in which the short-term reference pictures are arranged may be arranged in order from the reference picture having the highest picture number among the reference pictures to the reference picture having the lowest picture number.
- the order in which the long-term reference pictures are arranged may be arranged in order from the reference picture having the lowest long-term picture number among the reference pictures to the reference picture having the highest long-term picture number.
- reference picture list 0 and reference picture list 1 are generated (S1240).
- reference pictures may be arranged according to a picture order count for a short reference picture, and reference pictures may be arranged according to a longterm picture number (LongtermPicNum) for a long reference picture.
- the short-term reference pictures may be initialized prior to the long-term reference picture.
- the order in which the short-term reference pictures of reference picture list 0 are arranged is the order of the reference pictures having the lowest picture output order from the reference picture having the highest picture output order among the reference pictures having the lower picture output order than the current picture.
- the order in which the long-term reference pictures of the reference picture list 0 are arranged may be arranged in the order of the reference picture having the lowest long-term picture number among the reference pictures and the reference picture having the highest long-term picture number.
- the interview reference picture used for the inter-view prediction may include a reference picture used only for inter-view prediction and a reference picture used for temporal and inter-view prediction, as described with reference to FIG. 1.
- the interview reference picture may be added to the reference picture list.
- the interview reference pictures may be arranged based on the interview reference information, and the interview reference information is obtained according to whether the current slice is a random access picture or a non-random access picture, and thus, if the current slice is a random access picture and non- It can be initialized by dividing into a case of a random access picture. It has been described in FIG. 1 that whether the current slice is a random access picture may be determined based on random access flag information. For example, when the current slice is a random access picture according to random access flag information, the interview belongs to the same access unit as the current slice and has the same viewpoint identification information as the viewpoint identification information of the interview reference picture for the current slice.
- a reference picture may be added to the reference picture list, which may be obtained based on the number information of the interview reference pictures for the current slice. For example, interview reference pictures may be added to the reference picture list by the number of the interview reference pictures. Similarly, when the current slice is a non-random access picture, the reference picture list for inter-view prediction may be initialized using the interview reference information for the current slice.
- the above-described reference picture list generation method may be applied in the same manner.
- Inter-view prediction flag information may be obtained for a picture of a reference view.
- a reference number for inter-view prediction is assigned to the picture of the reference view. It is allocated and may be added to a reference picture list for temporal prediction of the picture at the non-reference time point. Meanwhile, the picture of the reference time point may be added to the reference picture list for the temporal prediction based on a random access flag. That is, a reference picture list for inter-view prediction may be generated according to whether the picture of the non-reference view is a random access picture or a non-random access picture.
- anchor_ref_flag indicating whether the random access picture is used for inter-view prediction
- flag information non_anchor_ref_flag
- a picture coding structure flag (field_pic_flag) may be obtained for the picture at the reference view.
- a picture coding structure flag field_pic_flag
- a lower field indication flag bottom_field_flag may be additionally obtained.
- the lower field indication flag bottom_field_flag
- FIG. 13 illustrates a method for assigning a reference number to a picture of a reference view when a picture of a reference view is coded as a field according to an embodiment to which the present invention is applied.
- an upper field of a reference view in a reference picture list for inter-view prediction may be set to have a reference number smaller than a lower field of the reference view belonging to the same access unit.
- the lower field of the reference time point may be set to have a reference number smaller than the upper field of the reference time point belonging to the same access unit.
- Reference numbers may be assigned to pictures at the reference time, that is, the upper field and the lower field, based on the coding format of the picture at the non-reference time.
- a reference number smaller than a lower field may be allocated to an upper field among the pictures of the reference time point.
- a reference number smaller than an upper field may be allocated to a lower field among the pictures of the reference time point.
- the pixel value of the macro block in the picture at the non-reference time point may be predicted based on the generated reference picture list.
- the reference picture list corrector 620 improves the compression ratio by allocating a smaller number to a picture frequently referenced in the initialized reference picture list (S1250). This is because a reference number designating a reference picture is encoded in units of blocks, so that a smaller number of bits is assigned as the reference number becomes smaller.
- FIG. 14 is an embodiment to which the present invention is applied and shows an internal block diagram of the reference picture list corrector 620
- FIG. 15 is an embodiment to which the present invention is applied and shows the inside of the reference number changing units 623B and 625B. Represents a block diagram.
- the reference picture list corrector 620 includes a slice type checker 621, a reference picture list 0 corrector 623, and a reference picture list 1 corrector 625.
- the slice type checker 621 determines whether to modify the reference picture list 0 or the reference picture list 1 by checking the type of the slice to be decoded.
- the reference picture list 0 correction unit 623 and the reference picture list 1 correction unit 625 may modify the reference picture list 0 when the slice types are not I slice and SI slice, and the slice type is B slice. In this case, reference picture list 1 may be modified.
- the reference picture list 0 correction unit 623 or the reference picture list 1 correction unit 625 includes reference picture correction information acquisition units 623A and 625A and reference number change units 623B and 625B, respectively.
- reference picture modification information acquisition units 623A and 625A modify the reference picture list according to flag information (ref_pic_list_modification_flag) indicating whether to modify a reference picture list
- reference picture modification information (modification_of_pic_nums_idc). ) Can be obtained.
- the reference picture modification information may mean information for specifying a reference picture to be modified among reference pictures of the initialized reference picture list.
- the difference value of the picture number and the long-term picture number may be obtained based on the reference picture correction information.
- the difference value of the picture number may mean a difference between a picture number of a current picture and a predicted picture number, and the predicted picture number may mean a number of a reference picture allocated immediately before.
- the reference number changing unit 624A for the reference picture is operated, and the reference number changing unit 624A for the reference picture is the same.
- a difference value of the picture number can be obtained.
- the reference number changing unit 624B for the long-term reference picture is operated, and the reference number changing unit 624B for the long-term reference picture is operated.
- a long-term picture number of a long-term reference picture specified according to the reference picture correction information may be obtained.
- the reference number change end unit 624D may operate to terminate the reference number change.
- the reference number changing units 623B and 625B may modify the reference picture list by changing the reference number of the reference picture using the reference picture correction information, the difference value of the picture number, and the long-term picture number.
- a modified reference number may be derived based on the difference between the predicted picture number and the picture number.
- the modified reference number may be derived by subtracting or adding a difference value of the reference number from the predicted picture number according to the reference picture correction information.
- the reference picture list may be modified based on the long-term picture number.
- the aforementioned reference picture list modification method may be equally applied. Only the additional part of multi-view video coding will be mentioned here.
- reference picture modification information is a reference that is modified among reference pictures of the initialized reference picture list or interview reference pictures of the reference picture list for initialized inter-view prediction. It may mean information specifying a picture or an interview reference picture.
- the reference picture correction information obtaining unit may additionally obtain a difference value of viewpoint information based on the reference picture correction information in addition to the difference between the reference picture correction information, the picture number, and the long-term picture number.
- the difference value of the viewpoint information may mean a difference between a viewpoint number of a current picture and a predicted viewpoint number, and the predicted viewpoint number may mean a viewpoint number of a reference picture immediately allocated.
- the reference number of the interview reference picture of the reference picture list for inter-view prediction may be changed using the obtained difference value of the view information.
- the reference number changing unit 624C for the interview reference picture operates, and the reference number changing unit for the interview reference picture ( In operation 624C, a difference value of the viewpoint information may be obtained.
- the modified view number picViewIdxLX may be derived based on the difference value of the view information and the predicted view number.
- the modified view number may be derived by subtracting or adding a difference value of the view information from the predicted view number according to the reference picture correction information.
- a target view identifier targetViewID may be derived based on the modified view number.
- the target viewpoint identifier may be derived from viewpoint identification information of the interview reference picture having the modified viewpoint number.
- the target viewpoint identifier may be derived based on random access flag information. That is, the current slice may be derived according to whether the current slice is a random access picture or a non-random access picture.
- the reference picture list for inter-view prediction may be modified using an interview reference picture having the same view identification information as the target view identifier.
- 16 to 18 illustrate syntaxes for modifying a picture list referenced for temporal or inter-view prediction as an embodiment to which the present invention is applied.
- the above-described method of modifying the reference picture list may be applied in the same way, but the method of modifying the reference picture list for inter-view prediction may be different. Can be. This is because in the case of a bitstream coded with a stereo image, only two viewpoints exist, and a difference value of the viewpoint information may not be coded.
- a difference value between the viewpoint information may be obtained based on the reference picture correction information and a profile identifier.
- the reference picture modification information indicates that the reference number of the interview reference picture is to be changed, and the bitstream in which the profile identifier is received is coded into a stereo image. Only when it is indicated that the bitstream is not present (S1620), a difference value of the viewpoint information may be obtained (S1630).
- the reference picture modification information indicates that the reference number of the interview reference picture is to be changed, a difference value of the viewpoint information may not be obtained when the reference picture is a bitstream coded with the stereo image.
- the difference value of the viewpoint information may be obtained by the same process (S1650, S1660).
- the received bitstream according to the profile identifier is a bitstream coded as a multiview image
- it may be decoded in the decoder of the stereo profile according to the compatibility indication flag (constraint_setX_flag).
- the bitstream is a bitstream coded as a multiview image and the bitstream is decoded by the decoder of the stereo profile according to the compatibility indication flag information
- the difference value of the viewpoint information may not be obtained. .
- the reference picture modification information indicates that the reference number of the interview reference picture is to be changed, that the bitstream in which the profile identifier is received is not a bitstream coded into a stereo image, and that the bitstream in which the profile identifier is received is changed again.
- the difference value of the viewpoint information may be obtained only when the bitmap is not a bitstream coded as a point image or when the compatibility indication flag indicates that the bitstream cannot be decoded by the decoder of the stereo profile (S1710 and S1730). S1720, S1740).
- FIG. 18 illustrates a method of deriving a target view identifier in the case of a stereo image.
- the target view identifier may be derived according to the method of deriving the target view identifier described above in the case of not being a bitstream coded with a stereo image (S1810).
- the target viewpoint identifier may be derived based on the viewpoint identification information of the reference viewpoint (S1840). ).
- the reference picture manager (not shown) manages the reference picture in order to more flexibly implement inter prediction (S1260). For example, an adaptive memory management control method and a sliding window method may be used. This is to manage the memory of the reference picture and the non-reference picture into one memory and manage them efficiently with less memory. In multi-view video coding, since pictures in a view direction have the same picture order count, information for identifying the view point of each picture may be used for their marking. Reference pictures managed through this process may be used in the inter predictor 700.
- the inter prediction unit 700 may perform inter-prediction using a reference picture or an interview reference picture stored in the decoded picture buffer unit 600. Macro blocks coded in inter mode may be divided into macro block partitions, and each macro block partition may be predicted from one or two reference pictures or an interview reference picture.
- the inter prediction unit 700 compensates for the movement of the current block by using the information transmitted from the entropy decoding unit 100.
- a motion vector of blocks neighboring the current block may be obtained from a video signal, and a motion vector prediction value of the current block may be obtained from the obtained motion vector.
- the motion of the current block may be compensated using the obtained motion vector prediction value and the difference vector obtained from the video signal.
- the motion compensation may be performed using one reference picture or may be performed using a plurality of reference pictures.
- the direct prediction mode is a coding mode for predicting motion information of a current block from motion information of a coded block. This method improves compression efficiency because the number of bits necessary for encoding motion information is saved.
- a temporal direct mode predicts motion information of a current block by using motion information correlation in the time direction. Similar to this method, the present invention can predict the motion information of the current block using the motion information correlation in the view direction. Pictures predicted in the inter mode and pictures predicted in the intra mode through the above process are selected according to the prediction mode to reconstruct the current picture.
- the present invention can be used to process a multiview video signal.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Library & Information Science (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
Abstract
Description
Claims (8)
- 다시점 영상 비트스트림으로부터 프로파일 식별자 및 슬라이스 타입을 파싱하는 단계;상기 프로파일 식별자에 기초하여 인터뷰 참조 정보를 획득하는 단계;상기 인터뷰 참조 정보를 이용하여 참조 픽쳐 리스트를 초기화하는 단계; 및상기 슬라이스 타입을 고려하여 상기 참조 픽쳐 리스트를 수정하는 단계를 포함하는 것을 특징으로 하는 참조 픽쳐 리스트 변경 방법.
- 제 1항에 있어서, 상기 프로파일 식별자에 따라 상기 다시점 영상 비트스트림이 스테레오 영상으로 코딩된 비트스트림인 경우에 상기 인터뷰 참조 정보를 획득하는 것을 특징으로 하는 참조 픽쳐 리스트 변경 방법.
- 제 2항에 있어서, 상기 인터뷰 참조 정보는 랜덤 액세스 픽쳐가 시점간 예측을 위해 이용되는지 여부를 나타내는 플래그 정보와 넌-랜덤 액세스 픽쳐가 시점간 예측을 위해 이용되는지 여부를 나타내는 플래그 정보를 포함하는 것을 특징으로 하는 참조 픽쳐 리스트 변경 방법.
- 제 3항에 있어서, 상기 랜덤 액세스 픽쳐라 함은 모든 슬라이스들이 동일 액세스 단위내의 슬라이스만을 참조하는 부호화된 픽쳐이며, 넌-랜덤 액세스 픽쳐는 랜덤 액세스 픽쳐가 아닌 픽쳐인 것을 특징으로 하는 참조 픽쳐 리스트 변경 방법.
- 제 1항에 있어서, 상기 참조 픽쳐 리스트를 수정하는 단계는 참조 픽쳐 리스트를 수정할 지 여부를 나타내는 플래그 정보를 획득하는 단계;상기 플래그 정보에 기초하여 참조 픽쳐 수정 정보를 획득하는 단계;상기 참조 픽쳐 수정 정보에 기초하여 픽쳐 번호의 차이값, 장기 픽쳐 번호 및 시점 정보의 차이값을 파싱하는 단계; 및상기 픽쳐 번호의 차이값, 장기 픽쳐 번호 및 시점 정보의 차이값을이용하여 수정된 참조 번호를 유도하는 단계를 더 포함하는 것을 특징으로 하되,상기 슬라이스 타입이 I 슬라이스가 아닌 경우에 상기 참조 픽쳐 리스트를 수정하는 것을 특징으로 하는 참조 픽쳐 리스트 변경 방법.
- 제 5항에 있어서, 상기 참조 픽쳐 수정 정보라 함은 상기 초기화된 참조 픽쳐 리스트의 참조 픽쳐들 중에서 변경되는 참조 픽쳐를 특정하는 정보이고,상기 픽쳐 번호의 차이값이라 함은 현재 픽쳐의 픽쳐 번호와 예측된 픽쳐 번호의 차이를 나타내고, 상기 예측된 픽쳐 번호란 직전에 할당된 참조 픽쳐의 번호이며, 상기 시점 정보의 차이값이라 함은 상기 현재 픽쳐의 시점 번호와 예측된 시점 번호의 차이를 의미하고, 상기 예측된 시점 번호란 직전에 할당된 참조 픽쳐의 시점 번호인 것을 특징으로 하는 참조 픽쳐 리스트 변경 방법.
- 제 5항에 있어서, 상기 시점 정보의 차이값은 상기 참조 픽쳐 정보 및 상기 프로파일 식별자에 기초하여 파싱하는 것을 특징으로 하는 참조 픽쳐 리스트 변경 방법.
- 제 5항에 있어서, 상기 참조 픽쳐가 단기 참조 픽쳐인 경우에는 예측된 픽쳐 번호와 픽쳐 번호의 차이값에 기초하여 수정된 참조 번호를 유도하고, 상기 참조 픽쳐가 장기 참조 픽쳐인 경우에는 장기 픽쳐 번호에 기초하여 수정된 참조 번호를 유도하는 것을 특징으로 하는 참조 픽쳐 리스트 변경 방법.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/265,657 US8933989B2 (en) | 2009-04-22 | 2010-03-09 | Reference picture list changing method of multi-view video |
EP10767218.0A EP2424240A4 (en) | 2009-04-22 | 2010-03-09 | METHOD FOR MODIFYING REFERENCE COPY LISTS FOR A MORE VIEWED VIDEO |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US17180609P | 2009-04-22 | 2009-04-22 | |
US61/171,806 | 2009-04-22 | ||
US22109709P | 2009-06-29 | 2009-06-29 | |
US22110009P | 2009-06-29 | 2009-06-29 | |
US61/221,097 | 2009-06-29 | ||
US61/221,100 | 2009-06-29 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2010123203A2 true WO2010123203A2 (ko) | 2010-10-28 |
WO2010123203A3 WO2010123203A3 (ko) | 2010-12-23 |
Family
ID=43011561
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/KR2010/001453 WO2010123203A2 (ko) | 2009-04-22 | 2010-03-09 | 다시점 영상의 참조 픽쳐 리스트 변경 방법 |
Country Status (4)
Country | Link |
---|---|
US (1) | US8933989B2 (ko) |
EP (1) | EP2424240A4 (ko) |
KR (1) | KR20110139304A (ko) |
WO (1) | WO2010123203A2 (ko) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9560369B2 (en) | 2012-09-28 | 2017-01-31 | Lg Electronics Inc. | Video decoding method and apparatus using the same |
CN112188246A (zh) * | 2020-09-30 | 2021-01-05 | 深圳技威时代科技有限公司 | 一种视频云存储方法 |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5267886B2 (ja) * | 2009-04-08 | 2013-08-21 | ソニー株式会社 | 再生装置、記録媒体、および情報処理方法 |
JP2012023651A (ja) * | 2010-07-16 | 2012-02-02 | Sony Corp | 画像処理装置と画像処理方法 |
US9118928B2 (en) * | 2011-03-04 | 2015-08-25 | Ati Technologies Ulc | Method and system for providing single view video signal based on a multiview video coding (MVC) signal stream |
US11647197B2 (en) | 2011-06-30 | 2023-05-09 | Velos Media, Llc | Context initialization based on slice header flag and slice type |
US9338465B2 (en) * | 2011-06-30 | 2016-05-10 | Sharp Kabushiki Kaisha | Context initialization based on decoder picture buffer |
EP3902258A1 (en) | 2011-06-30 | 2021-10-27 | Telefonaktiebolaget LM Ericsson (publ) | Reference picture signaling |
US9674525B2 (en) | 2011-07-28 | 2017-06-06 | Qualcomm Incorporated | Multiview video coding |
US9635355B2 (en) | 2011-07-28 | 2017-04-25 | Qualcomm Incorporated | Multiview video coding |
US10034018B2 (en) | 2011-09-23 | 2018-07-24 | Velos Media, Llc | Decoded picture buffer management |
US9264717B2 (en) | 2011-10-31 | 2016-02-16 | Qualcomm Incorporated | Random access with advanced decoded picture buffer (DPB) management in video coding |
US10003817B2 (en) | 2011-11-07 | 2018-06-19 | Microsoft Technology Licensing, Llc | Signaling of state information for a decoded picture buffer and reference picture lists |
US9258559B2 (en) | 2011-12-20 | 2016-02-09 | Qualcomm Incorporated | Reference picture list construction for multi-view and three-dimensional video coding |
ES2571863T3 (es) * | 2012-01-17 | 2016-05-27 | ERICSSON TELEFON AB L M (publ) | Gestión de listas de imágenes de referencia |
US9369710B2 (en) | 2012-02-06 | 2016-06-14 | Qualcomm Incorporated | Reference picture list modification for video coding |
CN107835428B (zh) * | 2012-03-02 | 2021-09-24 | 太阳专利托管公司 | 图像编码方法、图像解码方法、图像编码装置、图像解码装置及图像编码解码装置 |
US10200709B2 (en) | 2012-03-16 | 2019-02-05 | Qualcomm Incorporated | High-level syntax extensions for high efficiency video coding |
KR101456501B1 (ko) * | 2012-04-15 | 2014-11-03 | 삼성전자주식회사 | 참조픽처리스트 변경이 가능한 인터 예측 방법과 그 장치 |
WO2014010192A1 (ja) * | 2012-07-09 | 2014-01-16 | パナソニック株式会社 | 画像符号化方法、画像復号方法、画像符号化装置及び画像復号装置 |
US9992490B2 (en) * | 2012-09-26 | 2018-06-05 | Sony Corporation | Video parameter set (VPS) syntax re-ordering for easy access of extension parameters |
US9313500B2 (en) | 2012-09-30 | 2016-04-12 | Microsoft Technology Licensing, Llc | Conditional signalling of reference picture list modification information |
KR102091139B1 (ko) * | 2012-10-08 | 2020-03-20 | 삼성전자주식회사 | 다시점 비디오의 인터 레이어 예측 구조에 따른 비디오 스트림 부호화 방법 및 그 장치, 다시점 비디오의 인터 레이어 예측 구조에 따른 비디오 스트림 복호화 방법 및 그 장치 |
TW201415898A (zh) * | 2012-10-09 | 2014-04-16 | Sony Corp | 影像處理裝置及方法 |
WO2014081226A1 (ko) * | 2012-11-21 | 2014-05-30 | 엘지전자 주식회사 | 영상 디코딩 방법 및 이를 이용하는 장치 |
US10136119B2 (en) * | 2013-01-10 | 2018-11-20 | Qualcomm Incoporated | View synthesis in 3D video |
CN103414889B (zh) * | 2013-04-09 | 2016-06-22 | 宁波大学 | 一种基于双目恰可察觉失真的立体视频码率控制方法 |
WO2015025747A1 (ja) * | 2013-08-22 | 2015-02-26 | ソニー株式会社 | 符号化装置、符号化方法、送信装置、復号化装置、復号化方法および受信装置 |
WO2015099401A1 (ko) * | 2013-12-24 | 2015-07-02 | 주식회사 케이티 | 멀티 레이어 비디오 신호 인코딩/디코딩 방법 및 장치 |
WO2015147426A1 (ko) | 2014-03-24 | 2015-10-01 | 주식회사 케이티 | 멀티 레이어 비디오 신호 인코딩/디코딩 방법 및 장치 |
WO2016119746A1 (zh) * | 2015-01-29 | 2016-08-04 | 同济大学 | 图像编码方法及装置、图像解码方法及装置 |
EP3996463B1 (en) | 2018-04-05 | 2023-08-16 | Telefonaktiebolaget LM Ericsson (publ) | Multi-stage sidelink control information |
KR102642697B1 (ko) * | 2018-12-21 | 2024-03-04 | 레이아 인코포레이티드 | 뷰-말단 표시자를 갖는 멀티뷰 디스플레이 시스템, 멀티뷰 디스플레이 및 방법 |
WO2020185889A1 (en) * | 2019-03-11 | 2020-09-17 | Futurewei Technologies, Inc. | Sub-picture level filtering in video coding |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7710462B2 (en) * | 2004-12-17 | 2010-05-04 | Mitsubishi Electric Research Laboratories, Inc. | Method for randomly accessing multiview videos |
ZA200805337B (en) * | 2006-01-09 | 2009-11-25 | Thomson Licensing | Method and apparatus for providing reduced resolution update mode for multiview video coding |
BRPI0710048A2 (pt) | 2006-03-30 | 2011-08-02 | Lg Electronics Inc | método e aparelho para decodificar / codificar um sinal de vìdeo |
WO2008023967A1 (en) | 2006-08-25 | 2008-02-28 | Lg Electronics Inc | A method and apparatus for decoding/encoding a video signal |
KR20110123291A (ko) | 2006-10-16 | 2011-11-14 | 노키아 코포레이션 | 멀티뷰 비디오 코딩에서 효율적인 디코딩된 버퍼 관리를 구현하기 위한 시스템 및 방법 |
BRPI0809210A2 (pt) * | 2007-04-04 | 2014-09-02 | Thomson Licensing | Gerenciamento de lista de imagens de referência |
BRPI0910284A2 (pt) * | 2008-03-04 | 2015-09-29 | Thomson Licensing | vista de referência virtual |
-
2010
- 2010-03-09 EP EP10767218.0A patent/EP2424240A4/en not_active Withdrawn
- 2010-03-09 KR KR1020117025926A patent/KR20110139304A/ko not_active Application Discontinuation
- 2010-03-09 US US13/265,657 patent/US8933989B2/en active Active
- 2010-03-09 WO PCT/KR2010/001453 patent/WO2010123203A2/ko active Application Filing
Non-Patent Citations (2)
Title |
---|
None |
See also references of EP2424240A4 |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9560369B2 (en) | 2012-09-28 | 2017-01-31 | Lg Electronics Inc. | Video decoding method and apparatus using the same |
US9848200B2 (en) | 2012-09-28 | 2017-12-19 | Lg Electronics Inc. | Video decoding method and apparatus using the same |
US10390032B2 (en) | 2012-09-28 | 2019-08-20 | Lg Electronics Inc. | Video decoding method and apparatus using the same |
US10567783B2 (en) | 2012-09-28 | 2020-02-18 | Lg Electronics Inc. | Video decoding method and apparatus using the same |
US11259038B2 (en) | 2012-09-28 | 2022-02-22 | Lg Electronics Inc. | Video decoding method and apparatus using the same |
CN112188246A (zh) * | 2020-09-30 | 2021-01-05 | 深圳技威时代科技有限公司 | 一种视频云存储方法 |
Also Published As
Publication number | Publication date |
---|---|
EP2424240A2 (en) | 2012-02-29 |
WO2010123203A3 (ko) | 2010-12-23 |
US20120069903A1 (en) | 2012-03-22 |
EP2424240A4 (en) | 2013-04-10 |
US8933989B2 (en) | 2015-01-13 |
KR20110139304A (ko) | 2011-12-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2010123203A2 (ko) | 다시점 영상의 참조 픽쳐 리스트 변경 방법 | |
WO2010123198A2 (ko) | 다시점 비디오 신호 처리 방법 및 장치 | |
WO2010068020A2 (ko) | 다시점 영상 부호화, 복호화 방법 및 그 장치 | |
US7831102B2 (en) | Processing multiview video | |
KR100949978B1 (ko) | 비디오 신호를 디코딩/인코딩하기 위한 방법 및 장치 | |
KR101619451B1 (ko) | 다시점 비디오 신호의 처리 방법 및 장치 | |
WO2010087589A2 (ko) | 경계 인트라 코딩을 이용한 비디오 신호 처리 방법 및 장치 | |
WO2011008065A2 (en) | Method and apparatus for multi-view video coding and decoding | |
WO2014107083A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
KR20080007086A (ko) | 비디오 신호의 디코딩/인코딩 방법 및 장치 | |
WO2013133587A1 (ko) | 비디오 신호 처리 방법 및 장치 | |
KR101366289B1 (ko) | 비디오 신호의 디코딩/인코딩 방법 및 장치 | |
USRE44680E1 (en) | Processing multiview video | |
WO2012099352A2 (ko) | 다시점 영상 부호화/복호화 장치 및 방법 | |
KR20080023210A (ko) | 비디오 신호 디코딩 방법 및 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 10767218 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 20117025926 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2010767218 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13265657 Country of ref document: US |